ADH1B*47His allele map’s distribution and frequency reveals origin and path of expansion of rice domestication from South China’s Zhejiang center to Japan

Fig. 1

Fig. 1

Peng et al. “The ADH1B Arg47His polymorphism in East Asian
populations and expansion of rice domestication
in history
“, BMC Evolutionary Biology 2010, 10:15

Background: The emergence of agriculture about 10,000 years ago marks a dramatic change in human evolutionary history. The diet shift in agriculture societies might have a great impact on the genetic makeup of Neolithic human populations. The regionally restricted enrichment of the class I alcohol dehydrogenase sequence polymorphism (ADH1BArg47His) in southern China and the adjacent areas suggests Darwinian positive selection on this genetic locus during Neolithic time though the driving force is yet to be disclosed.
Results: We studied a total of 38 populations (2,275 individuals) including Han Chinese, Tibetan and other ethnic populations across China. The geographic distribution of the ADH1B*47His allele in these populations indicates a clear east-to-west cline, and it is dominant in south-eastern populations but rare in Tibetan populations. The molecular dating suggests that the emergence of the ADH1B*47His allele occurred about 10,000~7,000 years ago.
Conclusion: We present genetic evidence of selection on the ADH1BArg47His polymorphism caused by the emergence and expansion of rice domestication in East Asia. The geographic distribution of the ADH1B*47His allele in East Asia is consistent with the unearthed culture relic sites of rice domestication in China. The estimated origin time of ADH1B*47His allele in those populations coincides with the time of origin and expansion of Neolithic agriculture in southern China.


The major diet shift in recent human history was caused by domestication of plants and animals[1]. During human evolution, diet shifts may create different selective pressures acting on the genetic variations of human populations. Two well-studied examples are the copy number variation of amylase gene for starchy food and the regulatory sequence variations of lactase for milk [2-4]. In southern China, the earliest agriculture started to flourish due to the domestication of rice about 10,000 years ago [5]. Hence, like the amylase gene selected for high copy numbers in agricultural societies including East Asia, the rice-culture-related selection could have been acting on populations living in southern China. Rice has been used as the material to produce fermented food and beverages for a long time insouthern China since early Neolithic time. The fermentation helps to preserve and enhance the nutritional value of foods and beverages[6]. However, alcohol can lead to addiction and cause damages to human bodies, including nervous system dysfunction, tumor genesis, innate immune system modulation and fetal alcohol syndrome [7-11]. Therefore, genes involved in the ethanol metabolic pathway might become the target of selection when the ethanol-containing food and beverages had been routinely consumed by Neolithic populations in southern China.
The Class I alcohol dehydrogenase (ADH) is the major enzyme that catalyzes alcohol to acetaldehyde in liver.
The Class I ADH genes (ADH1A, ADH1B, and ADH1C) encode three subunits of Class I ADH isoenzymes, i.e. a, b and g. The well studied sequence polymorphism, ADH1BArg47His (rs1229984) is located in ADH1B. The change of amino acid from Arg to His causes enzymatic activity alteration. The derived allele, ADH1B*47His, changes the pKa of the enzyme from 8.5 to 10.0 which is associated with 40 to 100 fold increase in Km and Vmax of alcohol metabolism [12,13]. A global investigation of the ADH1B*47His allele frequency shows a strong geographic distribution. It is dominant in East Asian populations, but rare in European and African populations[14]. The molecular signature of positive selection on ADH1B have been reported [15,16], and the culture-related selective forces were proposed [17] though no correlation with rice domestication has been tested. We hypothesize that the emergence and expansion of rice domestication during Neolithic time is the driving force, leading to the current regional distribution of the ADH1BArg47His polymorphism in East Asia.

ADH1B*47His allele frequency in East Asian populations We analyzed a total of 2,275 individuals from 38 East Asian populations, especially those not included in the previous reports (northern Han Chinese, Tibetan and southern ethnic populations in China). Table 1 lists the frequencies of ADH1B*47His in the 38 populations. In general, the distribution pattern is consistent with the previous reports[14,17], and most of the populations (31/38) have frequencies higher than 50%. In Han Chinese, the highest frequency is detected in Zhejiang province of south-eastern China (98.5%), and those in the west have relatively low frequencies (60-70%). The same pattern is also observed for the other ethnic populations from China and Southeast Asia (Cambodia and Thailand) except for Tibetan (14.1% on average), Bulang (1.7%, an ethnic population from south-western China) and Cambodian (20.6%). All the five Tibetan populations from different geographic regions have low frequencies (13-21%). We created a contour map based on the data from the 38 populations and those published before (Figure. 1). The distribution of the frequencies of ADH1B*47His confirms its prevalence in East Asia and a clear east-to-west cline is observed.

Selection on the ADH1B gene
To detect the molecular signature of recent selection on the ADH1B*47 polymorphism, we applied the LRH method and the iHS statistics using the genotype data from the HapMap project. The obtained iHS value for the core SNP (rs1229984) is -2.189 (the empirical pvalue is 0.0269), an indication of selection. We then define the core region of ADH1B on the basis of five SNPs (rs4147536, rs1229984, rs1353621, rs1159918 and rs6810842) which determine the East Asian-dominant haplotype. We also select the flanking SNPs, extending both upstream and downstream to 250 Kb, to study the decay of LD from the core haplotype. We plot the haplotype-bifurcation diagrams[18] for the two East Asian populations (Figure. 2) from HapMap (JPT: Japanese in Tokyo, Japan; CHB: Han Chinese in Beijing, China). At a minimum threshold of 9%, we define two core-region haplotypes in the JPT+CHB population. The haplotype CTTCG, which covers the derived variant of ADH1B*47His has an extended predominance by showing a thick branch in the haplotype-bifurcation diagram, clearly suggesting a long-range LD.
The EHH and REHH of the major core haplotypes (≥9%) are plotted against the distance away from the core for the JPT+CHB population (Figure. 2). The EHH of the CTTCG core haplotype decays more slowly than that of the other core haplotype (containing the ancestral variant ADH1B*47Arg) does. In addition, the upstream REHH value of the CTTCG is 17.329 (P = 0.01, by using 1-NORMSDIST). Again, this result is highly consistent with the previous studies, in which the molecular signature of selection was suggested in a wider genomic region containing the ADH1B locus among East Asian populations[15,17]. The selection on ADH1B was also reported previously when the global populations were screened[16]. Additionally, a strong signature of positive selection was detected for the ADHgene cluster in a genome-wide analysis[19]. Collectively, the distribution of ADH1B*47His allele frequency in the populations studied cannot be explained by random genetic drift, and recent selection needs to be invoked.

The time of selection
Previous studies suggested a culture-related selection on the ADH1B*47His[17]. To test this, we superimposed the unearthed culture relic sites of rice domestication in East Asia and we observed a significant correlation of the ADH1B*47His allele frequencies with the ages of rice domestication (r = 0.769, p < 0.01, two-tailed t test; Figure 3; see Additional file 1). The origin of rice domestication occurred along the Yangtze River of southern China about 10,000 years ago[20,21]. Based on the culture relics, the earliest rice sites are located in southern and south-eastern China (8,000-12,000 YBP), and then expanded to the central parts of China about 3,000-6,000 years ago, reaching Korea and Japan less than 3,000 years ago [22,23].

Fig. 3

The spread of rice domestication

agrees well with the distribution of ADH1B*47His, implying that rice domestication is likely the force driving up the frequency and expansion of ADH1B*47His in East Asia during the past 10,000 years.
To see if the initial increase of ADH1B*47His in East Asia occurred during the same period as the emergence of rice domestication in early Neolithic time, we conducted molecular dating[24] by typing the nearest STR loci (a CATA repeat STR located about 14 Kb upstream to the ADH1B locus, and a ATTC repeat STR located about 35 Kb downstream to the ADH1B locus) in 598 individuals randomly selected from the 38 populations.
For phase reconstruction, only homozygous individuals with the ADH1B*47His alleles are included (see Additional file 2). The estimated ages based on the STRs are 5,525 YBP (CATA repeat), and 9,200 YBP (ATTC repeat). Considering that the two STR loci are still far away from the ADH1B locus, we also estimate the age of the ADH1B*47His based on the phased SNP haplotypes from the HapMap dataset (see Additional file 3).
With the fine-scale genetic map, we selected 19 contiguous polymorphic SNPs to estimate the age (Table 2).
Surprisingly, the estimated ages are extremely different between the upstream SNPs (114,693-208,919 yrs, 95% confidence interval) and the downstream SNPs (7,338-9,948 yrs, 95% confidence interval), which is due to the dramatic change of recombination rates in the studied genomic region. As suggested, the method based on the moments estimator[24] is not suitable for the region of low average recombination rates. The previous genomic study based on the HapMap SNPs also excluded the regions with low average recombination rate[25]. Therefore, the age estimated based on the downstream SNPs seems to reflect the real age of ADH1B*47His allele, which is also consistent with the ages estimated from the STR variations. Taken together, the age of the derived allele at the ADH1B locus falls in the range of 10,000-7,000 years before present.

Having established that the rice culture is likely the driving force of selection on the ADH1BArg47His polymorphism, the left question would be to explain the selective advantage of the ADH1B*47His allele. In southern China, people began to make fermented beverages long time ago. The potential benefits of having fermented beverage (or foods) can be explained by ethanol’s combined analgesic, disinfectant and profound mind-altering effects[26]. In addition, fermentation helps to preserve and enhance the nutritional value of foods and beverages. Chemical analyses of ancient organics absorbed into pottery jars suggests that the earliest production of rice fermentation was carried out by the Neolithic people who lived in southern China about 9,000 years ago[6], not long after the origin of rice domestication in the same region. We believe that the custom could have prevailed rapidly among those early agriculture populations in southern China during the Neolithic time, which have lasted thousands of years.
The ADH I has a low Km for ethanol, found in the liver, which metabolizes the most part of ethanol in the body.
The derived ADH1B*47His allele is known to metabolize ethanol up to 100 times quicker than the ancestral ADH1B*47Arg allele, providing support that quick eradication of ethanol, and therefore lower local exposure should be protective. The recent case-control studies also suggested that the ADH1B*47His allele is the protective variant [27-30]. The higher metabolic rate of ADH1B*47His may also lead to the accumulation of the toxic aldehyde intermediate that has been commonly associated with the flushing phenotype[31]. An association study in Han Chinese indicates that the individuals carrying ADH1B*47His have the lowest risk for alcoholism[32].
It was suggested that the flushing phenotype is biochemically equivalent to the effects of disulfiram (a drug used to prevent relapse)[33], which can influence drinking behaviour as a way of protection from over consumption of alcohol. It can also protect against the damage to human bodies caused by alcohol consumptions.

In summary, we provide a plausible explanation about the high frequency of the derived ADH1B*47His allele in East Asia. The distribution of the derived ADH1B*47His allele in East Asia can be well explained by the origin and expansion of the Neolithic rice culture, which is so far one of the few cases demonstrating the genetic adaptation of human populations to the dramatic change during Neolithic time. The ethanol intake increased with the origin of rice agriculture in southern China creates a selective pressure on the Neolithic populations, which is similar with the convergent adaptation of human lactase persistence in Africa and Europe along with the emergence of Neolithic cattle farming[4].

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s