Professional Documents
Culture Documents
ORIGINAL INVESTIGATION
Abstract Allelic frequencies for up to five short tandem tion of human evolution (Bowcock et al. 1994). In some
repeat systems (HumTH01, HumVWA, HumF13B, cases, they have been found to cause, or to be associated,
HumCD4, HumD21S11) were analyzed in seven popula- with disease (for review see Richards and Sutherland
tion samples from Asia using the polymerase chain reac- 1996). Population genetic data based on these systems of-
tion and gel electrophoresis. No deviations from Hardy- fer a tool to demonstrate parallels or differences between
Weinberg equilibrium were observed. Two new alleles of ethnic groups in an evolutionary context (Meyer et al.
the CD4 and TH01 loci were detected, and sequenced and 1995; Brinkmann et al. 1996).
their molecular structure is presented. A phylogenetic tree Asia is the largest continent and was peopled by archa-
based on Thai, Han Chinese (from the northeast of China), ic Homo species before the arrival of the modern Homo
Japanese, German and Ovambo allelic frequencies was sapiens sapiens (Cavalli-Sforza et al. 1994). Today, it is
constructed and demonstrates the close relationship of the generally accepted that at least two independent waves
Asian populations. Additionally, allelic frequency data for brought modern humans into Asia. A southern wave,
the VWA and TH01 systems were determined for the 50,000–70,000 years ago, brought people along the south-
south Chinese minorities Bai, Dai and Qiang and for Kore- ern coastal line to New Guinea and Australia. A second
ans and compared with the above data. The Bai and Dai expansion, starting in western Asia/northern Africa 50,000
populations were clear outliers of the cluster of all other years ago, brought people to the whole of Asia, the Arctic
Asians, indicating an unexpected pattern of genetic hetero- and also to America (Cavalli-Sforza et al. 1994), where the
geneity of the Chinese nation. Two clusters of Asian popu- ancestors of today’s Amerinds arrived 20,000–25,000
lations could be established: the Koreans and Japanese years ago (Forster et al. 1996). Peoples of this second
together with the Han and Qiang Chinese, and, forming a wave might have mixed with descendants of the first,
separate cluster, the Bai and Dai populations. southern wave out of Africa into Asia but not into high-
land Papua New Guinea and Australia.
Population genetic data based on protein polymor-
phisms have revealed that differences in eastern Asia are
Introduction mainly observed between the northern and the southern
populations. Cavalli-Sforza et al. (1994) found divergence
Short tandem repeat (STR) polymorphic systems are used between north and south Chinese and a clustering of both
in human identification/parentage testing (Gill et al. 1994), of these with neighboring populations. Zhao and Lee
linkage analysis (Hearn et al. 1992), and for the investiga- (1989) assumed that the modern Chinese nation originated
from two distinct populations during early Neolithic times,
one population originating from the northern Yellow river
valley and one originating from the southern Yangtze river
B. Rolf · B. Horst · B. Brinkmann (✉) valley, respectively. Nevertheless, information on nuclear
Institut für Rechtsmedizin, Westfälische Wilhelms-Universität,
von Esmarch Str. 86, D-48149 Münster, Germany DNA polymorphisms, such as, STR allelic frequencies, of
Fax: +49-251-8355158 these populations are still rare. Only a few reports dealing
A. Eigel · J. Horst with STR allelic frequencies in Asian populations, mainly
Institut für Humangenetik, Westfälische Wilhelms-Universität, Japanese and Chinese, have been published (Brinkmann et
Vesaliusweg 12-14, D-48149 Münster, Germany al. 1996; Hou and Walter 1996; Takahashi et al. 1996).
T. Sanguansermsri Considering the size and the variety of Asian populations,
Human Genetic Unit, Faculty of Medicine, the amount of information is poor. The aim of the present
Chiang Mai 50002, Thailand work was to determine the allelic frequencies of up to five
Table 1 Allelic frequencies in the five short tandem repeat (STR) polymorphic systems HumTH01, HumVWA, HumF13B, HumCD4 and HumD21S11. (n number of chromosomes investi-
648
gated)
VWA 11 13 14 14’ 15 16 17 18 19 20 21 22 n
Ovambosa 0.055 0.035 0.065 0 0.2 0.225 0.195 0.115 0.085 0.025 0 0 200
Germansa 0 0.001 0.097 0 0.101 0.207 0.279 0.213 0.086 0.015 0.001 0.001 1298
Japanesea 0 0 0.214 0 0.025 0.178 0.278 0.239 0.062 0 0.004 0 256
Hanb 0 0 0.287 0 0.033 0.143 0.253 0.192 0.055 0.037 0 0 181
Thais 0 0 0.263 0.004 0.04 0.194 0.202 0.194 0.087 0.012 0.004 0 248
Baia 0 0 0.23 0 0.039 0.236 0.197 0.191 0.084 0.017 0.006 0 178
Dai 0 0 0.219 0 0.036 0.158 0.23 0.214 0.128 0.015 0 0 196
Qiang 0 0 0.205 0 0.026 0.234 0.234 0.186 0.091 0.026 0 0 280
Koreans 0 0.026 0.225 0 0.051 0.126 0.321 0.1506 0.076 0 0.026 0 40
CD4 4 5 6 7 8 9 10 11 12 13 14 15 n
Ovambosb 0 0.228 0.156 0.067 0.091 0.01 0.095 0.21 0.086 0.014 0.038 0.005 210
Germansb 0 0.348 0.305 0 0.003 0.003 0.307 0.023 0.009 0.002 0 0 600
Japaneseb 0 0.733 0.033 0 0 0.003 0.23 0.001 0 0 0 0 274
Hanb 0 0.715 0.005 0 0 0.005 0.265 0.01 0 0 0 0 200
Thais 0.012 0.606 0.016 0 0 0.004 0.35 0.012 0 0 0 0 248
649
206
816
296
194
248
n
mCD4 and HumD21S11) in Chinese and Japanese as well
>33.2
as in the three Chinese minority populations, Bai, Dai and
0.081
0.005
0.003
0.005
0.012
Qiang, and in the neighboring Thai and Korean popula-
tions, for the reconstruction of the phylogenetic history of
several Asian populations.
0.001
0.027
0.043
0.057
0.076
33.2
0.008
33
Population samples
0.084
0.097
0.177
0.129
0.184
32.2
0.036
0.008
0.04
ang Chinese from Sichuan, Thais from the Chiang Mai Province,
32
Koreans from Seoul, South Korea, and Japanese from the Shiga area.
0.034
0.095
0.062
0.064
31.2
0.06
DNA extraction
0.005
0.048
30.2
202
804
274
200
248
HumD21S11 (Sharma and Litt 1992; Möller et al. 1994b) and Hu-
0
0
0
0
0.001
0.004
29.2
0
0
lelic ladder.
0.187
0.205
0.197
0.301
0.195
0.001
0.001
0.005
10K
29
Sequence analysis
0.433
0.795
0.644
28.2
0.02
0.05
0
0
0.032
0.063
0.005
0.001
0.004
0.26
0.03
0
0
0.385
0.225
0.146
0.228
27.2
0.22
Statistics
0
0
0
0
0.005
0.045
0.224
0.055
0.075
0.104
was used (Guo and Thompson 1992) with HWE Analysis 3.2 soft-
27
0.116
0.012
0.004
0.004
0
0
1993) and the Da distance measure (Nei et al. 1983). Based on this
Table 1 (continued)
0.398
0.103
0.012
0
0
0
0
0
0
Ovambosb
Ovambosb
Germansb
Germansb
Japaneseb
Japaneseb
D21S11
Thaisc
Thais
Hanb
Hanb
F13B
b
a
c
650
Table 2 Distance matrix for five
populations based on five loci Ovambos Germans Japanese Han Thai Bai Dai Qiang
(61 alleles, upper right triangle)
or for nine populations based on Ovambos - 0.1492 0.2644 0.2553 0.2246
two loci (23 alleles, lower left
Germans 0.1286 - 0.0965 0.1082 0.0809
triangle). Calculations were
done using the program DISPAN Japanese 0.1461 0.0763 - 0.0195 0.0273
(Ota 1993). The measurement of Han 0.1550 0.1144 0.0238 - 0.0240
distance used is Da (Nei et al. Thais 0.1300 0.0881 0.0196 0.0241 -
1983) Bai 0.1111 0.1218 0.0698 0.0609 0.0462 -
Dai 0.1232 0.0820 0.0505 0.0639 0.0357 0.0169 -
Qiang 0.1224 0.0803 0.0267 0.0180 0.0172 0.0594 0.0465 -
Koreans 0.1545 0.1241 0.0254 0.0275 0.0385 0.0698 0.664 0.0380
Results
pattern is still the same as that observed above for the five The major ethnic group in China are the Han, making
populations using the five STR systems. The addition of up 93% of the population. The remaining 7% comprise 55
four populations and the use of fewer loci did not change minority ethnic groups (Matsumoto et al. 1995). The Bai
the basic phylogenetic relationship. The Koreans are clus- and Dai minorities are from the Yunnan Province in China,
tered together with the Han Chinese from the Shenyang ar- close to Thailand. The Bai and the Qiang from Sichuan
ea in 75% of the bootstraps; these two are then clustered Province are speakers of a Tibetoburmese language. The
together with the Japanese with the same bootstrap proba- Dai population consists of two, closely related groups, the
bility. The Chinese minorities Bai and Dai are clear outli- southern Dai, who are related to the northern Thai, and the
ers to the cluster of the other Asians. The high bootstrap western Dai, who are related to the Shan of Burma and ex-
value of 100 indicates the significance of this finding. The hibit a higher frequency of the HbE gene (Yongvanit et al.
Thai and the Chinese minority population Qiang from Si- 1989, G. Flatz, personal communication). The Dai sample
chuan Province are not included in this branch. Although in this study is from the cities of Dali and Weishan in the
the bootstrap probability for the Qiang is not significant, western part of the province, 150 km west of Kunming.
they are placed outside the northeast Asian cluster (Han, Therefore, they belong to the Shan. Interestingly, the Bai
Koreans and Japanese) and the cluster of the Bai and Dai. and Dai populations are placed outside all other Asian
populations with a bootstrap probability of 100%, al-
though both the Thai and the Dai populations speak a Daic
language. The Dai population comprises 760,000 people
Discussion and the Bai 1,000,000 (Etler 1992); thus, recent effects of
drift due to the small size of these populations are proba-
The proposed model for the mutation of microsatellites is bly not the reason for the phylogenetic difference. Selec-
the so-called single step mutation model, in which muta- tion resulting from climatic influence on genetic composi-
tions change the number of repeats mainly by one unit tion has not been described for the neutral STR markers.
(Weber and Wong 1993; Valdes et al. 1993). The two new As the STRs described here do not code for proteins and
alleles observed here in this study in the Thai sample, al- are mainly in the introns of genes, hitchhiking would be
lele 4 at the CD4 locus and allele 12 at the TH01 locus, the only possible mechanism of selection, but this has not
differ from the known allelic size range by one repeat unit. been described so far.
This finding might support the single step mutation model, The genetic diversity observed reflects only in part the
although other mechanisms are also possible. Pedigree geographical and linguistic relationships of the Asian pop-
analysis revealed mutation rates of 10–5–10–2 for the sys- ulations. China is genetically very heterogeneous, and the
tems (Weber and Wong 1993). The two new alleles de- Chinese are much more closely related to their neighbors,
scribed here could have originated recently de novo in such as the Koreans or Japanese, than to a minority living
Asia/Thailand. However, we cannot exclude the possibility in China.
that these alleles originated in an African precursor popu-
lation and that they were then subject to genetic drift, and
are thus no longer present in all populations. References
The tree provides strong evidence for the close relation-
ship between the Asian populations. The branching pattern Allen CA, Graves G, Budowle B (1989) Polymerase chain reaction
observed in the tree in Fig. 1 reflects the geographic loca- amplification products separated on rehydratable polyacrylamide
gels and stained with silver. Biotechniques 7:736–744
tion of the population samples, as the Han Chinese sample Bowcock AM, Ruiz-Linares A, Tomfohrde J, Minch E, Kidd JR,
is from the northeast of China, much closer to Japan than Cavalli-Sforza LL (1994) High resolution of human evolutionary
to Thailand. In the two-loci tree in Fig. 2, the Koreans are trees with polymorphic microsatellites. Nature 368:455–457
clustered together with the Han Chinese in 75% of the Brinkmann B, Sajantila A, Goedde HW, Matsumoto H, Nishi K,
Wiegand P (1996) Population genetic comparisons among eight
bootstraps and these two are then clustered together with populations using allele frequency and sequence data from three
the Japanese with the same bootstrap probability. Studies microsatellite loci. Eur J Hum Genet 4:175–182
on mtDNA polymorphisms indicate that a major part of Brinkmann B, Junge A, Meyer E, Wiegand P (1998) Population ge-
the Japanese gene pool is derived from Yayoi people, who netic diversity in relation to microsatellite heterogeneity. Hum
Mutat 11:135–144
migrated to Japan from the Korean peninsula/northeast Budowle B, Chakraborty R, Giusti AM, Eisenberg AJ, Allen RC
China about 2300 years ago (Horai et al. 1996). Genes of (1991) Analysis of the variable number of tandem repeat locus
the older Neolithic Japanese population, the Jomon peo- D1S80 by the polymerase chain reaction followed by high reso-
ple, are less frequent in modern Japanese. These findings lution polyacrylamide gel electrophoresis. Am J Hum Genet
are also supported by studies on Y-chromosomal variation 48:137–144
Cavalli-Sforza LL, Menozzi P, Piazza A (1994) The history and ge-
in Japanese males (Hammer and Horai 1995). The phylo- ography of human genes. Princeton University Press, Princeton
genetic relationship based on data for the autosomal STRs NJ
reveals a similar pattern. Koreans and Japanese are speak- Edwards A, Hammond HA, Jin L, Caskey CT, Chakraborty R (1992)
ers of an Altaic language (according to Ruhlen 1987), Genetic variation at five trimeric and tetrameric tandem repeat
loci in four human population groups. Genomics 12:241–253
whereas the Chinese speak a Sinotibetan language; thus at Edwards MC, Clemens PR, Tristan M, Pizzuti A, Gibb RA (1991)
least the genetic relationship between Koreans and Japa- Pentanucleotide repeat length polymorphism at the human CD4
nese is supported by linguistic evidence. locus. Nucleic Acids Res 19:4791
652
Etler DA (1992) Recent developments in the study of human biology Nei M, Tajima F, Tateno Y (1983) Accuracy of estimated phyloge-
in China: a review. Hum Biol 64:567–585 netic trees from molecular data. II Gene frequency data. J Mol
Forster P, Harding R, Torronio A, Bandelt H-J (1996) Origin and Evol 19:153–170
evolution of native American mtDNA variation. Am J Hum Nishimura DY, Murray JC (1992) A tetranucleotide repeat for the
Genet 59:935–945 F13B locus. Nucleic Acids Res 20:1167
Gill P, Ivanov PL, Kimpton C, Piercy R, Benson N, Tully G, Evett I, Richards RI, Sutherland GR (1996) Repeat offenders: simple repeat
Hagelberg E, Sullivan K (1994) Identification of the remains of sequences and complex genetic problems. Hum Mutat 8:1–7
the Romanov family by DNA analysis. Nat Genet 6:130–135 Saitou N, Nei M (1987) The neighbor-joining method: a new method
Guo SW, Thompson EA (1992) Performing the exact test of Hardy- for reconstructing phylogenetic trees. Mol Biol Evol 4:406–425
Weinberg proportion for multiple alleles. Biometrics 48:361–372 Sharma V, Litt M (1992) Tetranucleotide repeat polymorphism at the
Hammer MF, Horai S (1995) Y chromosomal variation and the D21S11 locus. Hum Mol Genet 1:67
peopling of Japan. Am J Hum Genet 56:951–962 Takahashi M, Kato Y, Miyakawa G, Kuroso A, Kamiyama S (1996)
Hearn CM, Ghosh S, Todd JA (1992) Microsatellites for linkage Allele detection and population study in Japanese using two STR
analysis of genetic traits. Trends Genet 8:288–294 loci (Cyp19 and TH01). Int J Legal Med 108:323–326
Horai S, Murayama K, Hayasaka K, Matsubayashi S, Hattori Y, Ota T (1993) Program Dispan: genetic distance and phylogenetic
Fucharoen G, Harihara S, Park KS, Omoto K, Pan I-H (1996) analysis. Copyright of the Pennsylvania State University, Univer-
mtDNA polymorphism in East Asian populations, with special sity Park, PA 16802, USA
reference to the peopling of Japan. Am J Hum Genet 59:579–590 Ruhlen M (1987) A guide to the world’s languages. Stanford Univer-
Horst B, Eigel A, Sanguansermsri T, Rolf B (1997) Analysis of the sity Press, Stanford, California
short tandem repeat systems HumVWA and HumF13B in a pop- Valdes AM, Slatkin M, Freimer M (1993) Allele frequencies at
ulation sample from northern Thailand. Int J Legal Med microsatellite loci: the stepwise mutation model revisited.
110:235–237 Genetics 133:737–749
Hou Y, Walter H (1996) Genetic substructure at the STR loci Walsh PS, Metzger DA, Higuchi R (1991) Chelex 100 as a medium
HumTH01 and VWA in Han populations, China. In: Carracedo for simple extraction of DNA for PCR-based typing from foren-
A, Brinkmann B, Bär W(eds) Advances in forensic haemo- sic material. BioTechniques 10:506–513
genetics 6. Springer, Berlin Heidelberg New York Waye JS, Lawrence PA, Budowle B, Shuttler GG, Fourney RM
Kimpton C, Walton A, Gill P (1992) A further tetranucleotide repeat (1989) A simple and sensitive method for quantifying human ge-
polymorphism in the vWF gene. Hum Mol Genet 1:287 nomic DNA in forensic specimen extracts. Biotechniques
Matsumoto H, Miyazaki T, Suzuki K (1995) Characteristics of Japa- 7:852–855
nese, Korean, and Chinese populations based on the genetic Weber JL, Wong C (1993) Mutation of human short tandem repeats.
markers of human immunoglobolins. Z Morphol Anthropol Hum Mol Genet 2:1123–1128
83:23–32 Wiegand P, Budowle B, Rand S, Brinkmann B (1993) Forensic vali-
Meyer E, Wiegand P, Brinkmann B (1995) Phenotype differences of dation of the STR systems SE33 and TC11. Int J Legal Med
STRs in 7 human populations. Int J Legal Med 107:314–322 105:315–320
Miller SA, Dykes DD, Polsky HF (1986) A simple salting out proce- Yongvanit P, Sriboonlue P, Mularlee N, Karnthong T, Areeji-
dure for extracting DNA from human nucleated cells. Nucleic tranusorn P, Hundrieser J, Limberg R, Schulze B, Laig M, Flatz
Acids Res 31:1215 SD (1989) DNA haplotypes and frameworks linked to the beta-
Möller A, Brinkmann B (1994) Locus ACTBP2 (SE33): sequencing globin locus in an Austro-Asiatic population with a high
data reveal considerable polymorphism. Int J Legal Med prevalence of hemoglobin E. Hum Genet 83:171–174
106:262–267 Zhao T, Lee DL (1989) Gm and Km allotypes in 74 Chinese popula-
Möller A, Wiegand P, Grüschow CP, Seuchter, SA, Baur MP, tions: a hypothesis of the origin of the Chinese nation. Hum
Brinkmann B (1994a) Population data and forensic efficiency Genet 83:101–110
values for the STR systems HumVWA, HumMBP and Hum-
FABP. Int J Legal Med 106:183–189
Möller A, Meyer E, Brinkmann B (1994b) Different types of struc-
tural variation in STRs: HumFES/FPS, HumVWA and
HumD21S11. Int J Legal Med 106:319–323