Professional Documents
Culture Documents
Received 4 July 2000; received in revised form 14 November 2000; accepted 14 November 2000
Abstract
In many indigenous minority populations, and among migrants from Asian and African populations now resident in western
Europe, North America and Australia, there is a strong tradition of endogamy and a preference for consanguineous unions.
These marriage practices can result in FST values greatly in excess of the maximum value (0.01) currently recommended for
forensic DNA purposes under guidelines established by the National Research Council (NRC) of the USA. To examine the
possible extent of deviation from this accepted norm, three co-resident Pakistani communities were studied using 10
autosomal dinucleotide markers and six tetranucleotide markers on the Y-chromosome. The mean population subdivision
coefficient (FST) value was 0.13 for the autosomal loci, and Y-chromosome loci exhibited even stronger differentiation with
unique alleles identified in all three communities. The data indicate that even when sub-populations are virtually
indistinguishable in terms of anthropology, geography, ethnicity or culture, they may still exhibit major genetic differentiation.
Where significant population stratification is known to exist, more detailed genetic databases should be developed for forensic
DNA purposes, based on reference data from each of the appropriate sub-populations and not on random or combined
samples. # 2001 Elsevier Science Ireland Ltd. All rights reserved.
Keywords: Forensic DNA; Population genetic structure; Endogamous communities; FST; STR markers
1. Introduction
During the last two decades the application of genomic
analysis to forensic investigation has resulted in levels of
certainty in individual identification and paternity testing
previously impossible with exclusion-based criteria. However, the extent and effects of genetic differentiation within
different populations is an important facet of these investigations that has yet to be satisfactorily addressed.
The degree of uncertainty surrounding this topic was
evident in the creation of the ceiling principle, formulated
by the National Research Council (NRC) of the USA, which
sought to establish a maximum threshold for human allele
0379-0738/01/$ see front matter # 2001 Elsevier Science Ireland Ltd. All rights reserved.
PII: S 0 3 7 9 - 0 7 3 8 ( 0 0 ) 0 0 4 4 2 - 4
270
such as native American and Inuit tribes, it was recommended that a substitute database should be chosen, composed of other groups living in the same geographical region
and/or selected on the recommendation of appropriately
qualified physical anthropologists ([6], p. 123). While random sampling can provide credible estimates for large-scale,
inter-population differentiation, it effectively ignores intrapopulation subdivisions. But, since substantial genetic subdivision occurs in many human societies and recognised
population isolates, the FST values generated by these
internal differences may prove to be significantly larger
than the inter-population differentiation.
3. Results
As indicated in Table 1, for the autosomal dinucleotide
loci the mean FST value across all three communities
was 0.13, ranging from a minimum of 0.05 (D13S270
and D15S101) to a maximum of 0.21 (D15S108). The Ychromosome loci exhibited even greater differentiation, with
unique alleles identified in all three communities (Table 2).
Indeed, at the DYS390 locus the alleles were communityspecific, and so the estimate of FST matched the theoretical
maximum of 1.00. This is especially interesting from a forensic
perspective, given the extensive use of Y-chromosome allele
variability in forensic studies [14].
It could be argued that STR dinucleotide markers have
not been used in forensic settings because of potential errors
in genotyping, and the expectation that mutation rates may
be higher than at tetranucleotide loci [1517]. If these
caveats proved correct the net result could be greater allelic
variation and hence larger genetic differences between
diverged populations at dinucleotide than tetranucleotide
loci. Since FST is treated as a measure of differentiation
among populations that is relative to the total degree of
population diversity, there should however, be little difference between the FST values calculated for markers with di
or tetranucleotide repeats.
To test this supposition, the degree of FST variation found
with different types of STR markers was compared with RST
values (an analog of FST for microsatellites [18]) calculated
for previously reported global data [19,20]. The resultant
mean values in samples from comparable geographical
regions were 0.14 for dinucleotide loci and 0.12 for tetranucleotide loci, levels of differentiation which correspond
well with those reported for other genetic markers ([21],
Table 1
Mean values of genetic differentiation FST at autosomal DNA markers among three co-resident communities in the province of Punjab,
Pakistana,b
Loci
D13S192
D13S126
D13S133
D13S270
D15S11
D15S97
GABRB3
D15S101
D15S108
D15S98 Average
FST
0.16
0.09
0.11
0.05
0.10
0.17
0.17
0.05
0.21
0.20
0.13 0.019
FST values have been estimated as y [12] using the GDA software [13].
Samples from two communities (Khattar and Rajpoot) are identical to those from [9], the Awan sample used in this study differs from
that in [9].
b
Table 2
Allele content in the three Punjab communities at Y-chromosome tetranucleotide loci (data from [9])a
Community
Loci
DYS19
DYS389-I
DYS389-II
DYS390
DYS391
DYS393
Awan
Khattar
Rajpoot
202 (41)
194 (24)
186 (7), 190 (15)
253 (41)
253 (24)
249 (12), 253 (10)
373 (41)
373 (24)
365 (15), 369 (7)
223 (41)
215 (24)
211 (22)
287 (41)
283 (24)
283 (22)
124 (41)
124 (24)
116 (15), 124 (7)
a
Italics indicate the allele notation (size in bp); the number in parenthesis shows the total number of individuals with this allele; unique
alleles are underlined.
4. Discussion
In the present example, and in many other Asian and
African populations, community endogamy is the rule. In
addition, consanguinity acts as a major determinant of
genetic differentiation, and of pregnancy outcome [22].
For example, unions between first cousins (coefficient of
inbreeding, F 0:0625) currently account for 49.4% of all
marriages in Pakistan [10], and in the southern states of India
29.5% of marriages are contracted either between uncle and
niece (F 0:125) or first cousins [23]. What these general
statistics fail to indicate is that marriages are not merely
contracted within families, but also occur almost exclusively
within wider endogamous mating groups and hence separate
breeding pools, examples being castes in India, biradaris in
Pakistan, and tribal groupings in Arab populations. Therefore, in such communities the interpretation of DNA
evidence can be complicated both by a high level of
within-gene pool inbreeding and significant between-gene
pool differentiation [24,25].
The importance of including co-ancestry in the calculation of probability estimates has been emphasised [26], and
some doubt has been expressed ([27], p. 586) as to the
applicability of the FST maxima suggested by the NRC for
forensic purposes, i.e. F ST 0:01 to 0.03 [6]. The present
study clearly demonstrates that within specific communities,
endogamy reinforced by a preference for consanguineous
unions can result in FST values greatly in excess of these
maxima (Table 1). In fact even a value for FST of 0.05, which
was suggested to correct for the effects of population subdivision [28], could result in overstatement of forensic
indices and corresponding match probabilities by two orders
of magnitude based on an assumption of no differentiation
([26], p. 7; [29], Tables 4 and 5).
It should be emphasised that the estimates of genetic
differentiation refer to differences between extended
families (pedigrees) recruited from different communities.
Although the FST estimates formally include possible differences between families in each community, it is appropriate to regard the family and not the entire community as
the forensic population unit. Since in addition to customary
endogamy most marriages also are intra-familial, reflecting
the perceived social and economic benefits of such unions
[8,30].
Although, low values of FST were observed in databases
composed of random samples drawn from large US populations of Caucasian, Black or Hispanic origin ([29], Table 7;
271
Acknowledgements
We are grateful to an anonymous reviewer for helpful
and constructive suggestions. The work was supported in
part by the Australian Research Council (grant A-350-629),
Edith Cowan University, the Russian Foundation of Basic
Research (grant 98-04-49292), the National Institutes of
Health (grant 1 R03 TW00491-01), and the Morrison
Institute for Population and Resource Studies, Stanford
University.
272
References
[1] National Research Council, DNA Technology in Forensic
Science, National Academy Press, Washington, DC, 1992,
pp. 7496.
[2] J. Cohen, The ceiling principle is not always conservative in
assigning genotype frequencies for forensic DNA testing,
Am. J. Hum. Genet. 51 (1992) 11651168.
[3] B.S. Weir, Forensic population genetics and the National
Research Council (NRC), Am. J. Hum. Genet. 52 (1993)
437440.
[4] I.W. Evett, J. Scranage, R. Pinchin, An illustration of the
advantages of efficient statistical methods for RFLP analysis
in forensic science, Am. J. Hum. Genet. 52 (1993) 498505.
[5] N.E. Morton, Genetic structure of forensic populations, Am.
J. Hum. Genet. 55 (1994) 587588.
[6] National Research Council, The Evaluation of Forensic DNA
Evidence, National Academy Press, Washington, DC, 1996,
pp. 89124.
[7] K.L. Monson, B. Budowle, Effect of reference database on
frequency estimates of polymerase chain reaction (PCR)based DNA profiles, J. Forensic Sci. 43 (1998) 483488.
[8] S.A. Shami, J.C. Grant, A.H. Bittles, Consanguineous
marriage within social/occupational boundaries in Pakistan,
J. Biosoc. Sci. 26 (1994) 9196.
[9] W. Wang, S.G. Sullivan, S. Ahmed, D. Chandler, L.A.
Zhivotovsky, A.H. Bittles, A genome-based study of
consanguinity in three co-resident endogamous Pakistan
communities, Ann. Hum. Genet. 64 (2000) 4149.
[10] J.C. Grant, A.H. Bittles, The comparative role of consanguinity in infant and child mortality in Pakistan, Ann. Hum.
Genet. 61 (1997) 143149.
[11] R. Hussain, A.H. Bittles, The prevalence and demographic
characteristics of consanguineous marriages in Pakistan, J.
Biosoc. Sci. 30 (1998) 261275.
[12] B.S. Weir, Genetic Data Analysis, II, Sinauer Associates,
Sunderland, MA, 1996, pp. 161201.
[13] P.O. Lewis, D. Zaykin, Genetic data analysis, Computer
Program for the Analysis of Allelic Data, Version 1.0, 1997,
http://chee.unm.edu/gda.
[14] M.A. Kayser, D. Caglia, N. Corach, C. Fretwell, G. Gehring,
Evaluation of Y-chromosomal STRs: a multi-centre study, Int.
J. Leg. Med. 110 (1997) 125133.
[15] R. Chakraborty, M. Kimmel, D.N. Stivers, L.J. Davison, R.
Deka, Relative mutation rates at di, tri and tetranucleotide
microsatellite loci, Proc. Natl. Acad. Sci. U.S.A. 94 (1997)
10411046.
[16] M.W. Feldman, J. Kumm, J.K. Pritchard, Mutation and
migration in models of microsatellite evolution. in:
D.G.Goldstein, C. Schlotterer (Eds.), Microsatellites: Evolution and Applications, Oxford University Press, Oxford,
1999, pp. 98115.
[17] L.A. Zhivotovsky, L. Bennett, A.M. Bowcock, M.W.
Feldman, Human population expansion and microsatellite
variation, Mol. Biol. Evol. 17 (2000) 757767.