Ancient Ethiopian genome reveals humans, by providing the baseline against which other extensive Eurasian admixture throughout events are defined. In the absence of ancient DNA, the African continent geneticists rely on contemporary African populations, but a number of historic events, in M. Gallego Llorente,1*† E. R. Jones,2*† A. Eriksson,1,3 V. Siska,1 particular a genetic backflow K. W. Arthur, J. W. Arthur, M. C. Curtis, J. T. Stock, 4 4 5,6 7 from West Eurasia into Eastern Africa (3, 4), act as confounding M. Coltorti, P. Pieruccini, S. Stretton, F. Brock, 8 8 9 10,11 T. Higham, 10 factors. Y. Park,12 M. Hofreiter,13,14 D. G. Bradley,2 J. Bhak,15 R. Pinhasi,16* Here, we present an ancient A. Manica1* human genome from Africa, and 1 Department of Zoology, University of Cambridge, Downing Street, Cambridge CB2 3EJ, UK. 2Smurfit Institute of Genetics, use it to disentangle the effects Trinity College Dublin, Dublin, Ireland. 3Integrative Systems Biology Laboratory, King Abdullah University of Science and of recent population movement
Downloaded from www.sciencemag.org on October 8, 2015
Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia. 4Department of Society, Culture, and Language, into Africa. By sampling the pe- University of South Florida St. Petersburg, 140 7th Avenue South, St. Petersburg, FL 33701, USA. 5Department of trous bone (5), we sequenced the Anthropology, Ventura College, 4667 Telegraph Road, Ventura, CA 93003, USA. 6Humanities and Social Sciences Program, UCLA Extension, University of California Los Angeles, 10995 Le Conte Avenue, Los Angeles, CA 90095, USA. genome of a male from Mota 7 Department of Archaeology and Anthropology, University of Cambridge, Pembroke Street, Cambridge CB2 3QG, UK. Cave (herein referred to as ‘Mo- 8 Department of Physical Sciences, Earth and Environment, University of Siena, Via di Laterina, 8-53100 Siena, Italy. ta’) in the southern Ethiopian 9 Department of Anthropology, University of Illinois at Urbana-Champaign, Public Service Archaeology and Architecture highlands, with a mean coverage Program, 109 Davenport Hall, 607 South Mathews Avenue, Urbana, IL 61801, USA. 10Oxford Radiocarbon Accelerator Unit, Research Laboratory for Archaeology and the History of Art, University of Oxford, Dyson Perrins Building, South Parks of 12.5x (6). Contamination was Road, Oxford OX1 3QY, UK. 11Cranfield Forensic Institute, Cranfield University, Defence Academy of the United Kingdom, estimated to be between 0.29 Shrivenham, Oxon SN6 8LA, UK. 12Theragen BiO Institute, 2nd Floor B-dong, AICT bldg, Iui-dong, Youngtong-gu, Suwon, and 1.26% (6). Mota’s remains 443-270, Republic of Korea. 13Institute for Biochemistry and Biology, Faculty for Mathematics and Natural Sciences, were dated to ~4,500 years ago University of Potsdam, Karl-Liebknechtstraße 24–25, 14476 Potsdam Golm, Germany. 14Department of Biology, University 15 of York, Wentworth Way, Heslington, York YO10 5DD, UK. The Genomics Institute, UNIST, Ulsan, 689-798, Republic of (direct calibrated radiocarbon Korea. 16School of Archaeology, University College Dublin, Belfield, Dublin 4, Ireland. date (6)), and thus predate both *Corresponding author. E-mail: mg632@cam.ac.uk (M.G.L.); joneser@tcd.ie (E.R.J.); ron.pinhasi@ucd.ie (R.P.); the Bantu expansion (7), and, am315@cam.ac.uk (A.M.) more importantly, the 3ky-old †These authors contributed equally to this work. West Eurasian backflow which has left strong genetic signatures Characterizing genetic diversity in Africa is a crucial step for most analyses in the whole of Eastern and, to a reconstructing the evolutionary history of anatomically modern humans. lesser extent, Southern Africa (3, However, historic migrations from Eurasia into Africa have affected many 4). contemporary populations, confounding inferences. Here, we present a We compared Mota to con- 12.5x coverage ancient genome of an Ethiopian male (‘Mota’) who lived temporary human populations approximately 4,500 years ago. We use this genome to demonstrate that (6). Both Principal Component the Eurasian backflow into Africa came from a population closely related to Analysis (Fig. 1A) and outgroup Early Neolithic farmers, who had colonized Europe 4,000 years earlier. The f3 analysis using Ju|’hoansi extent of this backflow was much greater than previously reported, (Khoisan) from Southern Africa reaching all the way to Central, West and Southern Africa, affecting even as the outgroup (Fig. 1B,C) place populations such as Yoruba and Mbuti, previously thought to be relatively this ancient individual close to unadmixed, who harbor 6-7% Eurasian ancestry. contemporary Ethiopian popula- tions, and more specifically to The ability to sequence ancient genomes has revolutionized the Ari, a group of Omotic speakers from southern Ethiopia, our understanding of human evolution. However, genetic to the West of the highland region where Mota lived. Our analyses of ancient material have focused on individuals ancient genome confirms the view that the divergence of from temperate and arctic regions, where ancient DNA is this language family results from the relative isolation of its preserved over longer time frames (1). Africa has so far speakers (8), and indicates population continuity over the failed to yield skeletal remains with much aDNA, with the last ~4,500 years in this region of Eastern Africa. exception of a few poorly preserved specimens from which The age of Mota means that he should predate the West only mitochondrial DNA could be extracted (2). This is Eurasian backflow, which has been dated to ~3,000 years particularly unfortunate, as African genetic diversity is ago (3, 4). We formally tested this by using an f4 ratio esti- crucial to most analyses reconstructing the evolutionary mating the West Eurasian component (6), following the ap-
proach adopted by Pickrell et al. (3). As expected, we failed African populations is not enough to change conclusions to find any West Eurasian component in Mota (Table S5), qualitatively (estimates of Neanderthal ancestry in French thus providing support for previous dating of that event (3, and Han only increased marginally when tested with Mota 4). as a reference), it should be accounted for when looking for Given that Mota predates the backflow, we searched for specific introgressed haplotypes (17) or searching for un- its most likely source by modelling the Ari, the contempo- known ancient hominins who might have hybridized with rary population closest to our ancient genome, as a mixture African populations (18). of Mota and another West Eurasian population (6). We in- We also investigated the Mota genome for a number of vestigated both contemporary sources (3) as well as other phenotypes of interest (6). As expected, Mota lacked any of Eurasian ancient genomes (5, 9). In this analysis, contempo- the derived alleles found in Eurasian populations for eye rary Sardinians and the early Neolithic LBK (Stuttgart) ge- and skin color, suggesting that he had brown eyes and dark nome stand out (Fig. 2A). Previous analyses have shown skin. Mota lacked any of the currently known alleles that Sardinians to be the closest modern representatives of early give lactose tolerance, which may have implications con- Neolithic farmers (10, 11), implying that the backflow came cerning when pastoralism appeared in southwestern Ethio- from the same genetic source that fuelled the Neolithic ex- pia. In addition, Mota did possess all three selected alleles pansion into Europe from the Near East/Anatolia, before that have been recently shown to play a role in the adapta- recent historic events changed the genetic makeup of popu- tion to altitude in contemporary highland Ethiopian popula- lations living in that region. An analysis with haplotype tions (19). The presence of these mutations supports our sharing also identified a connection between contemporary conclusion that Mota is the descendant of highland dwell- Ethiopians and Anatolia (4, 12). Interestingly, archaeological ers, who have lived in this environment long enough to ac- evidence dates the arrival of Near Eastern domesticates cumulate adaptations to the altitude (20, 21). (such as wheat, barley and lentils) to the same time period Until now, it has been necessary to use contemporary Af- (circa 3,000 years ago) (13, 14), suggesting that the direct rican populations as the baseline against which events dur- descendants of the farmers that earlier brought agriculture ing the worldwide expansion of Anatomically Modern into Europe may have also played a role in the development Humans are defined (16, 22–24). By obtaining an ancient of new forms of food production in the Horn of Africa. whole genome from this continent, we have shown that hav- Using Mota as an unadmixed African reference and the ing an unadmixed reference that predates the large number early farmer LBK as the source of the West Eurasian com- of recent historical migrations can greatly improve our in- ponent, it is possible to reassess the magnitude and geo- ference. This result stresses the importance of obtaining graphic extent of historical migrations, avoiding the unadmixed baseline data to reconstruct demographic complications of using admixed contemporary populations events, and the limitations of analyses that are solely based (6). We estimated a substantially higher Eurasian backflow on contemporary populations. Even older African genomes admixture than previously detected (3), with an additional will thus be needed to investigate key demographic events 4-7% of the genome of most African populations tracing that predate Mota, such as earlier instances of back-flows back to a Eurasian source, and, more importantly, we de- into Africa (25). tected a much broader geographical impact of the backflow, REFERENCES AND NOTES going all the way to West and Southern Africa (Fig. 2B). 1. M. Hofreiter, J. L. Paijmans, H. Goodchild, C. F. Speller, A. Barlow, G. G. Fortes, J. A. Even though the West Eurasian component in these regions Thomas, A. Ludwig, M. J. Collins, The future of ancient DNA: Technical advances is smaller than in Eastern Africa, it is still sizeable, with Yo- and conceptual shifts. BioEssays 37, 284–293 (2015). Medline doi:10.1002/bies.201400160 ruba and Mbuti, who are often used as African references 2. A. G. Morris, A. Heinze, E. K. F. Chan, A. B. Smith, V. M. Hayes, First ancient (15, 16), showing 7% and 6%, respectively, of their genomes mitochondrial human genome from a prepastoralist southern African. Genome to be of Eurasian origin (Table S5). Biol. Evol. 6, 2647–2653 (2014). Medline doi:10.1093/gbe/evu202 Since Mota predates recent demographic events, his ge- 3. J. K. Pickrell, N. Patterson, P. R. Loh, M. Lipson, B. Berger, M. Stoneking, B. Pakendorf, D. Reich, Ancient west Eurasian ancestry in southern and eastern nome can act as an ideal African reference to understand Africa. Proc. Natl. Acad. Sci. U.S.A. 111, 2632–2637 (2014). Medline episodes during the out-of-Africa expansion. We used him as doi:10.1073/pnas.1313787111 the African reference to quantify Neanderthal introgression 4. L. Pagani, T. Kivisild, A. Tarekegn, R. Ekong, C. Plaster, I. Gallego Romero, Q. Ayub, in a number of contemporary genomes (6). Both Yoruba and S. Q. Mehdi, M. G. Thomas, D. Luiselli, E. Bekele, N. Bradman, D. J. Balding, C. Tyler-Smith, Ethiopian genetic diversity reveals linguistic stratification and Mbuti, which are routinely used as African references for complex influences on the Ethiopian gene pool. Am. J. Hum. Genet. 91, 83–96 this type of analysis (15, 16), show a marginally closer affini- (2012). Medline doi:10.1016/j.ajhg.2012.05.015 ty with Neanderthal than Mota based on D statistics, and an 5. C. Gamba, E. R. Jones, M. D. Teasdale, R. L. McLaughlin, G. Gonzalez-Fortes, V. f4 ratio analysis detected a small Neanderthal component in Mattiangeli, L. Domboróczki, I. Kővári, I. Pap, A. Anders, A. Whittle, J. Dani, P. Raczky, T. F. Higham, M. Hofreiter, D. G. Bradley, R. Pinhasi, Genome flux and these genomes at around 0.2-0.7%; greater than previously stasis in a five millennium transect of European prehistory. Nat. Commun. 5, suggested (16), and consistent with our estimates of the 5257 (2014). 10.1038/ncomms6257 Medline doi:10.1038/ncomms6257 magnitude of their Western Eurasian ancestry (6). While the 6. See supporting material on Science Online. magnitude of Neanderthal ancestry in these contemporary 7. S. Li, C. Schlebusch, M. Jakobsson, Genetic variation reveals large-scale
population expansion and migration during the expansion of Bantu-speaking 15128 (2011). Medline doi:10.1073/pnas.1109300108 peoples. Proc. Biol. Sci. 281, 20141448 (2014). 10.1098/rspb.2014.1448 Medline 19. N. Udpa, R. Ronen, D. Zhou, J. Liang, T. Stobdan, O. Appenzeller, Y. Yin, Y. Du, L. doi:10.1098/rspb.2014.1448 Guo, R. Cao, Y. Wang, X. Jin, C. Huang, W. Jia, D. Cao, G. Guo, V. E. Claydon, R. 8. R. Blench, in SemitoHamitic Festschrift for A.B. Dolgopolsky and H. Jungraithmayr Hainsworth, J. L. Gamboa, M. Zibenigus, G. Zenebe, J. Xue, S. Liu, K. A. Frazer, Y. (Dietich Reimer Verlag, Berlin, 2008), pp. 63–78. Li, V. Bafna, G. G. Haddad, Whole genome sequencing of Ethiopian highlanders 9. I. Lazaridis, N. Patterson, A. Mittnik, G. Renaud, S. Mallick, K. Kirsanow, P. H. reveals conserved hypoxia tolerance genes. Genome Biol. 15, R36 (2014). Sudmant, J. G. Schraiber, S. Castellano, M. Lipson, B. Berger, C. Economou, R. Medline Bollongino, Q. Fu, K. I. Bos, S. Nordenfelt, H. Li, C. de Filippo, K. Prüfer, S. 20. E. Huerta-Sánchez, M. Degiorgio, L. Pagani, A. Tarekegn, R. Ekong, T. Antao, A. Sawyer, C. Posth, W. Haak, F. Hallgren, E. Fornander, N. Rohland, D. Delsate, M. Cardona, H. E. Montgomery, G. L. Cavalleri, P. A. Robbins, M. E. Weale, N. Francken, J. M. Guinet, J. Wahl, G. Ayodo, H. A. Babiker, G. Bailliet, E. Bradman, E. Bekele, T. Kivisild, C. Tyler-Smith, R. Nielsen, Genetic signatures Balanovska, O. Balanovsky, R. Barrantes, G. Bedoya, H. Ben-Ami, J. Bene, F. reveal high-altitude adaptation in a set of ethiopian populations. Mol. Biol. Evol. Berrada, C. M. Bravi, F. Brisighelli, G. B. Busby, F. Cali, M. Churnosov, D. E. Cole, 30, 1877–1888 (2013). Medline doi:10.1093/molbev/mst089 D. Corach, L. Damba, G. van Driem, S. Dryomov, J. M. Dugoujon, S. A. Fedorova, 21. G. Alkorta-Aranburu, C. M. Beall, D. B. Witonsky, A. Gebremedhin, J. K. Pritchard, I. Gallego Romero, M. Gubina, M. Hammer, B. M. Henn, T. Hervig, U. Hodoglugil, A. Di Rienzo, The genetic architecture of adaptations to high altitude in Ethiopia. A. R. Jha, S. Karachanak-Yankova, R. Khusainova, E. Khusnutdinova, R. Kittles, T. PLOS Genet. 8, e1003110 (2012). Medline doi:10.1371/journal.pgen.1003110 Kivisild, W. Klitz, V. Kučinskas, A. Kushniarevich, L. Laredj, S. Litvinov, T. 22. A. Eriksson, L. Betti, A. D. Friend, S. J. Lycett, J. S. Singarayer, N. von Cramon- Loukidis, R. W. Mahley, B. Melegh, E. Metspalu, J. Molina, J. Mountain, K. Taubadel, P. J. Valdes, F. Balloux, A. Manica, Late Pleistocene climate change Näkkäläjärvi, D. Nesheva, T. Nyambo, L. Osipova, J. Parik, F. Platonov, O. and the global expansion of anatomically modern humans. Proc. Natl. Acad. Sci. Posukh, V. Romano, F. Rothhammer, I. Rudan, R. Ruizbakiev, H. Sahakyan, A. U.S.A. 109, 16089–16094 (2012). Medline doi:10.1073/pnas.1209494109 Sajantila, A. Salas, E. B. Starikovskaya, A. Tarekegn, D. Toncheva, S. Turdikulova, 23. A. Eriksson, A. Manica, Effect of ancient population structure on the degree of I. Uktveryte, O. Utevska, R. Vasquez, M. Villena, M. Voevoda, C. A. Winkler, L. polymorphism shared between modern human populations and ancient Yepiskoposyan, P. Zalloua, T. Zemunik, A. Cooper, C. Capelli, M. G. Thomas, A. hominins. Proc. Natl. Acad. Sci. U.S.A. 109, 13956–13960 (2012). Medline Ruiz-Linares, S. A. Tishkoff, L. Singh, K. Thangaraj, R. Villems, D. Comas, R. doi:10.1073/pnas.1200567109 Sukernik, M. Metspalu, M. Meyer, E. E. Eichler, J. Burger, M. Slatkin, S. Pääbo, J. 24. L. Pagani, S. Schiffels, D. Gurdasani, P. Danecek, A. Scally, Y. Chen, Y. Xue, M. Kelso, D. Reich, J. Krause, Ancient human genomes suggest three ancestral Haber, R. Ekong, T. Oljira, E. Mekonnen, D. Luiselli, N. Bradman, E. Bekele, P. populations for present-day Europeans. Nature 513, 409–413 (2014). Medline Zalloua, R. Durbin, T. Kivisild, C. Tyler-Smith, Tracing the route of modern doi:10.1038/nature13673 humans out of Africa by using 225 human genome sequences from Ethiopians 10. M. Sikora, M. L. Carpenter, A. Moreno-Estrada, B. M. Henn, P. A. Underhill, F. and Egyptians. Am. J. Hum. Genet. 96, 986–991 (2015). Medline Sánchez-Quinto, I. Zara, M. Pitzalis, C. Sidore, F. Busonero, A. Maschio, A. doi:10.1016/j.ajhg.2015.04.019 Angius, C. Jones, J. Mendoza-Revilla, G. Nekhrizov, D. Dimitrova, N. Theodossiev, 25. J. A. Hodgson, C. J. Mulligan, A. Al-Meeri, R. L. Raaum, Early back-to-Africa T. T. Harkins, A. Keller, F. Maixner, A. Zink, G. Abecasis, S. Sanna, F. Cucca, C. D. migration into the Horn of Africa. PLOS Genet. 10, e1004393 (2014). Medline Bustamante, Population genomic analysis of ancient and modern genomes doi:10.1371/journal.pgen.1004393 yields new insights into the genetic ancestry of the Tyrolean Iceman and the 26. F. Brock, T. Higham, P. Ditchfield, C. B. Ramsey, Current Pretreatment Methods genetic structure of Europe. PLOS Genet. 10, e1004353 (2014). Medline for AMS Radiocarbon Dating at the Oxford Radiocarbon Accelerator Unit doi:10.1371/journal.pgen.1004353 (ORAU). Radiocarbon 52, 103–112 (2010). 11. P. Skoglund, H. Malmström, M. Raghavan, J. Storå, P. Hall, E. Willerslev, M. T. 27. J. E. Buikstra, D. H. Ubelaker, Standards for data collection from human skeletal Gilbert, A. Götherström, M. Jakobsson, Origins and genetic legacy of Neolithic remains (1994), vol. 44 of Arkansas Archaeological Survey Research Series. farmers and hunter-gatherers in Europe. Science 336, 466–469 (2012). Medline 28. M. R. Feldesman, R. L. Fountain, “Race” specificity and the femur/stature ratio. doi:10.1126/science.1216304 Am. J. Phys. Anthropol. 100, 207–224 (1996). Medline doi:10.1002/(SICI)1096- 12. T. Kivisild, M. Reidla, E. Metspalu, A. Rosa, A. Brehm, E. Pennarun, J. Parik, T. 8644(199606)100:2<207::AID-AJPA4>3.0.CO;2-U Geberhiwot, E. Usanga, R. Villems, Ethiopian mitochondrial DNA heritage: 29. M. H. Raxter, C. B. Ruff, A. Azab, M. Erfan, M. Soliman, A. El-Sawaf, Stature Tracking gene flow across and around the gate of tears. Am. J. Hum. Genet. 75, estimation in ancient Egyptians: A new technique based on anatomical 752–770 (2004). Medline doi:10.1086/425161 reconstruction of stature. Am. J. Phys. Anthropol. 136, 147–155 (2008). Medline 13. M. C. Curtis, in The Oxford Handbook of African Archaeology (Oxford University doi:10.1002/ajpa.20790 Press, 2013), pp. 571–584. 30. C. B. Ruff, E. Trinkaus, T. W. Holliday, Body mass and encephalization in 14. M. Harrower, M. McCorriston, A. D’Andrea, General/Specific, Local/Global: Pleistocene Homo. Nature 387, 173–176 (1997). Medline doi:10.1038/387173a0 Comparing the Beginnings of Agriculture in the Horn of Africa (Ethiopia/Eritrea) 31. R. Pinhasi, D. Fernandes, K. Sirak, M. Novak, S. Connell, S. Alpaslan-Roodenberg, and Southwest Arabia (Yemen). Am. Antiq. 75, 452–472 (2010). F. Gerritsen, V. Moiseyev, A. Gromov, P. Raczky, A. Anders, M. Pietrusewsky, G. doi:10.7183/0002-7316.75.3.452 Rollefson, M. Jovanovic, H. Trinhhoang, G. Bar-Oz, M. Oxenham, H. Matsumura, 15. Q. Fu, M. Hajdinjak, O. T. Moldovan, S. Constantin, S. Mallick, P. Skoglund, N. M. Hofreiter, Optimal ancient DNA yields from the inner ear part of the human Patterson, N. Rohland, I. Lazaridis, B. Nickel, B. Viola, K. Prüfer, M. Meyer, J. petrous bone. PLOS ONE 10, e0129102 (2015). Medline Kelso, D. Reich, S. Pääbo, An early modern human from Romania with a recent doi:10.1371/journal.pone.0129102 Neanderthal ancestor. Nature 524, 216–219 (2015). Medline 32. D. Y. Yang, B. Eng, J. S. Waye, J. C. Dudar, S. R. Saunders, Improved DNA doi:10.1038/nature14558 extraction from ancient bones using silica-based spin columns. Am. J. Phys. 16. K. Prüfer, F. Racimo, N. Patterson, F. Jay, S. Sankararaman, S. Sawyer, A. Heinze, Anthropol. 105, 539–543 (1998). Medline doi:10.1002/(SICI)1096- G. Renaud, P. H. Sudmant, C. de Filippo, H. Li, S. Mallick, M. Dannemann, Q. Fu, 8644(199804)105:4<539::AID-AJPA10>3.0.CO;2-1 M. Kircher, M. Kuhlwilm, M. Lachmann, M. Meyer, M. Ongyerth, M. Siebauer, C. 33. M. Meyer, M. Kircher, Illumina sequencing library preparation for highly Theunert, A. Tandon, P. Moorjani, J. Pickrell, J. C. Mullikin, S. H. Vohr, R. E. multiplexed target capture and sequencing. Cold Spring Harb. Protoc. 2010, Green, I. Hellmann, P. L. Johnson, H. Blanche, H. Cann, J. O. Kitzman, J. pdb.prot5448 (2010). Medline doi:10.1101/pdb.prot5448 Shendure, E. E. Eichler, E. S. Lein, T. E. Bakken, L. V. Golovanova, V. B. 34. G. Renaud, U. Stenzel, J. Kelso, leeHom: Adaptor trimming and merging for Doronichev, M. V. Shunkov, A. P. Derevianko, B. Viola, M. Slatkin, D. Reich, J. Illumina sequencing reads. Nucleic Acids Res. 42, e141 (2014). Medline Kelso, S. Pääbo, The complete genome sequence of a Neanderthal from the Altai doi:10.1093/nar/gku699 Mountains. Nature 505, 43–49 (2014). Medline doi:10.1038/nature12886 35. H. Li, R. Durbin, Fast and accurate short read alignment with Burrows-Wheeler 17. S. Sankararaman, S. Mallick, M. Dannemann, K. Prüfer, J. Kelso, S. Pääbo, N. transform. Bioinformatics 25, 1754–1760 (2009). Medline Patterson, D. Reich, The genomic landscape of Neanderthal ancestry in present- doi:10.1093/bioinformatics/btp324 day humans. Nature 507, 354–357 (2014). Medline doi:10.1038/nature12961 36. A. McKenna, M. Hanna, E. Banks, A. Sivachenko, K. Cibulskis, A. Kernytsky, K. 18. M. F. Hammer, A. E. Woerner, F. L. Mendez, J. C. Watkins, J. D. Wall, Genetic Garimella, D. Altshuler, S. Gabriel, M. Daly, M. A. DePristo, The Genome Analysis evidence for archaic admixture in Africa. Proc. Natl. Acad. Sci. U.S.A. 108, 15123– Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing
data. Genome Res. 20, 1297–1303 (2010). Medline doi:10.1101/gr.107524.110 doi:10.1016/j.ajhg.2008.04.002 37. A. R. Quinlan, I. M. Hall, BEDTools: A flexible suite of utilities for comparing 53. L. Jostins et al., YFitter: Maximum likelihood assignment of Y chromosome genomic features. Bioinformatics 26, 841–842 (2010). Medline haplogroups from low-coverage sequence data. arXiv:1407.7988 [q-bio] (2014) doi:10.1093/bioinformatics/btq033 (available at http://arxiv.org/abs/1407.7988). 38. H. Li, B. Handsaker, A. Wysoker, T. Fennell, J. Ruan, N. Homer, G. Marth, G. 54. O. Semino, C. Magri, G. Benuzzi, A. A. Lin, N. Al-Zahery, V. Battaglia, L. Maccioni, Abecasis, R. Durbin; 1000 Genome Project Data Processing Subgroup, The C. Triantaphyllidis, P. Shen, P. J. Oefner, L. A. Zhivotovsky, R. King, A. Torroni, L. Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 L. Cavalli-Sforza, P. A. Underhill, A. S. Santachiara-Benerecetti, Origin, diffusion, (2009). Medline doi:10.1093/bioinformatics/btp352 and differentiation of Y-chromosome haplogroups E and J: Inferences on the 39. H. Jónsson, A. Ginolhac, M. Schubert, P. L. F. Johnson, L. Orlando, neolithization of Europe and later migratory events in the Mediterranean area. mapDamage2.0: Fast approximate Bayesian estimates of ancient DNA damage Am. J. Hum. Genet. 74, 1023–1034 (2004). Medline doi:10.1086/386295 parameters. Bioinformatics 29, 1682–1684 (2013). Medline 55. B. Trombetta, F. Cruciani, D. Sellitto, R. Scozzari, A new topology of the human Y doi:10.1093/bioinformatics/btt193 chromosome haplogroup E1b1 (E-P2) revealed through the use of newly 40. A. W. Briggs, U. Stenzel, P. L. Johnson, R. E. Green, J. Kelso, K. Prüfer, M. Meyer, characterized binary polymorphisms. PLOS ONE 6, e16073 (2011). Medline J. Krause, M. T. Ronan, M. Lachmann, S. Pääbo, Patterns of damage in genomic doi:10.1371/journal.pone.0016073 DNA sequences from a Neandertal. Proc. Natl. Acad. Sci. U.S.A. 104, 14616– 56. E. I. Gebremeskel, M. E. Ibrahim, Y-chromosome E haplogroups: Their 14621 (2007). Medline doi:10.1073/pnas.0704665104 distribution and implication to the origin of Afro-Asiatic languages and 41. P. Brotherton, P. Endicott, J. J. Sanchez, M. Beaumont, R. Barnett, J. Austin, A. pastoralism. Eur. J. Hum. Genet. 22, 1387–1392 (2014). Medline Cooper, Novel high-resolution characterization of ancient DNA reveals C > U- doi:10.1038/ejhg.2014.41 type base modification events as the sole cause of post mortem miscoding 57. O. Semino, A. S. Santachiara-Benerecetti, F. Falaschi, L. L. Cavalli-Sforza, P. A. lesions. Nucleic Acids Res. 35, 5717–5728 (2007). Medline Underhill, Ethiopians and Khoisan share the deepest clades of the human Y- doi:10.1093/nar/gkm588 chromosome phylogeny. Am. J. Hum. Genet. 70, 265–268 (2002). Medline 42. B. Shapiro, M. Hofreiter, A paleogenomic perspective on evolution and gene doi:10.1086/338306 function: New insights from ancient DNA. Science 343, 1236573 (2014). Medline 58. J. K. Pickrell, N. Patterson, C. Barbieri, F. Berthold, L. Gerlach, T. Güldemann, B. doi:10.1126/science.1236573 Kure, S. W. Mpoloka, H. Nakagawa, C. Naumann, M. Lipson, P. R. Loh, J. 43. T. S. Korneliussen, A. Albrechtsen, R. Nielsen, ANGSD: Analysis of Next Lachance, J. Mountain, C. D. Bustamante, B. Berger, S. A. Tishkoff, B. M. Henn, Generation Sequencing Data. BMC Bioinformatics 15, 356 (2014). Medline M. Stoneking, D. Reich, B. Pakendorf, The genetic prehistory of southern Africa. doi:10.1186/s12859-014-0356-4 Nat. Commun. 3, 1143 (2012). Medline doi:10.1038/ncomms2140 44. M. Rasmussen, X. Guo, Y. Wang, K. E. Lohmueller, S. Rasmussen, A. Albrechtsen, 59. P. Danecek, A. Auton, G. Abecasis, C. A. Albers, E. Banks, M. A. DePristo, R. E. L. Skotte, S. Lindgreen, M. Metspalu, T. Jombart, T. Kivisild, W. Zhai, A. Eriksson, Handsaker, G. Lunter, G. T. Marth, S. T. Sherry, G. McVean, R. Durbin; 1000 A. Manica, L. Orlando, F. M. De La Vega, S. Tridico, E. Metspalu, K. Nielsen, M. C. Genomes Project Analysis Group, The variant call format and VCFtools. Ávila-Arcos, J. V. Moreno-Mayar, C. Muller, J. Dortch, M. T. Gilbert, O. Lund, A. Bioinformatics 27, 2156–2158 (2011). Medline Wesolowska, M. Karmin, L. A. Weinert, B. Wang, J. Li, S. Tai, F. Xiao, T. Hanihara, doi:10.1093/bioinformatics/btr330 G. van Driem, A. R. Jha, F. X. Ricaut, P. de Knijff, A. B. Migliano, I. Gallego 60. S. Purcell, B. Neale, K. Todd-Brown, L. Thomas, M. A. Ferreira, D. Bender, J. Romero, K. Kristiansen, D. M. Lambert, S. Brunak, P. Forster, B. Brinkmann, O. Maller, P. Sklar, P. I. de Bakker, M. J. Daly, P. C. Sham, PLINK: A tool set for Nehlich, M. Bunce, M. Richards, R. Gupta, C. D. Bustamante, A. Krogh, R. A. whole-genome association and population-based linkage analyses. Am. J. Hum. Foley, M. M. Lahr, F. Balloux, T. Sicheritz-Pontén, R. Villems, R. Nielsen, J. Wang, Genet. 81, 559–575 (2007). Medline doi:10.1086/519795 E. Willerslev, An Aboriginal Australian genome reveals separate human 61. N. Patterson, A. L. Price, D. Reich, Population structure and eigenanalysis. PLOS dispersals into Asia. Science 334, 94–98 (2011). Medline Genet. 2, e190 (2006). Medline doi:10.1371/journal.pgen.0020190 45. F. Sánchez-Quinto, H. Schroeder, O. Ramirez, M. C. Avila-Arcos, M. Pybus, I. 62. L. van Dorp, D. Balding, S. Myers, L. Pagani, C. Tyler-Smith, E. Bekele, A. Olalde, A. M. Velazquez, M. E. Marcos, J. M. Encinas, J. Bertranpetit, L. Orlando, Tarekegn, M. G. Thomas, N. Bradman, G. Hellenthal, Evidence for a common M. T. Gilbert, C. Lalueza-Fox, Genomic affinities of two 7,000-year-old Iberian origin of blacksmiths and cultivators in the Ethiopian Ari within the Last 4500 hunter-gatherers. Curr. Biol. 22, 1494–1499 (2012). Medline Years: Lessons for clustering-based inference. PLOS Genet. 11, e1005397 doi:10.1016/j.cub.2012.06.005 (2015). Medline 46. D. Vianello, F. Sevini, G. Castellani, L. Lomartire, M. Capri, C. Franceschi, 63. N. Patterson, P. Moorjani, Y. Luo, S. Mallick, N. Rohland, Y. Zhan, T. Genschoreck, HAPLOFIND: A new method for high-throughput mtDNA haplogroup assignment. T. Webster, D. Reich, Ancient admixture in human history. Genetics 192, 1065– Hum. Mutat. 34, 1189–1194 (2013). Medline doi:10.1002/humu.22356 1093 (2012). Medline doi:10.1534/genetics.112.145037 47. P. Skoglund, J. Storå, A. Götherström, M. Jakobsson, Accurate sex identification 64. P. Moorjani, N. Patterson, J. N. Hirschhorn, A. Keinan, L. Hao, G. Atzmon, E. of ancient human remains using DNA shotgun sequencing. J. Archaeol. Sci. 40, Burns, H. Ostrer, A. L. Price, D. Reich, The history of African gene flow into 4477–4482 (2013). doi:10.1016/j.jas.2013.07.004 Southern Europeans, Levantines, and Jews. PLOS Genet. 7, e1001373 (2011). 48. P. Skoglund, H. Malmström, A. Omrak, M. Raghavan, C. Valdiosera, T. Günther, P. Medline doi:10.1371/journal.pgen.1001373 Hall, K. Tambets, J. Parik, K. G. Sjögren, J. Apel, E. Willerslev, J. Storå, A. 65. V. Fernandes, P. Triska, J. B. Pereira, F. Alshamali, T. Rito, A. Machado, Z. Götherström, M. Jakobsson, Genomic diversity and admixture differs for Stone- Fajkošová, B. Cavadas, V. Černý, P. Soares, M. B. Richards, L. Pereira, Genetic Age Scandinavian foragers and farmers. Science 344, 747–750 (2014). Medline stratigraphy of key demographic events in Arabia. PLOS ONE 10, e0118625 49. D. M. Behar, M. van Oven, S. Rosset, M. Metspalu, E. L. Loogväli, N. M. Silva, T. (2015). Medline doi:10.1371/journal.pone.0118625 Kivisild, A. Torroni, R. Villems, A “Copernican” reassessment of the human 66. R. E. Green, J. Krause, A. W. Briggs, T. Maricic, U. Stenzel, M. Kircher, N. mitochondrial DNA tree from its root. Am. J. Hum. Genet. 90, 675–684 (2012). Patterson, H. Li, W. Zhai, M. H. Fritz, N. F. Hansen, E. Y. Durand, A. S. Malaspinas, Medline doi:10.1016/j.ajhg.2012.03.002 J. D. Jensen, T. Marques-Bonet, C. Alkan, K. Prüfer, M. Meyer, H. A. Burbano, J. 50. P. Soares, F. Alshamali, J. B. Pereira, V. Fernandes, N. M. Silva, C. Afonso, M. D. M. Good, R. Schultz, A. Aximu-Petri, A. Butthof, B. Höber, B. Höffner, M. Costa, E. Musilová, V. Macaulay, M. B. Richards, V. Cerny, L. Pereira, The Siegemund, A. Weihmann, C. Nusbaum, E. S. Lander, C. Russ, N. Novod, J. Expansion of mtDNA Haplogroup L3 within and out of Africa. Mol. Biol. Evol. 29, Affourtit, M. Egholm, C. Verna, P. Rudan, D. Brajkovic, Z. Kucan, I. Gusic, V. B. 915–927 (2012). Medline doi:10.1093/molbev/msr245 Doronichev, L. V. Golovanova, C. Lalueza-Fox, M. de la Rasilla, J. Fortea, A. 51. A. Torroni, A. Achilli, V. Macaulay, M. Richards, H.-J. Bandelt, Harvesting the fruit Rosas, R. W. Schmitz, P. L. Johnson, E. E. Eichler, D. Falush, E. Birney, J. C. of the human mtDNA tree. Trends Genet. 22, 339–345 (2006). Medline Mullikin, M. Slatkin, R. Nielsen, J. Kelso, M. Lachmann, D. Reich, S. Pääbo, A draft doi:10.1016/j.tig.2006.04.001 sequence of the Neandertal genome. Science 328, 710–722 (2010). Medline 52. D. M. Behar, R. Villems, H. Soodyall, J. Blue-Smith, L. Pereira, E. Metspalu, R. doi:10.1126/science.1188021 Scozzari, H. Makkan, S. Tzur, D. Comas, J. Bertranpetit, L. Quintana-Murci, C. 67. M. Meyer, M. Kircher, M. T. Gansauge, H. Li, F. Racimo, S. Mallick, J. G. Schraiber, Tyler-Smith, R. S. Wells, S. Rosset; Genographic Consortium, The dawn of F. Jay, K. Prüfer, C. de Filippo, P. H. Sudmant, C. Alkan, Q. Fu, R. Do, N. Rohland, human matrilineal diversity. Am. J. Hum. Genet. 82, 1130–1140 (2008). Medline A. Tandon, M. Siebauer, R. E. Green, K. Bryc, A. W. Briggs, U. Stenzel, J. Dabney,
J. Shendure, J. Kitzman, M. F. Hammer, M. V. Shunkov, A. P. Derevianko, N. Patterson, A. M. Andrés, E. E. Eichler, M. Slatkin, D. Reich, J. Kelso, S. Pääbo, A high-coverage genome sequence from an archaic Denisovan individual. Science 338, 222–226 (2012). Medline doi:10.1126/science.1224344 68. I. V. Ovchinnikov, A. Götherström, G. P. Romanova, V. M. Kharitonov, K. Lidén, W. Goodwin, Molecular analysis of Neanderthal DNA from the northern Caucasus. Nature 404, 490–493 (2000). Medline doi:10.1038/35006625 69. K. L. Hart, S. L. Kimura, V. Mushailov, Z. M. Budimlija, M. Prinz, E. Wurmbach, Improved eye- and skin-color prediction based on 8 SNPs. Croat. Med. J. 54, 248–256 (2013). Medline doi:10.3325/cmj.2013.54.248 70. S. Walsh, F. Liu, A. Wollstein, L. Kovatsi, A. Ralf, A. Kosiniak-Kamysz, W. Branicki, M. Kayser, The HIrisPlex system for simultaneous prediction of hair and eye colour from DNA. Forensic Sci. Int. Genet. 7, 98–115 (2013). Medline doi:10.1016/j.fsigen.2012.07.005 71. T. G. K. Jensen, A. Liebert, R. Lewinsky, D. M. Swallow, J. Olsen, J. T. Troelsen, The -14010*C variant associated with lactase persistence is located between an Oct-1 and HNF1α binding site and increases lactase promoter activity. Hum. Genet. 130, 483–493 (2011). Medline doi:10.1007/s00439-011-0966-0 72. S. A. Tishkoff, F. A. Reed, A. Ranciaro, B. F. Voight, C. C. Babbitt, J. S. Silverman, K. Powell, H. M. Mortensen, J. B. Hirbo, M. Osman, M. Ibrahim, S. A. Omar, G. Lema, T. B. Nyambo, J. Ghori, S. Bumpstead, J. K. Pritchard, G. A. Wray, P. Deloukas, Convergent adaptation of human lactase persistence in Africa and Europe. Nat. Genet. 39, 31–40 (2007). Medline doi:10.1038/ng1946 73. B. L. Jones, T. O. Raga, A. Liebert, P. Zmarz, E. Bekele, E. T. Danielsen, A. K. Olsen, N. Bradman, J. T. Troelsen, D. M. Swallow, Diversity of lactase persistence alleles in Ethiopia: Signature of a soft selective sweep. Am. J. Hum. Genet. 93, 538–544 (2013). Medline doi:10.1016/j.ajhg.2013.07.008 74. S. Bergink, S. Jentsch, Principles of ubiquitin and SUMO modifications in DNA repair. Nature 458, 461–467 (2009). Medline doi:10.1038/nature07963 ACKNOWLEDGMENTS A.M. was supported by ERC Consolidator Grant 647787 ‘LocalAdaptation’; R.P. by ERC Starting Grant: ERC- 2010-StG 26344; M.H. by ERC Consolidator Grant 310763 ‘GeneFlow’; J.B. by the 2014 Research Fund (1.140113.01, 1.140064.01) of UNIST (Ulsan National Institute of Science and Technology) and Geromics internal research funding; J.T.S. by ERC Consolidator Grant 617627 'ADaPt'; K.W.A. by NSF award 1027607; D.G.B. by ERC Investigator Grant 295729-CodeX; V.S. by a scholarship from the Gates Cambridge Trust; and M.G. by a BBSRC DTP studentship. Permission for the archaeology was given by the Ethiopian Authority for Research and Conservation of Cultural Heritage and offices of the Ministry of Culture and Tourism for the Southern Nations, Nationalities, and Peoples Region. Raw reads from Mota are available for download through NCBI, BioProject ID PRJNA295861, and the corresponding BAM and VCF files are available at africangenome.org. SUPPLEMENTARY MATERIALS www.sciencemag.org/cgi/content/full/science.aad2879/DC1 Supplementary Text Figs. S1 to S8 Tables S1 to S14 References (26–74)
25 August 2015; accepted 28 September 2015
Published online 8 October 2015 10.1126/science.aad2879
Fig. 1. Mota shows a very high degree of similarity with the highland Ethiopian Ari populations. (A) PCA showing Mota projected onto components loaded on contemporary African and Eurasian populations. The inset magnifies the PCA space occupied by Ethiopian and Eastern African populations. (B) Outgroup f3 quantifying the shared drift between Mota and contemporary African populations, using Ju|’hoansi (Khoisan) as an outgroup; bars represent standard error. (C) Map showing the distribution of outgroup f3 values across the African continent. In (A) and (B), populations speaking Nilo-Saharan languages are marked with blue shades, Omotic speakers with red, Cushitic with orange, Semitic with yellow, and Bantu with green. Mota is denoted by a black symbol.
Fig. 2. Quantifying the geographic extent and origin of the West Eurasian component in Africa. (A) Admixture f3 identifying likely sources of the West Eurasian component (lowest f3 values). Contemporary populations in blue, ancient genomes in red; bars represent standard error. (B) Map showing the proportion of West Eurasian component, λMota,LBK, across the African continent.