You are on page 1of 7

SPECIAL COMMUNICATION

The Anatomy of the Human Genome


A Neo-Vesalian Basis for Medicine in the 21st Century
Victor A. McKusick, MD
Since 1956, the anatomy of the human genome has been described on the
basis of chromosome studies, gene mapping, and DNA sequencing. The gross

T
HE LINEAR ARRANGEMENT OF
genes on our chromosomes is anatomy of Andreas Vesalius, published in 1543, played a leading role in
part of our microanatomy. the development of modern medicine. The objective of this article is to show
When we speak of mapping that knowledge of genomic anatomy is having a comparably strong and per-
genes on chromosomes, we use a carto- vasive influence on all of medicine. The research revealing human genome
graphic metaphor. An equally appropri- anatomy is reviewed. The insight provided by genome anatomy has brought
ate anatomic metaphor is the anatomy
about shifts of focus, both in research and in the clinic, eg, from genomics
of the human genome.1-3 Clinical cyto-
genetics (starting in the late 1950s), map- to proteomic and from the individually rare, single-gene disorders to com-
ping genes on chromosomes (begin- mon disorders. Genomic anatomy permits medicine to become more pre-
ning for autosomes in the late 1960s), dictive and preventive. At the same time, diagnosis and treatment are ren-
and comprehensive DNA sequencing of dered more sensitive, specific, effective, and safe. Hazards in misuse and
the genome (initiated in the late 1980s) misunderstanding of the information exist. Education of both the public and
have provided, in the words of Charles health professionals is vital if the full benefits of neo-Vesalian medicine are
Scriver, MDCM (oral communication, to be realized.
1982), a neo-Vesalian basis for medi-
JAMA. 2001;286:2289-2295 www.jama.com
cine. The influence on medicine is fully
as great as was that of Andreas Vesalius’
de corporis humani Fabrica, which was correct diploid chromosome number of mongolism (mercifully renamed
published in 1543 and was the basis of 46 (not 48, as previously thought) was Down syndrome)8 and others
Harvey’s physiology of the circulation established.5,6 It is remarkable that it was described the numerical abnormalities
(1628) and Morgagni’s morbid anatomy not until 3 years after the determina- of the sex chromosomes in the Turner
(1761). tion of the double-helical structure of and Klinefelter syndromes that same
The history of medical genetics4 can DNA by Watson and Crick7 that the cor- year. In the early 1960s, abnormalities
be discussed in 2 parts, the pre-1956 rect number of chromosomes in hu- in chromosome number and structure
foundations of medical genetics going mans was determined. The advance was were described in other congenital
back to Mendel and the greats of the significant to medicine, not because of malformation syndromes, such as tri-
first half of the 20th century, and the the specific numerology but because of somies 13 and 18, and in a variety of
developments in the period since 1956, the associated simple improvements in translocations, deficiencies, mosaics,
during which medical genetics has technique that made chromosome analy- and, in spontaneously aborted tissue,
evolved into a full-fledged clinical and sis feasible in the study of disease and triploidy. The finding of a rather
academic field. The objective of this ar- in clinical diagnosis. Medical genetics, consistent chromosomal change in
ticle is to trace the influence of chro- which really did not exist as a clinical
Author Affiliation: McKusick-Nathans Institute of Ge-
mosome studies, gene mapping, and specialty before 1956, was given its own netic Medicine, Johns Hopkins University School of
DNA sequencing (the Human Ge- organ, the nucleus, just as cardiology had Medicine, Baltimore, Md.
nome Project [HGP]) on the evolu- the heart, neurology the nervous sys- Disclosure: Dr McKusick was a member of the Pro-
gram Advisory Committee for the National Institutes
tion of medical genetics since 1956. tem, etc. of Health Human Genome Project 1989-1992 and is
Not only was the correct chromo- a member of the Scientific Advisory Board of Celera
Genomics Inc.
Clinical Cytogenetics some number established with the Corresponding Author and Reprints: Victor A. McKu-
(Chromosomology) improved techniques, but also, in sick, MD, McKusick-Nathans Institute of Genetic Medi-
cine, Johns Hopkins Hospital, 600 N Wolfe St, Blalock
1956 was a watershed year in the his- 1959, Jerome Lejeune found the addi- Bldg, Room 1007, Baltimore, MD 21287-4922 (e-mail:
tory of medical genetics. In that year, the tional small chromosome underlying mckusick@peas.welch.jhu.edu).

©2001 American Medical Association. All rights reserved. (Reprinted) JAMA, November 14, 2001—Vol 286, No. 18 2289

Downloaded from www.jama.com at Washington University - St Louis, on October 27, 2005


THE ANATOMY OF THE HUMAN GENOME

cell division when they are extended, be on the distal end of the long arm of
Figure 1. Progress in Mapping of Genes to
Specific Chromosomes Through the Early banding methods made it possible to rec- the X in 1973.17
Stages of the Human Genome Project ognize small deletions and to interpret It was not until 1968, when 68 loci
chromosomal rearrangements. Corre- were already known to be on the X
2500
X-Chromosomal
lation of the specifically interpreted chromosome, that a gene was mapped
Autosomal karyotype with phenotype led to the de- to a specific autosome, ie, the Duffy
2000 scription of new microdeletion syn- blood group gene to chromosome 1.18
No. of Genes Mapped

dromes, such as Williams syndrome (on- This was achieved by Roger Donahue,
1500 line Mendelian Inheritance in Man then a Johns Hopkins University PhD
[OMIM] 104050), and the related con- candidate in human genetics, through
1000 cept (and designation) of contiguous a linkage study of a chromosome 1 het-
gene syndromes, eg, DiGeorge syn- eromorphism (one chromosome 1 was
500
drome (OMIM 188400). 12 Further- unusually long and appeared in the pre-
more, it allowed the large area of hema- banding karyotypes to have an un-
tologic malignancies that result from coiled region near the centromere) that
0
68 70 72 74 76 78 80 82 84 86 88 90 reciprocal translocations with creation he had found in his own family.
19 19 19 19 19 19 19 19 19 19 19 19
Year
of fusion genes to be studied. The Phila- Progress in gene mapping is shown
delphia chromosome was the first of in FIGURE 1. The largest part of the
these; the total number of examples is progress in the 1970s19 was through
chronic myeloid leukemia9 in 1960 now more than 100.13 study of interspecies somatic cell hy-
provided an early confirmation of In the last 20 years, molecular cyto- brids, particularly cells produced by fus-
Theodor Boveri’s chromosome theory genetics, “chromosome painting,” and ing human and mouse cells.20 In cells
of cancer.10 Named for the city of resi- in situ hybridization for identification derived by cell division from such hy-
dence of its discoverers and patients of deletions and rearrangements are brid cells, the full set of mouse chro-
following the practice of naming only some of the methods used for char- mosomes are retained, whereas indi-
hemoglobin variants, the Philadelphia acterizing the karyotype in clinical ap- vidual human chromosomes are lost
chromosome (Ph1) was thought to plications. more or less at random. The presence
represent a partially deleted chromo- or absence of a particular human cell
some 21. By improved chromosome Gene Mapping trait could be correlated with the pres-
staining techniques, it was shown in The first gene to be mapped to a spe- ence or absence of a particular human
1973 that chromosome 22, not 21, is cific chromosome in any species was chromosome in the derivative cells to
involved and that the change that probably the one for colorblindness. In determine that the gene for that trait
causes the abnormally short Ph1 chro- 1911, cytologist E. B. Wilson14 con- was located on that chromosome. An
mosome is not a deletion but rather a cluded that the characteristic pedigree early example (in 1971) was mapping
reciprocal translocation between chro- pattern of this trait, described by Pliny of the gene that encodes the enzyme
mosomes 9 and 22.11 Earle in Philadelphia, Pa, in 184515 and thymidine kinase to chromosome 17.21
The ability to study the chromo- by Friedrich Horner in Zurich, Swit- Molecular genetics came to gene
somes in cultured cells in the amni- zerland, in 1876, was explained if the mapping about 1980 and contributed
otic fluid inaugurated the field of pre- trait is recessive, the gene is on the X to the field in 3 ways: (1) It provided
natal diagnosis of Down syndrome and chromosome, and humans have a fe- DNA probes for analysis of somatic cell
other chromosomal aberrations by am- male-XX/male-XY sex chromosome hybrids so that one could “go directly
niocentesis beginning about 1966. The constitution. In the following de- for the gene” and not require expres-
characterization in cultured cells of en- cades, a host of disorders were de- sion of the human gene in the hybrid
zyme deficiencies in inborn errors of duced to be X-linked from the charac- cell. (2) It provided DNA probes (at first
metabolism had also progressed to the teristic pedigree pattern, so that by radioactive, later fluorescent) for in situ
point that many of these could like- publication of the second edition of hybridization to chromosomes. This di-
wise be diagnosed prenatally by study Mendelian Inheritance in Man in 1968,12 rect method for direct mapping was first
of amniotic fluid cells. the catalog of X-linked phenotypes had made to work reliably for single-copy
About 1970, various staining meth- 68 asterisked (seemingly confirmed) genes in 1981. 22 (3) Most impor-
ods were developed that showed band- entries, each presumably related to a dif- tantly, molecular genetics provided an
ing of chromosomes. The distinctive pat- ferent gene. The precise localization on abundance of DNA markers that could
tern of this banding permitted unique the X chromosome of these genes was be used for family linkage studies.23 Pre-
identification of each chromosome. not known; the first regional mapping viously, such studies were seriously
When combined with methods for was for the linked genes for colorblind- hampered by the pitifully small hand-
studying the chromosomes at a stage of ness and G6PD deficiency,16 shown to ful of available marker traits, ie, a few
2290 JAMA, November 14, 2001—Vol 286, No. 18 (Reprinted) ©2001 American Medical Association. All rights reserved.

Downloaded from www.jama.com at Washington University - St Louis, on October 27, 2005


THE ANATOMY OF THE HUMAN GENOME

blood groups and serum or red blood family with the Mendelian disorder. was first shown by Avery, McLeod, and
cell proteins in which allelic variation Sometimes the gene for an enzyme or McCarty in 1944 in pneumococcus.30
could be demonstrated by immuno- other protein had already been mapped They found that the so-called transform-
logic, electrophoretic, or other meth- to the region and, thus, was a candi- ing factor, which converted one pneu-
ods. The abundant DNA markers first date gene; in other cases, a previously mococcus form to another, is DNA.
included restriction fragment length unknown gene was found strictly by its In 1953, Watson and Crick 7 de-
polymorphisms, followed by variable location in the chromosomal region duced the double-helical structure of
number tandem repeats, microsatel- identified by linkage mapping of the ge- DNA from x-ray diffraction data. The
lites or short tandem repeats, and, most netic disorder. genetic code of nucleotide triplets, each
recently, single-nucleotide polymor- It took 10 years for the gene mutant specifying a particular amino acid, was
phisms. in Huntington disease to be isolated by worked out in final detail in 1966. In
By 1985 when the HGP, as an initia- positional cloning, which occurred in the late 1960s, restriction enzymes,
tive to sequence completely the DNA of 1993.26 This was partly because of ge- which cut DNA at specific sites and,
the human genome, was first formally netic peculiarities of that region of 4p thus, could be used as scalpels for the
proposed, about 700 genes had been (ie, many genes and a relatively high rate dissection of the genome, were discov-
mapped to specific chromosomes and, of recombination) and particularly be- ered. In the early 1970s, it was found
for many of these genes, to specific re- cause it was a new type of mutation, an that genes (including those of hu-
gions of chromosomes. The genes expanded trinucleotide repeat. mans) could be cloned in abundance
mapped included those for blood groups, The first 4 successes with positional by splicing DNA into a bacterial plas-
enzymes, clotting factors, structural pro- cloning were chronic granulomatous mid31 and growing the bacteria—the so-
teins, and so on. Importantly, they also disease27 and Duchenne muscular dys- called recombinant DNA technology.
included the genes mutated in mystery trophy28 in 1986 and retinoblastoma In 1977, improved methods of DNA
diseases, so termed because at the time and cystic fibrosis29 in 1989. Cystic fi- sequencing were reported by Maxam
of mapping the nature of the basic de- brosis was the first to be elucidated by and Gilbert32 and by Sanger et al.33 Re-
fect was unknown. The usefulness of positional cloning without the assis- markably, the dideoxy method of
gene mapping to clinical medicine3 was tance of a cytogenetically visible dele- Sanger et al remains the technologic
particularly evident in connection with tion. In the decade that followed, map- cornerstone of the HGP; the method has
these disorders. It was the availability of based gene discovery became a leading been modified extensively with re-
an abundance of DNA markers for fam- paradigm in biomedical research. All spect to automation and efficiency but
ily linkage studies that advanced clini- specialties of medicine used it to study remains fundamentally the same.
cal application of gene mapping. some of their most puzzling disor- The circular bacteriumlike chromo-
The first mystery disease to be mapped ders. Once the disease gene and its some of the cytoplasmic organelle, the
through linkage to DNA markers was mutations were identified, specific mitochondrion, was completely se-
Huntington disease in 1983, located on DNA-based diagnostic tests could be de- quenced, all 16 569 nucleotides, by
the end of the short arm of chromo- signed. Furthermore, scientists were in Sanger’s group in 1981.34 Thus, some in-
some 4.24 The clinical applicability of the a better position to determine patho- vestigators were emboldened to pro-
information was immediately evident. By genetic mechanisms, the steps be- pose sequencing the entire nuclear ge-
the linkage principle, one could now tween gene and phene, ie, between nome. An early proposal for complete
make diagnoses prenatally and premor- genotype and phenotype. That infor- sequencing, ie, the HGP, came from the
bidly, provided that DNA of relatives mation can often help investigators de- US Department of Energy, which had re-
was available for testing and DNA mark- vise methods of intervention for treat- sponsibilities in the area of the muta-
ers near the gene were found to be ment or secondary prevention. tional effects of radiation. Importance in
appropriately heterozygous in specific the solution of problems of cancer was
individuals. Considerable experience The Ultimate Anatomy: cited by Renato Dulbecco35 as the main
with the psychosocial problems sur- The Sequence of reason to undertake the HGP. Indeed,
rounding predictive DNA testing then the Human Genome genomics has had perhaps its greatest
followed. As noted earlier, a considerable num- impact on cancer. Usefulness to the un-
The other useful application of gene ber of genes had been mapped before derstanding of birth defects of map-
mapping was for disease gene discov- the HGP was formally proposed. Fur- ping all the genes had been proposed ear-
ery through positional cloning.25 The thermore, positional cloning for isola- lier.36 In the 1920s, Haldane pointed out
procedure used was to identify mark- tion of disease genes had been con- the usefulness of linkage in diagnosis,
ers (ideally flanking markers) linked to ceived, and proof of principle had been and in his “Croonian Lecture” in 1948
the disease locus to search the region provided in 1986. he wrote that the “final aim . . . should
for genes and to scrutinize those genes DNA was discovered in the 19th cen- be the enumeration and location of all
for a mutation that cosegregates in the tury. That the genetic material is DNA the genes found in normal human
©2001 American Medical Association. All rights reserved. (Reprinted) JAMA, November 14, 2001—Vol 286, No. 18 2291

Downloaded from www.jama.com at Washington University - St Louis, on October 27, 2005


THE ANATOMY OF THE HUMAN GENOME

beings.”37 The complete sequence was the nonhuman genome projects, ie, computerization. The approach was
needed for finding all the genes. those involving model organisms. Com- validated by the sequencing of Dro-
The HGP was discussed, debated, and parative genomics is a valuable way to sophila. The genomes of 5 specific hu-
planned between 1985 and 1990 and gain understanding of the structure and mans representing 4 different ethnic
had its official start in the United States function of the human genome and its backgrounds and both sexes were se-
on October 1, 1990.38 A National Re- genes. quenced by Celera. 50 The publicly
search Council/National Academy of Expressed sequence tags (ESTs), that funded HGP rose to the competitive
Science (NRC/NAS) committee39 on is, complementary DNA created from challenge and accelerated its sequenc-
mapping and sequencing the human ge- messenger RNA by reverse transcrip- ing, with a coordination of efforts in sev-
nome was commissioned in late 1986 tion, were developed in 199144 as a short- eral laboratories in the United States,
and reporting in February 1988 sug- cut to the coding part of the human ge- United Kingdom, Japan, France, Ger-
gested that complete mapping and se- nome. Large EST databases for humans many, and elsewhere, under the lead-
quencing could be achieved in 10 to 15 and many other species have been valu- ership of Francis Collins.51
years at a cost, in add-on funding, of able to comparative genomics. The data generated by the publicly
about $200 million per year. In retro- The first free-living organism in which funded HGP are available in public da-
spect, this seems in some ways like a the genome was completely sequenced tabases free of charge. The sequence
remarkably rash conclusion. Polymer- was the bacterium Haemophilus influen- data generated by Celera collated with
ase chain reaction was announced at a zae, with 1830137 nucleotides and about the publicly available data and with an-
Cold Spring Harbor, NY, meeting in 1800 protein-coding genes. This se- notation as well as computer-based
1986,40 where the status of the gene map quence was determined by the group of methods of analysis, are available to aca-
of Homo sapiens was reviewed41 and the J. Craig Venter, PhD,45 using a bot- demic researchers by subscription and
HGP was discussed actively in a rump tom-up approach. The DNA of the cir- to pharmaceutical and other nonaca-
session. Yeast artificial chromosomes cular bacterial chromosome was bro- demic laboratories at a substantially
were invented in 1987, and bacterial ar- ken into segments by shearing, the higher subscription rate.
tificial chromosomes and plasmid ar- segments were cloned and sequenced at At the White House on June 26, 2000,
tificial chromosomes were introduced random, and the individual sequences Collins and Venter announced comple-
as other mechanisms for cloning large were then assembled through recogni- tion of initial public and private drafts
DNA segments later. The most poly- tion of identity at overlapping ends. of the human sequence. These were
morphic and, therefore, useful link- Thereafter, the genomes of a consider- published in mid-February 2001—
age markers, the microsatellites, were able number of other microorganisms the publicly funded results from labo-
discovered in 1989 and the early 1990s. were sequenced by the same approach, ratories in the United States, United
In the end, however, the estimates including Helicobacter pylori, Mycobac- Kingdom, and elsewhere in Nature51
proved not far off. terium tuberculosis, and Treponema pal- and the results of Celera in Science.50
The NRC/NAS committee recom- lidum. In 1996, baker’s yeast (Saccharo- In each journal, accompanying ar-
mended “map first, sequence later”39 be- myces cerevisiae) was the first nucleated ticles described some of the implica-
cause the sequencing technology was not (eukaryotic) organism to be com- tions of the new information.
yet at an efficient and economical level pletely sequenced.46 The nematode Cae- When the complete human genome
and because the maps, both genetic (eg, norhabditis elegans was the first multi- sequence was available, the total num-
of microsatellite markers42) and physi- cellular organism to be completely ber of genes was only about half the
cal (eg, of yeast artificial chromosome sequenced, in 199847,48 and the com- number previously estimated52 and little
clones43), would be useful to the final se- plete sequence of the geneticists’ pet, Dro- more than twice the number in a much
quencing. Thus, the HGP of the Na- sophila melanogaster, was reported in simpler organism such as C elegans. The
tional Institutes of Health adopted a March 2000. 49 The first 2 were se- increased complexity of the human, as
top-down approach when it was initi- quenced clone by clone and the third by compared with the worm, for ex-
ated October 1, 1990. James D. Wat- a combination of the clone-by-clone and ample, is achieved by increasing the
son, PhD, was the first director; he was random (“shotgun”) methods. number of different proteins encoded
succeeded by Francis S. Collins, MD, Following the success with random by single genes, through alternative
PhD, in 1993. The sequencing was sequencing of clones, with subse- splicing of messenger RNA, posttran-
performed clone by clone after the quent assembly, in microorganisms, scriptional and posttranslational modi-
construction of genetic and physical Venter and colleagues undertook the fications, formation of heteromeric
maps. Another recommendation of the same in humans. They established a pri- proteins (ie, proteins combining the
NRC/NAS committee was that model or- vate company (Celera Genomics Inc) products of 2 or more different genes),
ganisms be studied in parallel with the to work on a factory scale: DNA and and so on. The estimated 30 000 to
human.39 Some of the most interesting clone preparation, sequencing, and as- 40 000 genes encode more than 10
and contributory parts of the HGP are sembly, all assisted by automation and times that number of proteins. This has
2292 JAMA, November 14, 2001—Vol 286, No. 18 (Reprinted) ©2001 American Medical Association. All rights reserved.

Downloaded from www.jama.com at Washington University - St Louis, on October 27, 2005


THE ANATOMY OF THE HUMAN GENOME

resulted in a partial shift of focus from archival advantage and that of accessi-
Figure 2. Growth of Mendelian Inheritance
the gene to proteins and from genom- bility in a nonelectronic setting, as well in Man: A Catalog of Human Genes and
ics to proteomics.53-55 as ease of browsing. Genetic Disorders (MIM)12 and Its Online
Another interesting but not new find- Throughout its history, MIM (and Version (OMIM)
ing in the sequences published in Feb- OMIM) has attempted comprehensive 14 000
ruary 200150,51 is the nonuniform den- cataloging of gene mapping, especially 13 083
OMIM
13 000
sity of genes within chromosomes any gene related to disease, and com- October 21, 2001
12 000
(genes tend to be concentrated at the prehensive cataloging of specific disease-
ends of chromosome arms) and be- related mutations. The number of genes 11 000

tween chromosomes. Chromosomes 19 with 1 or more disease-related muta- 10 000

No. of MIM Entries


and 22 are particularly gene-rich and tions passed the 1000 mark about Janu- 9000

chromosomes 13, 18, and 21 are rela- ary 1, 2001.56 FIGURE 3A indicates the 8000
MIM12

tively gene-poor. Related to the latter pace of disease-related gene identifica- 7000
MIM11
observation may be the fact that chro- tion during the last 20 years.53 Figure 3B 6000
MIM10
mosomes 13, 18, and 21 are involved indicates the pace at which specific ge-
5000 MIM9
in the only autosomal trisomies that are netic disorders have been character- MIM8
4000
compatible with live birth. Before ized at the DNA level.53 The total num- MIM7
MIM6
3000
completion of the HGP, the informa- ber of characterized disorders (more MIM5
MIM4
MIM Editions
tion on variation in gene density was than 1600) exceeds the number of dis- 2000 MIM3
MIM2 OMIM
already known on the basis of num- ease-related genes (more than 1200) be- 1000 MIM1

bers of genes mapped and indirectly on cause many such genes are the site of 1965 1975 1985 1995 2005
Year
the basis of GC vs AT content; high GC mutations causing more than one dis-
correlates with high gene count. None- tinct disorder, eg, the ␤-globin gene, Each entry is an essay on a particular phenotype (usu-
theless, precise confirmation by the which is the site of mutations causing ally a disorder) or gene, with extensive bibliographic
references and, in the case of OMIM, links to many
HGP was useful. sickle cell disease, thalassemia, Heinz other sources of information.
body hemolytic anemia, methemoglo-
The Morbid Anatomy binemia, erythremia, and so on. In part,
of the Human Genome: the excess of distinct, molecularly de- We may soon have a complete catalog
How Far Have We Come? fined disorders over the number of genes of the genes, but we do not know all of
The anatomic metaphor is useful be- involved is a corollary of the “one gene, the protein products of all of those
cause it extends to the comparative many proteins” phenomenon sup- genes or the function of all of those pro-
anatomy and evolution of the human ported by the unexpectedly low total teins, or even a majority of them, in iso-
genome, as well as to its functional gene counts from analysis of the hu- lation, let alone in concert with oth-
anatomy, developmental anatomy, and, man genome sequence. ers. We do not know the worldwide
particularly in the medical context, to The counts in Figure 3 include both variation in the genes. We do not know
its morbid anatomy.2 germline (heritable) and somatic mu- the correlation between structural varia-
Progress in the last 40 years in de- tations. They do not include about 100 tion in the genes and variation in func-
fining the morbid anatomy of the hu- disease-related genes first identified as tion as reflected in the phenotype. These
man genome is chronicled in MIM, a translocation-fusion partners in leuke- matters will occupy biology and medi-
catalog of human genes and genetic dis- mias and some solid tumors. The counts cine for a long time to come.
orders (FIGURE 2).12 Computerized do include some genes in which spe- Progress in the HGP has brought sev-
since 1964, MIM was a pioneer in com- cific susceptibility or resistance alleles eral paradigm shifts that are relevant to
puter-based publication (eg, the first have been identified through associa- the neo-Vesalian basis of medicine pro-
edition in 1966) and has been avail- tion studies. vided by this ultimate anatomy of the ge-
able online as OMIM since 1987. The nome. A shift from genomics57 to pro-
periodic print editions, most recently Where Do We Go From Here? teomics,55 or at least an extension to
the 12th, published in 3 volumes in Clearly, there is much we don’t know, proteomics, comes from the realiza-
1998, represent serial cross-sections of as reflected in the quotes: tion that several or even many different
the field of genetic medicine in the last proteins can be encoded by a single gene,
As the radius of knowledge gets longer, the
35 years. OMIM has the advantage of circumference of the unknown expands as must be the case to account for the
daily updating, ease of searching, and even more. increased complexity of humans as com-
ease of linking to related sources of in- —Anonymous pared with C elegans and Drosophila,
formation, such as that on DNA and How is it that we know so little, given that which have one third or one half as many
protein structure and that on the re- we have so much information? genes as humans. A shift from map-
lated biomedical literature; MIM has the —Noam Chomsky based gene discovery to sequence-
©2001 American Medical Association. All rights reserved. (Reprinted) JAMA, November 14, 2001—Vol 286, No. 18 2293

Downloaded from www.jama.com at Washington University - St Louis, on October 27, 2005


THE ANATOMY OF THE HUMAN GENOME

based gene discovery has occurred be- specific diagnoses at earlier stages. It Better understanding of individual ge-
cause availability of databases with will surely advance gene therapy. In a nomic constitutions should permit drug
expressed sequence tags or complete se- more general way, genomics is likely to therapy to get away from the one-size-
quence information on many different render medicine more predictive and, fits-all approach. It should allow selec-
organisms has made research in silico therefore, more preventive. Compre- tion of drugs likely to be more effec-
(cybergenomics) possible. hensive “genome screens” for recogni- tive in the treatment of a given disorder
A shift of emphasis from relatively tion of individual susceptibilities to in a given individual. Even though a
rare single-gene disorders to common common disorders can be foreseen. Ge- particular drug may be effective in the
disorders of multifactorial causation nomics-based clinical medicine will re- treatment of a disorder in a particular
(complex traits) has occurred, now that quire that primary care physicians be patient, the genomic constitution of the
it is possible that susceptibility alleles competent in the interpretation of gene patient may place him or her at an in-
that collaborate in causation can be screens and in advising appropriate creased risk of adverse drug reaction.
identified. Such alleles may be found health measures. Identifying such risks is part of pick-
through studies of association be- At the same time, on the traditional ing the right treatment for the right pa-
tween the particular disorder and the turf of clinical medicine, diagnosis will tient and can reduce the very consid-
multitude of DNA markers, particu- become more specific and precise, and erable toll of iatrogenic morbidity and
larly single nucleotide polymor- treatment also more specific and safer. mortality.
phisms, now available. As a conse- Genomics-based individualization of As reflected in the large investments
quence, DNA testing will no longer be medical care aims to achieve the right in genomics by pharmaceutical firms, ge-
limited to specific diagnosis of Mende- treatment for the right patient.59 More nomics is anticipated to lead to identi-
lian disorders but can be extended to precise characterization of the ge- fication of new drug targets, ie, genes and
recognition of vulnerability or resis- nome in common disorders, such as hy- gene products involved in physiologic
tance to common disorders. pertension and mental illness, will iden- processes that can be enhanced or down-
tify diagnostic subtypes for which regulated by custom designed drugs,
The Neo-Vesalian Influence different therapies will be more effec- pharmacogenomics. 5 9 Genomics-
on 21st-Century Medicine tive. Cancer medicine is at the fore- based drug development should lead to
As indicated in more detail else- front in the use of genomics to match entirely new medications for disorders
where,58 the availability of the human specific diagnosis with specific treat- not now treatable and to more effective
genome sequence and information on ment. Morphologically indistinguish- and safer replacements for current drugs.
proteomics related to the sequence is able neoplasms have been shown on In summary, chromosome analysis,
likely to change medicine in many ways. “biopsy” of their altered genomes to be gene mapping, and complete sequenc-
It will influence reproductive medi- different, suggesting that different treat- ing of the genome provide an anatomic
cine, for example, permitting ever more ments are indicated. basis for all aspects of clinical medicine.

Figure 3. Pace of Disease Gene Discovery and Molecular Characterization of Clinical Disorders, 1981-2000

A No. of Genes Discovered With Disease-Related Mutations B No. of Clinical Disorders Characterized at the Molecular Level
175 200
(6)
(12) (6)
150

150
125 (5) (11)
(13)
100
(8)
No.

No.

(8) 100
75 (8) (9)
(6)
50 (2)
50

25
(1) (1)
(1) (1)
0 0
82 84 86 88 90 92 94 96 98 00 82 84 86 88 90 92 94 96 98 00
19 19 19 19 19 19 19 19 19 20 19 19 19 19 19 19 19 19 19 20
Year Year

Reprinted with permission from Peltonen L, McKusick VA. Dissecting human disease in the post-genomic era. Science. 2001;291:1224-1229.53 A, The number of
disease genes discovered by the end of 2000 was 1112, including both germline and neoplasia-related somatic mutations. This number does not include all the genes
identified as translocation gene-fusion partners in neoplastic disorders. Numbers in parentheses indicate genes with disease-related polymorphic alleles (susceptibility
genes). B, The number of clinical disorders characterized by the end of 2000 was 1430. This number does not include the many neoplastic disorders caused by translocation-
related fusion genes.

2294 JAMA, November 14, 2001—Vol 286, No. 18 (Reprinted) ©2001 American Medical Association. All rights reserved.

Downloaded from www.jama.com at Washington University - St Louis, on October 27, 2005


THE ANATOMY OF THE HUMAN GENOME

As with any powerful new develop- netic is one problem. Another is the mis- a particular outcome is not 100% but
ment, there are hazards of misuse of in- conception of determinism, the idea that some lower number means that there
formation or techniques. Privacy and the phenotype is hard-wired to the geno- may be lifestyle, dietary, or medical mea-
confidentiality are issues of impor- type. Most genetic tests will provide an sures that can be used to reduce the prob-
tance. A serious risk may be public mis- answer in probabilistic terms. It is use- ability even further. Clearly, education
understanding that will prevent maxi- ful for persons to understand the prin- of both the public and health care pro-
mum benefit to be realized from the ciples of probability in relation to their fessionals is vital if the full benefits of neo-
information. Blind fear of anything ge- own health. The fact that the chance of Vesalian medicine are to be realized.

REFERENCES
1. McKusick VA. The anatomy of the human ge- 21. Miller OJ, Allderdice PW, Miller DA. Human thy- Human Genome. Washington, DC: National Acad-
nome. Am J Med. 1980;69:267-276. midine kinase gene locus: assignment to chromo- emy Press; 1988.
2. McKusick VA. The human genome through the eyes some 17 in a hybrid of man and mouse cells. Science. 40. Mullis K, Faloona F, Scharf S. Specific enzymatic
of Mercator and Vesalius. Trans Am Clin Climatol As- 1971;173:244-245. amplification of DNA in vitro: the polymerase chain
soc. 1981;42:66-90. 22. Harper ME, Ullrich A, Saunders GF. Localization reaction. Cold Spring Harb Symp Quant Biol. 1986;
3. McKusick VA. The morbid anatomy of the human of the human insulin gene to the distal end of the short 51:265-273.
genome: a review of gene mapping in clinical medi- arm of chromosome 11. Proc Natl Acad Sci U S A. 41. McKusick VA. The gene map of Homo sapiens:
cine. Medicine. 1986;65:1-33, 1987;66:1-63, 1987; 1981;78:4458-4460. status and prospectus. Cold Spring Harb Symp Quant
66:237-206, 1988;67:1-19. 23. Botstein D, White RL, Skolnick M, Davis RW. Con- Biol. 1986;51:15-27, 1123-1208.
4. McKusick VA. History of medical genetics. In: struction of a genetic linkage map in man using re- 42. Dib C, Faure S, Fizames C, et al. A comprehen-
Rimoin DL, Connor JM, Pyeritz RE, eds. Emery- striction fragment length polymorphisms. Am J Hum sive genetic map of the human genome based on 5264
Rimoin Principles and Practice of Medical Genetics. Genet. 1980;32:314-331. microsatellites. Nature. 1996;380:152-154.
4th ed. Edinburgh, Scotland: Churchill Livingstone; 24. Gusella JF, Wexler NS, Conneally PM, et al. A poly- 43. Chumakov IM, Rigault P, Le Gall I, et al. YAC con-
2001:3-36. morphic DNA marker genetically linked to Hunting- tig map of the human genome. Nature. 1995;377
5. Tjio JH, Levan A. The chromosome number in man. ton’s disease. Nature. 1983;306:234-238. (suppl):175-297.
Hereditas. 1956;42:1-6. 25. Collins FS. Positional cloning: let’s not call it re- 44. Adams MD, Kelley JM, Gocayne JD, et al. Comple-
6. Ford CE, Hamerton JC. The chromosomes of man. verse anymore. Nat Genet. 1992;1:3-6. mentary DNA sequencing: expressed sequence tags
Nature. 1956;178:1020-1023. 26. Huntington’s Disease Collaborative Research and human genome project. Science. 1991;252:1651-
7. Watson JD, Crick FH. Molecular structure of nucleic Group. A novel gene containing a trinucleotide re- 1656.
acids: a structure for deoxyribose nucleic acid. Na- peat that is expanded and unstable on Huntington’s 45. Fleischmann RD, Adams MD, White O, et al.
ture. 1953;171:737-738. disease chromosomes. Cell. 1993;72:971-983. Whole-genome random sequencing and assembly of
8. Allen G, Benda CE, Böök JA, et al. “Mongolism.” 27. Royer-Pokora B, Kunkel LM, Monaco AP, et al. Haemophilus influenzae Rd. Science. 1995;269:496-
Lancet. 1961;1:775. Cloning the gene for an inherited human disorder— 512.
9. Nowell PC, Hungerford DA. A minute chromo- chronic granulomatous disease—on the basis of its 46. Goffeau A, Barrell BG, Bussey H, et al. Life with
some in human chronic granulocytic leukemia. Sci- chromosomal location. Nature. 1986;322:32-38. 6000 genes. Science. 1996;274:546-567.
ence. 1960;132:1497. 28. Monaco AP, Neve RL, Colletti-Feener C, Bertel- 47. C elegans Sequencing Consortium. Genome se-
10. McKusick VA. Marcella O’Grady Boveri (1865- son CJ, Kurnit DM, Kunkel LM. Isolation of candi- quence of the nematode C elegans: a platform for in-
1950) and the chromosome theory of cancer. J Med date cDNAs for portions of the Duchenne muscular vestigating biology. Science. 1998;282:2012-2018.
Genet. 1985;22:431-440. dystrophy gene. Nature. 1986;323:646-650. 48. Chervitz SA, Aravinda L, Sherlock G, et al. Com-
11. Rowley JD. A new consistent abnormality in 29. Rommens JM, Iannuzzi MC, Kerem B, et al. Iden- parison of the complete protein sets of worm and yeast:
chronic myelogenous leukemia identified by quina- tification of the cystic fibrosis gene: chromosome walk- orthology and divergence. Science. 1998;282:2022-
crine fluorescence and Giemsa staining. Nature. 1973; ing and jumping. Science. 1989;245:1059-1065. 2028.
243:290-293. 30. Avery OT, MacLeod CM, McCarty M. Studies on 49. Adams MD, Celniker SE, Holt RA, et al. The ge-
12. McKusick VA. Mendelian Inheritance in Man. Bal- the chemical nature of the substance inducing trans- nome sequence of Drosophila melanogaster. Sci-
timore, Md: Johns Hopkins University Press; 1966. [On- formation of pneumococcal types: induction of trans- ence. 2000;287:2185-2195.
line Mendelian Inheritance in Man. 12th ed. 1998. formation by a deoxyribonucleic acid fraction from Pneu- 50. Venter JC, Adams MD, Myers EW, et al. The se-
Available at: http://www.ncbi.nlm.nih.gov/omim.] mococcus type III. J Exp Med. 1944;79:137-158. quence of the human genome. Science. 2001;291:
13. Mitelman F. Catalog of Chromosome Aberrations 31. Cohen S, Cang A, Boyer H, Helling R. Construc- 1304-1351.
in Cancer. 5th ed. New York, NY: Wiley-Liss; 1994. tion of biologically functional bacterial plasmids in vitro. 51. International Human Genome Sequencing Con-
14. Wilson EB. The sex chromosomes. Arch Mikrosk Proc Natl Acad Sci U S A. 1973;70:3240-3244. sortium. Initial sequencing and analysis of the human
Anat Entwicklungsmech. 1911;77:249-271. 32. Maxam AM, Gilbert W. A new method for se- genome. Nature. 2001;409:860-921.
15. Earle P. On the inability to distinguish colors. Am quencing DNA. Proc Natl Acad Sci U S A. 1977;74: 52. Field C, Adams MD, White O, Venter JC. How
J Med Sci. 1845;9:346-354. 1258-1262. many genes in the human genome? Nat Genet. 1994;
16. Porter IH, Schulze J, McKusick VA. Genetical link- 33. Sanger F, Nicklen S, Coulson AR. DNA sequenc- 7:345-346.
age between the loci for glucose-6-phosphate dehy- ing with chain-terminating inhibitors. Proc Natl Acad 53. Peltonen L, McKusick VA. Dissecting human dis-
drogenase deficiency and colour-blindness in Ameri- Sci U S A. 1977;74:5463-5467. ease in the post-genomic era. Science. 2001;291:
can Negroes. Ann Hum Genet. 1962;16:107-122. 34. Anderson S, Bankier AT, Barrell BG, et al. Se- 1224-1229.
17. Ricciuti FC, Ruddle FH. Assignment of three gene quence and organization of the human mitochon- 54. Wilkins MR, Pasquali C, Appel RD, et al. From pro-
loci (PGK, HGPRT, and G6PD) to the long arm of the drial genome. Nature. 1981;290:457-465. teins to proteomes: large scale protein identification
human X-chromosome by somatic cell genetics. Ge- 35. Dulbecco R. A turning point in cancer research: by two-dimensional electrophoresis and amino acid
netics. 1973;74:661-678. sequencing the human genome. Science. 1986;231: analysis. Biotechnology. 1996;14:61-65.
18. Donahue RP, Bias WB, Renwick JH, McKusick VA. 1055-1056. 55. Pandey A, Mann M. Proteomics to study genes
Probable assignment of the Duffy blood group locus 36. McKusick VA. Prospects for progress. In: Fraser and genomes. Nature. 2000;405:837-846.
to chromosome 1 in man. Proc Natl Acad Sci U S A. FC, McKusick VA, eds. Congenital Malformations. Vol 56. Antonarakis SE, McKusick VA. OMIM passes the
1968;61:949-955. 3. Amsterdam, the Netherlands: Excerpta Medica; 1000 disease-gene mark. Nat Genet. 2000;25:11.
19. McKusick VA, Ruddle FH. The status of the gene 1970:407. 57. McKusick VA, Ruddle FH. A new discipline, a new
map of the human chromosomes. Science. 1977;396: 37. Haldane JB. The formal genetics of man. Proc R name, a new journal. Genomics. 1987;1:1-2.
390-405. Soc B. 1948;135:147-170. 58. Collins FS, McKusick VA. Implications of the hu-
20. Weiss M, Green H. Human-mouse hybrid cell lines 38. McKusick VA. Mapping and sequencing the hu- man genome project for medical science. JAMA. 2001;
containing partial complements of human chromo- man genome. N Engl J Med. 1989;320:910-915. 285:540-544.
somes and functioning human genes. Proc Natl Acad 39. Alberts BM, Botstein D, Brenner S, et al. Report 59. Roses AD. Pharamacogenomics and the practice
Sci U S A. 1967;58:1104-1111. of the Committee on Mapping and Sequencing the of medicine. Nature. 2000;405:857-865.

©2001 American Medical Association. All rights reserved. (Reprinted) JAMA, November 14, 2001—Vol 286, No. 18 2295

Downloaded from www.jama.com at Washington University - St Louis, on October 27, 2005

You might also like