Professional Documents
Culture Documents
Human DNA Repair Genes
Human DNA Repair Genes
If you wish to distribute this article to others, you can order high-quality copies for your
colleagues, clients, or customers by clicking here.
Permission to republish or repurpose articles or portions of articles can be obtained by
following the guidelines here.
Updated information and services, including high-resolution figures, can be found in the online
version of this article at:
/content/291/5507/1284.full.html
Science (print ISSN 0036-8075; online ISSN 1095-9203) is published weekly, except the last week in December, by the
American Association for the Advancement of Science, 1200 New York Avenue NW, Washington, DC 20005. Copyright
2001 by the American Association for the Advancement of Science; all rights reserved. The title Science is a
registered trademark of AAAS.
ANALYSIS OF GENOMIC INFORMATION
likely GTPases, as indicated by the activity of CIITA 27. L. Aravind, H. Watanabe, D. J. Lipman, E. V. Koonin, of the Integrated Protein Index and A. Uren for critical
and HET-E [E. V. Koonin, L. Aravind, Trends Biochem. Proc. Natl. Acad. Sci. U.S.A. 97, 11319 (2000). reading of the manuscript and useful comments. The
Sci. 25, 223 (2000)]. 28. J. R. Brown, W. F. Doolittle, Microbiol. Mol. Biol. Rev. release of the unpublished WormPep data set by The
14. T. L. Beattie, W. Zhou, M. O. Robinson, L. Harrington, 61, 456 (1997). Sanger Center is acknowledged and greatly appreciated.
Curr. Biol. 8, 177 (1998). 29. We thank E. Birney and A. Bateman (The Sanger Center,
15. E. Diez, Z. Yaraghi, A. MacKenzie, P. Gros, J. Immunol. Hinxton, UK) for kindly providing the preliminary version 25 October 2000; accepted 18 January 2001
164, 1470 (2000).
16. A. M. Verhagen et al., Cell 102, 43 (2000).
17. L. Goyal, K. McCall, J. Agapite, E. Hartwieg, H. Steller,
18.
EMBO J. 19, 589 (2000).
The eukaryotic crown group is the assemblage of
relatively late-diverging, major eukaryotic taxa
Human DNA Repair Genes
whose exact order of radiation is difficult to deter- Richard D. Wood,1* Michael Mitchell,2 John Sgouros,2
mine with confidence. The crown group includes the
multicellular eukaryotes (animals, fungi, and plants) Tomas Lindahl1
and some unicellular eukaryotic lineages such as
slime molds and Acanthamoebae [A. H. Knoll, Science
256, 622 (1992); S. Kumar, A. Rzhetsky, J. Mol. Evol. Cellular DNA is subjected to continual attack, both by reactive species inside
42, 183 (1996)]. cells and by environmental agents. Toxic and mutagenic consequences are
19. The sister group of the classic animal caspase
family of thiol proteases are the paracaspases that
minimized by distinct pathways of repair, and 130 known human DNA repair
thus far have been identified only in animals and genes are described here. Notable features presently include four enzymes that
Dictyostelium; together, these two families consti- can remove uracil from DNA, seven recombination genes related to RAD51, and
tute the sister group of the metacaspases that many recently discovered DNA polymerases that bypass damage, but only one
have been detected in plants, protists, and bacteria
[A. G. Uren et al., Mol. Cell 6, 961 (2000)]. On the system to remove the main DNA lesions induced by ultraviolet light. More
basis of conserved structural features, Uren et al. human DNA repair genes will be found by comparison with model organisms
showed that the paracaspases and metacaspases and as common folds in three-dimensional protein structures are determined.
are specifically related to the caspases, to the
exclusion of other members of the caspase-gingi- Modulation of DNA repair should lead to clinical applications including im-
pain fold [A. Eichinger et al., EMBO J. 18, 5453 provement of radiotherapy and treatment with anticancer drugs and an ad-
(1999)]. vanced understanding of the cellular aging process.
20. The A20 protein is a regulator of apoptosis that
appears to be involved in the NFkB pathway and
interactions with the TRAFs [R. Beyaert, K. Heyn- The human genome, like other genomes, en- through the accession numbers. Recent re-
inck, S. Van Huffel, Biochem. Pharmacol. 60, 1143 codes information to protect its own integrity view articles on the evolutionary relation-
(2000)]. A20 belongs to a distinct family of pre- (1). DNA repair enzymes continuously mon- ships of DNA repair genes (3) and common
dicted thiol proteases that is conserved in all
crown-group eukaryotes and many viruses. None itor chromosomes to correct damaged nucle- sequence motifs in DNA repair genes (4 )
of the members of this family has a known bio- otide residues generated by exposure to car- may also be helpful.
chemical function, but they share two conserved cinogens and cytotoxic compounds. The The functions required for the three dis-
motifs with the cysteine proteases of arteriviruses,
which led to the prediction of the protease activity.
damage is partly a consequence of environ- tinct forms of excision repair are described
A20 and another protein of this family, cezanne, mental agents such as ultraviolet (UV) light separately. These are base excision repair
contain a specialized finger module that is also from the sun, inhaled cigarette smoke, or (BER), nucleotide excision repair (NER), and
found in some proteins of the ubiquitin pathway.
Together with a fusion of an A20-like protease
incompletely defined dietary factors. Howev- mismatch repair (MMR). Additional sections
domain with a ubiquitin hydrolase that has been er, a large proportion of DNA alterations are discuss direct reversal of DNA damage, re-
detected in C. elegans, this suggests a functional caused unavoidably by endogenous weak combination and rejoining pathways for re-
connection between these predicted proteases and mutagens including water, reactive oxygen pair of DNA strand breaks, and DNA poly-
the ubiquitin system [K. S. Makarova, L. Aravind,
E. V. Koonin, Trends Biochem. Sci. 25, 50 (2000)]. species, and metabolites that can act as alky- merases that can bypass DNA damage.
An additional connection between apoptosis and lating agents. Very slow turnover of DNA The BER proteins excise and replace
the ubiquitin system is indicated by the demon- consequently occurs even in cells that do not damaged DNA bases, mainly those arising
stration that, similar to other RING fingers, the one
in TRAF6 is an E3-like ubiquitin ligase pathway [L. proliferate. Genome instability caused by the from endogenous oxidative and hydrolytic
Deng et al., Cell 103, 351 (2000)]. great variety of DNA-damaging agents would decay of DNA (1). DNA glycosylases initiate
21. The AP-GTPase is a previously undetected predict- be an overwhelming problem for cells and this process by releasing the modified base.
ed GTPase typified by the COOH-terminal domain
of the conserved apoptosis regulator, the DAP pro-
organisms if it were not for DNA repair. This is followed by cleavage of the sugar-
tein kinase [B. Inbal et al., Nature 390, 180 (1997)]. On the basis of searches of the current phosphate chain, excision of the abasic resi-
This predicted GTPase family appears to be the draft of the human genome sequence (2), we due, and local DNA synthesis and ligation.
sister group of the RAS/ARF family GTPases, but compiled a comprehensive list of DNA repair Cell nuclei and mitochondria contain several
differs from them in having a divergent P-loop
motif and a THXD instead of the NKXD signature genes (Table 1). This inventory focuses on related but nonidentical DNA glycosylases
motif. Additional AP-GTPases are found in plants genes whose products have been functionally obtained through alternative splicing of tran-
and animals as multidomain proteins that also linked to the recognition and repair of dam- scripts. Three different nuclear DNA glyco-
contain ankyrin, Lrr, and kinase domains. This do-
main architecture suggests that AP-GTPases par- aged DNA as well as those showing strong sylases counteract oxidative damage, and a
ticipate in GTP-dependent assembly of signaling sequence homology to repair genes in other fourth mainly excises alkylated purines. Re-
complexes. organisms. Readers desiring further infor- markably, four of the eight identified DNA
22. A. G. Uren et al., Proc. Natl. Acad. Sci. U.S.A. 96,
10170 (1999).
mation on specific genes should consult glycosylases can remove uracil from DNA.
23. The ZU5 domain is a previously undetected con- the primary references and links available Each of them has a specialized function,
served domain that is present in receptors (such as however. UNG, which is homologous to the
netrin receptors and vertebrate zona pellucida pro- Escherichia coli Ung enzyme, is associated
teins) and cytoskeletal proteins (such as ankyrins) 1
Imperial Cancer Research Fund, Clare Hall Laborato-
and is predicted to be involved in anchoring receptors with DNA replication forks and corrects
ries, Blanche Lane, South Mimms, Herts EN6 3LD, UK.
to the cytoskeleton. 2
Imperial Cancer Research Fund, 44 Lincoln’s Inn
uracil misincorporated opposite adenine.
24. S. L. Ackerman, B. B. Knowles, Genomics 52, 205 SMUG1, which is unique to higher eu-
Fields, London WC2A 3PX, UK.
(1998).
25. H. Sakahira, M. Enari, S. Nagata, Nature 391, 96 *Present address: University of Pittsburgh Cancer In-
karyotes, probably removes the uracil that
(1998). stitute, S867 Scaife Hall, 3550 Terrace Street, Pitts- arises in DNA by deamination of cytosine.
26. A. M. Aguinaldo et al., Nature 387, 489 (1997). burgh, PA 15261, USA. MBD4 excises uracil and thymine specifical-
Chromosome Accession
Gene name (synonyms) Activity
location number
Chromosome Accession
Gene name (synonyms) Activity
location number
Chromosome Accession
Gene name (synonyms) Activity
location number
The human genome sequence has already involved in the repair of damage caused by lipid volved in human DNA repair, disruptions of
markedly influenced the field of DNA repair. peroxidation (1). Other uncharacterized forms the corresponding murine genes have been
Many of the genes listed were discovered as of DNA damage caused by reactive metabolites reported (14), are in progress, or have recent-
investigators searched the expanding database and catabolites may be found. For example, the ly been constructed. The results are begin-
for sequence similarity to genes discovered in genome is dynamic, and single-stranded re- ning to guide searches for additional DNA
model organisms. This approach will no doubt gions are temporarily exposed during DNA rep- repair enzymes. Knockouts of DNA glycosy-
continue, and new human genes will be identi- lication and gene transcription. Positions that lases in mice have unexpectedly mild conse-
fied as additional repair functions are identified are normally protected by base-pairing within quences by comparison with budding yeast
in other systems. One source that is likely to be the double helical structure are then vulnerable and E. coli models. This implies that more
fruitful is the genome of Deinococcus radio- to group-specific reagents, creating new classes backup systems exist, probably because en-
durans (9). This bacterium has an exceptionally of lesions. Alkylating agents can form the cy- dogenous damage presents a more frequent
high resistance to DNA-damaging agents, espe- totoxic lesions 1-methyladenine and 3-methyl- problem for larger genomes.
cially ionizing radiation, in comparison to other cytosine in single-stranded DNA, and new re- As the genes from the human genome se-
microorganisms. Some of the currently unchar- pair strategies may be needed to remove such quence continue to be cataloged, studying the
acterized genes in D. radiodurans are expected lesions. activity of the protein products will become
to contribute to DNA repair, and it remains to be DNA is assembled into several levels of increasingly important. More effective methods
seen if there will be homologs of such functions ordered chromatin structure, and so DNA for rapid expression of active proteins will be
in the human genome. metabolic processes need a close connection required to test for possible functions. An alter-
The sequence database also makes it in- with proteins that allow chromatin remodel- native approach is to selectively inactivate in-
creasingly straightforward to use mass spec- ing or disassembly. Several human chromatin dividual proteins in vivo. An efficient method
trometry fingerprinting to identify new sub- remodeling complexes are known, for in- for selective proteolytic destruction has been
units of repair protein complexes (10). In this stance, that allow and control access to DNA successful in budding yeast (15) and should be
sensitive technique, isolated proteins are di- during gene transcription (11). The great ma- extendable to mammalian cells. Alternatively,
gested with an enzyme such as trypsin, and jority of enzymological DNA repair studies systematic interference with gene expression
the exact molecular masses of the resulting to date have worked with naked DNA, but with the use of inhibitory RNA molecules, as
fragments are measured. Comparison of these chromatin presents a substantial barrier to employed successfully in C. elegans (16), is
fragments with a computer-simulated tryptic recognition of DNA damage. It is expected proving to be a powerful way to dissect gene
digest of each human gene product can un- that human protein complexes will be found functions.
ambiguously identify the protein. that are dedicated to DNA repair and recom- Intense activity is being devoted to under-
In addition, new genes will be found as bination, facilitating access of DNA repair standing how DNA damage transmits signals
novel biochemical assays are developed for var- enzymes to the genome. to the cell-cycle checkpoint machinery and to
ious aspects of repair. For example, human cells The three-dimensional structures of DNA the monitoring systems that control cellular
can repair cross-links between the two DNA repair proteins are being determined at an apoptosis. There is recent progress on this
strands. Interstrand cross-links are generated by ever-increasing pace (12). Structural biolo- complex extended network, which involves
natural psoralen compounds and their chemo- gists will soon turn their attention to open damage recognition factors, protein kinases,
therapeutic derivatives, by other drugs used for reading frames of unknown function, and and transcription factors such as p53 (17).
cancer treatment such as nitrogen mustards, and new repair genes will become apparent in the Attempts are already being made to obtain an
to some extent by ionizing and ultraviolet radi- process. As an example, the functionally re- integrated picture of DNA repair with regard
ation. Repair of such cross-links involves the lated SMUG1, TDG, and UNG enzymes to signaling (18). The subject is of great
NER genes and the XRCC2 and XRCC3 re- show little or no primary sequence homology interest as some inherited human syndromes
combination genes and is predicted to involve yet have common structural folds and belong associated with sensitivity to DNA-damaging
the DNA polymerase POLQ. In addition, the to a single protein superfamily (13). As the agents result from loss of functions such as
sensitivity of cells from individuals with Fan- structures of new protein folds are document- ATM, which is involved in damage sensing.
coni anemia (FA) points to a role for the FANC ed, more members of DNA repair enzyme New clinical applications relating to hu-
group of genes in cross-link repair. However, families are likely to be found with the aid of man DNA repair genes are certain to emerge.
the mechanism of interstrand DNA cross-link three-dimensional structure prediction mod- Tumor cells often acquire resistance to ther-
repair remains obscure, and further investigation els. In this way, the new field of structural apeutic drugs or radiation. Genomics ap-
may implicate even more gene products. genomics will help guide functional studies proaches such as array technology will be
Several other classes of DNA damage exist of presently uncharacterized open reading used to define any DNA repair genes that
for which repair has been relatively unexplored. frames in the human genome. may be overexpressed in this context. Fur-
New genes may be identified, for instance, For an impressive number of genes in- thermore, it will be important to find ways to