Professional Documents
Culture Documents
DNA methylation
DNA methylation is a biological process by which methyl groups are added to
the DNA molecule. Methylation can change the activity of a DNA segment
without changing the sequence. When located in a gene promoter, DNA
methylation typically acts to repress gene transcription. In mammals, DNA
methylation is essential for normal development and is associated with a
number of key processes including genomic imprinting, X-chromosome
inactivation, repression of transposable elements, aging, and carcinogenesis.
So far (2016) two nucleobases have been found on which natural, enzymatic
DNA methylation takes place: adenine and cytosine. The modified bases are
N6-methyladenine[, 5-methylcytosine and N4-methylcytosine.
Two of DNA's four bases, cytosine and adenine, can be methylated. Cytosine
methylation is widespread in both eukaryotes and prokaryotes, even though
the rate of cytosine DNA methylation can differ greatly between species: 14%
of cytosines are methylated in Arabidopsis thaliana, 4% to 8% in Physarum,[4]
7.6% in Mus musculus, 2.3% in Escherichia coli, 0.03% in Drosophila, 0.006%
in Dictyostelium[5] and virtually none (0.0002 to 0.0003%) in
Caenorhabditis[6] or fungi such as Saccharomyces cerevisiae and S. pombe
(but not N. crassa).[7][8]: 3699 Adenine methylation has been observed in
bacterial, plant, and recently in mammalian DNA,[9][10] but has received
considerably less attention.
CpG islands
In mammals, the only exception for this global CpG depletion resides in a
specific category of GC- and CpG-rich sequences termed CpG islands that
are generally unmethylated and therefore retained the expected CpG content.
[23] CpG islands are usually defined as regions with: 1) a length greater than
200bp, 2) a G+C content greater than 50%, 3) a ratio of observed to expected
CpG greater than 0.6, although other definitions are sometimes used.[24]
Excluding repeated sequences, there are around 25,000 CpG islands in the
human genome, 75% of which being less than 850bp long.[22] They are
major regulatory units and around 50% of CpG islands are located in gene
promoter regions, while another 25% lie in gene bodies, often serving as
alternative promoters. Reciprocally, around 60-70% of human genes have a
CpG island in their promoter region.[25][26] The majority of CpG islands are
constitutively unmethylated and enriched for permissive chromatin
modification such as H3K4 methylation. In somatic tissues, only 10% of CpG
islands are methylated, the majority of them being located in intergenic and
intragenic regions.
Repression of CpG-dense promoters
DNA methylation was probably present at some extent in very early eukaryote
ancestors. In virtually every organism analyzed, methylation in promoter
regions correlates negatively with gene expression.[20][27] CpG-dense
promoters of actively transcribed genes are never methylated, but,
reciprocally, transcriptionally silent genes do not necessarily carry a
methylated promoter. In mouse and human, around 60–70% of genes have a
CpG island in their promoter region and most of these CpG islands remain
unmethylated independently of the transcriptional activity of the gene, in both
differentiated and undifferentiated cell types.[28][29] Of note, whereas DNA
methylation of CpG islands is unambiguously linked with transcriptional
repression, the function of DNA methylation in CG-poor promoters remains
unclear; albeit there is little evidence that it could be functionally relevant.[30]
DNA methylation may affect the transcription of genes in two ways. First, the
methylation of DNA itself may physically impede the binding of transcriptional
proteins to the gene,[31] and second, and likely more important, methylated
DNA may be bound by proteins known as methyl-CpG-binding domain
proteins (MBDs). MBD proteins then recruit additional proteins to the locus,
such as histone deacetylases and other chromatin remodeling proteins that
can modify histones, thereby forming compact, inactive chromatin, termed
heterochromatin. This link between DNA methylation and chromatin structure
is very important. In particular, loss of methyl-CpG-binding protein 2 (MeCP2)
has been implicated in Rett syndrome; and methyl-CpG-binding domain
protein 2 (MBD2) mediates the transcriptional silencing of hypermethylated
genes in "cancer".
Genome expansion
There are many ways that gene expression is controlled in eukaryotes, but
methylation of DNA (not to be confused with histone methylation) is a
common epigenetic signaling tool that cells use to lock genes in the "off"
position. In recent decades, researchers have learned a great deal about DNA
methylation, including how it occurs and where it occurs, and they have also
discovered that methylation is an important component in numerous cellular
processes, including embryonic development, genomic imprinting, X-
chromosome inactivation, and preservation of chromosome stability. Given
the many processes in which methylation plays a part, it is perhaps not
surprising that researchers have also linked errors in methylation to a variety
of devastating consequences, including several human diseases.
Prior to 1980, there were a number of clues that suggested that methylation
might play a role in the regulation of gene expression. For example, J. D.
McGhee and G. D. Ginder compared the methylation status of the beta-globin
locus in cells that did and did not express this gene. Using restriction enzymes
that distinguished between methylated and unmethylated DNA, the duo
showed that the beta-globin locus was essentially unmethylated in cells that
expressed beta-globin but methylated in other cell types (McGhee & Ginder,
1979). This and other evidence of the time were indirect suggestions that
methylation was somehow involved in gene expression.
Shortly after McGhee and Ginder published their results, a more direct
experiment that examined the effects of inhibiting methylation on gene
expression was performed using 5-azacytidine in mouse cells. 5-azacytidine
is one of many chemical analogs for the nucleoside cytidine. When these
analogs are integrated into growing DNA strands, some, including 5-
azacytidine, severely inhibit the action of the DNA methyltransferase enzymes
that normally methylate DNA. (Interestingly, other analogs, like Ara-C, do not
negatively impact methylation.) Because most DNA methylation was known to
occur on cytosine residues, scientists hypothesized that if they inhibited
methylation by flooding cellular DNA with 5-azacytidine, then they could
compare cells before and after treatment to see what impact the loss of
methylation had on gene expression. Knowing that gene expression changes
are responsible for cellular differentiation, these researchers used changes in
cellular phenotypes as a proxy for gene expression changes (Table 1; Jones
& Taylor, 1980).
Today, researchers know that DNA methylation occurs at the cytosine bases
of eukaryotic DNA, which are converted to 5-methylcytosine by DNA
methyltransferase (DNMT) enzymes. The altered cytosine residues are
usually immediately adjacent to a guanine nucleotide, resulting in two
methylated cytosine residues sitting diagonally to each other on opposing
DNA strands. Different members of the DNMT family of enzymes act either as
de novo DNMTs, putting the initial pattern of methyl groups in place on a DNA
sequence, or as maintenance DNMTs, copying the methylation from an
existing DNA strand to its new partner after replication. Methylation can be
observed by staining cells with an immunofluorescently labeled antibody for 5-
methylcytosine. In mammals, methylation is found sparsely but globally,
distributed in definite CpG sequences throughout the entire genome, with the
exception of CpG islands, or certain stretches (approximately 1 kilobase in
length) where high CpG contents are found. The methylation of these
sequences can lead to inappropriate gene silencing, such as the silencing of
tumor suppressor genes in cancer cells.
The roles and targets of DNA methylation vary among the kingdoms of
organisms. As previously noted, among Animalia, mammals tend to have
fairly globally distributed CpG methylation patterns. On the other hand,
invertebrate animals generally have a "mosaic" pattern of methylation, where
regions of heavily methylated DNA are interspersed with nonmethylated
regions. The global pattern of methylation in mammals makes it difficult to
determine whether methylation is targeted to certain gene sequences or is a
default state, but the CpG islands tend to be near transcription start sites,
indicating that there is a recognition system in place.
Plantae are the most highly methylated eukaryotes, with up to 50% of their
cytosine residues exhibiting methylation. Interestingly, in Fungi, only repetitive
DNA sequences are methylated, and in some species, methylation is absent
altogether, or it occurs on the DNA of transposable elements in the genome.
The mechanism by which the transposons are recognized and methylated
appears to involve small interfering RNA (siRNA). The whole silencing
mechanism invoking DNMTs could be a way for these organisms to defend
themselves against viral infections, which could generate transposon-like
sequences. Such sequences can do less harm to the organism if they are
prevented from being expressed, although replicating them can still be a
burden (Suzuki & Bird, 2008). In other fungi, such as fission yeast, siRNA is
involved in gene silencing, but the targets include structural sequences of the
chromosomes, such as the centromeric DNA and the telomeric repeats at the
chromosome ends.
For many years, methylation was believed to play a crucial role in repressing
gene expression, perhaps by blocking the promoters at which activating
transcription factors should bind. Presently, the exact role of methylation in
gene expression is unknown, but it appears that proper DNA methylation is
essential for cell differentiation and embryonic development. Moreover, in
some cases, methylation has observed to play a role in mediating gene
expression. Evidence of this has been found in studies that show that
methylation near gene promoters varies considerably depending on cell type,
with more methylation of promoters correlating with low or no transcription
(Suzuki & Bird, 2008). Also, while overall methylation levels and
completeness of methylation of particular promoters are similar in individual
humans, there are significant differences in overall and specific methylation
levels between different tissue types and between normal cells and cancer
cells from the same tissue.
Researchers have also determined that mice that lack a particular DNMT
have reduced methylation levels and die early in development (Suzuki & Bird,
2008). This is not the case for all eukaryotes, however; some organisms, such
as the yeast Saccharomyces cerevisiae and the nematode worm
Caenorhabditis elegans, are thought to have no methylated DNA at all
(although, at least in yeast, there are sequences in their genomes that are
homologous to those that code for the DNMT enzymes).
Given the critical role of DNA methylation in gene expression and cell
differentiation, it seems obvious that errors in methylation could give rise to a
number of devastating consequences, including various diseases. Indeed,
medical scientists are currently studying the connections between methylation
abnormalities and diseases such as cancer, lupus, muscular dystrophy, and a
range of birth defects that appear to be caused by defective imprinting
mechanisms (Robertson, 2005). The results of these studies will be invaluable
for treating these disorders, as well as for understanding and preventing
complications that can arise during embryonic development due to
abnormalities in X-chromosome methylation and gene imprinting.
Summary
Within the past thirty years, researchers have discovered numerous details
about the process of DNA methylation. For instance, scientists now know that
methylation plays a critical role in the regulation of gene expression, and they
have also determined that this process tends to occur at certain locations
within the genomes of different species. Furthermore, DNA methylation has
been shown to play a vital role in numerous cellular processes, and abnormal
patterns of methylation have been liked to several human diseases.
Nonetheless, as with other topics in the field of epigenetics, gaps remain in
our knowledge of DNA methylation. As new laboratory techniques are
developed and additional genomes are mapped, scientists will no doubt
continue to uncover many of the unknowns of how, when, and where DNA is
methylated, and for what purposes.