Professional Documents
Culture Documents
Genome
• Prokaryotic genomes
• Eukaryotic genomes
– Nuclear genomes
– Mitochondrial genomes
– Chloroplast genomes (Plastome)
Eukaryotes:
•Larger genome size
•Have the feature of exon-intron organization of protein
coding genes
•The variation of repetitive DNA content in eukaryotes is
also extremely high
•In mammals and plants, the major part of genome is
composed by repetitive DNA
Genome size and complexity
• Pseudogenes
– Inactive copies of protein-coding genes
– Often generated by gene duplication, that have become
nonfunctional through the accumulation of inactivating
mutations
– Example, the olfactory receptor gene family is one of the
best-documented examples of pseudogenes in the human
genome; More than 60% of the genes in this family are
non-functional pseudogenes in humans
C-Value
Prediction true:
Prokaryote → Single celled → Invertebrate → Vertebrate
eukaryote
E. coli Yeast Drosophila Human
Prediction incorrect:
Homo sapiens (~3 x 109 bp) and Xenopus laevis (3.1 x 109 bp)
Flowering plants and insects have same amount of DNA
2. C-value of a family should be same as the members
perform same type of functions
Prediction true:
Fungi, algae, worms, fish, reptiles, birds, mammals have
same amount of DNA within the family/phyla
Prediction incorrect:
Insects, amphibians, flowering plants have wide ranges of
C-value
3. As C-value increases, number of genes should increase
Prediction true:
E. coli → Yeast → Drosophila → Human
Prediction incorrect:
Monocots (e.g. Oryza) have more number of genes than
dicots (e.g. Arabidopsis)
Variation in genome size between E. coli and human is
~700X, but the variation in number of genes is only ~7X
C-Value Paradox
• One surprising outcome of analyzing the C-value from
different organisms is the so-called ‘C-value paradox’
• Describes the lack of relationship between the DNA
content (C-value) of an organism and its coding potential
• Refers to the fact that genome sizes (and hence the C-
values) do not always correlate with genetic and/or
morphological complexity
• Means that organisms with similar complexity may have
very different genome sizes and conversely organisms
with similar C-values may not be equally complex
• Means that the larger genome size does not necessarily
contain more number of genes
• Examples of C-value paradox (facts):
– Toad xenopus and man have genomes of essentially the same
size. But we assume that man is more complex in terms of
genetic development
– In some phyla there are extremely large variations in DNA
content between organisms that do not vary much in complexity.
This is especially marked in insects, amphibians and plants
• A cricket has a genome 11X the size of a fruitfly
• In amphibians, the smallest genomes are <109 bp, while the
largest are >1011 bp
• There are unlikely to be a large difference in the number of
genes needed to specify these amphibians
– This variation among phyla does not occur in birds, reptiles and
mammals, which all show little variation within the group, with an
~2X range of genome sizes
• Genome size cannot be used as a predictor of genetic or
morphological complexity (there is no correlation between
genome size and genetic complexity)
– The organism with the largest genome is not necessarily the most
complex
– Genome size is positively correlated with the morphological
complexity among prokaryotes and lower eukaryotes (exception to
C-value paradox)
– However, after mollusks and all the other higher eukaryotes above,
this correlation is no longer effective
• Bigger genome does not mean more genes
• The proportion of non-repetitive DNA may show a decrease
along with the increase of genome size in higher eukaryotes
• Genome sizes of organisms may vary within many phyla