Professional Documents
Culture Documents
Seminario 1
Seminario 1
DOI 10.1007/s00239-015-9722-8
ORIGINAL ARTICLE
Received: 6 July 2015 / Accepted: 11 November 2015 / Published online: 5 January 2016
The Author(s) 2016. This article is published with open access at Springerlink.com
Introduction
The emergence of life on Earth and the subsequent evolution
of the last universal comment ancestor (LUCA) together
represent a long process that passed through several distinct
stages. Life emerged from an unknown geochemical context
in which large proto-biomolecules were likely generated
from smaller precursors. This prebiotic scenario must have
been capable of producing precursor biopolymers that gave
rise to the earliest genetic systems. The majority of evidence
suggests that an RNA world scenario followed, in which a
simple genetic system consisted of RNA genes that encoded
a ribozyme-based metabolism (Gilbert 1986). While this
hypothesis describes a predecessor to the current genetic
system based on RNA or a similar biomolecule, it is possible, and some argue more likely, that many important
metabolic reactions were catalyzed by amino acids, peptides, ions, and geochemical catalysts (Fig. 1b).
Additional evidence suggests that the system of biological protein synthesis by translation emerged from or
co-evolved with this RNA-based genetic system (Freeland
et al. 1999; Cech 2000) and that the establishment of the
DNA genome followed (Forterre 2002; Goldman and
123
18
123
19
through common mechanisms of protein structural innovation, such as recombination or duplication. Previous
work on simulated evolution of RNA sequences inferred
that nucleotide sequence becomes less random over time
through the formation of functional modules (Ancel and
Fontana 2000). This trend was probably the case for early
protein evolution as well, given the modular domain substructure of modern proteins (Chothia et al. 2003). At the
level of secondary structure, however, the repetitive nature
of the TIM barrel structure suggests that it could have
easily been generated from smaller (b/a) proteins such as
quarter and half barrels through some of the same processes responsible for exon shuffling and gene duplication.
It is, therefore, not difficult to imagine that TIM barrel
superfamilies could have easily, and perhaps repeatedly,
arisen from smaller protein-coding genes during the early
evolution of the protein repertoire.
Taxonomic Breadth and Functional Range of TIM
Barrel Superfamilies
As defined by the SCOP database (Andreeva et al. 2008),
members of the same TIM barrel superfamily are related by
a common origin. We evaluated the taxonomic distribution
of TIM barrel superfamilies to approximate which superfamilies were likely present by the time of LUCA and
123
20
123
21
123
22
b Fig. 3 The taxonomic breadth and functional diversity of TIM barrel
We also sought to better understand the metabolic distribution of functions performed by TIM barrel proteins generally and within individual superfamilies with a large
functional range. The patchwork model of metabolic evolution predicts that metabolic pathways composed of multiple enzymes with specific functions evolved from a smaller
number of functionally general enzymes that performed
multiple catalytic functions (Yamada and Bork 2009).
Recent evidence shows that TIM barrel proteins exhibit a
pattern of distribution in modern metabolic pathways consistent of the patchwork model versus other models of
metabolic pathway evolution (Caetano-Anolles et al. 2009).
To further test this hypothesis, we mapped the complete
set of TIM barrel protein functions onto the KEGG global
123
23
Table 1 Correlations between cofactor usage and enzymatic breadth for TIM barrel superfamilies and superfamilies in general
Variables
Correlation
r = 0.56, p = 0.00078
r = 0.62, p = 0.00012
r = 0.16, p = 0.044
r = 0.0083, p = 0.92
Enzyme categories are defined as the first term of Enzyme Commission codes
123
24
Conclusions
The modern form of protein-mediated metabolism emerged
from a less capable system likely composed of ribozymes,
peptide catalysts, and inorganic catalysts (Szathmary and
Maynard Smith 1994; Lazcano and Miller 1999; CaetanoAnolles et al. 2007; Bowman et al. 2015). Previous studies
have proposed that the inorganic cofactors, nucleotide-derived
cofactors, and amino acid- or peptide-derived cofactors that
facilitate modern protein catalysis may have ancient roots in
this transition (Szathmary and Maynard Smith 1994; Wachtershauser 1990; Yarus 1993; Mulkidjanian and Galperin
2009; White 1976). We have conducted a set of analyses
focused on understanding the evolution of TIM barrel proteins in the context of early proteome evolution.
We showed that many TIM barrel superfamilies probably originated by the time of LUCA and that these superfamilies, themselves, demonstrate a high degree of
catalytic diversity. Supporting our hypothesis, we find that
inorganic cofactors and organic cofactors derived from
nucleotides, amino acids, and peptides underlie this functional range in modern TIM barrel proteins and that this
breadth of cofactor usage contributes to TIM barrel functional diversity in a manner unlike proteins, generally. It is
not difficult to imagine that a significant functional
expansion could have taken place prior to LUCA if TIM
barrels had originated even earlier. Thus, the TIM barrel
structure, with its evolutionarily malleable active site
pocket, represents an ideal scaffold to facilitate the transition from ribozyme, peptide, and abiotic catalysts to
modern protein-mediated metabolism.
Methods
Secondary Structure Complexity
Structural complexity was defined as the Shannon entropy
of secondary structure elements along the protein chain
from N-terminus to C-terminus. Real secondary structure
data were acquired from the DSSP database (version 2.0:
July, 2011) (Kabsch and Sander 1983). Domains corresponding to SCOP assignments were extracted from the
PDB-formatted DSSP files and converted into strings of
secondary structure assignments per residue. Secondary
structure assignments of individual residues were then
smoothed and converted to strings of alpha and beta elements. Shannon entropy was calculated on these strings, as
well as the canonical TIM barrel string and all other possible mixed a/b strings of the same size (79 alpha helices
and beta strands, respectively). Shannon entropy was calculated in a nonmetric form (that is, not normalized for
length) using a substring length (k) of 5.
123
References
Anantharaman V, Aravind L, Koonin EV (2003) Emergence of
diverse biochemical activities in evolutionarily conserved structural scaffolds of proteins. Curr Opin Chem Biol 7:1220
25
Ancel LW, Fontana W (2000) Plasticity, evolvability, and modularity
in RNA. J Exp Zool 288:242283
Andreeva A, Howorth D, Chandonia JM, Brenner SE, Hubbard TJ,
Chothia C, Murzin AG (2008) Data growth and its impact on the
SCOP database: new developments. Nucleic Acids Res
36:D419D425
Becerra A, Delaye L, Islas S, Lazcano A (2007) The very early stages
of biological evolution and the nature of the last common
ancestor of the three major cell domains. Annu Rev Ecol Evol
Syst 38:361379
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H,
Shindyalov IN, Bourne PE (2000) The Protein Data Bank.
Nucleic Acids Res 28:235242
Bowman JC, Hud NV, Williams LD (2015) The ribosome challenge
to the RNA world. J Mol Evol 80:143161
Caetano-Anolles G, Kim HS, Mittenthal JE (2007) The origin of
modern metabolic networks inferred from phylogenomic analysis
of protein architecture. Proc Natl Acad Sci USA 104:93589363
Caetano-Anolles G, Yafremava LS, Gee H, Caetano-Anolles D, Kim
HS, Mittenthal JE (2009) The origin and evolution of modern
metabolism. Int J Biochem Cell Biol 41:285297
Caetano-Anolles G, Kim KM, Caetano-Anolles D (2012) The
phylogenomic roots of modern biochemistry: origins of proteins,
cofactors and protein biosynthesis. J Mol Evol 74:134
Catling DC, Claire M (2005) How Earths atmosphere evolved to an
oxic state: a status report. Earth Planet Sci Lett 237:120
Cech TR (2000) The ribosome is a ribozyme. Science 289:878879
Chothia C, Gough J, Vogel C, Teichmann SA (2003) Evolution of the
protein repertoire. Science 300:17011703
Copley RR, Bork P (2000) Homology among (beta, alpha)8 barrels:
Implications for the evolution of metabolic pathways. J Mol Biol
303:627640
DeLano W (2008) The PyMOL molecular graphics system. DeLano
Scientific LLC, Palo Alto
Ertem G, Ferris JP (1998) Formation of RNA oligomers on montmorillonite: site of catalysis. Orig Life Evol Biosph 28:485499
Forterre P (2002) The origin of DNA genomes and DNA replication
proteins. Curr Opin Microbiol 5:525532
Freeland SJ, Knight RD, Landweber LF (1999) Do proteins predate
DNA? Science 286:690692
Frenkel ZM, Trifonov EN (2005) Closed loops of TIM barrel protein
fold. J Biomol Struct Dyn 22:643656
Gilbert W (1986) The RNA world. Nature 319:618
Gilbert W, Glynias M (1993) On the ancient nature of introns. Gene
135:137144
Goldman AD, Landweber LF (2012) Oxytricha as a modern analog of
ancient genome evolution. Trends Genet 28:382388
Goldman AD, Samudrala R, Baross JA (2010) The evolution and
functional repertoire of translation proteins following the origin
of life. Biol Direct 5:15
Goldman AD, Baross JA, Samudrala R (2012) The enzymatic and
metabolic capabilities of life forms during the origin of life.
PLoS One 7:e39912
Goldman AD, Bernhard TM, Dolzhenko E, Landweber LF (2013)
LUCApedia: a database for the study of ancient life. Nucleic
Acids Res 41:D1079D1082
Gough J, Karplus K, Hughey R, Chothia C (2001) Assignment of
homology to genome sequences using a library of hidden
Markov models that represent all proteins of known proteins.
J Mol Biol 313:903919
Henn-Sax M, Hocker B, Wilmanns M, Sterner R (2001) Divergent
evolution of (beta, alpha)8-barrel enzymes. Biol Chem.
382:13151320
Hocker B, Claren J, Sterner R (2004) Mimicking enzyme evolution
by generating new (beta, alpha)8-barrels from (beta, alpha)4half-barrels. Proc Natl Acad Sci USA 101:1644816453
123
26
Huang W, Ferris JP (2006) One-step, regioselective synthesis of up to
50-mers of RNA oligomers by montmorillonite catalysis. J Am
Chem Soc 128:89148919
Huang PS, Feldmeier K, Parmeggiani F, Fernandez Velasco DA,
Hocker B, Baker D (2016) De novo design of a four-fold
symmetric TIM-barrel protein with atomic-level accuracy. Nat
Chem Biol 12:2934
Huber C, Eisenreich W, Hecht S (2003) A possible primordial peptide
cycle. Science. 301:938940
Kabsch W, Sander C (1983) Dictionary of protein secondary
structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22:25772637
Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M (2012) KEGG
for integration and interpretation of large-scale molecular
datasets. Nucleic Acids Res 40:D109D114
Kim HS, Mittenthal JE, Caetano-Anolles G (2006) MANet: tracing
evolution of protein architecture in metabolic networks. BMC
Bioinform 7:351
Kyrpides NC, Ouzounis CA (1995) Nucleic acid-binding metabolic
enzymes: living fossils of stereochemical interactions? J Mol
Evol 40:564569
Kyrpides N, Overbeek R, Ouzounis C (1999) Universal protein
families and the functional content of the last universal common
ancestor. J Mol Evol 49:413423
Lane N, Allen JF, Martin W (2010) How did LUCA make a living?
Chemiosmosis in the origin of life. BioEssays 32:271280
Lang D, Thoma R, Henn-Sax M, Sterner R, Wilmanns M (2000)
Structural evidence for evolution of the beta/alpha barrel
scaffold by gene duplication and fusion. Science 289:15461550
Lazcano A, Miller SL (1999) On the origin of metabolic pathways.
J Mol Evol 49:424431
Lundin D, Berggren G, Logan DT, Sjoberg BM (2015) The origin and
evolution of ribonucleotide reduction. Life 5:604636
Mulkidjanian AY, Galperin MY (2009) On the origin of life in the Zinc
world. 2. Validation of the hypothesis on the photosynthesizing
zinc sulfide edifices as cradles of life on Earth. Biol Direct 4:27
Nagano N, Orengo CA, Thornton JM (2002) One fold with many
functions: The evolutionary relationships between TIM barrel
families based on their sequences, structures and functions.
J Mol Biol 321:741765
Nasvall J, Sun L, Roth JR, Andersson DI (2012) Real-time evolution
of new genes by innovation, amplification, and divergence.
Science 338:384387
Nicolet Y, Drennan CL (2004) AdoMet radical proteinsfrom
structure to evolutionalignment of divergent protein sequences
reveals strong secondary structure element conservation. Nucleic
Acids Res 32:40154025
Nordlund P, Reichard P (2006) Ribonucleotide reductases. Annu Rev
Biochem 75:681706
123