Professional Documents
Culture Documents
The central dogma of molecular biology is a theory stating that genetic information flows only in
one direction, from DNA, to RNA, to protein, or RNA directly to protein.
An RNA primer complementary to the parental strand is synthesized by RNA primase and is
elongated by DNA polymerase III through the addition of nucleotides to the 3′-OH ends. On the
leading strand, DNA is synthesized continuously, whereas on the lagging strand, DNA is
synthesized in short stretches called Okazaki fragments. RNA primers within the lagging strand
are removed by the exonuclease activity of DNA polymerase I, and the Okazaki fragments are
joined by DNA ligase.
The histone chaperones involved in these processes are associated with replisome proteins:
CAF-1/Rtt106 with PCNA and FACT/Asf1 with MCMs.
PIC contains, in addition to promoter DNA, the GTFs (TFIIA, B, D, E, F, and H),
and RNA Pol II. PIC assembly is thought to occur in a highly regulated, stepwise
fashion, as indicated. TFIID is among the first GTFs to bind the core promoter via
its TATA-box Binding Protein (TBP) subunit.
The rate of transcription elongation by E. coli RNAP is not uniform. RNA synthesis
is characterized by pauses, some of which may be brief and resolved spontaneously,
whereas others may lead to the transcription elongation complex
(TEC) backtracking.
Elongation rate and pausing are determined by template sequence and RNA
structure (e.g., stem-loops) and involve at least two components of the RNAP
catalytic center, the bridge helix (BH) and trigger loop (TL). Elongation is
proposed to occur in three steps (Fig. 10.14). First, the TL folds in response to NTP
binding. Mutational analyses indicate that this conformational change in the TL can
be rate-limiting, and reflects the ability of the incoming NTP to bind to TEC. The
second step is the incorporation of the NTP and the release of pyrophosphate. The
third step involves the translocation of the RNAP down the DNA Template such
that the next RNA nucleotide can be added to the nascent transcript.
Figure 10.14 A model for Transcriptional Elongation. The trigger loop hinges,
bridge helix hinges and bridge helix bending models are based on all atom molecular
dynamics simulations. At the top of the figure, diagrams of the closed TEC, the
closed product TEC (after chemistry) and the translocating TEC are shown. DNA is
grey; RNA is red; the NTP substrate (or incorporated NMP and pyrophosphate) is
blue; the trigger loop (TL) is purple; the bridge helix (BH) is yellow. Interpretations
of simulations are shown schematically below. Simulations indicate trigger loop
hinges H1 and H2, bridge helix hinges H3 and H4 and bridge helix bend modes B1
(straighter) and B2 (more sharply bent).
Backtracking of TEC may take place after a brief pause in transcription, caused by
the thermodynamic properties of nucleic acids sequences surrounding the elongation
complex. In addition, misincorporation events render elongation complexes prone
to backtracking by at least one bp. In this case, the rescue from backtracking through
the cleavage of the 3′ end of the erroneous transcript also may be seen as a
proofreading reaction. Any backtracking event causes a pause or arrest of
transcription elongation, which may limit its overall rate (the average speed of
RNAP along the template) or processivity (the fraction of RNAP molecules reaching
the end of the gene).
While the general structure of the elongation complex (the transcription bubble, the
RNA-DNA hybrid) remains unchanged during backtracking, extension of RNA
becomes impossible in this conformation. However, such complexes can be resolved
by the hydrolytic activity of RNAP, which cleaves the phosphodiester bond in the
active center of the backtracked complex, producing a new RNA 3′ end in the active
center. For single base backups, the hydrolytic reaction is catalyzed by a flexible
domain of RNAP located in the secondary channel called the Trigger Loop (TL;
Figure 10.15B) and the two metal ions of the active center.
Longer sequences of backtracked TEC can restart when acted upon by GreA/B
factors, which restore the 3′-end of the nascent transcript to the active center. GreA
and GreB are transcript cleavage factors that act on backtracked elongation
complexes. When Gre factors are bound in the secondary channel, Gre factors
displace the TL from the active center (Figure 10.15). The displacement switches
off the relatively slow TL-dependent intrinsic transcript hydrolysis, and imposes the
highly efficient Gre-assisted hydrolysis. This efficiency is thought to be due to
stabilization of the second catalytic Mg2+ ion and an attacking water molecule by
the Gre factors.
Figure 10.15 The Role of Gre Factors in Relieving Transcription Elongation
Complex Backtracking. (A) Ribbon diagram of the GreA and GreB proteins. (B)
The mode of functioning of Gre factors. The Gre factor is bound to the active
elongation complex but does not impose hydrolytic activity on it. Upon backtracking
or misincorporation, the Gre factor protrudes its coiled-coil domain through the
secondary channel of RNAP (shown in the lefthand diagram), where it substitutes
for the catalytic domain Trigger Loop (TL). This substitution switches off the slow
TL-dependent phosphodiester bond hydrolysis and, and instead, facilitates highly
efficient Gre-dependent hydrolysis. After resolution of the backtracked complex
through RNA cleavage, the elongation complex returns to the active conformation
and the Gre factor gives way to the TL, which can now continue catalysis of RNA
synthesis (shown in the righthand diagram). The controlled switching between Gre
and the TL eliminates possible interference of Gre with the RNA synthesis.
The bridge α-helix in the β’ subunit borders the active site and may have roles in
both catalysis and translocation. Mutations in the YFI motif (β’ 772-YFI-774) affect
intrinsic termination as well as pausing, fidelity and translocation of RNAP. One
mutation, F773V, abolishes the activity of the λ tR2 intrinsic terminator, although
neighboring mutations have little affect on termination. Modeling suggests that this
unique phenotype reflects the ability of F773 to interact with the fork domain in the
β subunit.
Figure 10.16 Model of Intrinsic Termination. (A) Shows the open conformation
of the RNAP during transcriptional elongation. RNAP is shown in yellow, DNA
template in blue, and nascent RNA in red. Key elements of the RNAP RNA exit
channel are shown in grey and labeled as indicated. (B) Shows the extension of the
nascent RNA through the RNAP exit channel and the potential for forming the RNA
hairpin structure when enough length has been achieved. (C) Shows the clamp
opening and disintegration of the the TEC when the RNA hairpin structure is
encountered at the transcriptional bubble.
Transcriptional termination can also be dependent upon accessory factors, such as
the Rho protein. Transcription termination factor Rho is an essential protein in E.
coli first identified for its role in transcription termination at Rho-dependent
terminators, and is estimated to terminate ~20% of E. coli transcripts. The rho gene
is highly conserved and nearly ubiquitous in bacteria. Rho is an RNA-dependent
ATPase with RNA:DNA helicase activity, and consists of a hexamer of six identical
monomers arranged in an open circle (Figure 10.17A).
Rho binds to single stranded RNA in a complex multi-step pathway that involves
two distinct sites on the hexamer. The primary binding site (PBS), distributed on the
N-terminal domains around the hexamer (Figure10.17B, cyan), ensures initial
anchoring of Rho to the transcript at a Rut (Rho utilization) site, a∼70 nucleotides
(nt) long, cytidine-rich and poorly-structured RNA sequence. Each Rho monomer
contains a subsite capable of binding specifically the base residues of a 5′-YC dimer
(Y being a pyrimidine). Biochemical and structural data suggest that Rho initially
binds to RNA in an open, ‘lock-washer’ conformation that closes into a planar ring
as RNA transfers to the central cavity. There, the ssRNA contacts an asymmetric
secondary binding site (SBS) (Fig. 10.17B, green), and this step, which presumably
is rate-limiting for the overall reaction, leads to motor activation. Upon hydrolysis
of ATP, the ssRNA is pulled upon conformational changes of the conserved Q and
R loops of the SBS, leading to Rho translocation, and ultimately promoting RNA
polymerase (RNAP) dissociation. The molecular mechanism of Rho translocation
based on single-molecule fluorescence methods appears to be tethered tracking. The
tethered tracking model postulates that Rho maintains its contacts between the PBS
and the loading (Rut) site upon translocation (Figure 10.17B). This mechanism
would allow Rho to maintain its high affinity interaction with Rut, and implies the
growing of an RNA loop between the PBS and the SBS upon translocation.
Figure 10.17 Schematic of Rho factor structure and mechanisms. (A) Molecular
structure of the Rho protein (PDB 1pv4) (B) Rho assembles as a homo-hexameric
ring (red spheres or tetragons), with RNA (black/yellow curve) binding to the
primary binding sites (PBS, cyan) and the secondary binding sites inside the ring
(SBS, green), where ATP-coupled translocation takes place. The Rut specific
binding site is depicted in yellow. The tethered-tracking model proposed that Rho
translocates RNA while maintaining interactions between PBS and Rut. This model
requires the formation of a loop that would shorten the extension of RNA upon
translocation.
mRNA Splicing
Eukaryotic genes that encode polypeptides are composed of coding sequences
called exons (ex-on signifies that they are expressed) and intervening sequences
called introns (int-ron denotes their intervening role). Transcribed RNA sequences
corresponding to introns do not encode regions of the functional polypeptide and are
removed from the pre-mRNA during processing. It is essential that all of the intron-
encoded RNA sequences are completely and precisely removed from a pre-mRNA
before protein synthesis so that the exon-encoded RNA sequences are properly
joined together to code for a functional polypeptide. If the process errs by even a
single nucleotide, the sequences of the rejoined exons would be shifted, and the
resulting polypeptide would be nonfunctional. The process of removing intron-
encoded RNA sequences and reconnecting those encoded by exons is called RNA
splicing. Intron-encoded RNA sequences are removed from the pre-RNA while it is
still in the nucleus. Although they are not translated, introns appear to have various
functions, including gene regulation and mRNA transport. On completion of these
modifications, the mature transcript, the mRNA that encodes a polypeptide, is
transported out of the nucleus, destined for the cytoplasm for translation. Introns can
be spliced out differently, resulting in various exons being included or excluded
from the final mRNA product. This process is known as alternative splicing. The
advantage of alternative splicing is that different types of mRNA transcripts can be
generated, all derived from the same DNA sequence. In recent years, it has been
shown that some archaea also have the ability to splice their pre-mRNA.
The splicing reaction is catalyzed by the spliceosome, a macromolecular complex
formed by five small nuclear ribonucleoproteins (snRNPs), termed U1, U2, U4, U5,
and U6, and approximately 200 proteins (Fig. 10.23). The assembly of the
spliceosome on pre-mRNA includes the binding of U1 snRNP, U2 snRNP, the pre-
formed U4/U6-U5 triple snRNP, and the Prp19 complex. This assembly occurs
through the recognition of several sequence elements on the pre-mRNA that define
the exon/intron boundaries, which include the 5′ and 3′ splice sites (SS), the
associated 3′ sequences for intron excision, the polypyrimidine (Py) tract, and the
branch point sequence (BPS). The assembly of the spliceosome during the process
is depicted in Figure 10.23.
In mammals, the first catalytic step of the splicing reaction begins when the U1
snRNP binds the 5′ SS of the intron (defined by the consensus sequence
AGGURAGU), and the splicing factors SF1 and U2AF cooperatively recognize the
BPS, Py, and 3′ SS to assembled complex E or the commitment complex (Figure
10.23). Subsequently, U2 snRNP and additional proteins are recruited to the pre-
mRNA BPS to form the pre-spliceosome or complex A. The binding of the U4/U6-
U5 tri-snRNP forms the pre-catalytic spliceosome or complex B. After RNA-RNA
and RNA-protein rearrangements at the heart of the spliceosome, U1 and U4 are
released to form the activated complex B or complex B* This complex is responsible
for executing the first catalytic step, through which the phosphodiester bond at the
5′ SS of the intron is modified by the 2′-hydroxyl of an adenosine of the BPS to form
a free 5′ exon and a branched intron (Fig. 10.24). The reaction of the 2′-hydroxyl
from the branchpoint adenosine nucleotide is known as a transesterification
reaction. During this process, additional rearrangements occur to generate the
catalytic spliceosome or complex C (Fig. 10.23), which is responsible for catalyzing
the second transesterification reaction leading to intron excision and exon–exon
ligation (Fig. 10.24). The resulting intron structure is referred to as a lariat structure.
After the second catalytic step, the U2, U5, and U6 snRNPs are released from the
post-spliceosomal complex and recycled for additional rounds of splicing.
Figure 10.24 Transesterification Reactions Involved in mRNA Splicing. (A)
Schematic diagram of the pre-mRNA with exons and introns indicated. Key
sequences are required for splicing at the 5′ and 3′ intron locations, and for the
recognition and positioning of the branchpoint Adenosine residue for the first
transesterification reaction. (B) Schematic of the two transesterification reactions
required for intron removal. The branchpoint 2′-OH residue mediates attack on the
5′-phosphate of the intron guanosine residue located at the 5′-splice site. This
releases the 3′ hydroxyl of Exon 1 which subsequently mediates attack of the 5′
phosphate of the first guanosine residue in Exon 2. The 3′ hydroxyl of the intron
guanine residue is released forming the Lariat structure and Exon 1 is ligated to Exon
2.