Professional Documents
Culture Documents
Supersecondary structures
Supersecondary structures - some simple combinations of secondary structure elements have
been found to occur frequently in proteins
◊ Also known as motifs
can be associated with a specific function
◊ Example: helix-loop-helix is associated with DNA binding
can form part of a larger functional and/or structural assembly (domain)
usually not stable by themselves
Helix-loop-helix
- 2 types:
1. EF hand
Occurs in a number of calcium-binding proteins (ex//calmodulin)
Glu and Asp residues in the loop of these proteins form part of the calcium binding
site
2. Helix-turn helix
Occurs in certain DNA-binding proteins and bind to DNA
Play important role in gene silencing and gene activation
β-hairpin motif
Biochem Page 1
○ Estimated amount of naturally occurring folds is around 8000
○ 1200 was observed
○ However, about half of the proteins with known structure belong to only 20 fold groups
- proteins are more stable in water when their hydrophobic side chains are sequestered from water
- Assembled into the core of a protein
- Polar side chains remain in contact with water on the surface of the protein - they are solvated
- If polar side chains are forced into interior of the protein they neutralize their polarity by forming H-bonds and forming
secondary structure
- Salt bridges
Biochem Page 2
- Salt bridges
side chains with complementary charges form ionic bonds
salt bridges are buried deep within the hydrophobic interior of the protein where it can’t be
disrupted by solvent
Quaternary Structure
- folded proteins interact to form dimers, trimers, or higher order structures
- each subunit is a separate polypeptide chain
- And they always assume exactly the same arrangement in the same protein molecules
- Can be identical chains or can be different chains
- multi-subunit protein is referred to as an oligomer
◊ Example: bacteria
Bacteria has many different structures which help it to move
(fimbriae/flagellum/pilus)
These structures are made up of repeating subunits polymerized together to form
long structures
A multi-subunit protein
- may consist of identical or non-identical polypeptide chains
- The interaction regions superficially resemble the interiors of single subunit proteins:
- they contain closely packed nonpolar side chains, hydrogen bonds, and in some cases, inter-chain disulfide bonds.
- They differ in several ways:
1. Hydrophobicity is midway between those of protein core and surface. In particular, the interfaces of proteins that
dissociate in vivo are less hydrophobic than permanent interfaces.
2. An average of ~77% of inter-subunit H-bonds are between side chains. Average within subunit is ~32%.
3. About 56% of protein interfaces contain salt bridges
- Few ion pairs buried in protein core.
Biochem Page 3
Lecture 7 (Protein purification)
October 6, 2020 7:39 PM
Purification step 1
- The first step is cell lysis
- The gel forms a matrix with pores of certain molecular dimensions which can be specified according to the protein you are
monitoring
○ gel slows the movement of the larger molecules
○ Therefore: proteins can run through the pores as a function of their size
– The process also involves SDS (Sodium Dodecyl-Sulfate) - very strong protein which
denatures molecules
Binds tightly to proteins, unfolds them and causes them to assume a long rod shape
The binding ration is 1 SDS molecule per 2 amino acids
Highly negative charge masks intrinsic charge of the protein
Therefore: proteins which are treated with SDS have identical charge to mass ratio,
as well as, similar shape
SDS treatment also disrupts non-covalent interactions between multi polypeptide
subunits
• Therefore: separation of protein is based on gel filtration and electrophoretic property of the proteins
Biochem Page 4
○ Migrate according to their size only since their similar in all the other aspects, thank to SDS
Steps:
1. SDS-coated proteins move through the gel when an electric potential is applied
2. After electrophoresis the proteins are visualized by staining with Coomassie Blue or Silver
3. The migration distance is proportional to the log10 [Mr ].
The mass of an unknown protein can be determined by interpolation using the migration
distances of proteins of known molecular weight.
There is a logarithmic relationship between protein molecular mass and its relative electrophoretic mobility
◊ Relationship between MM and relative mobility for 37 proteins ranging in size from 11 to 70 kDa
Absorbance at 280nm
– Tryptophan and tyrosine absorb light in UV region
– Use beer's law to calculate concentration of protein in the sample
Assay activity - ex// nuclease activity
– Monitors protein activity
– The greater the activity - the higher proportion of proteins is present in the sample
Solubility
Biochem Page 5
Solubility
- A protein’s multiple acid-base groups make its solubility properties dependent on the concentrations of dissolved salts
- As a result, proteins greatly vary in their solubility in different conditions
○ Some precipitate in certain conditions and other don't which forms the bases of protein purification
- Solubility purification technique depends on salting in and salting out procedures:
1. Salting in:
◊ Most proteins have low solubility in the absence of salt
◊ At low concentrations, added salt usually increases the solubility of charged macromolecules because
counterions shield the multiple ionic charges of the protein
◊ Neutralizing charges on protein surface reduces ordered H2O around proteins, increasing entropy of
the system, and thus the solubility
◊ so low salt concentrations prevent aggregation of the proteins, which can lead to precipitation or
“crashing”
1. Salting out:
◊ Most proteins have low solubility in very high concentrations of salt
◊ at high concentrations, added salt lowers the solubility of macromolecules because it competes for the
water molecules needed to solvate them
◊ As a result of counterions competing for solvation of solvent, the solvation around proteins is not
enough to solubilize them
◊ so high salt concentration removes the solvation sphere from the protein molecules and they come out
of solution as precipitation
- Protein solubility at a given ionic strength varies with the type of ions in solution
○ Protein solubility is related to ion size and hydration
○ Different ions have different effects
◊ The graph: shows the solubility of carboxy-hemoglobin at its pI as a function of ionic strength
S is solubility in salt solution
S' is solubility in pure water
Ionic strength = salt concentration
2 trends:
– Larger ions have more drastic effects on protein solubility
– solubility of proteins decreases with increasing ionic strength
◊ Salting out was traditionally one of the most commonly used purification steps
remove unwanted proteins from a solution by salting out
after removing the precipitate by centrifugation, the desired protein can be precipitated by
altering the salt concentration to the level at which the desired protein becomes insoluble – “salt
cuts”
- Chromatography - the separation of a mixture by passing it through a medium in which the components move at different rates
- Ion exchange chromatography - a purification method aimed at separating proteins based on charge
◊ Schematics:
The procedure requires a column filled with solid matrix
The matric contains pores of specific size
The sample is applied to the very top of the column - mixture of proteins and cell parts
Biochem Page 6
◊ More detailed mechanism:
We can use concentration gradient to titrate the protein of interest
– In this case we use just 2 step gradient
First apply low-salt solution buffer
– The first protein is eluted from the chromatography
Then apply high-salt solution
– Protein which was binding more tightly at low salt solution start to elute from the column
Then we are left with the protein of interest which is eluted from the column by adjusting the
salt concentration
To determine at which point the protein is being diluted use:
– protein can be detected by UV absorbance, SDS-PAGE, enzymatic activity
○ Therefore the proteins can bind to both anion and cation exchangers depending on their net charge
- This binding allows the separation of the proteins
- Different proteins can be bound at different points of chromatography by adjusting the pH of the solution
- Once bound, the protein is eluted by increasing the ion concentration
○ the affinity with which a particular protein binds to a given ion exchanger depends on the identities and concentrations of the
other ions in solution b/c of competition among these ions for the binding sites
Biochem Page 7
◊ molecules move from the more concentrated solution (inside) to the less concentrated solution (bulk
solvent)
◊ The procedure can be repeated multiple times, to ensure that most of the unwanted material left the
dialysis bag
This technique is limited by the assumption that known and unknown proteins have identical
shapes
– Globular proteins fall well into the line
– Elongated proteins not so much - therefore, this is a bad way to estimate their molecular
mass
◊ Before purification:
Cloning the strain with expression vector - plasmid from which we can express recombinant
protein of interest
– Contains protein sequence, linker sequence and affinity tag
Biochem Page 8
Review of protein purification
1. Thousands of different proteins in starting mixture (cell extract)
2. Exploit differences in:
- Solubilities
- charge, polarity
- Size
- binding specificities
3. Need the protein to retain its biological activity and high yield to analyze structure and function
Biochem Page 9
Lecture 8
October 13, 2020 2:30 PM
COLLAGEN
Collagen
□ most abundant protein in vertebrates
□ 30% of total protein in mammals
□ extracellular protein organized into insoluble fibers
□ found in connective tissues
bone, teeth, cartilage, tendon, fibrous matrices of blood vessels and skin
□ remarkably diverse forms and functions
in tendons it forms stiff, rope like fibers of tremendous tensile strength
in skin it forms loosely woven fibers that permit expansion in all directions
□ 28 different types which occur in different tissues
□ collagen consists of three left-handed helical chains coiled around each other to form a right-handed supercoil
□ each left-handed helix has 3 AA/turn
□ much more extended than α-helix, with a rise per residue of 2.9 Å (more elongated)
Biochem Page 10
Mutations in collagen
Mutations in collagen can cause disease
Example: Osteogenesis imperfecta (brittle bone disease)
◊ single point mutation
◊ central Gly changes to Ala and distorts the triple helix and fiber formation
◊ additional methyl groups cause the fibril melting temperature to drop from 63°C to 29°C
◊ Therefore: protein is not functional in human biological conditions
ELASTIN
Elastin
◊ highly elastic protein in connective tissue
◊ allows tissues in the body to stretch and “snap back” to their original shape
◊ primarily composed of small nonpolar amino acids (e.g. Gly, Ala, Val)
◊ also rich in Pro and Lys, but contain lidle hydroxyproline and hydroxylysine
◊ Usually around 800 amino acids long - single molecule or fiber?
◊ Molecules aggregate together and form tissue by becoming cross-linked
15-17 cross links per molecule
Wide spacing of cross-links allows extension of tissues and provides strength
□ the shape of globular proteins, with their indentations and interdomain interfaces allows them to function by binding
selectively and transiently to other molecules
□ best exemplified by interactions between enzymes and substrates
□ binding sites are typically positioned toward the interior of a protein, and are relatively free of water
□ when substrates bind, they fit so well that water molecules in the binding cleft are excluded
MYOGLOBIN
Myoglobin
2 parts of Myoglobin
- Myoglobin has two parts: the protein itself and the heme prosthetic group
◊ Protein itself:
myoglobin is composed of 153 amino acids
8 helices ranging in length from 7 to 26 residues (~75% helix)
– Named from A to H starting from N-terminus
heme is tightly wedged in a hydrophobic pocket formed mainly by helices E and F
the accessibility of the heme group to molecular oxygen depends on slight movement of nearby amino
acids
Myoglobin function:
- Myoglobin was first thought to carrying O
○ However, this feature is more important for aquatic animals rather than humans
○ Aquatic animals have 10 to 30 times higher myoglobin concentration in muscles
- Likely major function is to facilitate oxygen transport in rapidly respiring muscles
○ Respiration is limited by low solubility of oxygen in aqueous solution
○ Myoglobin increases solubility of oxygen in muscles as they are the most rapidly respiring tissue under conditions of high exertion
Myoglobin summary
- despite being one of the best studied proteins in biology, its physiological function is not yet conclusively established
- myoglobin increases the solubility of oxygen and facilitates its diffusion
- oxygen storage is also a function – concentrations are 10- fold higher in whales and seals than in land mammals
- hypothesized that myoglobin function relates to increased oxygen transport to muscle, oxygen storage and as a scavenger of reactive oxygen
species
Hemoglobin
Hemoglobin
Hemoglobin is another oxygen transport and storage molecule
composed of four polypeptide chains
◊ two α - 141 amino acids
◊ two β - 146 amino acids
contains two dimers of αβ subunits held together by noncovalent interactions
each chain is a subunit with a heme group in the center that carries oxygen
one hemoglobin molecule contains four heme groups and carries four O2 molecules
Biochem Page 11
one hemoglobin molecule contains four heme groups and carries four O2 molecules
□ The structure:
α-globin (blue)
β-globin (purple)
Myoglobin (green)
□ These protein subunits share only 18% primary sequence identity
◊ Sequence alignment
Light blue - shared amino acids between Hb alpha and beta
Pink and purple - shared amino acids between all three subunits
Overall, not that many conserved sequences + location of loops vary
Graph:
◊ y-axis - fraction saturation of each protein
◊ x-axis - concentration of Oxygen as partial pressure
◊ When y=0.5 - protein is half saturated with oxygen
Hemoglobin function
- myoglobin, an oxygen storage protein, has a greater affinity for oxygen at all oxygen pressures
- hemoglobin is different – it must bind oxygen in lungs and release it in capillaries
- hemoglobin becomes saturated with O2 in the lungs, where the partial pressure of O2 is about 100 torr
- In capillaries, pO2 is about 40 torr, and oxygen is released from hemoglobin
○ Once Oxygen is released, myoglobin binds to it
○ Makes up a system to deliver Oxygen from the lungs to the other parts of the body
- The binding of O2 to hemoglobin is cooperative – binding of oxygen to the first subunit makes binding to the other subunits more favorable
◊ The Fe iron is about 0.6 Å out of the heme plane in the deoxy state
◊ When oxygen binds it pulls the iron back into the heme plane
◊ Since the proximal His F8 is attached to the Fe this pulls the complete F helix
◊ The dimers become closer together
Diagram:
◊ Unbound state - blue
◊ Bound state - red
Biochem Page 12
- CO2 indirectly effects the affinity by lowering pH of the blood cells
○ Lower pH leads to protonation of several hemoglobin groups which can then formion pairs and stabilize T-form state
Overall:
◊ higher pH promotes tighter binding of oxygen to Hb
◊ lower pH permits easier release of oxygen from Hb
◊ this is a result of pK changes of several groups (N-terminal amino groups of α subunits and C-terminal His of β
subunits
The mutant Val can insert into hydrophobic pocket in neighbouring Hb molecules
◊ Oxy and deoxy -bound states of sickle cell Hb have a thing on the side
◊ Deoxy-bound state of sickle cell Hb has a pocket, perfect for binding the thing
◊ As a result, deoxy-Hb of sickle cell polymerizes into filaments, aggregating into fibers of 14-16 strands
◊ These fibers deform the cell
- Heterozygotes don't show the disease b/c formation of fibers is 1000 times slower than in homozygotes
Biochem Page 13
Lecture 9 (Protein Folding and Stability)
October 15, 2020 6:32 PM
- 2 folding criteria:
○ Peptide chain must satisfy the constraints inherent in its own sequence
○ Peptide chain must fold so as to "bury" the hydrophobic side chains, minimizing their contact with water
a) polypeptide collapses due to the hydrophobic effect and elements of secondary structure begin to form
◊ Formed intermediate is called "molten globe"
b) subsequent steps involve rearrangement of backbone chain to form characteristic structural motifs
c) finally reaches stable native conformation
Protein Unfolding
- Denaturation - the process by which a folded or native protein is converted to an unfolded form
- The free energy difference between the native and unfolded states of a protein is usually small (∆Gfolding ~5-10 kcal/mol), so mild treatments like heat
or a change in solvent will denature most proteins
- Once denatured, a few proteins can refold spontaneously, but this is rare (chaperones are usually required)
- Usually occurs with small proteins only
- Most denatured proteins tend to aggregate and precipitate
Biochem Page 14
- Most denatured proteins tend to aggregate and precipitate
Molecular Chaperones
- larger proteins are more likely to become temporarily trapped in a local energy well
- the presence of such metastable intermediates slows rate of protein folding and may contribute to aggregation
- to overcome these problems in the cell, the rate of correct protein folding is enhanced by molecular chaperones
◊ Functions of chaperons:
enhance rate of correct folding of proteins
– sometimes by binding newly synthesized polypeptides before they are completely folded
prevent formation of improperly folded intermediates that may trap polypeptide in aberrant form
bind unassembled protein subunits to prevent aggregation and precipitation before they are assembled into
complete multi-subunit complexes
Biochem Page 15
○ in bacteria it is called DnaK
- one role of Hsp70 is to bind to proteins while being synthesized to prevent aggregation or entrapment in local low-energy well
- high conservation of Hsp70/DnaK indicates that chaperone-assisted folding is an ancient and essential requirement for efficient synthesis of correctly
folded proteins
◊ Structure:
2 domains which are connected by a short interdomain linker
N-terminal domain: ATP binding domain
C-terminal domain: substrate binding pocket with affinity for peptides with hydrophobic amino acids
– Also contains alpha-helical structure which acts as a lid for substrate binding pocket
◊ Mechanism:
Binding of the peptide triggers ATP hydrolysis - causes lid to close and trap the substrate
When ADP is converted back to ATP - lid opens and substrate is released
◊ Mechanism:
HSP70 brings the unfolded protein
unfolded proteins bind the hydrophobic central cavity enclosed by the rings
When protein binds, ATP hydrolysis occurs which attracts GroES cap which closes the protein within the chamber
– At the same time, the GroES cap on the other side of the protein is released
when folding is complete the protein is released and ADP is converted back to ATP
– The GroES top cap is released and the bottom cap binds back
Biochem Page 16
Biochem lab manual 1
October 22, 2020 10:35 AM
Spectrophotometer
- A spectrophotometer is an instrument that is used to measure the transmission of light
through a sample
- The higher the concentration of the substance, the more light will be absorbed as it passes
through the solution
- Light that is not absorbed, strikes a photocell causing a flow of current, which is directly
proportional to the light intensity
○ The current is then measured by an ammeter
- The instruments are adjusted so that a maximum current flows when the light passes through
the blank solution (100% transmission or 0% absorbance)
○ A decrease in current is therefore proportional to the concentration of the substance.
The UV Method:
- Proteins absorb in the UV range (~280 nm) due mainly to the presence of aromatic side chains
in tyrosine and tryptophan residues
○ The amount of absorbance at 280 nm for a given protein will thus depend on its amino
acid composition
- a BSA standard curve is used to determine the concentration of an unknown
- can assume that absorbance is completely linear with respect to protein concentration and
values may be extrapolated from a linear standard curve
- may be performed quickly and easily on a small quantity of material (sensitivity is 0.05 – 2.0
mg/mL protein)
- One major disadvantage to the UV method is that interference may occur from other
compounds found in biological materials
Soret band
- heme-containing proteins such as hemoglobin (Hb) display absorbance at ~ 409 nm
- It is due to delocalization of electrons in the heme porphyrin ring
- In addition to the Soret band, hemoglobin displays characteristic absorption peaks in the
visible range
○ These absorption bands distinguish oxyhemoglobin from deoxyhemoglobin
Biochem Page 17
Lecture 1&2 Section 2
October 27, 2020 2:26 PM
- note that operators may lie upstream of the promoter (as diagramed here) or downstream
◊ Consider experiemnt:
E. coli cells are grown in media containing both glucose and lactose
Glucose is preferred as a carbon source over lactose so in the initial phase of
the experiment the cells use the glucose and ultimately deplete it from the
media at which point cell division levels off
After a lag cells begin to divide again and this coincides with the cells utilizing
the lactose
The lag represents the time bacteria requires to activate the lac operon
– This switch involves mechanisms that both positively and negatively
regulate the genes encoded within the lac operon
1. Galactoside transacetylase - whose function is still unclear (even though this work began some 70 years ago)
Biochem Page 18
the lacI gene - encodes the lac repressor
– expressed all the time
in the absence of lactose - tetramers of lac repressor bind to the lac operator
– this blocks expression from the Plac promoter
in the presence of lactose - an inducer is generated that binds to the lac
repressor preventing it from binding to DNA
– this permits expression from the Plac promoter resulting in expression of
the proteins required to utilize lactose
Lac repressor
- lac repressor is an allosteric protein
○ An allosteric protein is one where binding of a specific molecule to the protein changes the shape of a
remote site on that protein thereby altering its interaction with a second molecule
- Note:
○ much of the work on the lac operon used an allolactose analog called IPTG that induces the lac operon but is
not cleaved by beta-galactosidase
○ this is helpful as you don’t have to worry about the inducer being depleted - during the course of an
experiment which in turn would lead to shutting down of the operon, as would happen if you treated cells
with lactose because beta-gal also cleave allolactose
- Initial biochemical experiments uncovered the role of the beta-galactosidase protein in metabolizing lactose and
showed that its expression was induced by treatment of cells with inducer
- Genetic experiments identified several mutants that were defective in their response to lactose
- wild-type cells only expressed the permease after treatment with inducer
- Additional experiments identified another enzyme, the galactoside transacetylase, that was induced along with
beta-galactosidase and the permease
- Finally, the 3 genes encoding these enzymes appeared to be right next to one another in the E. coli genome
- To summarize:
- 3 enzymes that function in lactose metabolism that are all induced by lactose and map to the same region of
the E. coli genome
- Together these data suggest that the regulation of these 3 enzymes might occur through a similar
mechanism that might be related to their position in the genome
Biochem Page 19
Constitutive mutants of lac operon (case 1)
– Another class of mutants identified were called constitutive and they
produce all 3 enzymes in the absence of inducer
– To investigate this mutant further partial diploid bacteria were
generated that carry both wild-type and constitutive alleles
(heterozygotes)
– Result:
In heterozygote individuals - both sets of lac genes were turned off
in the absence of lactose
The lac repressor produced by the wild type gene was enough to
supress the transcription of both alleles
– In this model the repressor binds to the operator to turn off expression
of lac products in the absence of inducer
- These results indicate that the operator must be linked on the same molecule of DNA to regulate expression of the
lac operon
- thus, the operator is said to act in cis (which is latin for here)
- Based on these data it was suggested that the Oc mutant indicates the operator is in a noncoding regulatory region
as opposed to a gene that encodes a protein
- This is because a gene that encodes a protein should be complementable in trans (latin for across) as was the
case for the mutation in the lac repressor gene
Summary
- these results pointed to the existence of a repressor molecule that acts in trans to keep the operon off in the
absence of inducer and a cis-acting operator that is a non-coding region that is bound by the repressor
- Remember that this model is based largely on the results of genetic experiments
- The fact that this model was proposed based largely on genetic data meant that a key part of confirming it was
biochemical experiments that isolated the lac repressor and showed that it bound to the operator sequence
- Procedure:
- They fractionated lacIt cell extracts using standard biochemical approaches including ammonium sulphate
precipitation and gel filtration, and used an assay to look for an IPTG binding activity
- when you fractionate a cell extract you end up dividing your starting material into several (many) different
fractions and as such you need an assay to identify the fractions that contain the protein you are interested
in
Biochem Page 20
- researchers developed an assay to detect a protein that would bind IPTG
◊ How would you know which fractions contain the lac repressor?
In a fraction that doesn’t contain lac repressor - the IPTG will diffuse freely
across the dialysis bag membrane and at equilibrium the concentration of
IPTG will be the same
In a fraction that contains lac repressor - the IPTG will diffuse freely across the
dialysis bag membrane and at equilibrium the concentration of IPTG different
due to some of IPTG binding to the protein inside the bag
- How do you know that you have identified the right activity or put another way how do you know that the IPTG
binding activity you have identified in likely involved in regulation of the lac operon?
- Run some controls
Controls in Biochemistry
- There are typically two types of controls one uses in an experiment
- Negative control - a negative control should be designed to not give the desired outcome of the experiment
- Positive control - a positive control should give the desired outcome of the experiment, and is typically done
to make sure that all of the reagents in the experiment are working properly
▪ Positive controls are not always possible
- Typically, in each case you change only on thing to ensure that any difference you see is only due to that one
change
◊ Experimental procedure:
add BamHI, salts and a buffer that the enzyme will work in
incubate at 37oC to give the enzyme a chance to cut
run out the digest on a gel to separate any DNA fragments based on their
molecular weight and stain the gel to visualize the DNA
The DNA is negatively charged, thus it will run from negative to positive
electrode
- Possible results:
1. The fractions that contain IPGT binding activity when the purification is performed using lacI t cells DO NOT
HAVE any IPGT binding activity when the purification is performed using lacI - cells
▪ Such a result would suggest that you have purified the lac repressor
Biochem Page 21
▪ Such a result would suggest that you have purified the lac repressor
2. The fractions that contain the IPGT binding activity when the purification is performed using lacI t cells HAVE
IPGT binding activity when the purification is performed using lacI - cells
▪ This would suggest you have not purified the lac repressor and that you have purified another protein
that binds IPTG and is not likely to be involved in regulation of the lac operon
- Amount of DNA was kept constant and the amount of protein was varied
○ The amount of DNA bound to the filter as a percentage of input DNA was plotted on the y-axis versus the
amount of repressor protein added on the x-axis
- Results:
◊ Left panel
The wild-type lac operator (positive control), no operator (negative control) or
a mutant Oc operator was mixed with the repressor
The model predicts that the lac repressor should bind to the wild-type
operator with a higher affinity than the other 2 DNAs and this is what they see
The fact that the amount of DNA captured never reaches 100% results from
the fact that they were using a crude prep of DNA, a lot of which did not
actually contain operator sequences
◊ Right panel
wild-type lac operator was mixed with the repressor +/- IPTG
he model predicts that the repressor should bind the operator – IPTG but not
+IPTG and that is what they see
closed promoter complex - the DNA strands bound by the polymerase are coiled together
open promoter complex - the DNA strands bound by the polymerase are unwound
open complex is resistant to heparin while the closed complex is not
◊ Closed complex + heparin = RNAP will fall of RNA
◊ Open complex + heparin = RNAP will remain on RNA
Biochem Page 22
- In vitro experiment
○ experiment that is conducted in a test tube
○ 2 extreme cases:
1. in vitro experiment that consists of two highly purified components that you mix together and then
observe how they interact
2. in vitro experiment that involves taking a population of cells and disrupting their membranes (ie lyse
them) and then take the total soluble content of that cell and do an experiment in that complex
mixture
- In vivo experiment
○ conducted within a living organism
○ In vivo experiments can be conducted using living bacterial cells, yeast cells, whole worms (C. elegans),
whole fruit flies (D. melanogaster), or whole mice
- A grey area is experimentation related to cells in culture – some would consider these in vitro experiments while
others would call them in vivo
- work that use both in vivo and in vitro approaches, that complement and reinforce one another, tend to give the
most rigorous and complete picture of what’s going on
○ A good example of this is the genetic (in vivo) and biochemical (in vitro) experiments to understand the
function of the lac operon
Procedure:
◊ add the lac repressor and template DNA to the reaction and allow a complex
between them to form
◊ Next RNA polymerase is added and given a chance to bind the promoter
◊ Then heparin is added to the reaction which would displace any polymerase that is
not tightly associated with the template DNA
◊ Then radioactive nucleotides are added along with IPTG and the reaction is
incubated to allow transcription to proceed
◊ The reaction is then run out on a polyacrylamide gel so they can visualize the
radioactively labeled RNA product that might be produced
- Taken together these result suggest that the lac repressor doesn’t block recruitment of RNA polymerase,
suggesting instead that it blocks productive transcription of the lac operon
- Other experiments contradicted these results and suggested that lac repressor block RNA polymerase binding to
the promoter
○ To resolve this conflict additional experiments were done
Biochem Page 23
Chromatin immunoprecipitation (Chip)
- Procedure:
□ Treat cells with formaldehyde to covalently crosslink proteins to nucleic acid
□ Break open cells and shear DNA into small fragments
□ Use an antibody against specific protein to immunoprecipitate that protein and any linked
fragments of DNA
Involves beads which carry protein A or G attached to them
Protein A or G are able to bind to the FC region of IGG molecules such that the antibody
and protein that it recognizes become associated with the beads
○ Reverse the crosslink, purify DNA and assay the amounts of specific pieces of DNA using PCR
- No biochemical purification is perfect, so in this case not only will you get fragments of DNA that are bound to the
yellow protein but you will also get the other fragments
- But the idea is that you will recover greater quantities of the bound fragments than other fragments
- Compare the amount of the red fragment that comes down with an antibody against the yellow protein to another
antibody that doesn’t interact with the yellow protein
○ For example if you get 3 fold more red fragment if you run the experiment with an antibody against the
yellow protein compared to an antibody that doesn’t recognize the yellow protein this would suggest the
yellow protein does indeed interact with the red fragment of DNA
○ you would say the red fragment is 3 times enriched in the immunoprecipitates (IPs) with the antibody
against the yellow protein compared to a negative control IP
- On the other hand if you see similar amounts (i.e. no enrichment or 1 fold enrichment) of a fragment of DNA in
the IP with the yellow protein and a control IP where the yellow protein isn’t IPed would suggest the yellow
protein doesn’t interact with that region
Biochem Page 24
Lecture 3 Section 2
November 4, 2020 8:04 PM
Lac operons
– There are 3 lac repressor binding sites in the lac control region O1 O2 and O3
– O1 represents the original operator site where the Oc mutant mapped
○ Fold repression =
- Results:
◊ results show that full repression requires all three sites
◊ While the O1 site in combination with either one of the other two works very well
◊ also note that O2 and O3 only work well when combined with O1
– The X-ray crystal structure of the lac repressor tetramer shows that:
the two dimers within the tetramer can each bind to an operator sequence
And that each monomer of the dimer binds to one half of the operator
- This structure also highlights the fact that different regions of the protein are responsible for different properties
○ the DNA binding domain at the one end of the protein is separate from the regions that mediate tetramer formation
- Conclusion:
○ The ability of the tetramer to bind to two operators at the same time suggests that when a tetramer is bound to the lac control region
it would loop out the sequences between the two bound operators
○ The cooperativity observed between two operators is explained by the fact that the tetramer bound to two operators would be a
more stable complex than bound to only one
Biochem Page 25
more stable complex than bound to only one
▪ this increase stability would make repression more efficient as the repressor would be less likely to fall off the DNA
○ Consistent with this idea is the fact that a lac repressor that is mutant such that it can not form tetramers shows a substantial defect
in repression
◊ Ka for lac repressor in association with lac operon with +/- inducer
◊ Ka for lac repressor in association with non-special DNA with +/- inducer
- Observations:
○ Thus these numbers indicate that the affinity of lac repressor for the lac operator is reduced upon inducer binding, while its affinity for
other DNA sequences does not change
○ In addition, they indicate that lac repressor has a higher affinity for operator DNA compared to other DNA even in the presence of
inducer
- Consider that the high affinity operator sequence is competing with a large number of low affinity sites represented by the rest of the
genome
○ In the absence of inducer - the large difference between lac repressor’s affinity of operator and non-operator DNA ensures a high
level of binding to the operator
○ In presence of inducer - the reduction in affinity for the operator is sufficient enough that the large excess of non-specific binding sites
can efficiently compete for binding so the operator will no longer be bound
- Concluding points:
The model is that in the presence of inducer the repressor almost always bound to DNA and it is quickly
sliding along the E. coli genome
This arrangement explains why the repressor is able to find the operator sequences very quick when
inducer is removed
– much more quickly than if lac repressor was floating around the bulk of the E. coli cell
Last problem
- Recall this experiment where cells were grown simultaneously in both glucose and lactose
Biochem Page 26
the inducer allolactose
- Addressing problem 2
○ It turns out that the lac operon is expressed at a very low level in the absence of lactose such that a cell expresses about one molecule
each of permease and beta-gal
○ This results because the lac repressor/operator complex is not infinitely stable and as such every once in a while the repressor will fall
off the promoter allowing RNA polymerase the chance to transcribe the lac coding region
Biochem Page 27
Lecture 4 Section 2
November 5, 2020 1:43 PM
Why isn’t the lac operon on in the presence of glucose and lactose?
When glucose is present a small molecule, cyclic AMP (or cAMP for short), is not generated
by E. coli
As glucose is depleted, cAMP levels increase and this results in the activation of the lac
operon
This mechanism represents a positive form of regulation that controls the lac operon
◊ while the action of the lac repressor represents a negative form of regulation
- To explore this further a cell free (or in vitro) extract was developed that recapitulated this regulation
○ in vitro extracts are very useful because you can manipulate the factors present in them (i.e. remove
components and/or add things in) and see what effect this has on the process you are studying
Experiment 1:
- 2 conditions tested:
- *Note: that this strain is not completely wild-type as it is deleted for the entire lac operon, including the repressor
gene, so these gene products won’t interfere with your ability to assay the expression of the DNA you introduce
into the extract
- Conclusion: these extracts accurately replicate the effect of cAMP on beta-galactosidase expression
Experiment 2:
- Controls:
○ The - cAMP experiment is a negative control
○ the + cAMP experiment is the positive control
○ validates their system for the upcoming experiments
- Using this assay researchers asked if they could purify a protein that could stimulate beta -gal production
○ They used standard biochemical purification techniques such as ion exchange chromatography and assayed
the resulting fractions for their ability to mediate cAMP stimulation of beta-gal expression
- Conditions tested:
- Through this approach they were able to purify a protein whose addition to the extracted resulted in a further
stimulation of beta-gal production when cAMP is present
○ This protein is known CAP (catabolite activator protein)
- Note that these wild-type extracts already carry CAP protein in them, so the addition of more CAP results in just a
small increase in beta-gal expression
Experiment 3:
- Researchers also had a mutant that they thought might be in the CAP gene
- This mutant was identified in a screen for bacteria that were defective in their response to lactose and was shown
to not induce beta-gal expression
○ this mutation didn't map to the lac operon and still produced cAMP
- Conditions tested:
- Result:
Beta-gal activity is not stimulated by cAMP but if you add back purified CAP protein to these extracts you
Biochem Page 28
○ Beta-gal activity is not stimulated by cAMP but if you add back purified CAP protein to these extracts you
see that they become responsive to cAMP
○ Taken together these results provide strong evidence that the mutant carries a defective CAP gene
○ They also provide a correlation between the ability of a cell to use lactose and normal CAP function
Next using a dialysis type assay researchers showed that the purified CAP protein interacts
with cAMP
- Sequencing of these mutants mapped a putative binding site for CAP and subsequent foot printing experiments
confirmed that the above mutation maps to the CAP binding site
- Mechanism:
◊ To do DNase I foot printing you radioactively label one strand of your DNA of
interest at either the 5’ or 3’ end
◊ This labeled DNA is mixed with the DNA binding protein and the mixture is
subsequently treated with low levels of DNase I (an endonuclease) so as to induce
less than one cleavage per DNA molecule
◊ The reaction is then run on a polyacrylamide gel under denaturing conditions
the position of the bands in the gel are detected using x-ray film or
phosphoimaging
◊ The net result is that where-ever a protein is bound the DNA will be protected from
DNase digestion and this will show up as a gap in the gel
Diagram: as you add more amount of protein to the rxn, you can see that gap
becomes more obvious
◊ The fact that the DNA is labeled only on one strand and only at one end allows you
to map where the protein binding site is based on the size of the cleavage products
Experiment 2
- Experimental procedure:
◊ Allow RNA polymerase to bind to the lac promoter +/- CAP and cAMP
◊ Then nucleotides and rifampicin are added
rifampicin - a drug that blocks transcription initiation but not elongation
- Results:
○ In the presence of CAP and cAMP you see that transcription occurs while in their absence transcription does
not occur
- These results suggest that in the presence of CAP and cAMP an open promoter complex forms and when
nucleotides are added elongation proceeds before the rifampicin takes effect and blocks initiation
○ Thus, CAP+cAMP stimulates the ability of RNA polymerase to form an open promoter complex
- Additional work showed that CAP+cAMP stimulates the formation of the closed promoter complex if you consider
the formation of the open promoter complex as follows: R + P ---> RPclosed ---> RPopen
- In this case increasing the rate of formation of the closed promoter complex drives the formation of the open
Biochem Page 29
- In this case increasing the rate of formation of the closed promoter complex drives the formation of the open
complex
- Mechanism:
- Before X-ray crystallography - this fact had already been recognized using a gel mobility shift assay
○ Consider then that if you bend the DNA this will generate a hook in the DNA which will result in the DNA
getting hung up in the pores of the gel
○ Also, if the bend is in the middle of the DNA this is going to create a bigger hook and it will be hung up more
than a bend towards the end of the molecule
Biochem Page 30
Lecture 5 Section 2 (Trp operon)
November 18, 2020 9:14 AM
- Mechanism1:
○ When trp levels are high - cells don’t need to make any more trp so the operon is shut off
○ When trp levels are low - a non-functional trp repressor is produced that is called the aporepressor and as such the 5 genes of the operon are expressed so the
cells can make trp
- The aporepsressor
○ a dimer
○ is not functional because it can’t bind DNA
○ When the aporepressor dimer binds trp an allosteric transition occurs that changes the conformation of the repressor so it can now bind DNA and repress
transcription
- Mechanism2: The trp operon is also regulated by a mechanism that is known as attenuation
- Note:
○ Trp repressor is capable of repressing transcription of trp operon by about 70 fold
○ Attenuation represses transcription of trp operon by around 10 fold
- Mechanism:
◊ When trp levels are high and transcription does initiate at the trp promoter (i.e. the repressor has failed to block RNA polymerase
recruitment)
The polymerase proceeds through the 5’ end of the operon (through the leader sequence) but transcription terminates (i.e. RNA
polymerase falls off the DNA) in a sequence known as the terminator
Since the attenuator is upstream of all of the protein coding genes - the attenuator blocks production of the proteins involved in trp
synthesis
▪ In general transcript termination in prokaryotes results from the formation of a stem/loop sequence in the RNA transcript soon after it emerges from the
RNA polymerase
▪ This stem/loop is thought to destabilize the RNA:DNA hybrid which results in release of the transcript and RNA polymerase from the template DNA
The 5’ untranslated region (UTR), also known as the leader, can exist in two different secondary structures
1. Functional terminator (left) - sequences 1 and 2 form one stem/loop and sequences 3 and 4 make another stem/loop
followed by the stretches of Us
2. Non-functional terminator (right) - sequences 2 and 3 base pair with one another
The formation of these structures is regulated by the level of trp in the cell
- Attenuation requires translation of a small open reading frame in the trp leader
○ Trp’s ability to regulate the formation of these structures relates to the presence of a small open reading frame (protein coding sequence) within the 5‘UTR of
the trp mRNA
○ Open reading frame contains two trp codons
○ Translation of these two trp codons controls whether or not the terminator forms
- Attenuation relies on the fact that transcription and translation are coupled in bacteria
In bacteria soon after RNA polymerase initiates transcription a ribosome can start translating the mRNA
This simply requires an open reading frame with an upstream ribosome binding site (shine Dalgarno sequence) + start codon (Met)
Nascent RNA - refers to transcript that is still associated with the RNA polymerase
○ Mechanism:
If trp levels are low - the cells will contain low levels of tRNAtrp charged with the trp amino acid
– This causes the ribosome to pause at the trp codons which will, in turn, allow regions 2 and 3 to base pair with one another
thereby blocking formation of the terminator
– Therefore: in low trp levels termination is blocked and the genes that encode proteins which synthesize trp will be expressed
Biochem Page 31
If trp levels are low - the cells will contain low levels of tRNAtrp charged with the trp amino acid
– This causes the ribosome to pause at the trp codons which will, in turn, allow regions 2 and 3 to base pair with one another
thereby blocking formation of the terminator
– Therefore: in low trp levels termination is blocked and the genes that encode proteins which synthesize trp will be expressed
If trp levels are high - the cells will contain high levels of tRNAtrp charged with the trp amino acid
– Thus the ribosome will translate through the trp codons - no pause
– This will disrupt base pairing between regions 2 and 3, thus allowing region 3 to base pair with 4 creating the terminator
– Therefore: in high trp levels termination occurs and the genes that encode proteins which synthesize trp will not be expressed
- Evidence 1:
○ Trp repressor mutants could still respond to trp starvation by increasing the rate of synthesis of trp mRNA
▪ this already suggests that there is more going on than simple regulation by the trp repressor
- Evidence 2:
○ Researchers detected more RNA coming from the 5’ end (140 base) of the trp mRNA compared to more 3’ sequences, they then mapped the termination site and
showed that termination at this site was reduced in the absence of trp
- Examining the sequence where termination occurs showed the potential stem/loop and the run of Us consistent with a terminator signal
○ but how was it regulated by trp levels
- Note that many of the experiments to understand the mechanisms of attenuation were done in bacteria that were mutant for the trp repressor, simplifying the
interpretation of their results
- Evidence 3:
○ Using the purified leader sequence RNA it was shown that ribosomes bound to the leader and an AUG start codon was identified within the bound region
○ and other experiments showed that the trp leader functioned as a site for translation initiation
○ Translation initiated at this site would result in a 14 aa peptide that would contain 2 trp amino acids in a row towards the end of the peptide
○ The fact that the trp leader is predicted to encode for a peptide would make one wonder if the peptide itself might play a role in regulating the operon (i.e. in the
same way a protein might)
- To test the potential role of the peptide in regulation of the operon researches started with a mutant strain where the AUG start codon of the leader peptide was
mutated such that it would no longer support translation of the leader orf
○ Result: In low levels of trp amino acid this mutant failed to show an increase in the transcription of the regions downstream of the leader
○ this contrasted the behaviour of a strain with a wild-type start codon, where transcription of sequences downstream of the leader went up when trp levels were
low
- To test if the peptide encoded by the trp leader was important researchers tried to rescue the start codon mutant using a partial diploid approach
○ Result: They found no evidence for rescue in this case
○ Therefore: the mutant could not be rescued in trans, suggesting that the peptide itself did not play a role
○ Instead these results suggest that the leader functions as a cis-acting element
○ Note1: this is a negative result and researchers are always concerned about giving too much weight to them
▪ However, all of the other experiments done on the trp operon support the idea that the leader sequence functions in cis and not in trans
○ Note2: Also, note that if you were to do this sort of experiment today, you would likely just try to rescue the mutant using a plasmid that would express the
leader sequence peptide
- The potential importance of the trp codons was emphasized by the fact that other species of bacteria have trp codons within the leader sequence of their trp operons
○ this type of conservation suggests an evolutionary pressure to maintain them indicating they have an important functional role
- other amino acid synthetic operons that are regulated by attenuation contain small open reading frames within their leader sequences and these orfs are rich in codons
for the amino acids they are involved in synthesizing
○ suggests this mechanism might be a common form of regulation for these types of operons
- starvation of cells for Trp or Arg (note that a codon for Arg comes just after the Trp codons) reduces termination in the leader
○ In contrast, starving cells for other amino acids (encoded by the leader or not) doesn’t
○ Suggests that ribosome stalling must occur at the right place to regulate attenuation
Biochem Page 32
▪ Therefore: starving for Met will not decrease termination
Mutation experiments
What if you mutated the start codon for the leader sequence peptide?
– The ribosome will not translate the mRNA
– Therefore: stem 1+2 will base pair as the mRNA is transcribed and as such stem 3 will have no choice but to pair with 4
– Generates a terminator even when trp levels are low
Conclusion:
- The model would seem to suggest that of translation on the leader sequence would have to begin at just the right time for the mechanism to work properly, because if
the ribosome joined too late it would not work properly
○ But it turns out that there is another level of complexity built into the system that links transcription with translation
- Mechanism:
RNA polymerase initiates transcription and then pauses at the stem/loop 1/2 structure
The ribosome initiates translation of the leader sequence orf and as a result RNA polymerase resumes transcription
- This mechanism ensures that the polymerase doesn’t get ahead of the ribosome which would disrupt the attenuation mechanism
◊ Lane 1 - the top intense band represents termination of the template at the leader sequence terminator, while there is a faint band that
results from RNA pol pausing at the stem/loop 1/2 sequence
◊ Lanes 2 and 3 - increasing amounts of a translational inhibitor are added and you can see that the amount of pausing that is detected
goes up
◊ This suggests that normal translation relieves the RNA pol pause
◊ Lane 4 - shows the results without a translational inhibitor and consistent with the fact that this template can not be translated you see
the pause site band is stronger
◊ In addition, you see that increasing amounts of translational inhibitor have no effect on the level of pausing
◊ This again provides evidence that relief of RNA pausing is related to translation of the leader peptide orf
Biochem Page 33
Lecture 6 Section 2 (Riboswitches)
November 18, 2020 2:51 PM
- Riboswitches - structured RNA domains that usually reside in the noncoding regions of mRNAs where they bind
metabolites and control gene expression through a variety of mechanisms
- These RNA regulatory elements form highly specific binding pockets for their target metabolite
○ When metabolites bind to these cis-acting elements they influence the expression of the mRNA
- One of the first characterized riboswitches controls the expression of genes involved in making thiamine
(vitamin B1)
○ Thiamine is an essential cofactor for many enzymes
○ Bacterial cells can utilize thiamine in their environment or when there is none they can synthesize it
themselves
- When this sequence was first identified it was recognized that it had the potential to form a secondary
structure
- Predicting RNA secondary structure involves using computer programs that, in general, use thermodynamic
parameters to predict the most stable structure for RNA sequence
○ Problem: even random RNA sequences are likely to adopt stable structures
○ Solution: to assess whether the structure is biologically important - compare related sequences that
might contain the same structure
- The differences highlighted in red above and in the rectangles to the left conserve the secondary structure
○ These type of changes are known as compensatory changes
○ Many such compensatory changes in several related sequences suggest the element can tolerate some
changes in sequence as long as the structure is preserved
- Conclusion: this suggests that the structure is functional in some way thereby increasing your confidence the
structure actually exists in vivo
- RNAs are typically generated using one of 3 RNA polymerases isolated from different bacteriophages
○ T7 RNA polymerase
○ SP6 RNA polymerase
○ T3 RNA polymerase
- These polymerases recognize specific DNA promoter sequences and initiate transcription just downstream
i) Start with plasmid that contains sequence of interest + unique restriction site
downstream of sequence of interest
ii) Digest the plasmid with restriction enzyme to linearize it
iii) Linear RNA is mixed with 1 of the polymerases
iv) RNA polymerase will recognize the promoter & initiate transcription of RNA of
interest
Biochem Page 34
1. generate RNA via in vitro transcription
2. end label the RNA either 3' or 5' end
3. Allow the RNA to sit for a prolonged time at a room temperature
- under these conditions RNA will undergo limited spontaneous cleavage but residues that are part
of structured regions are resistant to this cleavage
- again you want to get on average less than one cleavage per RNA molecule
4. resolve RNA on a polyacrylamide gel
If structure 1 forms on this RNA you will see the cleavage pattern represented in lane
1 of the gel
if structure 2 forms you will see the pattern in lane 2
- These result suggest that binding of TPP to its binding site within the thiM 5‘UTR changes the structure of
sequences that are remote to the binding site indicating that the 5‘UTR is behaving as a allosteric molecule
- One key change they noticed in the structure was that in the absence of TPP the ribosome binding site (rbs)
was sensitive to cleavage while in the presence of TPP it was not
○ This suggests that in the absence of TPP that the rbs is single stranded, which would allow it to recruit
the ribosome to the mRNA
○ in the presence of TPP the rbs is double stranded, which would block mRNA translation
- In addition, they found that the sequence that was believed to base pair with the rbs was resistant to cleavage
+/- TPP in contrast to the behaviour of the rbs
○ This suggests that the sequence that can pair with the rbs is paired with another sequence in the
absence of TPP
- To develop a reporter system to study TPP-mediated regulation of gene expression two constructs were
initially made
1. Translational fusion where the thiM promoter drives the transcription of an mRNA carrying the thiM
5‘UTR and the lacZ gene
▪ in this case the start codon for translation comes from the thiM sequence and its fused in frame
with the lacZ open reading frame
▪ in addition the rbs is provided by the thiM 5‘UTR
2. Transcriptional fusion where the thiM promoter drives the expression of an mRNA carrying the thiM
5‘UTR and the lacZ open reading frame
▪ in this case both the ribosome binding site and the start codon for translation is provided by lacZ
sequences
- Note that in this construct the lacZ gene serves simply as a reporter so that you can readily assess the effect of
TPP on expression of these constructs
○ remember that lacZ encodes the beta-gal protein and that one can readily assay beta-gal protein levels
using a simple assay that measures beta-gal enzyme activity
◊ Mutants M1 and M3 are predicted to disrupt base pairing within the core 91 nt
region that carries the TPP binding site
These mutations block TPP binding and consistent with the model they
Biochem Page 35
aspects of the model
◊ Mutants M1 and M3 are predicted to disrupt base pairing within the core 91 nt
region that carries the TPP binding site
These mutations block TPP binding and consistent with the model they
also block regulation of these mRNAs by TPP
they also block the structural rearrangement that results in base pairing of
the rbs upon TPP binding
◊ Compensatory mutations (M2 and M4) - restore base pairing in these regions
but change the sequence and found that these mutations restored TPP binding,
TPP-mediated regulation and the base pairing of the rbs upon TPP binding
- Conclusion: these data provide a STRONG correlation between TPP binding to the thiM 5‘UTR, TPP-dependent
control of gene expression and base pairing at the rbs
- In addition, they provide evidence for the existence of the base pairing in these regions and the importance of
these structures to TPP binding
In the presence of TPP the rbs (again labeled SD) is base paired with another region of
the thiM 5‘UTR which would block ribosome recruitment
In the absence of TPP this region is base paired with another segment of the 5‘UTR
- As another test of this model the indicated base is changed from a U to a C (see the red arrow)
○ This is predicted to stabilize the right hand conformation while it should destabilize the left hand
conformation
○ Consistent with the model this mutation reduces expression even in the absence of TPP and the
presence of TPP has little effect on this low level of expression
○ This occurs despite the ability of TPP to still bind to the 5‘UTR
- In addition, in line probing shows that this mutation reduces cleavage at the rbs in the absence of TPP
consistent with it leading to increased base pairing at the rbs
TPP riboswitches
- Additional work in bacteria has shown that TPP riboswitches can also control gene expression through
regulation of transcription termination
- The ability of TPP riboswitches to regulate gene expression through different mechanisms relates to their
structure
- They consist of an aptamer or ligand binding region upstream of a sequence that is referred to as the
expression platform
○ The aptamer is solely responsible for ligand (small molecule) binding
○ The expression platform is the sequence that is acted upon by ligand binding that results in some change
in gene expression
- So when a TPP riboswitch controls termination in the absence of TPP no terminator is formed
- but when TPP is present you see a rearrangement of sequences such that a terminator does form
- TPP riboswitches are widespread in prokaryotes and have even been found in some eukaryotes - namely fungi
and plants
- In fungi and plants these riboswitches regulate the splicing of transcripts involved in thiamine synthesis
- In addition, bacterial riboswitches have been identified that bind to other small molecules to regulate gene
expression in much the same way that the TPP riboswitches function
- In general these other riboswitches also consist of an aptamer sequence involved in ligand binding and an
expression platform that is the site that mediates regulation
- Some of these riboswitches were identified by simply looking for conserved structures/sequences upstream of
genes that encode proteins involved in the same metabolic process and knowing what that process is allows
one to make guesses as to what the ligand might be for that potential riboswitch
- In some cases ~2% of the genes in a bacterial genome could be regulated by a riboswitch
Mechanism:
◊ At low temperature the ribosome binding site (SD) is found in a stem structure
Biochem Page 36
- Similar RNA structures have been shown to function as thermosensors
- Example: in pathagenic human bacteria the expression of a transcription factor is regulated by temperature
Mechanism:
◊ At low temperature the ribosome binding site (SD) is found in a stem structure
and therefore its non-functional
◊ At higher temperature this stem is unstable and thus translation can occur
- This switch occurs at 37’C (ie when the bacteria enters its human host) and the transcription factor whose
expression is activated in turn activates the expression of a number of genes required for growth of the
bacteria in the human host
Biochem Page 37
Lecture 7 Section 2 (Alternative splicing)
November 25, 2020 7:04 PM
Splicing basics
- Steps of removing an intron:
1. 2’-OH group of adenosine nucleotide in middle of intron attacks the phosphodiester bond between the 1st exon and G
residue at the beginning of intron
Results in the formation of the lariat and separating the first exon from intron
2. 3’-OH left end of 1st exon attacks phosphodiester bond linking the intron to 2nd exon
Results in the formation of the exon-exon phosphodiester bond and the release of the intron in the lariat form
- Precision is key to ensure the mRNA will generate the correct protein
Alternative Splicing
- Many genes exhibit alternative splicing - the process whereby the same pre-mRNA can give rise to more than one mRNA through changes
in the pattern of splicing
- As can been seen in this diagram there are several ways that splicing can differ from one transcript to another
○ At the mRNA level inserting or deleting cis-acting sequences that control processes such as mRNA stability, translation and the localization of
transcript
- these signals often residue in the transcripts 5’ and 3‘UTRs
- these changes do not have to be accompanied by changes in the sequence of the protein
○ Changes in the mRNA that lead to insertion or deletion of portions of the protein
- can have subtle or profound effects on protein function including complete loss of protein function
- alternative splicing is a much more versatile control mechanism than transcriptional control where in general the only regulation that can be achieved
is to affect changes in the amount of the gene product
- ~95% of human genes show evidence that they are alternatively spliced
○ This level of alternative splicing is thought to help explain the apparent discrepancy between the number of genes in some complex multi-
celled organism (ie humans) and some simpler single cell organisms such as yeast
- The human genome encodes ~25,000 genes while the genome of budding yeast encodes ~6,000 genes
○ since people assume humans are more than 4 times more complex than a yeast cell some have suggested that the high level of alternative
splicing in humans plays a big role in generating this higher level of complexity
Biochem Page 38
○ 12 different versions of exon 4 (red)
○ 48 versions of exon 6 (blue)
○ 33 versions of exon 9 (green)
○ 2 versions of exon 17 (yellow)
- Therefore: there are 38,016 (12x48x33x2) versions of the mature mRNA possible
○ This represent more genes than are actually encoded within the Drosophila genome (~14,500)
Splicing basics
- Three signals in the pre-mRNA are involved in splicing
○ the 5’ splice site, the 3’ splice site, and the branchpoint
○ note that these signals are somewhat degenerate, especially the branchpoint and 3’ splice site
- These signals are recognized by small nuclear ribonucleoproteins (snRNPs, U1, U2, U5 and U6)
○ snRNPs - complexes of RNA and proteins
○ recognition of the splicing signals occurs through base pairing interactions between the small nuclear RNAs (snRNAs) within the snRNPs and
the pre-mRNA
Exception: recognition of the part of the 3’ splice site by the U2AF which consists of two proteins
– a 65kDa subunit which binds the polypyrimidine tract (Yn) and
– a 35kDa subunit which binds the AG sequence in the 3’ splice site
- One of the best characterized examples of alternative splicing comes from Drosophila where a cascade of differential splicing plays an important role
in sex determination
◊ Note : exon 2 in the early transcript is the same as exon 4 on the late transcript
Both encode for the protein RNA binding domain
◊ In males (late sxl protein)
translation starts in exon 2 (red arrow) and stops in exon 3 (red line)
Therefore this protein lacks its RNA binding domain and is therefore not functional
◊ In females
early Sxl protein blocks splicing of exon 2 to exon 3 which excludes exon 3 with a pre-mature stop codon
resulting in an mRNA that encodes functional late Sxl protein which also blocks splicing of exon 2 to exon 3
Therefore: late Sxl protein autoregulates its own expression ensuring continuous expression of functional Sxl
protein in females after PE is turned off
- Tra mRNA
○ In females - tra mRNA makes tra protein (includes only exon 1 and 3)
○ In males - a stop codon in exon 2 results in a non functional protein (includes exons 1, 2, 3)
Biochem Page 39
Experiments: How does Sxl regulate tra splicing?
- To investigate this, researches developed an in vitro splicing assay that recapitulated Sxl lethal mediated regulation of tra splicing
- After incubating the reaction they assayed the level of splicing using a primer extension assay
- Other work suggested that Sxl acts through binding to the poly-pyrimidine tract (Y n) located close to the male 3’ splice site
◊ Since U2AF also binds here this suggested that Sxl might be regulating U2AF function
◊ To explore this further researchers assessed the the affinity of Sxl and U2AF for the 3’ splice site region of both the
male and the female splice events using a RNA gel shift assay which works the same way as a DNA gel shift
Gel shift assays are used to assay the ability of proteins to bind DNA or RNA
In a gel shift assay you use a radiolabeled DNA or RNA probe and mix it with a protein of interest
You then run the mixture on a polyacrylamide gel and you will see that the binding of the protein to the DNA will reduce the
mobility of the DNA in the gel giving you a shifted band
Sxl protein, purified from E. coli, bound to the male 3’ splice site
- These experiments also allow you assess the stability of this interaction by measuring an apparent K d
○ Kd is defined as the concentration of protein where 50% of the probe has shifted
- Result:
○ Sxl had an apparent Kd of 10-9M for the male 3’ splice site
○ Sxl did not bind at all to the female 3’ splice site
Biochem Page 40
○ U2AF had an apparent Kd of 10-8M for the male 3’ splice site
○ U2AF had an apparent Kd of 10-6M for the female 3’ splice site
○ Based on these results you would rank the affinities of these sites from highest to lowest as: Sxl (M3’SS) > U2AF (M3’SS) > U2AF (F3’SS)
- So given that both Sxl and U2AF bind the male 3‘ SS the next question is:
○ do they compete for binding to the same site or put another way would one protein block the ability of the other to bind to the male 3‘ SS?
How does Sxl regulate tra splicing: competition between Sxl and U2AF
- To test this researchers made use of a UV-crosslinking assay
- Result:
Overall model
- Based on these data one can propose a model for the regulation of tra splicing
◊ In males where functional Sxl is not expressed U2AF binds preferentially to the male 3’ splice site as it has a higher
affinity for this region than it does for the female 3’ splice site and thus splicing occurs at that site
◊ In females functional Sxl protein binds to the male 3’ splice site and blocks binding of U2AF as Sxl has a higher affinity
for this site than does U2AF
◊ Thus, U2AF will bind to the female 3’ splice site so splicing occurs at this site
- This model also explains why addition of GST-Sxl to the in vitro splicing reaction reduced the overall efficiency of splicing
○ Recall that Sxl forces U2AF to bind to the female splice site and that U2AF has a lower affinity for the female splice site compared to the male
splice site
○ This reduce affinity means that splicing at this site is in general going to be less efficient, thereby explaining the presence of unspliced pre-
mRNA with Sxl is added
◊ In females a complex of 3 proteins tra2, tra and RBP1 binds to this element thereby recruiting the splicing machinery to
the weak exon 4 3‘ splice site
this only happens in females as functional tra is required for the ESEs to work
◊ Other RNAs carry exonic splicing enhancers, exonic splicing silencers and intronic splicing enhancers and silencers that
bind to a variety of trans-acting factors that control splicing
Biochem Page 41
Lecture 8 Section 2 ( Subcellular localization of mRNAs)
November 26, 2020 1:16 PM
□ Virtually all mRNAs are transported from the nucleus to the cytoplasm
□ Once in the cytoplasm some mRNAs are localized to various locations within the cytoplasm
□ Purpose: Localization of an mRNA will serve to localize the encoded protein
□ While most of you are used to thinking about protein localization mediated via sequences within proteins a genome wide study of mRNA localization have
suggested that mRNA localization likely plays a major role in the localization of many proteins
Example 1: Xenopus oocyte - VgI mRNA (red) found in vegetable pole of the cell
- Critical components of the mRNA localization machinery include cis-elements within target mRNAs that function as binding sites for RNA-binding proteins
- During these divisions the mother cell undergoes some specific DNA rearrangements that don’t happen in the daughter cell
○ These rearrangements allow the mother cell to undergo a process known as mating type switching
- Suppressing these rearrangements in the daughter cell involves the localization of ASH1 mRNA to the daughter cell
○ specifically to the distal tip
- This mechanism of Ash1p protein localization was uncovered in work aimed at understanding the mating type switching process
- A genetic screen for factors involved in mating type switching was conducted
○ The ASH1 gene came out of this screen and it was shown that accumulation of this protein in the daughter cell nucleus was required to preventing switching in the
daughter cell
○ Other factors, including Myo4p, She2p, and She3p, were identified in the screen and were found to be required for the accumulation of the Ash1p protein to the
daughter cell nucleus
- Subsequent experiments showed that ASH1 mRNA also localized to the daughter cell and this localization also required Myo4p, She2p, and She3p
- So based on these data it was proposed that Ash1p protein localizes to the daughter cell through localization of its mRNA
- The fact that Myo4p was required for ASH1 mRNA localization began to suggest a mechanism
- So this suggested that ASH1 mRNA localization would require the actin cytoskeleton
○ Researchers confirmed this by showing that latrunculin A, a drug that causes the depolymerization of the actin cytoskeleton, disrupted ASH1 mRNA localization
○ They also showed that ASH1 mRNA was not localized in an actin mutant
- So taken together these data suggest that ASH1 mRNA is transported to the daughter cell along actin cable by the motor protein Myo4p
○ So how does Myo4p interact with ASH1 mRNA?
Mapping the cis-acting element in the ASH1 mRNA that function in mRNA localization
- Researchers assumed there must be a cis-acting element within the ASH1 mRNA that was responsible for localizing the transcript
- To search for this element they fused various pieces of the ASH1 mRNA to an mRNA that doesn’t localize to the daughter cell and then asked which of these hybrid mRNAs
were localized
○ The goal was to identify the smallest fragments that would function in this assay
- They identified 4 elements: E1, E2A, E2B and E3 that could direct the localization of an mRNA to the daughter cell
Biochem Page 42
were localized
○ The goal was to identify the smallest fragments that would function in this assay
- They identified 4 elements: E1, E2A, E2B and E3 that could direct the localization of an mRNA to the daughter cell
So presumably these localization signals must be responsible for recruiting Myo4p to ASH1 mRNA
They are redundant to each other b/c each element was capable of localizing the mRNA on its own
- For example, in one series of experiments She2p was purified from E. coli as a GST fusion protein and using a UV-crosslinking assay it was shown to interact with the E3
localization element
- To assess the specificity of these interactions competition experiments were performed which involves the inclusion of large amount of various unlabeled RNAs
○ In lanes 2-4 they added increasing amounts of an unrelated RNA
○ In lanes 6-8 they added increasing amounts of unlabeled E3 RNA
○ Control:
▪ Contaminating protein which is represented by upper band n the gel
○ Results:
As you add more E3, She2p signal is getting weaker
– Indicates that She2p is binding to the unlabelled E3 RNA
As you add more pGEM, RNA has no effect on She2p binding
– Indicating it doesn’t bind to She2p
Conclusion: She2p appears to bind RNA in a sequence specific matter
▪ The contaminating protein is competed by both unlabeled RNAs suggesting that it binds RNA non-specifically
□ The amount of protein bound reduces in similar pattern for both unlabelled RNAs
□ Therefore: protein is unable to distinguish between 2 RNAs
- Using competition experiments researchers tested if She2p could interact with the other localization signals
◊ Excess unlabelled RNA corresponding to the E,1 E2A E2B or E3 elements was included in a crosslinking assay again using labeled E3 RNA as the
probe
◊ Therefore: E3 competes with E1, E2A, E2B, and E3 in lanes 3-6
◊ Note that all 3 elements compete but to different extends suggesting that each element has a different affinity for She2p
- In another series of experiments it was found that She3p is required for ASH1 mRNA localization
○ It binds to both She2p and Myo4p
- To do these experiments both Myo4p and She2p were purified from E. coli as GST fusions
○ This purification exploits the fact that the GST fusion protein interacts with a small molecule called glutathione
○ Thus GST fusions proteins are captured on beads that carry covalently coupled glutathione and eluted from the resin by the addition of excess soluble glutathione
○ This also allows you to create resins that allow you to ask if two protein interact with one another
- In this particular experiments GST tagged Myo4p and GST tagged She2p were captured on glutathione beads and radioactively labeled
- Results:
○ both Myo4p beads and She2p beads captured She3p protein while the GST alone beads did not
○ additional experiments showed that different parts of She3p interact with She2p and Myo4p
- This suggests that She3 can bind to both proteins and mediate interaction between them
- This model makes the predication that if She3p could recognize an mRNA on its own it would localize that mRNA in the absence of She2p
Biochem Page 43
- This model makes the predication that if She3p could recognize an mRNA on its own it would localize that mRNA in the absence of She2p
- Results
○ When the She3p:MS2 fusion was expressed in cells it directed the localization of the mRNA to the daughter cell and this localization was independent of She2p
○ These results highlight that localization of mRNAs using the cytoskeleton involves 3 basic components
▪ a protein to bind the RNA
▪ a motor protein to transport the mRNA on the cytoskeleton and
▪ adaptor protein that links the RNA binding 12 protein to the motor protein
- The idea is that the GFP/MS2 fusion protein will bind to the RNA carrying the MS2 binding sites allowing you to visualize the RNA as it moves in a living cell
- Imaging of cells expressing these constructs showed in general one spot and several controls were done to show that this spot represented the reporter mRNA
- This film shows that in wild-type cells this mRNA is transported into the daughter cell but the path it takes is not very direct
○ This is likely because the actin fibers are not single long chains that go from the mother to the daughter cell
○ Instead the fibers are shorter and instead of all pointing in exactly the same direction, they are just generally oriented in the same direction
○ Thus the mRNA eventually makes it into the daughter cell but the route taken is not a straight line
- Note that:
○ in the she1 mutant (she1 is another name for myo4) the particle still forms but it doesn’t get transported to the daughter cell
○ the RNA in the wild-type cells is not anchored to the distal tip of the daughter cell
▪ This contrasts the behaviour of a another mRNA carrying the ASH1 open reading frame which did become anchored
- These results thus suggest that transport and anchoring appear to function through somewhat different mechanisms
Biochem Page 44
Lecture 9 Section 2 (mRNA stability)
November 26, 2020 4:49 PM
i) Mechanism 1:
Generic mRNA undergoes deadenylation
mRNA is then subjected to Decapping
– Decapping - removal of 5' cap
Exonucleolytic decay of the RNA in 5' to 3' direction
ii) Mechanism 2:
Generic mRNA undergoes deadenylation
Exonucleolytic decay of the RNA in 3' to 5' direction
No decapping
iii) Mechanism 3: Deadenylation-independent decapping
Generic mRNA undergoes decapping
Exonucleolytic decay of the RNA in 5' to 3' direction
No deadenylation
iv) Mechanism 4: endonuclease cleavage
Cleavage somewhere in the body of the generic mRNA
Generates 2 pieces
– 1 with free 3' end - Exonucleolytic decay of the RNA in 3' to 5' direction
– 1 with free 5' end - Exonucleolytic decay of the RNA in 5' to 3' direction
- The key consideration here is that the 5’ cap and the 3’ poly(A) tail protect the mRNA from degradation and as such each decay pathway
must somehow over come the protection that these two features of the mRNA provide
- In mammals iron is transported through the blood bound to a glycoprotein known as transferrin
○ when cells need iron they transport transferrin bound to iron into cells via the transferrin receptor (TfR)
- Overall mechanism:
○ When intercellular iron levels are high - cells down regulate the expression of TfR on the cell surface
○ When intercellular iron levels are low - cells up regulate TfR on the cell surface
- To begin to figure out how this regulation is accomplished initial experiments asked if TfR transcription was regulated by iron levels
using a nuclear run-on assay
- Using this approach it was found that TfR transcription was not regulated by iron
- Next researchers disrupted cells and separated them into a cytoplasmic fraction and a nuclear fraction and then assayed levels of TfR
mRNA via Northern blot
Northern blot
◊ Mechanism (initial)
Denature RNA and separate on gel
Biochem Page 45
Northern blot
◊ Mechanism (initial)
Denature RNA and separate on gel
– For big fragments use agarose gel
– For small fragments use acrylamide
Denaturation of the RNA disrupts any secondary structure in the RNA which would otherwise cause
the RNA to run aberrantly
Denaturation therefore ensures the RNA’s mobility in the gel will be a true reflection of its size
- Mechanism:
□ transcription is inhibited
□ If you, then check the RNA levels as a function of time
If RNA is stable - RNA levels will persist after addition of the transcriptional inhibitor
If RNA is unstable - RNA levels will decrease with time
- Suggest that changes in levels of transferrin receptors at the surface of the cell +/- iron might be controlled by stability of mRNA of
transferrin
- Initial efforts:
○ deleted and/or replaced different parts of the TfR gene and found that the 3’ UTR was required for iron regulation
- In addition:
○ swapping the TfR promoter for another promoter had no effect on iron regulation
○ consistent with regulation not occurring at the level of transcription
Additional mapping experiments identified a ~700 nucleotide region of the 3‘UTR that was sufficient for iron
regulation
◊ this region that carried 5 stem/loops
◊ These step-loops were similar to a stem/loop in another mRNA (ferritin heavy chain mRNA) that was also
regulated by iron
This stem/loop within ferritin mRNA was shown to be responsible for iron dependent regulation of ferritin
translation and it was designated an iron response element or IRE
Biochem Page 46
Other work has identified an RNA binding protein
○ the iron response element binding protein or IRE-BP for short
○ This IRE-BP could bound to the ferritin IRE
- Using a gel shift and a competition assay it was shown that the same protein that binds to the ferritin IRE also binds the stem/loops in
the TfR 3‘UTR
- Conclusion:
○ Taken together these data would suggest that in low iron
▪ IRE-BP will be bound to the TfR 3‘UTR
▪ this binding would stabilize the mRNA
○ Or put another way the TfR mRNA is normally unstable and this instability is reversed by IRE-BP binding to the 3‘UTR
○ Conclusion: The ability to make such mutants in consistent with the model
- A clue to the mechanism of TfR degradation came from Northern blot analysis of TfR mRNA levels after treating cells in high iron
conditions
○ In this case the signal was detected by exposing the blot to Xray film
- Results:
The short exposure showed the full length TfR mRNA going away with time consistent with its degradation
under these conditions
A long exposure showed a less abundant smaller mRNA that isn’t initially present but becomes detectable
sometime after treatment with high iron levels
Biochem Page 47
length
◊ Run RNA on polyacrylamide gel and perform northern analysis using 3'end probe
Allows to measure the fragments length and the length of poly A tail
- This assay can be done on total RNA harvested from cells and since you use a probe specific to your mRNA of interest you will only
detect that RNA on your northern blot
- The size of the resulting fragment on the Northern blot will be an indication of the length of the poly(A) tail
- Can generate a marker for a completely deadenylated mRNA by including an DNA oligo consisting of a run of Ts
○ Oligonucleotide hybridizes to transcript's Poly A tail
○ Therefore: when treated with RNase H - will degrade Poly A tail and generate a marker for deadenylated mRNA
○ at the second time point you see a decrease in the signal intensity of the long tail version and an increase in the signal of the
short tail version
▪ This suggests conversion from one form to the other
▪ Also note in the deadenylase mutant cell that the mRNA has a uniformly long tail and is also stable providing a correlation
between the deadenylation of this transcript and its degradation
Mechanism:
- Sequence specific RNA binding proteins recruit deadenylase to target mRNAs to induce there degradation
Biochem Page 48