Professional Documents
Culture Documents
of proteins in particular.
That's the one that varies between the 20 different common amino acids
and proteins.
or negatively charged.
because there are four different groups coming off of the C-alpha carbon.
And the natural amino acids that are used by the ribosome
So if you take the hydrogen that's coming off of the C-alpha carbon,
and then you put the nitrogen on the left and the carboxylate on the right
To speak the language of proteins, it's important to first learn its alphabet.
because their polarity affects which noncovalent interactions they can form.
And these interactions are largely what gives proteins their shape.
over short distances, although they are present between any pair of atoms
in close proximity.
a salt bridge can provide more than 10 times the binding energy
But they only occur between polar groups with permanent dipoles.
Here you can see a hydrogen bond within the backbone of a protein,
In contrast, both the van der Waals interactions and salt bridge
So let's start with the two acidic amino acids, aspartate and glutamate.
They both have a carboxylic acid group at the end of their side chain.
And you might think that that's not very much-- that's not a big difference.
And that means that it might be better able to position itself exactly
its entropy.
So now let's turn to histidine, which is another interesting amino acid that's
one of them on the extreme left, and one on the right of the screen.
These two neutral states have different hydrogen bonding properties,
So most amino acids are formed of carbon, nitrogen, oxygen, and hydrogen.
And it's very similar in size and shape to leucine, shown here.
And those are conditions that are often found on the extracellular sides.
more on the reducing side, which means that the cysteines will
by the ribosome, the natural amino acids, are chiral and L-amino acids.
Well, there are actually two exceptions to this.
One is glycine.
means that it now has two hydrogens coming off of its alpha carbon.
This Venn diagram shows some of the many ways to classify amino acids.
It can also readily form hydrogen bonds, and therefore, it's also polar.
Good question.
But take a look at the structure again, and I'll give you a hint.
Now, aside from the charged amino group at the end of its side-chain,
So that means that lysine can sometimes act as nonpolar in certain situations.
So the sequence of amino acids that make up a protein
2. RACHELLE GAUDET: In this video, we'll move to the next level of structure
5. that puts constraints on the shapes that it can take in three dimensions.
10. But we'll first see how the constraints on the polypeptide
17. And in proteins, these amide bonds are called peptide bonds.
22. and the carbonyl carbon has angles corresponding to a trigonal geometry,
23. suggesting that the peptide bond is a double bond.
24. But the peptide bond, the bond between the carbonyl carbon
28. This unusual bond length is due to the partial double-bonded character
33. and the carbon and oxygen of the carbonyl all lie in the same plane.
34. A second important consequence is that the peptide bond cannot rotate freely,
36. So the peptide bond can be in only one of two conformations, cis or trans.
41. the two C alpha atoms are on opposite sides of the peptide bond.
42. Whereas in the cis conformation, they're on the same side of the peptide bond.
43. But as you see here, when they're on the same side of the peptide bond,
50. and that means that the steric clash is similar in both
51. the cis and trans conformations.
52. And since the trans confirmation is now as bad as the cis one,
53. it is much more likely for a cis peptide bond to occur before a proline.
54. And note that it is the peptide bond on the end terminal
57. But the other two dihedral angles of the backbone on either side of each C alpha
60. One named phi, between N and the alpha carbon, and the other named psi,
63. In proteins, N always comes before C, and P-H-I, or phi, comes before P-S-I,
64. or psi.
67. Omega is either zero or 180 degrees for cis or trans bonds respectively.
68. Let's see how the phi and psi angles are measured.
69. To measure the phi angle, you look down the length
70. of the nitrogen to C alpha bond from its most end terminal end.
71. And then you measure the angle between the two carbonyl carbons.
72. Similarly, to measure the psi angle, you look down the length of the C alpha
74. and then you measure the angle between the two nitrogens.
76. So this isoleucine has dihedral angles phi of minus 138 degrees
81. realized that not all dihedral angle values are allowed,
83. So using molecular models, he tested all possible values of dihedral angles.
84. And then he realized that there are only three regions that
85. contain allowed values of psi and phi angles for normal natural proteins.
86. And this plot that you now see is called the Ramachandran plot.
87. Let's take a closer look at how the plot was built.
90. The corresponding dihedral angles are indicated in the Ramachandran plot
94. Note how the allowed angles, here within the blue region of the plot,
96. And we can do the same process doing a full rotation around the psi bond.
97. Ramachandran looked at every single possible positions to build his plot.
98. We can see now again the three regions of allowable phi and psi angle values
100. Now, to form the two familiar regular secondary structures, alpha
105. in a row to lie within the phi and psi region shown here in green.
112. The railing that's on the outside should be in your right hand
114. Now let's look underneath this ribbon at the backbone atoms.
120. Side chains, which are illustrated here as spheres, will then point outwards
124. And that will lead to important patterns in the primary structure
126. And I've illustrated it here with an example from ATP synthase.
127. So on this helix you can see the yellow residues are the hydrophobic residues,
131. And that allows this particular helix to form a hydrophobic core
132. by packing against another helix, which I'm now showing here in blue.
138. Now, multiple residues in a series that fall within the blue region,
141. And then you need multiple beta strands to hydrogen bond
144. Each strand is illustrated by an arrow going from the N to the C terminus.
148. of the primary structure, and then they would be linked by a short loop.
149. But adjacent strands can also come from a whole other part
151. Again, let's look at the backbone structure behind these cartoons.
154. and carbonyl groups between adjacent strands is what forms a beta sheet.
156. see how the side chains extend perpendicularly to the plane containing
158. The side chains within one beta strand alternate above and below the plane.
162. could form a sheet with one hydrophobic side and one polar side.
163. STUDENT: What about the last region on the Ramachandran plot?
165. allowed, but it cannot be repeated for many residues, unlike the other ones.
167. But be careful, for loops, any allowable phi-psi angles are OK.
168. So you'll find loop residues in all three regions of the Ramachandran plot.
171. that all fall within the green region of the Ramachandran plot,
172. and it would be the same for the beta sheets in the blue region
174. So a residue that's in the alpha region, that doesn't mean it's in the helix.
178. Loops and turns that will connect adjacent secondary structure elements.
179. Our first example here is two strands of an anti-parallel beta sheet
188. Remember, we talked about two special amino acids-- proline and glycine,
190. glycine being very flexible, and proline being restricted, but being
191. able to form cis peptide bonds.
192. And so proline and glycine are often found within loop regions of proteins.
196. You can see that accordingly its residues are distributed in all three
199. can be arranged into defined elements-- the alpha helices, beta sheets,
202.
10. the van der Waals, salt bridges and hydrogen bonds.
11. And that's why tertiary and quaternary structure are often described together.
20. When we described how the secondary structure elements are arranged
23. Alcohol dehydrogenase is an enzyme that we'll discuss again in a future video.
25. as a cartoon, with the helices in green, the sheets in blue, loops in gray.
29. Like their names suggest, motifs recur in many different proteins
32. is the beta alpha beta motif, where two adjacent beta strands
34. Notice how this allows the formation of a parallel beta sheet.
35. One or more motifs can assemble to form a compact globular structure, a domain.
36. Here, the C terminal beta alpha beta motif, highlighted in yellow,
37. interacts with additional beta strands and several other helices
40. That is, an isolated domain can usually fold on its own
41. and maintain its tertiary structure independent of the rest of the protein.
42. Proteins often have multiple domains, which may then interact with each other
43. to form the functional unit.
44. In alcohol dehydrogenase, we can see how the domain we were examining,
47. The active sites of enzymes are often found in loop regions,
49. And these loop regions can be shaped both chemically and structurally
51. Often the active sites are in crevasses at the interface between two domains.
53. The active site of alcohol dehydrogenase lies between these two domains.
55. NADP and ethanol, bound at the active site in the interface.
59. with each one containing one active site at the interface of their two domains.
61. interacting anti-parallel beta strands to generate one large mixed beta
62. sheet from the two parallel beta sheets in each Rossmann fold domain.
69. OFF CAMERA SPEAKER: What's the difference between a domain and a fold?
80. are three different big types of folds-- the alpha folds,
82. Within alcohol dehydrogenase, we already saw an example of a mixed alpha beta
85. of beta strands in alpha helices, including several beta alpha beta
86. motifs.
100. associated with binding of the heme cofactor, shown here in orange.
105. But unlike myoglobin, hemoglobin is made of four separate polypeptide chains,
113. As we'll explore later in the course, the quaternary structure of hemoglobin
115. that are not possible from monomeric proteins like myoglobin.
119. has two identical subunits, which are highlighted here in red and yellow.
122. But more remarkably, proteins with highly divergent sequences often
123. still have the same structure, as we saw for hemoglobin and myoglobin.
128. Shown here are just a few examples of roughly 1,500 folds
130. This graph here shows the progress of structural biology over the last 40
131. years.
132. The number of folds that have been determined in using crystallography
134. has risen, but has actually plateaued in the last few years,
136. And this is in contrast to the all-time high number of protein structures
141. and specific side chains into how the protein functions.
148. OFF CAMERA SPEAKER: But what holds these folds together?
151.
1. Start of transcript. Skip to the end.
2. RACHELLE GAUDET: In this video, we're going to talk about the forces that drive
protein folding.
3. And we're going to see that the primary structure, or the sequence of a protein, will
dictate
11. system that's used by protein chemists to study the process of protein folding.
14. It's surrounded by water molecules and also a lot of other ions,
15. like sodium and chloride for example, that are illustrated here.
18. But these thousands of transient interactions are critical to the process of protein
folding.
19. Let's consider the forces that dictate how proteins fold.
20. And so now I've gone to a schematic diagram that illustrates an unfolded protein with
21. all the little colored spheres as the side chains of the amino acids.
23. it will have many, many different conformations that it can take.
26. Now once it folds, it will achieve one single native conformation.
27. And that greatly reduces the conformational entropy of the protein.
28. And it turns out that this is the major force against protein folding.
29. So that opposes protein folding because you have a loss in entropy.
32. And in all the different orientations, they can see other molecules
35. Now we can contrast this to a water molecule that's on the hydrophobic surface.
39. that will enable it to have the same number of hydrogen bonds.
43. we can see that upon folding, a lot of the hydrophobic side chains-- which
44. are the little yellow spheres in our diagram-- are buried in the hydrophobic core of
proteins
46. And that means that now a lot of the water molecules
47. that were interacting with those side chains, are now free to interact with other water
48. molecules.
59. And these residues are often important for interactions with other proteins
61. Another situation where we find a lot of hydrophobic residues also on the surface
65. Another reason that the hydrophobic effect is really important to protein folding is
68. will start contacting each other, they constrain the backbone
69. that surrounds them and will lead to other hydrophobic side chains now
70. being in close proximity and also interacting with each other.
72. And the hydrophobic core will zip up rapidly to hide all these hydrophobic side chains
76. How are these groups buried inside the hydrophobic core of proteins?
77. As we learned in earlier videos, hydrogen bonding within the main chain
78. stabilizes secondary structures like the alpha helices and beta sheets.
79. And these hydrogen bonding patterns also serve to balance those partial charges on
the polar
86. These non covalent interactions that form within a protein-- including
87. also the salt bridges and the van der Waals interactions--
88. are the final major contributor to the energetics of protein folding.
89. They contribute to a favorable and net decrease in enthalpy of the protein
90. as it folds.
91. So now that we know about the three major forces that drive protein
93. So I'm going to show you some estimates of the energy contributions from each
95. It turns out that the average size of a domain--the independently folding unit
97. So the hydrophobic effect due to the burial of the hydrophobic amino acids
98. into the core of proteins, will yield minus 140 kilojoules
101. The second favorable energetic component are the non covalent interactions,
102. which provide again about minus 150 kilojoules per mole of stabilization energy.
103. And then the forces that opposes these two is the huge reduction in entropy
104. going from many, many conformations accessible to the unfolded protein
107. energy, with about minus 70 kilojoules per mole of stabilization energy upon
folding of 100 amino acid protein.
111. But it turns out that they agree quite well with experimental results.
114. Still, you can see that the net effect is rather small because we have a few large
forces
118. But I mentioned earlier that there's one native conformation to proteins.
119. How do the proteins reach that one single native conformation?
120.
3. energetically favorable.
4. But we also know that the unfolded state can take on many, many conformations.
5. So it could be reasonable to assume that the folded state could also take
7. But we know that a native protein will have typically just one native, folded
confirmation.
8. So this raises another fundamental question--how do proteins achieve that one single
native fold?
12. And there's a story behind why he used RNase A. It turns out
14. had purified a whole kilogram of this protein, and was distributing small samples to
interested scientists.
17. So another reason that RNase A is very useful is that it's a nuclease, which means that
it's actually relatively easy to assay its activity.
18. So RNase A is a 124 residue protein that contains a mixture of alpha helices and beta
sheets,
24. So after measuring the baseline activity, Anfinsen added 8 molar urea to his sample.
26. And when he measured the activity in the presence of urea, he found 0% activity.
27. And that's because urea unfolds the protein into sort
30. And once the urea was completely removed, he measured the activity again, and found
32. So the activity had returned, suggesting that the protein had once again
33. refolded into its native conformation.
37. So when he added the 8 molar urea, he also added a reducing agent, called beta
mercaptoethanol.
38. So what beta mercaptoethanol does is that it reduces that disulfide bonds to release
the two free cysteines.
39. So now there's going to be eight free cysteines on each RNase A chain.
40. And then he performed the dialysis again, which removed both the urea and the
mercaptoethanol.
41. And in the oxidizing environment of the test tube that he had that was just open to the
43. So after all the urea was removed, he measured the activity again
44. and found about 90% activity, which meant that the protein had again
50. Again, he denatured the protein, and reduced all of its disulfide bonds.
51. But then, before dialyzing the protein, he added an oxidizing agent.
52. So this neutralized the mercaptoethanol and led to reforming of the disulfide bonds in
the denatured protein.
53. Then, when Anfinsen removed the urea by dialysis, he found that the protein was only
1% active.
55. He hypothesized that disulfide bonds formed at random within the protein
58. When the urea was removed, then the protein could no longer fold back into the native
61. Using some back of the envelope math, we can calculate that the correct combination
of
62. four disulfide bonds needed to form active RNase A would form, by chance, about 1%
of the time.
63. And that matches quite well the results that Anfinsen
67. So he took this RNase A that now had the mismatched disulfides-- or he
69. And then he added to this solution just a small amount of beta mercaptoethanol.
72. And so they can exchange to different cysteines, and there's a chance of the disulfide
bonds
75. I mean about 10 hours-- he found that he could regain a lot of the RNase
76. activity, so that the sample regained as much as 90% of the original activity.
77. And that suggests that the protein, with time, can again take on its native conformation,
80. the RNase A protein could not reach its native fold after the disulfide bonds were
allowed
81. to form at random in the unfolded peptide chain in the presence of urea.
82. However, when the urea and the reducing agent were removed at the same time,
84. And this suggested that the native fold of the protein
86. And then the correct disulfide bonds then formed, and stabilized the structure and
activity of the
87. RNase A. AUDIENCE: Can you explain how urea denatures proteins?
88. RACHELLE GAUDET: So the answer actually goes back to the previous video.
92. and that will decrease the impact of the hydrophobic effect we talked about,
95. the equilibrium of protein folding back towards the unfolded state.
97. From the results of his series of experiments, Anfinsen formulated an important
concept called
99. And the thermodynamic hypothesis states that the most stable
101. And that basically implies that folding is a spontaneous process in cells.
102. Another important lesson from his experiments is that the information contained
in the primary
105. So the protein doesn't need any templates or any other instructions,
109. are sufficiently sophisticated and precise, we could actually perhaps predict the
structure of proteins, just from knowing their sequence, or primary structure.
110. And actually, for some simple small proteins, that is the case.
113. And the structure was predicted using a powerful computational algorithm that
115. And after those predictions were finished, the structure was determined using x-
ray crystallography.
116. So here in gray is the experimental structure, superimposed onto the predicted
structure, in yellow.
117. And you can see that there's actually a pretty impressive agreement
118. between the predicted and experimental structures, because they look very
similar, and they superimpose nicely.
119. And that actually says that there's strong, independent support
121. because the fact that we can predict it means that we actually
124. we often illustrate it using a protein folding funnel, like the one shown here.
125. And the protein folding funnel shows the thermodynamic landscape-- so
128. And that sort of illustrates all the many different conformations
130. In some sense, you could illustrate that with a large, multi-dimensional space--
136. So then if you imagine that the protein starts at a high energy
137. level of that unfolded state, and then it will slowly travel down the funnel,
139. And the native structure will be at the minimum of that energy landscape,
145. on the way towards the native state, the most stable state.
146. The protein folding funnels can actually take on many different shapes
148. So each primary structure or sequence will have its own folding funnel.
149. So in this smooth funnel case, the protein will simply start in an unfolded state,
and
153. So that means that there are many conformations in the unfolded state that
have about the
156. before it reaches the point where the funnel suddenly drops.
157. And so the protein will then fold rapidly in its native state,
161. And that's indicated here by the shaded regions on this blue plot.
163. like the first funnel we discussed, which is shown here in yellow again.
168. As you might imagine, the shape of the funnel influences the kinetics of the
process, or
170. So now we know that protein structure is determined by the primary sequence.
171. And that's a good thing for cells, because life is complicated enough.
172. So because the protein sequence dictates the three dimensional shape,
174. almost every time, without any other templates or information coming
176. But those protein folding funnels that we just looked at also hint at another
important concept in protein folding,
178.
1. Start of transcript. Skip to the end.
3. active field of research. So in this video, we'll show you some of the basic
4. concepts that dictate some of the kinetics of protein folding just to get
5. you sense of how protein folding happens. Remember that we already have a tool to
6. visualize the process of protein folding, and that's the folding funnel. When we
7. look at the protein funnel, we think of the protein starting at a high energy
8. level when it's unfolded and then moving down in trajectory towards the minimum,
9. and the minimum is the native fold of the protein. But how does a protein
10. actually change its shape as it's undergoing this folding process? Remember
11. the hydrophobic effect? We talked about how the hydrophobic side chains will
12. bury themselves into the hydrophobic core of protein. Well this hydrophobic
13. collapse is actually often one of the first steps in the folding process. When
14. looking at these folding funnels, remember that the fraction of residues
15. in the native conformation increases as the proteins move towards its native
16. state, and at the same time the number of possible conformations decreases and
17. that's the funnel effect here, as the side chain becomes more and more
18. constrained. So let's dig deeper into these conformations as the protein is
19. undergoing the process of folding to better understand the kinetics. Remember
20. that these phi and psi bonds can rotate, but not quite freely all the way around
21. the 360 degrees of rotation. Some of the angles are disallowed because there are
22. some steric clashes, and we saw that the Ramachandran plot is a great way to
23. visualize the regions that are allowed, the regions that prevent a lot of the
24. steric clashes in proteins. And so we already know now that there's some
25. constraints on the conformations of proteins, but the protein still needs to
26. take on very precise dihedral angles to maintain its native fold, because just
27. one or two errant dihedrals can actually disrupt the fold
28. of the protein, and we can illustrate this with a simple example of an alpha
29. helix. Remember that a series of residues within the same region of the phi-psi
30. angles in the Ramachandran plot, the green region here, can generate an alpha
31. helix. So say if just two residues within this alpha helix change their dihedral
33. Well that would actually disrupt the whole helix, so therefore that tells us
34. that protein folding relies on very precise conformations of the backbone
35. through their phi-psi angles. So now let's get an idea of the magnitude of the
36. problem, the magnitude of the task that the protein has to go through to get to
37. its native fold. So we're going to do a thought experiment making some estimate
38. about the number of conformations that the protein might be able to undertake.
39. So we're going to take an example of a protein of 100 residues, so that's about
40. the average size of a protein domain. So now we know that it's phi-psi angles are
41. restricted, so that we can approximate that there's roughly three different
43. regions of the Ramachandran plot. So that would yield three to the 100
44. conformations available to the whole protein, assuming that each of these
45. dihedral angles can behave independently. Three to the 100 is actually an enormous
46. number, and to get an idea of the magnitude of this number we're going to
47. estimate that if the protein say samples each of these conformations for a very
49. yield 10 to the 27 years to sample every single conformation, and that's orders of
50. magnitudes longer than the lifetime of the universe. So that brings us to
51. a paradox, right, that the protein has no time to explore all the possible
52. conformations on the way to its native fold. This is called Levinthal's paradox
53. after Cyrus Levinthal, who's the first one to state this
54. concept in this paradox, and what he hypothesized is that this paradox means
55. that there must be some conformations that are just never explored. There must
56. be some paths that are more common as proteins start to fold. So how do proteins
57. then figure out which conformations might be the best ones to lead them
58. towards their native fold? That's where cooperativity comes into
59. play, because this cooperativity brings some residues in proximity, and so it
60. makes choices for the protein. So let's see a more specific example. Remember
61. when the hydrophobic core forms you might have at first a contact between
62. two residues, that's labeled c1 here, but that first contact will then bring other
63. residues in close proximity. So then the second contact that's labeled c2 here
64. now has a higher probability of forming. So that's the cooperativity being
65. illustrated for the hydrophobic effect that leads to the collapse of the
66. hydrophobic core, but this cooperativity is also important in forming the
67. secondary structure elements. So let's see this example now for a secondary
68. structure element, looking at the folding of a helix. So the largest kinetic
69. barrier for the folding of a alpha helix is the formation of the first hydrogen
70. bond, and that's called the nucleation event. It's analogous to when you
71. crystallize something, the first seeds of a crystal might take a long time, but
72. then the growth of the crystal is a lot faster. So here in an alpha helix, the
73. first hydrogen bond might take a long time to form, but once it's formed it
74. brings into proximity the other residues that want to form hydrogen bonds, and
75. then they will extend the helix by forming these hydrogen bonds on either
76. side of that first hydrogen bond to form the complete helix.
78. as well. So protein scientists who study folding have generated sort of two
80. In the first one, the secondary structure forms first, and then the secondary
81. structure elements will then form together the tertiary structure by
82. moving until they achieve the correct fold. And this is illustrated here as a
83. time course with the energy on the y axis, and you can see that first the
84. secondary structure elements are formed and then the full fold of the protein.
85. And this process or this model is called the diffusion collision model. In the
86. other model, the hydrophobic core of the protein that's indicated here in yellow
87. is what first collapses in a relatively random way, and once the core has
88. collapsed, this allows the secondary structure elements to form, because now
89. the residues are in close proximity, and the protein will adopt its native
90. conformation. And this second model is called the nucleation condensation
92. indicate that both models can operate in protein folding, and which model is
93. favored depends on the protein, depends on the primary sequence. So as with the
95. that our understanding of it is advancing more and more rapidly, and a
96. lot of that is due to computer simulations. Now one type of computer
97. simulations are molecular dynamic simulations which simulate the motion of
98. atoms, and it's powerful enough to simulate the folding of some of the
99. small proteins in the aqueous solution, because remember that the aqueous
100. solution is key to the folding process. So now we're going to look at a
101. simulation of the folding of a small protein, a 35 residue protein. It's the
102. headpiece domain of villin, which is a cytoskeletal protein, and this protein is
103. actually experimentally one of the classic models of protein folding and
104. it's one of the fastest folding proteins that's been determined experimentally. So
105. in the simulations, there are actually thousands of water molecules that are in
106. included in the aqueous solution, but they're not going to be shown here just
107. for clarity. And the simulation lasts in the real time just six microseconds,
108. that's how long it takes for this villin headpiece to fold. You can see
109. that the villin headpiece is a small fold that contains three helices, and
110. they're colored here from a blue N-terminus to a red C-terminus. So this
111. simulation starts with an extended peptide, and the peptide chain rapidly
112. collapses onto itself, and that's similar to the nucleation condensation model of
113. protein folding that we just saw. Then two helices form soon after, while that
114. middle helix takes a lot longer to form to yield the final fold. Remarkably the
116. the headpiece. But it's important to note that this is just one sample trajectory
117. that shows us one example of how the villin protein might fold, and actually
119. results, although there are themes that emerge. So protein folding kinetics are
120. quite complicated, but we've already reached a few take-home messages,
some
121. key ideas. And one of them is that there's going to be a limited number of
122. paths that the protein is actually going to take to reach its fold. That we've
123. come about through looking at the Levinthal paradox and how a lot of these
125. that the landscape of the energetics of protein folding is what's going to
126. dictate the path towards folding. And the other important idea is that the
128. will sort of dictate which steps to take along the way, one after the other, and so
129. that will ensure that the protein folds in a timely manner.
130.
1. transcript. Skip to the end.
2. RACHELLE GAUDET: So we've learned quite a bit about how proteins fold,
3. but we've done so under assumptions that are based on a homogeneous solution of
dilute
8. We often think that the cytosol is this big wide open space for the molecules
9. to float around.
10. But, in fact, it's actually jam-packed with molecules-- particularly proteins and RNA.
12. in a bacteria.
16. So the challenge here is that the protein has to fold onto itself
18. And that's in contrast to it might very well interact with other proteins or other molecules
21. Here's our model of the protein folding funnel, where the protein can achieve its native
fold--
23. And we've seen that this is true in dilute solutions-- for example,
25. But in the cytosol, there are many other proteins that are in close proximity to the
nascently
26. folding proteins-- the proteins that are just synthesized by the ribosomes.
27. So if the protein chain actually interacts with other proteins that are also nascent
30. And this potential problem is illustrated here by extending the conformational space
available to the protein in the
32. So we've added a whole other side to this graph showing the potential for aggregates.
33. The same energetic forces that dictate native protein folding
35. Namely, the benefit of burying hydrophobic residues through the hydrophobic effect.
36. So this suggests that in the cytosol there's a risk that proteins aggregate rather than
44. So you can see that aggregates can be really deleterious to cellular functions.
46. So cells can prevent the aggregation using proteins called chaperones.
47. And these chaperone proteins will actually aid protein folding.
48. But let's not assume that each protein has its own chaperone
52. But what chaperones can do is prevent the newly forming proteins
53. from aggregating with other proteins.
56. as they are being translated-- right as they're coming off the ribosomes.
59. And the polypeptide that is synthesized is at risk of forming some aggregates.
60. And that's especially true with large proteins because they have complicated folds
often.
62. between distant elements in the sequence in the primary structure-- for example,
63. between the N-terminus and the C-terminus-- the N-terminus is translated first.
69. So the protein will then be more likely to reach its proper fold.
74. But the key word here is transient because HSP70 also
76. If it was stably bound, it would effectively be just another kind of aggregate instead
78. So how does HSP70 interact only transiently with its substrates?
82. The substrate binding, in turn, will promote the exchange of ADP
84. And now HSP70 changes its conformation back to a state that is not permissive for
substrate binding.
86. The released protein then has another chance to fold on its own
88. This schematic here illustrates that HSP70 can exist in either a substrate-bound or a
90. And the structures of these two states have actually been determined.
91. So let's take a closer look at the structures to see how ATP hydrolysis and exchange
affect substrate affinity.
93. It has an ATPase domain, that's shown in blue, and a substrate-binding domain, that's
shown
95. Now with a transparent ATPase domain surface, you can see that the ATP is actually
quite
98. And upon ATP hydrolysis, the yellow lid actually moves into close contact with the rest
of
99. the substrate-binding domain-- the orange part of the protein here.
100. And that forms a substrate-binding pocket that sequesters some of the
substrate's hydrophobic surfaces.
101. So this is the part where HSP70 is sequestering the nascent protein.
102. Now if we return to the ATP-bound structure, notice how the exchange of ADP
for ATP moves
103. the lid domain away and abolishes this pocket.
104. It is remarkable how the state of the nucleotide affects very distal part of the
structure.
106. you can see just how massive that conformational change
107. is between the ADP-bound state, which is the solid surface here,
109. The movement of the lid domain is what creates a binding pocket that is crucial
for that
113. There's actually a whole class of heat shock proteins, and most of them
115. But where does the name heat shock protein come from?
116. It's because the expression of these heat shock proteins, or HSP proteins,
120. So the cells were heat shocked at time 0, and then the expression of HSP70
122. And you can see that, especially at two and four hours,
123. there's more HSP70-- there's a darker band--than there was at time 0.
126. But it's important to see that HSP70 is also present at pretty high levels in
unstressed
132. This plot shows the fraction of protein that's folded versus temperature for a
representative
139. So the cells will upregulate chaperone expression to prevent this aggregation
145. So we've already seen that under basal conditions, they're important to
maintain the homeostasis
151. and you'll see increased chaperone expression in these situations too.
153. When they're very active, when they're generating a lot of secreted proteins,
154. that can cause some protein folding stress.
159. Actually, mammalian genomes have on the order of 200 chaperone genes.
164. And this view here is a cutaway view that shows the inside of that cage
structure.
165. A whole protein can go inside this cage to fold into its native shape
169. And that's the hypothesis that the most thermodynamically stable state
171. And for most proteins that are folding in dilute solutions, this is true.
172. In the crowded environment of the cell, chaperones assist folding in several
ways.
173. One of them is that they help proteins-- as they fold-- overcoming the kinetic
traps that are
175. But even more important is that chaperones prevent the thermodynamically
favorable aggregates
177. And that's why many of the chaperone genes are actually essential genes.
182. from getting too tangled up before they've grown into full, mature adults.