Professional Documents
Culture Documents
Edited by Ian A. Wilson, The Scripps Research Institute, La Jolla, CA, and approved December 5, 2016 (received for review August 10, 2016)
Somatic mutations within the antibody variable domains are critical depend on somatic mutations to achieve high-affinity antigen
to the immense capacity of the immune repertoire. Here, via a deep binding, the approach is limited to the small set of mutations
mutational scan, we dissect how mutations at all positions of the present in a given antibody.
variable domains of a high-affinity anti-VEGF antibody G6.31 impact Here we took a systematic approach to assessing the muta-
its antigen-binding function. The resulting mutational landscape bility of the whole variable domain for maintaining or im-
demonstrates that large portions of antibody variable domain proving folding stability and antigen binding. We used a well-
positions are open to mutation, and that beneficial mutations can optimized anti-vascular endothelial cell growth factor (VEGF),
be found throughout the variable domains. We determine the role G6.31, with high affinity (Kd = 0.4 nM) and excellent thermo-
of one antigen-distal light chain position 83, demonstrating that stability [fragment antigen-binding (Fab) melting temperature
mutation at this site optimizes both antigen affinity and thermo-
(Tm) = 84 °C]. G6.31 derives its HC and LC variable domains
stability by modulating the interdomain conformational dynamics
from the frequently used human germline V families VH3 and
of the antigen-binding fragment. Furthermore, by analyzing a large
VK1, respectively, and exhibits the canonical variable domain
number of human antibody sequences and structures, we demon-
strate that somatic mutations occur frequently at position 83,
backbone structure (22–24), and thus is a good representative
with corresponding domain conformations observed for G6.31. of human antibodies.
Therefore, the modulation of interdomain dynamics represents an The present study had two aims: first, to map the mutational
important mechanism during antibody maturation in vivo. landscape of the two variable domains to delineate the potential
effects of all possible single mutations, and second, to identify
|
antibody affinity maturation | immunology | deep mutational scanning | novel molecular mechanisms of antigen distal framework muta-
conformational dynamics tions for improving antibody function. A high-affinity antibody
such as G6.31 likely has well-optimized CDRs, thus providing an
opportunity to identify beneficial antigen-distal mutations.
U nlike T-cell receptors, B-cell–derived antibodies accumulate
somatic mutations in the antigen-binding variable domains
of heavy chain (HC) and light chain (LC), in both the comple- Significance
mentarity-determining regions (CDRs) and the framework re-
gions (FWRs) that scaffold the CDRs (1, 2). Together with the On encountering antigens, antibody’s variable domains evolve
V(D)J gene recombination, somatic hypermutation (SHM) adds and mature through multiple rounds of somatic mutation. By
to the gene diversity of antibody variable domains (VH and VL) surveying the effects of all possible single mutations of an anti-
necessary for efficient immune recognition (3). body’s variable domains on folding stability and antigen binding,
Deciphering the molecular mechanism of SHMs in improving we showed that even for a high-affinity antibody such as the one
antibodies is critical for understanding the evolution of the im- used in our study, beneficial mutations can be found both near
mune repertoire. It is well recognized that mutations at the and far from the antigen-binding site. Furthermore, our muta-
CDRs or FWR structurally adjacent to the CDRs can modulate tional scan revealed an antigen-distal framework position work-
and optimize the antigen-binding interface (4–7), and several ing as a structural switch whereby a mutation can allow the
studies have used saturated mutagenesis of CDR position to variable domains to sample different conformational spaces and
explore the effect of various mutations on affinity and specificity thus potentially different binding functions. This understanding of
(8–11). the mechanism of somatic mutation in human antibodies illus-
The role of SHM in antigen-distal FWR is less well understood, trates how antibodies efficiently use somatic mutation for evolv-
however. Previous studies have suggested that the antigen-distal ing diversity in immune recognition.
FWR contributes primarily to the variable domain structural in-
tegrity; thus, mutations at these positions would be either detri- Author contributions: P.K., C.V.L., B.T.W., V.J., J.S., T.W.P., and G.F. designed research;
P.K., C.V.L., B.T.W., V.J., and T.W.P. performed research; P.K., C.V.L., B.T.W., V.J., J.S., T.W.P.,
mental to or neutral for antigen-binding function (12–14). Other and G.F. analyzed data; and P.K., B.T.W., and G.F. wrote the paper.
authors have characterized the potential of antigen-distal framework Conflict of interest statement: All authors are or were once paid employees of Genentech
mutation as beneficial for antigen-binding function. For example, Inc., a member of the Roche Group, and are inventors of a patent application based on
framework mutations can compensate for the destabilizing effect of the work described herein.
mutations at CDRs needed for antigen binding (15). In other cases, This article is a PNAS Direct Submission.
non-CDR SHMs have been shown to be required for the neutrali- 1
Present address: AbbVie Stemcentrx, South San Francisco, CA 94080.
zation activity of broadly neutralizing anti-HIV antibodies (16). 2
To whom correspondence may be addressed. Email: koenig.patrick@gmail.com or
Thus far, the roles of SHM have been studied primarily by ex- germainefuh@gmail.com.
amining the functional consequences of reverting the somatic
Downloaded by guest on November 7, 2019
3
Present address: Affinita Biotech, South San Francisco, CA 94080.
mutation of selected antibodies back to the germline sequences This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.
(15–21). Although these studies have demonstrated that antibodies 1073/pnas.1613231114/-/DCSupplemental.
COMPUTATIONAL BIOLOGY
BIOPHYSICS AND
Fig. 1. Mutational landscape of VH and VL. The change of the abundance of each mutation in the VH (A) and VL (B) during selection for stable expression
using anti-gD tag or for binding antigen VEGF (Fig. S1A) is calculated as log2ER and depicted in the heat maps with a scale of red (enriched) to blue (depleted).
All mutations were accounted for in the preselection libraries. Mutations in black are those that are absent after selection indicating strong depletion. Highly
Downloaded by guest on November 7, 2019
conserved positions of VH or VL are colored based on their structural role, including hydrophobic core (colored green), extended core (orange), VH–VL in-
terface (purple), or other (pink; hydrogen-bonded cluster) (Figs. S1–S4). CDR positions are boxed. Residues that contact VEGF (≤5 Å in distance based on the
G6/VEGF complex structure; PDB ID code 2FJG) are marked with arrowheads.
600
number of mutations
the fold as well as the binding affinity of the ligands in selection.
number of mutations
conserved
The ERs of all single mutations in VH library and VL library
400
framework
from panning with three different ligands (anti-gD, protein A, frame
400
work
and protein L) yielded similar and consistent results demon- VL
200
strating the validity of the approach (Figs. S1–S3). The ERs of VH 200
mutations in VH library from anti-gD and protein L panning
were closely correlated (r2 = 0.98). The ERs from the protein A
and protein L-panned VH were also correlated (r2 = 0.611) with
0
0
-4 0 4 -5 0 5
the differential enrichment attributed to the subset of VH posi- log2 enrichment ratio log2 enrichment ratio
tions binding to protein A (28). A similar pattern was seen for VEGF panning VEGF panning
the VL library panning data. C D
We chose the ER dataset from anti-gD panning to represent A53R
ER: 2.1 VEGF
folding selection, and used a heat map to display the patterns of +2°C D28R
4x T57E ER: 2.8
enrichment (red) or depletion (blue) for each mutation (Fig. 1 ER: 1.3 CDRs S50M
ER: 2.8
-2°C
+1°C 3x
and Figs. S1 and S2). Enrichment indicates that the mutation is Y58R 7x
conserved +2°C
ER: 2.5 framework 1.5x
likely beneficial for the selected function (stable expression), 0°C frame Y36G
10x
whereas depletion indicates unfavorable mutations. Overall, the work ER: -5.2
Y59T Q81S -14°C
vast majority of positions appear to be tolerant to either the full ER:6.4 ER:3.8
V2R
ER: 3.9
0.5x
+1°C +1°C
range of mutations or a set of conserved mutations, because 5x 3x 0°C
5x
L73G
ER: -7.1
Q89T
ER: 2.9
mutations at these sites do not exhibit overt depletion (Fig. 1 and N82aR
S7L
ER: 0.5
-11°C -4°C
A40E 2x 4x
Figs. S1, S2, and S4). ER: 3.9 ER: 5.4 VH +2°C
4x
+1°C +1°C F83A S12M VL
Consistent with what is known about the folding of the anti- 4x 5x ER: 3.4 ER: 3.1
body variable domain (7, 13, 14, 29, 30), we found a set of mainly +5°C +1°C
3x 2x
structurally buried framework positions resisting mutation (Fig.
S4). These “conserved” framework positions include the hydro- Fig. 2. Identification of VEGF affinity-improving mutations in G6.31. Shown
phobic core and VH–VL interface residues, as well as the resi- is the distribution of log2ERs obtained from VEGF panning of the VH library
dues involved in a hydrogen bond network at the bottom half of (A) and the VL library (B). Mutations that are depleted beyond the detection
limit are set to the maximum observed depletion. Mutations are color-coded
the variable domains. Mutations at some of the hydrogen bond
according to their location in the CDRs (gray), in the conserved framework as
network positions, like LC-R61 and LC-D82, have been shown to identified in Fig. 1 and Fig. S4 (yellow), or in the rest of the framework
induce aggregation and fibril formation in LCs (31). Further- (blue). (C and D) Using the same color scheme, selected mutations are
Downloaded by guest on November 7, 2019
more, a few CDR positions (e.g., R94, P100, D101 in the CDR- mapped on the G6/VEGF complex structure (PDB ID code 2FJG), and the
H3) also exhibited low tolerance to mutation, suggesting their change in Tm (°C) and the fold improvement in antigen-binding affinity (Kd)
role in overall stability. R94 and D101 are known to be important compared with G6.31 Fab are shown.
CH1 VL
CL determine whether the coupling of the side chain conformation
BIOPHYSICS AND
large Fab
F83
elbow angle of LC-83 with the Fab elbow conformation is a common feature
'In'
in human antibodies. The most frequently used human kappa
G6-L I106
germline families, IGKV1 and IGKV3 (IMGT database) (37),
CL carry phenylalanine at position 83, whereas the other, less fre-
DE-loop CL quently used germlines carry mostly valine (IGKV2 and IGKV4)
or alanine (IGKV5 and IGKV6). The isoleucine at position LC-106
Fig. 3. Association of G6 Fab elbow angles with LC-F83 conformation. is conserved in all five human IGKJ germline genes. We examined
(A) Fab elbow angles of the12 molecules present in the G6 Fab crystal structure
the conformation of LC-F83 of 319 Vkappa-containing human Fab
asymmetric unit (PDB ID code 2FJF). LCs of the molecules exhibiting a small
elbow angle (G6-S) are colored in green, and those with a large elbow angle
crystal structures in the PDB and divided these structures into two
(G6-L) are in red. HCs are in gray. (B and C) Detailed views of the VL–CL interface groups: LC-F83 in an “in” conformation (dihedral χ1 angle between
−50° and −100°) and LC-F83 in an “out” conformation (dihedral χ1
Downloaded by guest on November 7, 2019
and the side chain conformation of LC-F83 and LC-I106 in G6-S molecules
(green) or G6-L (red). The β-strand E, the helix α-1, and the loop connecting both angle mostly between 50° and 180°) (Fig. 5A). Indeed, the two
elements are removed for clarity. groups of LC-F83 rotamers correlate with changes in the Fab elbow
A B
C D
E F
Fig. 4. Mutations at LC-F83 regulate G6 conformation and domain dynamics. (A) Distribution of the F83 side chain conformation (χ1 angle) of G6-S (small
elbow angle) or G6-L (large elbow angle) of the G6free (PDB ID code 2FJF) during a 100-ns MD simulation, showing little transition. The χ1 angle of the G6-S
F83 was artificially split because it was plotted in a range of −180° to +180°. (B, D, and E) Data from the last 75 ns of the 100-ns MD simulations. (B) Dis-
tribution of the Fab elbow angle for the molecules G6-S, G6-S-83A, G6-L, and G6-L-83A. All samples are significantly different (P < 0.001) except G6-S and
G6-S-83A, as determined by ANOVA with Tukey’s honest significance difference (HSD) test. (C) VH–VL torsion angle for the six molecules of G6free structures
with F83 in an “out” conformation (orange) and for the six molecules with G6free with F83 in an “in” conformation (green), as well as the VH–VL torsion angle
of the VEGF-bound G6 structures (yellow). (D) VH–VL torsion angle distribution of the same molecules obtained from the same MD simulation. All samples
except G6-S-83A and G6-L-83A are significantly different (P < 0.001) by the HSD test. (E) Contour plot showing the overall Fab interdomain dynamics. The
elbow angle vs. the VH–VL torsion angle is plotted for selected time points of the MD simulation using a G6 structure with LC-F83 in an “in” conformation
Downloaded by guest on November 7, 2019
(G6-L; red) and the same structure with an LC-F83A mutation (G6-L-83A; green) as starting points. Two structures occupy distinct regions in the conforma-
tional landscape. (F) In HX-MS experiments comparing G6.31 F83A and G6.31 Fab, the positions that exchange faster (red) or slower (blue) are depicted on the
cartoon representation of the G6free Fab structure (PDB ID code 2FJF) (Fig. S6).
indeed influences the range of the elbow angle and, consequently, The results obtained by the MD simulation and HX-MS are in
BIOPHYSICS AND
the VL–CL interface area. good agreement, and thus both methods independently confirm
Therefore, our meta-analysis of human antibody structures, the impact of this antigen-distal mutation at LC-83 on Fab do-
along with the MD simulations, strongly suggest that the greater main orientation and dynamics.
thermostability of G6.31LC-F83A compared with G6.31 can be
explained by an increase in the VL–CL interface interaction area LC-83 and LC-106 Are Frequently Mutated in Human Antibodies. We
through the decrease in the Fab elbow angle, which likely further next investigated whether mutation at the position LC-83 occurs
stabilizes the fold of the four domains composing the Fab. frequently during affinity maturation in vivo among human an-
tibodies of the same germline family. We used two approaches to
Elbow Angle Dynamics of the G6 Fab Influence the Antigen-Binding examine the distribution of SHMs in LC sequences originating
Interface by Modulating the VH–VL Torsion Angle. Whereas the from IGKV1 (Vkappa1) germlines. In the first approach, we
change in Fab elbow angle explains the increase in thermostability of collated the SHMs of 4,623 unique human IGKV1 LC sequences
Downloaded by guest on November 7, 2019
the LC-F83A variant, the effect of the mutation on antigen-binding available in several publicly available databases (Fig. 6A, Upper).
affinity cannot be explained. We hypothesize that the LC-F83A In the second approach, we amplified IGKV1 sequences from
B C
Fig. 6. Somatic mutations in human antibody sequences in IGKV1 germline-derived antibodies. (A) Histogram showing the fraction somatically mutated for
the indicated VL positions of antibody sequences originating from IGKV1 germlines. The mutational distribution in the upper panel was obtained using
publicly available human antibody sequences from the GenBank, PDB, Kabat, Abysis, and IMGT databases (Public). The lower panel was obtained using SMRT
sequencing (PacBio) of cDNA obtained from more than 1,000 individual human lymphoid tissues. The high mutation rates at the N terminus of the public
sequences likely come from cloning artifacts. (B) The most common mutations at position LC-83 for IGKV1-39 sequences (Left) or at position LC-106 (Right)
Downloaded by guest on November 7, 2019
from the Public dataset or the PacBio dataset. IGKV1-39 carries a phenylalanine at position LC-83. Points are color-coded by the respective amino acid size as
noted. (C) The affinity (Kd) (Left) and Tm (Right) of selected mutations of G6.31 as measured by Biacore and DSF, respectively. Values represent the mean from
three replicates, and error bars show the SD.
positions in modulating antibody affinity and stability. The CDRs were used in all of the selections for folding and expression (anti-gD anti-
typically provide the main direct contacts with antigens, but—as body, protein A, and protein L). The VEGF selection data of the corrected
BIOPHYSICS AND
highlighted by our analysis—affinity also depends on framework segment from the second library were merged with the sequence data of
residues, even when the FWR positions are structurally far away the original library after calculation of ERs. To adjust for different se-
quencing depths, the dataset of the second VH library was adjusted by
from the antigen. Our investigation into the role of mutations at
obtaining a random sample of sequences of sizes matched to the dataset
LC position 83 has unraveled the complex relationship between a obtained from the first VH library (Table S1).
framework mutation and the conformational dynamics of a Fab
molecule, emphasizing the importance of conformational dynamics Antibody Expression and Characterization. VH and VL sequences of selected
in antibody–antigen interaction. variants were cloned into a mammalian Fab vector for expression and pu-
rification with a protein G column. Surface plasmon resonance measure-
Materials and Methods ments with a BIAcore T200 instrument was used for affinity determination as
Full Variable Domain NNK-Walk Library Design, Generation, and Phage Panning. described previously (24, 26). In brief, single cycle injection was used for
Downloaded by guest on November 7, 2019
VH residues 2–113 and VL residues 2–107 of G6.31 Fab in a phagemid con- kinetic measurement. VEGF (8–109) was coupled to Series S sensor chip CM5
struct, as described previously (26) (Fig. S1), were subjected to randomization (GE Healthcare) to achieve an Rmax value of ∼50 response units. Serial dilu-
by oligonucleotide-directed mutagenesis (52). Between 10 and 12 sequential tions of Fab in HBS-P buffer (0.01 M Hepes pH 7.4, 0.15 M NaCl, and 0.005%
1. Kim S, Davis M, Sinn E, Patten P, Hood L (1981) Antibody diversity: Somatic hyper- 13. Hsu HJ, et al. (2014) Antibody variable domain interface and framework sequence
mutation of rearranged VH genes. Cell 27(3 Pt 2):573–581. requirements for stability and function by high-throughput experiments. Structure
2. Neuberger MS, Milstein C (1995) Somatic hypermutation. Curr Opin Immunol 7(2): 22(1):22–34.
248–254. 14. Chang HJ, et al. (2014) Loop-sequence features and stability determinants in antibody
3. Tonegawa S (1983) Somatic generation of antibody diversity. Nature 302(5909): variable domains by high-throughput experiments. Structure 22(1):9–21.
575–581. 15. Wang F, et al. (2013) Somatic hypermutation maintains antibody thermodynamic
4. Foote J, Winter G (1992) Antibody framework residues affecting the conformation of stability during affinity maturation. Proc Natl Acad Sci USA 110(11):4261–4266.
the hypervariable loops. J Mol Biol 224(2):487–499. 16. Klein F, et al. (2013) Somatic mutations of the immunoglobulin framework are
5. Tomlinson IM, et al. (1996) The imprint of somatic hypermutation on the repertoire of generally required for broad and potent HIV-1 neutralization. Cell 153(1):126–138.
human germline V genes. J Mol Biol 256(5):813–817. 17. Wu X, et al.; NISC Comparative Sequencing Program (2011) Focused evolution of HIV-
6. Burkovitz A, Sela-Culang I, Ofran Y (2014) Large-scale analysis of somatic hyper- 1–neutralizing antibodies revealed by structures and deep sequencing. Science
mutations in antibodies reveals which structural regions, positions and amino acids 333(6049):1593–1602.
are modified to improve affinity. FEBS J 281(1):306–319. 18. Doria-Rose NA, et al.; NISC Comparative Sequencing Program (2014) Developmental
7. Padlan EA (1994) Anatomy of the antibody molecule. Mol Immunol 31(3):169–217. pathway for potent V1V2-directed HIV-neutralizing antibodies. Nature 509(7498):55–62.
8. Brummell DA, et al. (1993) Probing the combining site of an anti-carbohydrate antibody 19. Banfield MJ, King DJ, Mountain A, Brady RL (1997) VL:VH domain rotations in en-
by saturation-mutagenesis: Role of the heavy-chain CDR3 residues. Biochemistry 32(4): gineered antibodies: Crystal structures of the Fab fragments from two murine anti-
1180–1187. tumor antibodies and their engineered human constructs. Proteins 29(2):161–171.
9. Burks EA, Chen G, Georgiou G, Iverson BL (1997) In vitro scanning saturation muta- 20. Wedemayer GJ, Patten PA, Wang LH, Schultz PG, Stevens RC (1997) Structural insights
genesis of an antibody binding pocket. Proc Natl Acad Sci USA 94(2):412–417. into the evolution of an antibody combining site. Science 276(5319):1665–1669.
10. Chowdhury PS, Pastan I (1999) Improving antibody affinity by mimicking somatic 21. Xiao X, et al. (2009) Germline-like predecessors of broadly neutralizing antibodies lack
hypermutation in vitro. Nat Biotechnol 17(6):568–572. measurable binding to HIV-1 envelope glycoproteins: Implications for evasion of immune
11. Yang WP, et al. (1995) CDR walking mutagenesis for the affinity maturation of a responses and design of vaccine immunogens. Biochem Biophys Res Commun 390(3):
potent human anti–HIV-1 antibody into the picomolar range. J Mol Biol 254(3): 404–409.
Downloaded by guest on November 7, 2019
392–403. 22. Fuh G, et al. (2006) Structure-function studies of two synthetic anti-vascular endo-
12. Wagner SD, Milstein C, Neuberger MS (1995) Codon bias targets mutation. Nature thelial growth factor Fabs and comparison with the Avastin Fab. J Biol Chem 281(10):
376(6543):732. 6625–6631.
COMPUTATIONAL BIOLOGY
BIOPHYSICS AND
Downloaded by guest on November 7, 2019