Professional Documents
Culture Documents
Jesus V. Jorrin-Novo
Luis Valledor
Mari Angeles Castillejo
Maria-Dolores Rey Editors
Plant
Proteomics
Methods and Protocols
Third Edition
METHODS IN MOLECULAR BIOLOGY
Series Editor
John M. Walker
School of Life and Medical Sciences
University of Hertfordshire
Hatfield, Hertfordshire, UK
Third Edition
Edited by
Jesus V. Jorrin-Novo
Agroforestry and Plant Biochemistry, Proteomics and Systems Biology, Department of Biochemistry
and Molecular Biology, University of Cordoba, Cordoba, Spain
Luis Valledor
Department of Organisms and Systems Biology, Institute of Biotechnology of Asturias, University
of Oviedo, Oviedo, Asturias, Spain
Maria-Dolores Rey
Agroforestry and Plant Biochemistry, Proteomics and Systems Biology, Department of Biochemistry
and Molecular Biology, University of Cordoba, Cordoba, Spain
Editors
Jesus V. Jorrin-Novo Luis Valledor
Agroforestry and Plant Biochemistry Department of Organisms and Systems Biology
Proteomics and Systems Biology Institute of Biotechnology of Asturias
Department of Biochemistry University of Oviedo
and Molecular Biology Oviedo, Asturias, Spain
University of Cordoba
Cordoba, Spain Maria-Dolores Rey
Agroforestry and Plant Biochemistry
Mari Angeles Castillejo Proteomics and Systems Biology
Agroforestry and Plant Biochemistry Department of Biochemistry and Molecular Biology
Proteomics and Systems Biology University of Cordoba
Department of Biochemistry Cordoba, Spain
and Molecular Biology
University of Cordoba UCO-CeiA3
Cordoba, Cordoba, Spain
This Humana imprint is published by the registered company Springer Science+Business Media, LLC, part of Springer
Nature.
The registered company address is: 1 New York Plaza, New York, NY 10004, U.S.A.
Preface
You now have in your hands the third edition Plant Proteomics: Methods and Protocols,
preceded by the first edition in 2007 (M. Zivy, C. Damerval, and V. Mechin, eds.) and the
second one in 2014 (J. V. Jorrin Novo, S. Komatsu, W. Weckwerth, and S. Wienkoop, eds.).
The success of the previous editions and the continuous advances and improvements in
proteomic techniques, equipment, and bioinformatics tools, and their uses in basic and
translational plant biology research that has occurred in the past 5 years encouraged Humana
Press to prepare a new updated version. Under the title Advances in Proteomics Techniques,
Data Validation, and Integration with Other Classic and -Omics Approaches in the Systems
Biology Direction, it contains 29 chapters written by worldwide recognized scientists.
The monograph, which starts with an introductory chapter (Chapter 1), is a compilation
of protocols commonly employed in plant biology research. They show recent advances at all
workflow stages, starting from the laboratory (tissue and cell fractionation, protein extrac-
tion, depletion, purification, separation, MS analysis, quantification) and ending on the
computer (algorithms for protein identification and quantification, bioinformatics tools for
data analysis, databases and repositories).
Out of the 29 chapters, 6 are devoted to descriptive proteomics, with a special emphasis
on subcellular protein profiling (Chapters 5–10), 6 to PTMs (Chapter 11 and 14–18), 3 to
protein interactions (Chapters 19–21), and 2 to specific proteins, peroxidases (Chapter 24)
and proteases and proteases inhibitors (Chapter 26). The book reflects the new trajectory in
MS-based protein identification and quantification, moving from the classic gel-based
approaches to the most recent labeling (Chapters 10, 11, 29), shotgun (Chapters 5, 7,
12, 15), parallel reaction monitoring (Chapter 16), and targeted data acquisition
(Chapter 13). MS-imaging (Chapter 25), the only in vivo MS-based proteomics strategy,
is far from being fully optimized and exploited in plant biology research. A confident protein
identification and quantitation, especially in orphan species, and on low-abundant proteins,
is still a challenging topic (Chapters 4, 28).
This edition also gives a novel point of view to the proteomics approach with the
description of different protocols for proteomics data validation and integration with
other classic and -omics approaches in the systems biology direction. Chapter 2 reports on
multiple extractions in a single experiment of the different biomolecules, nucleic acids,
proteins, and metabolites. Chapter 27 describes how metabolic pathways can be recon-
structed from multiple -omics data, and Chapter 3 is on network building. Finally, Chapters
22 and 23 deal with, respectively, the search for allele-specific proteins and proteogenomics.
Keeping in mind the history and evolution of proteomics, it is quite probable that the
fourth edition will be published in few years, as we are still at the beginning of deciphering
the plant proteome to understand the central dogma of the molecular biology in terms of
proteins and to exploit the potential of the technique for translational purposes.
v
Contents
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
Contributors. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
vii
viii Contents
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 415
Contributors
NAGIB AHSAN • Division of Biology and Medicine, COBRE Center for Cancer Research
Development, Proteomics Core Facility, Rhode Island, USA Hospital, Providence, Brown
University, Providence, RI, USA; Division of Biology and Medicine, Brown University,
Providence, RI, USA
THUALFEQAR AL-MOHANNA • Department of Biochemistry, Molecular Biology, Entomology,
and Plant Pathology, Mississippi State University, Mississippi State, MS, USA
ANA ÁLVAREZ • Plant Physiology, Department of Organisms and Systems Biology and
University Institute of Biotechnology (IUBA), University of Oviedo, Oviedo, Spain
MAROUANE BASLAM • Department of Biochemistry, Faculty of Agriculture, Niigata
University, Niigata, Japan
NORBERT T. BOKROS • Department of Biochemistry, Molecular Biology, Entomology, and
Plant Pathology, Mississippi State University, Mississippi State, MS, USA
EGISTO BOSCHETTI • Scientific Consultant, JAM Conseil, Neuilly-sur-Seine, France
ROQUE BRU-MARTINEZ • Plant Proteomics and Functional Genomics Group, Department of
Agrochemistry and Biochemistry. Faculty of Sciences, University of Alicante, Alicante,
Spain
MARÍA JESÚS CAÑAL • Plant Physiology, Department of Organisms and Systems Biology and
University Institute of Biotechnology (IUBA), University of Oviedo, Oviedo, Spain
MARÍA CARBÓ • Plant Physiology, Department of Organisms and Systems Biology and
University Institute of Biotechnology (IUBA), University of Oviedo, Oviedo, Spain
SEBASTIEN CHRISTIAN CARPENTIER • SYBIOMA: Facility for Systems Biology-Based Mass
Spectrometry, KULeuven, Leuven, Belgium; Bioversity International, Genetic Resources,
Leuven, Belgium
MARI ANGELES CASTILLEJO • Agroforestry and Plant Biochemistry, Proteomics and Systems
Biology, Department of Biochemistry and Molecular Biology, University of Cordoba, UCO-
CeiA3, Cordoba, Spain
FRANCISCO COLINA • Plant Physiology, Department of Organisms and Systems Biology and
University Institute of Biotechnology (IUBA), University of Oviedo, Oviedo, Spain
LUIS ENRIQUE RODRÍGUEZ DE FRANCISCO • Laboratorio de Biologı́a, Instituto Tecnologico de
Santo Domingo, Santo Domingo, República Dominicana
JOSÉ M. ELIZALDE-CONTRERAS • Red de Estudios Moleculares Avanzados, Clúster Cientı́fico y
Tecnologico BioMimic®, Instituto de Ecologı́a A.C. (INECOL), Veracruz, Mexico
MÓNICA ESCANDÓN • Agroforestry and Plant Biochemistry, Proteomics and Systems Biology,
Department of Biochemistry and Molecular Biology, University of Cordoba, UCO-CeiA3,
Cordoba, Spain
YOEL ESTEVE-SÁNCHEZ • Plant Proteomics and Functional Genomics Group, Department of
Agrochemistry and Biochemistry. Faculty of Sciences, University of Alicante, Alicante,
Spain
IRIS FINKEMEIER • Plant Physiology, Institute of Plant Biology and Biotechnology, University
of Münster, Münster, Germany
MEGAN M. FORD • Department of Chemistry, University of North Carolina at Chapel Hill,
Chapel Hill, NC, USA
xi
xii Contributors
Abstract
The third edition of “Plant Proteomics Methods and Protocols,” with the title “Advances in Proteomics
Techniques, Data Validation, and Integration with Other Classic and -Omics Approaches in the Systems
Biology Direction,” was conceived as being based on the success of the previous editions, and the continuous
advances and improvements in proteomic techniques, equipment, and bioinformatics tools, and their uses
in basic and translational plant biology research that has occurred in the past 5 years (in round figures, of
around 22,000 publications referenced in WoS, 2000 were devoted to plants).
The monograph contains 29 chapters with detailed proteomics protocols commonly employed in plant
biology research. They present recent advances at all workflow stages, starting from the laboratory (tissue
and cell fractionation, protein extraction, depletion, purification, separation, MS analysis, quantification)
and ending on the computer (algorithms for protein identification and quantification, bioinformatics tools
for data analysis, databases and repositories). At the end of each chapter there are enough explanatory notes
and comments to make the protocols easily applicable to other biological systems and/or studies, discussing
limitations, artifacts, or pitfalls. For that reason, as with the previous editions, it would be especially useful
for beginners or novices.
Out of the 29 chapters, six are devoted to descriptive proteomics, with a special emphasis on subcellular
protein profiling (Chapters 5–10), six to PTMs (Chapters 11, and 14–18), three to protein interactions
(Chapters 19–21), and two to specific proteins, peroxidases (Chapter 24) and proteases and protease
inhibitors (Chapter 26). The book reflects the new trajectory in MS-based protein identification and
quantification, moving from the classic gel-based approaches to the most recent labeling (Chapters 10,
11, 29), shotgun (Chapters 5, 7, 12, 15), parallel reaction monitoring (Chapter 16), and targeted data
acquisition (Chapter 13). MS imaging (Chapter 25), the only in vivo MS-based proteomics strategy, is far
from being fully optimized and exploited in plant biology research. A confident protein identification and
quantitation, especially in orphan species, of low-abundance proteins, is still a challenging task (Chapters 4,
28).
What is really new is the use of different techniques for proteomics data validation and their integration
into other classic and -omics approaches in the systems biology direction. Chapter 2 reports on multiple
extractions in a single experiment of the different biomolecules, nucleic acids, proteins, and metabolites.
Chapter 27 describes how metabolic pathways can be reconstructed from multiple -omics data, and
Chapter 3 network building. Finally, Chapters 22 and 23 deal with, respectively, the search for allele-
specific proteins and proteogenomics.
Around 200 groups were, almost 1 year ago, invited to take part in this edition. Unfortunately, only 10%
of them kindly accepted. My gratitude to those who accepted our invitation but also to those who did not,
as all of them have contributed to the plant proteomics field. I will enlist, in this introductory chapter,
following my own judgment, some of the relevant papers published in the past 5 years, those that have
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_1, © Springer Science+Business Media, LLC, part of Springer Nature 2020
1
2 Jesus V. Jorrin-Novo
shown us how to enhance and exploit the potential of proteomics in plant biology research, without aiming
at giving a too exhaustive list.
Key words Omics approaches, Plant proteomics, Protein interactions, PTMs, Proteogenomics,
Quantitative proteomics, Shotgun proteomics, Systems biology, Targeted proteomics
1 Introduction
3 Proteomics Data Validation, and Integration into Other Classic and -Omics
Approaches in the Systems Biology Direction
References
1. Thiellement H, Zivy M, Damerval C et al (eds) 6. Picotti P, Bodenmiller B, Aebersold R (2013)
(2007) Plant proteomics methods and proto- Proteomics meets the scientific method. Nat
cols. Methods Mol Biol 355:1–8 Methods 10:24–27
2. Jorrin-Novo JV, Komatsu S, Weckwerth W et al 7. Gillet LC, Navarro P, Tate S et al (2012) Tar-
(2014) Plant proteomics methods and proto- geted data extraction of the MS/MS spectra
cols. In: Methods molecular biology, vol 1072, generated by data independent acquisition: a
2nd edn. Humana Press, Totowa new concept for consistent and accurate prote-
3. Jorrin Novo JV (2014) Plant proteomics meth- ome analysis. Mol Cell Proteomics 11:
ods and protocols. In: Novo J et al (eds) O111.016717
Chapter 1, plant proteomics methods and pro- 8. Jorrin-Novo JV, Komatsu S, Sanchez-Lucas R
tocols, Methods molecular biology, vol 1072, et al (2018) Gel electrophoresis-based plant
2nd edn. Humana Press, Totowa, pp 3–13 proteomics: past, present, and future. Happy
4. Rey MD, Valledor L, Castillejo MA et al 10th anniversary journal of proteomics. J Pro-
(2019) Recent advances in MS-based plant teome 198:1–10
proteomics: proteomics data validation 9. Luthria DL, Maria John KM, Marupaka R et al
through integration with other classic –omics (2018) Recent update on methodologies for
approaches. In: Progress in botany. Springer, extraction and analysis of soybean seed pro-
Berlin, Heidelberg teins. J Sci Food Agric 98:5572–5580
5. Neilson KA, Ali NA, Muralidharan S et al 10. Fesmire JD (2019) A brief review of other
(2011) Less label, more free: approaches in notable electrophoretic methods. Methods
label-free quantitative mass spectrometry. Pro- Mol Biol 1855:495–499
teomics 11:535–553 11. Minic Z, Dahms TES, Babu M (2018) Chro-
matographic separation strategies for precision
2019 Plant Proteomics Methods and Protocols 9
mass spectrometry to study protein-protein 25. Lankinen A, Abreha KB, Masini L et al (2018)
interactions and protein phosphorylation. J Plant immunity in natural populations and
Chromatogr B Analyt Technol Biomed Life agricultural fields: Low presence of
Sci 1102-1103:96–108 pathogenesis-related proteins in Solanum
12. Ankney JA, Muneer A, Chen X (2018) Relative leaves. PLoS One 13:e0207253
and absolute quantitation in mass 26. Ghatak A, Chaturvedi P, Weckwerth W (2017)
spectrometry-based proteomics. Annu Rev Cereal crop proteomics: systemic analysis of
Anal Chem 11:49–77 crop drought stress responses towards marker-
13. Eliuk S, Makarov A (2015) Evolution of Orbi- assisted selection breeding. Front Plant Sci
trap mass spectrometry instrumentation. Annu 8:757
Rev Anal Chem 8:61–80 27. Schaffer LV, Millikin RJ, Miller RM et al
14. Jung H, Winefield C, Bombarely A et al (2019) (2019) Identification and quantification of
Tools and strategies for long-read sequencing proteoforms by mass spectrometry. Proteomics
and de novo assembly of plant genomes. 19:SI 1800361
Trends Plant Sci 24(8):P700–P724. (in press) 28. Naryzhny S (2019) Inventory of proteoforms
15. Guerrero-Sanchez VM, Maldonado-Alconada- as a current challenge of proteomics: some
A, Amil-Ruiz et al (2019) Ion torrent and lllu- technical aspects. J Proteome 191:22–28
mina, two complementary RNA-seq platforms 29. Toby TK, Fornelli L, Kelleher NL (2016)
for constructing the holm oak (Quercus ilex) Progress in top-down proteomics and the anal-
transcriptome. PLoS One 14:e0210356 ysis of proteoforms. Annu Rev Anal Chem
16. Misra BB (2018) Updates on resources, soft- (Palo Alto, Calif) 9:499–519
ware tools, and databases for plant proteomics 30. Hashiguchi A, Komatsu S (2017) Postransla-
in 2016–2017. Electrophoresis 39:1543–1557 tional modifications and plant-environment
17. Subba P, Narayana Kotimoole C et al (2019) interaction. Methods Enzymol 586:97–113
Plant proteome databases and bioinformatic 31. Wu XL, Gong FP, Cao D et al (2016) Advances
tools: an expert review and comparative in crop proteomics: PTMs of proteins under
insights. OMICS 23:190–206 abiotic stress. Proteomics 16:847–865
18. Martens L, Vizcaı́no JA (2017) A golden age 32. Friso G, van Wijk KJ (2015) Posttranslational
for working with public proteomics data. protein modification in plant metabolism.
Trends Biochem Sci 42:333–341 Plant Physiol 3:1469–1487
19. Duncan O, Trosch J, Fenske R et al (2017) 33. Vu LD, Gevaert K, De Smet I (2018) Protein
Resource: mapping the Triticum aestivum pro- language: post-translational modifications talk-
teome. Plant J 89:601–616 ing to each other. Trends Plant Sci
20. Katam K, Jones KA, Sakata K (2015) Advances 12:1068–1080
in proteomics and bioinformatics in agriculture 34. Zhu XL, Yu FC, Yang Z et al (2016) In planta
research and crop improvement. J Proteomics chemical cross-linking and mass spectrometry
Bioinform 8:3 analysis of protein structure and interaction in
21. Hu J, Rampitsch C, Bykova NV (2015) Arabidopsis. Proteomics 16:1915–1927
Advances in plant proteomics toward improve- 35. Li GXH, Vogel C, Choi H (2018) PTMscape:
ment of crop productivity and stress resistance. an open source tool to predict generic post-
Front Plant Sci 6:209 translational modifications and map modifica-
22. Carrera DA, Oddsson S, Grossmann J et al tion crosstalk in protein domains and biological
(2018) Comparative proteomic analysis of processes. Mol Omics 14:197–209
plant acclimation to six different long-term 36. Willems P, Horne A, Van Parys T, et al (2019)
environmental changes. Plant Cell Physiol The Plant PTM Viewer, a central resource for
59:510–526 exploring plant protein modifications. Plant J
23. Schneider S, Harant D, Bachmann G et al doi: https://doi.org/10.1111/tpj.14345.
(2019) Subcellular phenotyping: using proteo- [Epub ahead of print]
mics to quantitatively link subcellular leaf pro- 37. Yao H, Wang X, Chen P et al (2018) Predicted
tein and organelle distribution analyses of Arabidopsis interactome resource and gene set
Pisum sativum cultivars. Front Plant Sci 10:638 linkage analysis: a transcriptomic analysis
24. de Lamo FJ, Constantin ME, Fresno DH et al resource. Plant Physiol 177:422–433
(2018) Xylem sap proteomics reveals distinct 38. Rödiger A, Baginsky S (2018) Tailored use of
differences between R gene- and endophyte- targeted proteomics in plant-specific applica-
mediated resistance against Fusarium wilt dis- tions. Front Plant Sci 9:1204
ease in tomato. Front Microbiol 9:2977 39. Chawade A, Alexandersson E, Bengtsson T
et al (2016) Targeted proteomics approach
10 Jesus V. Jorrin-Novo
for precision plant breeding. J Proteome Res 45. Corujo M, Pla M, van Dijk J et al (2019) Use of
15:638–646 omics analytical methods in the study of genet-
40. Jaffe J, Berg HC, Church GM (2004) Proteo- ically modified maize varieties tested in 90 days
genomic mapping as a complementary method feeding trials. Food Chem 292:359–371
to perform genome annotation. Proteomics 46. Ponnaiah M, Gilard F, Gakiere B et al (2019)
4:59–77 Regulatory actors and alternative routes for
41. Castellana NE, Payne SH, Shen Z (2008) Dis- Arabidopsis seed germination are revealed
covery and revision of Arabidopsis genes by using a pathway-based analysis of transcrip-
proteogenomics. Proc Natl Acad Sci U S A tomic datasets. Plant J 99:163–175
105:21034–21038 47. Pais MS (2019) Somatic embryogenesis induc-
42. Low TY, Mohtar MA, Ang MY et al (2019) tion in woody species: the future after omics
Connecting proteomics to next-generation data assessment. Front Plant Sci 10:240
sequencing: Proteogenomics and its current 48. Li T, Wang YH, Liu JX et al (2019) Advances in
applications in biology. Proteomics 19: genomic, transcriptomic, proteomic, and
e1800235 metabolomic approaches to study biotic stress
43. Hong WJ, Kim YJ, Chandran AKN et al (2019) in fruit crops. Crit Rev Biotechnol 39:680–692
Infrastructures of systems biology that facilitate 49. Proust H, Hartmann C, Crespi M et al (2018)
functional genomic study in rice. Rice 12:15 Root development in Medicago truncatula: les-
44. Xiong J, Yang Q, Kang J et al (2011) Simulta- sons from genetics to functional genomics.
neous isolation of DNA, RNA, and protein Methods Mol Biol 1822:205–239
from Medicago truncatula L. Electrophoresis
32:321–330
Chapter 2
Abstract
Microalgae are gaining attention in industry for their high value–added biomolecules and biomass produc-
tion and for studying fundamental processes in biology. The introduction of novel approaches for under-
standing and modeling molecular networks at different omic levels is paramount for increasing the
productivity of these organisms. However, the construction of these networks requires high quality datasets
with, if possible, perfectly overlapping datasets. The employ of different materials for different biomolecule
isolation protocols, even if they come from the same homogenate, is one of the commonest issues affecting
quality. Hence, a new method has been developed, allowing for the combined extraction of different levels
including total metabolites, or their pigments or lipid fractions along nucleic acids (DNA and RNA) and/or
proteins from the same sample reducing biological and time variation between levels data.
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_2, © Springer Science+Business Media, LLC, part of Springer Nature 2020
11
12 Francisco Colina et al.
700 µL ddH2O
TUBE L TUBE NAP
1900 g, 5 min
TUBE S
Protein fractionation
Fig. 1 Workflow of microalgae metabolite, lipid, or pigment fraction extraction combined with nucleic acid
and/or protein extraction from the same sample
2 Materials
2.1 Cell Culture 1. Chlamydomonas reinhardtii CC-503 cw92 mt+ agg1+ nit1
Materials nit2 (available at the Chlamydomonas Culture Collection,
Duke University).
Multiple Biomolecule Isolation in Microalgae 13
2.2 Sampling and 1. 50 mL conical tubes, 1.5 mL tubes, 2 mL tubes, and 1.5 screw-
Extraction Materials cap tubes.
2. Refrigerated centrifuge.
3. Regimill/Fastprep (beads beating system).
4. Vortex.
5. Vacuum concentrator (speedvac).
6. Heat block.
7. Ultrasound sonicator.
8. Freezers (20 and 80 C).
3 Methods
3.3 Pigment Following steps must be at 4 C unless other conditions are speci-
Extraction Method fied. All materials used must be acetone resistant. Pigment extrac-
tion is not compatible with metabolite and lipid extractions.
1. Add 500 μL of PEB to tube S for pellet resuspension. Transfer
to the glass beads screw cap tubes (tube SB) (see Note 8).
2. Add 500 μL of PEB to tube S and be sure the pellet is
completely resuspended. Mix with previous PEB (step 1) in
the tube SB.
3. Vortex vigorously for 30 s or Regimill/Fastprep for 30 s.
Transfer to a new 1.5 mL tube (tube NAP).
4. Centrifuge for 5 min at 21,100 g. Transfer supernatant to a
new tube (tubePi). The pellet containing nucleic acids and
proteins (tube NAP) should be whitish-brownish (see
Note 13).
5. Read the absorbance of tube Pi (dilute Pi contents if necessary)
immediately, since the acetone is highly volatile.
Absorbance to be read: 470 nm, 537 nm, 647 nm, 663 nm.
Take the background-subtracted mean absorbance of the three
replicates (see Note 14).
16 Francisco Colina et al.
3.4 Lipid Extraction Lipid extraction is not compatible with pigment and metabolite
Method extractions.
1. Add 200 μL of LBE1 to cell pellet (tube S) and transfer to a
glass beads containing screw-cap tube (tube SB).
2. Homogenize using beads beating until total homogenization.
Weight a 1.5 mL tube (tube L).
3. Centrifuge at 14,000 g for 5 min at room temperature and
transfer supernatant to the tube L.
4. Repeat steps 1 and 2, mixing both fractions in the tube same L.
5. Reextract the pellet with 400 μL of LBE2 and vigorously
vortex for 3 min.
6. Centrifuge at 14,000 g for 5 min at room temperature and
transfer supernatant to tube L. The pellet contains proteins and
nucleic acids (tube NAP) (see Notes 10 and 16).
7. Dry tube L in a speedvac or oven.
8. Determine lipid weight gravimetrically.
3.5 Nucleic Acid The following steps must be carried out at 4 C, unless other
Purification Method conditions are specified.
1. Add 500 μL of WB1 to tube NAP and mix by vortex until the
pellet is mostly disaggregated (see Note 17). Centrifuge at
20,000 g for 10 min. Discard supernatant without disturbing
the pellet (see Note 18).
2. Resuspend the pellet in 400 μL of PSB and centrifuge at
14,000 g for 3 min.
3. Transfer supernatant to a new silica column (SC1) placed in a
nuclease- and protease-free 2 mL tube (see Note 18). Centri-
fuge at 10,000 g for 1 min.
4. Transfer the flow through to a new tube (tube RP) containing
RNA and proteins. Reserve the SC1 containing DNA for later
washing steps.
Multiple Biomolecule Isolation in Microalgae 17
8. Fill the tube B with methanol and disaggregate the pellet using
an ultrasound sonicator.
9. Centrifuge at 10,000 g for 10 min and discard the superna-
tant without disturbing the pellet.
10. Wash the pellet with 600 μL of PPWB. Mix until the pellet is
completely disaggregated (see Note 24).
11. Centrifuge at 10,000 g for 10 min and discard the superna-
tant without disturbing the pellet.
12. Air-dry pellets and redissolve in an adequate buffer (see Notes
25 and 26).
13. Resolubilize and quantify proteins (see Note 26).
Proceed with protein fractionation, digestion, desalting, and
concentration according to [12].
4 Notes
Acknowledgments
References
1. Sasso S, Stibor H, Mittag M et al (2018) From oxidation substrates and antioxidants by means
molecular manipulation of domesticated Chla- of Folin-Ciocalteu reagent. In: Methods in
mydomonas reinhardtii to survival in nature. enzymology. Academic Press, Cambridge, pp
elife 7:e39233 152–178
2. Valledor L, Escandón M, Meijón M et al 8. Gregor J, Maršálek B (2004) Freshwater phy-
(2014) A universal protocol for the combined toplankton quantification by chlorophyll a: a
isolation of metabolites, DNA, long RNAs, comparative study of in vitro, in vivo and in
small RNAs, and proteins from plants and situ methods. Water Res 38:517–522
microorganisms. Plant J 79:173–180 9. Chiu L, Ho S-H, Shimada R et al (2017) Rapid
3. Nakayasu ES, Nicora CD, Sims AC et al (2016) in vivo lipid/carbohydrate quantification of
MPLEx: a robust and universal protocol for single microalgal cell by Raman spectral imag-
single-sample integrative proteomic, metabo- ing to reveal salinity-induced starch-to-lipid
lomic, and lipidomic analyses. MSystems 1: shift. Biotechnol Biofuels 10:9
e00043–e00016 10. Strenkert D, Schmollinger S, Gallaher SD et al
4. Salem MA, Jüppner J, Bajdzienko K et al (2019) Multiomics resolution of molecular
(2016) Protocol: a fast, comprehensive and events during a day in the life of Chlamydomo-
reproducible one-step extraction method for nas. PNAS 116:2374–2383
the rapid preparation of polar and semi-polar 11. Sims DA, Gamon JA (2002) Relationships
metabolites, lipids, proteins, starch and cell between leaf pigment content and spectral
wall polymers from a single sample. Plant reflectance across a wide range of species, leaf
Methods 12:45 structures and developmental stages. Remote
5. Morschett H, Wiechert W, Oldiges M (2016) Sens Environ 81:337–354
Automation of a Nile red staining assay enables 12. Valledor L, Weckwerth W (2014) An improved
high throughput quantification of microalgal detergent-compatible gel-fractionation
lipid production. Microb Cell Factories 15:34 LC-LTQ-Orbitrap-MS workflow for plant
6. Smith AM, Zeeman SC (2006) Quantification and microbial proteomics. In: Jorrin-Novo JV,
of starch in plant tissues. Nat Protoc 1:1342 Komatsu S, Weckwerth W, Wienkoop S (eds)
7. Singleton VL, Orthofer R, Lamuela-Raventós Plant proteomics: methods and protocols.
RM (1999) Analysis of total phenols and other Humana Press, Totowa, NJ, pp 347–358
Chapter 3
Abstract
The evolution of next-generation sequencing and high-throughput technologies has created new oppor-
tunities and challenges in data science. Currently, a classic proteomics analysis can be complemented by
going a step beyond the individual analysis of the proteome by using integrative approaches. These
integrations can be focused either on inferring relationships among proteins themselves, with other
molecular levels, phenotype, or even environmental data, giving the researcher new tools to extract and
determine the most relevant information in biological terms. Furthermore, it is also important the employ
of visualization methods that allow a correct and deep interpretation of data.
To carry out these analyses, several bioinformatics and biostatistical tools are required. In this chapter,
different workflows that enable the creation of interaction networks are proposed. Resulting networks
reduce the complexity of original datasets, depicting complex statistical relationships (through PLS analysis
and variants), functional networks (STRING, shinyGO), and a combination of both approaches. Recently
developed methods for integrating different omics levels, such as coinertial analyses or DIABLO, are also
described. Finally, the use of Cytoscape or Gephi was described for the representation and mining of the
different networks.
This approach constitutes a new way of acquiring a deeper knowledge of the function of proteins, such as
the search for specific connections of each group to identify differentially connected modules, which may
reflect involved protein complexes and key pathways.
Key words Protein networks, String, Omics levels, sPLS, DIABLO, Cytoscape
1 Introduction
The classic workflow for proteome analysis, mainly based on the use
of univariate statistics and PCAs, is being quietly displaced in favor
of new approaches that take advantage of protein interaction
knowledge and advanced statistical tools. These novel
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_3, © Springer Science+Business Media, LLC, part of Springer Nature 2020
21
22 Mónica Escandón et al.
methodologies allow for the study of the proteome and its interac-
tion with other biomolecules, the environment, and even with
itself, providing a holistic perspective. This kind of workflows
gives the researcher the possibility of having a deeper understand-
ing of the biological responses behind the observed differences in
the experimental systems.
Integrative studies heavily rely on computational biology and
require the use of specific algorithms, methods, and models to
extract and determine the most relevant information in biological
terms [1, 2]. Common classification methods (including discrimi-
nant analysis; neural networks; decision trees; support vector
machine, SVM; and random forest, RF) are suited to single dataset
analyses [2], whereas the methods that build predictive models
require multiple sources of those which act as predictors and
those which are predicted.
The most employed methods for the characterization of multi-
ple omics dataset is the combination of unsupervised multivariate
statistics, like principal component analysis (PCA), and supervised,
like partial least square (PLS) and discriminant analysis (PLS-DA)
and its variants [3–5]. PLS methods are suitable to integrate two
datasets considering one omic level a predictor of a second omic
level, the response. With these methods it is possible to get an
overview about the most important variables (proteins, metabolites
or transcripts) determining which variables of the predictor explain
the maximum variance of the responses [6]. In addition, there are
innovative multiple integration tools (for more than two different
data inputs) that allow for the construction of these relationship
based models, such as DIABLO (Data Integration Analysis for
Biomarker discovery using a latent component method for Omics
studies) [2], multiple coinertia analysis (MCIA) [7], and
xMWAS [8].
Integrative analysis can be pushed beyond sample and variable
biplotting or variable filtering, since determined interaction can be
depicted as networks, where the variables are the nodes and the
relations among them, the edges. As a result, this simple represen-
tation collects the complexity of the original data as retrieved by
previous analyses. These networks can be topologically evaluated to
determine the most connected nodes or hubs within the data as
well as subnetworks or clusters with same (or opposite) behavior.
Interaction networks described above, and its inferred relation-
ships are based on statistical analyses. However, there is also possi-
ble to create or enrich those networks with functional or
biologically relevant annotations. This new information layer is
obtained from specific tools and databases (STRING, ShinyGO)
which gather known functional relations (protein–protein, protein–
metabolite associations). In addition, we can create functional net-
works [9] even for species not included in these databases through
BLAST and protein domain analyses.
Protein Interaction Networks 23
2 Materials
3 Methods
Fig. 1 Example matrices for data entry in R in csv format. (a) Protein matrix, (b) Metabolite matrix, and (c)
Transcript matrix (RNA-seq data in this case). Each dataset with samples in columns and variables in rows
3.1 Selection For the analysis of protein–protein interactions, besides the global
of Differential analysis of the proteome, it is possible to analyze in particular the
Expression Proteins interactions of proteins with differential expression within our
(for Targeted experiment. Quantitative analysis of shotgun proteomic data can
Networks) be performed through statistical tools commonly used to measure
the differential expression of genes (proteins in our case) such as
EdgeR [15]. This package implements a range of statistical meth-
odology based on the negative binomial distributions, including
empirical Bayes estimation, exact tests, generalized linear models
and quasilikelihood tests. This analysis makes it possible to better
group proteins according to their function under certain condi-
tions, reducing network complexity and keeping only the proteins
significantly altered for a specific treatment.
Then, we will explain the workflow to obtain a selection of
differential proteins through which we will obtain the network
functionally enriched with programs such as STRING or ShinyGO
(Subheading 3.2).
Workflow 1. Install and load the required packages. These are collections of
functions, data, and R code that are stored in a folder according
to a well-defined structure, easily accessible for R (see Note 1).
In an R console or GUI (we recommend R Studio) type:
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocInstaller::install("edgeR")
library(edgeR)
26 Mónica Escandón et al.
2. Load your data (Protein matrix, Fig. 1a), indicating the path of
the file that contains them and its format. In addition, we must
assign a name to the columns of data, indicating the controls
and the corresponding treatments.
proteins <- read.table("proteins.csv", header = T,
row.names=1, sep=";")
3.2 Integration Tools While most approaches are focused on constant interactions and
assume that biological systems are static, nonhomogeneous
3.2.1 Statistical
dynamic Bayesian statistics with ARTIVA (autoregressive time vary-
Integration Networks:
ing) algorithm can reveal causal interactions considering temporal
Dynamic Protein–Protein
information. This method is able to determine indirect associations
Interaction Networks
and regulation loops of high biological interest, thus giving a more
complete picture of the system. ARTIVA model divides global
dataset dynamics (heterogeneity) in several uniform (homoge-
neous) phases called changepoints (CPs). In each CP, it searches
for relations between two types of variables (regulators and targets)
while taking into account a user-defined time delay. As a result, it
reveals dynamic interactions across time [17]. This type of model
has three main limitations: (1) Interactions that occur at a time scale
shorter than the sampling points cannot be detected and may result
in wrong conclusions. If the time between two consecutive sam-
pling points is too large, it is recommended to try other approaches.
(2) Datasets with low number of samples and high number of
variables can lead to erroneous inference [18]. It would be desirable
to find a balance by filtering the variables with differential expres-
sion tests or functional categories. (3) Time-consuming process.
An example of the ARTIVA networks is available in Fig. 2.
Protein Interaction Networks 29
Prot 46
Prot 2 Prot 3
Prot 4 Prot 5
Prot 45 Prot 62
Prot 50
Prot 47
Prot 11
Prot 6 Prot 49
Prot 9
Prot 7
Prot 10
Prot 14 Prot 8 Prot 48
Prot 61 Prot 51
Prot 13
Prot 15
Prot 12
Prot 52
Prot 55 Prot 56
Prot 16 Prot 18
Prot 20 Prot 53
Prot 19 Prot 54
Prot 17 Prot 59
Prot 23
Prot 64
Prot 58
Prot 25
Prot 26
Prot 24
Prot 60
Prot 27 Prot 28
Prot 34
Prot 33
Prot 31
Prot 32
Prot 1
Prot 32
Prot 31
Changepoint 4; Sampling points 4-5-6
Prot 38 Prot 24
Prot 27
Prot 35 Prot 36
Prot 38 Prot 37
Prot 28
Prot 30
Prot 44 Prot 90
Prot 72 Prot 39 Prot 81
Prot 3 Prot 5
Prot 29 Prot 2
Prot 33
Prot 43 Prot 37 Prot 43
Prot 91
Prot 41 Prot 80
Prot 40
Prot 34 Prot 5
Prot 82
Prot 39
Prot 35
Prot 65
Prot 44 Prot 84
Prot 4 Prot 42
Prot 36 Prot 24 Prot 33 Prot 57
Prot 62 Prot 83 Prot 25 Prot 34
Prot 66
Prot 46
Prot 66
Prot 11
Prot 59
Prot 36
Prot 45 Prot 8 Prot 73 Prot 6 Prot 10 Prot 9 Prot 7
Prot 67 Prot 1
Prot 50 Prot 68
Prot 67 Prot 74 Prot 4
Prot 49
Prot 18
Prot 41 Prot 17 Prot 58 Prot 32
Prot 87
Prot 63
Prot 51
Prot 12
Prot 18
Prot 58 Prot 2
Prot 53
Prot 52 Prot 40 Prot 88 Prot 89 Prot 79
Prot 59 Prot 71 Prot 75 Prot 49
Prot 71
Prot 54
Prot 29
Prot 13 Prot 37
Prot 20 Prot 14 Prot 70 Prot 42
Prot 76 Prot 53 Prot 90 Prot 50
Prot 77 Prot 85
Prot 65
Prot 19
Prot 1
Prot 61
Prot 78
Prot 38
Prot 4
Prot 72 Prot 31
Prot 71
Prot 37
Prot 69 Prot 29
Prot 30 Prot 36
Prot 79
Prot 35
Prot 65
Prot 62
Prot 74 Prot 33
Prot 81
Prot 2
Prot 45
Prot 75 Prot 46
Prot 80
Prot 48
Prot 49
Prot 76 Prot 5
Prot 12
Prot 43 Prot 53
Prot 25
Prot 77
Prot 24
Prot 44 Prot 66
Prot 59 Prot 55
Prot 54
Prot 58
Prot 68
Prot 42
Prot 40
Prot 67
Fig. 2 ARTIVA networks representing the interactions across time between three different groups of regulators
(blue, red, and green nodes) and targets (gray nodes). The interactions of each changepoint are represented
independently. The orange circle shows nodes that have steadily gained interactions over time and therefore
greater importance. This network was made using Cytoscape workflow. Changepoint 1 ¼ interactions that
only occur on sampling point 1; Changepoint 2 ¼ interactions that only occur on sampling point 2;
Changepoint 3 ¼ interactions that only occur on sampling point 3; Changepoint 4 ¼ interactions that only
occur on sampling points 4-5-6
3.2.2 Statistical The statistical interaction networks between proteins with other
Interaction Networks omic datasets (metabolomics, transcriptomics, ...) is obtained
between Proteins through the use of the Partial Least Square Regression (PLS), or
with Other Omics Datasets its variant regression of Sparse Partial Least Squares regression
(sPLS). The PLS is a multivariable analysis which integrates two
Partial Least Square matrices: X a predictive matrix (e.g., Proteins) and Y a response
Regression (PLS) matrix (e.g., Metabolites) [19, 20]. This method is used when the
and Variates number of predictors are more than the number of observations or
cases and where the variables considered for the study are correlated
[21], a common situation in data obtained from omic techniques.
The advantages of PLS is that it can handle many noisy and collinear
variables, while sPLS [22] also has the advantage that a
Protein Interaction Networks 31
IMPORTIN glucose-6-phosphate
SUBUNIT 1-dehydrogenase
ALPHA-2
Malate Proteins
dehydrogenase Importin subunit
Malate
dehydrogenase
(oxaloacetate-decarboxylating)) beta-1
(NADP(+)) Universal stress Transcripts
PEROXIDASE protein family Secogolanin
(decarboxylating)
ASPARTIC 35-RELATED
(Usp) synthase
BAND 7 PROTEINASE
hypothetical
PHOSPHOLIPASE PROTEIN-RELATED A1-RELATED solute carrier GLYCEROL-3-PHOSPHATE
protein (K09955)
family 25
D ALPHA
(mitochondrial
Farnesyl DEHYDROGENASE
1-RELATED
phosphate
diphosphate [NAD(+)]
Caffeate
transporter),member synthase NAD
O-methyltransferase
3 Diphosphomevalonate DEPENDENT
decarboxylase EPIMERASE
Ubiquitin PHOSPHOLIPASE Acyl-[acyl-carrier-protein]
carboxyl-terminal D ALPHA alpha,alpha-trehalase desaturase 1 4-coumarate--CoA
hydrolase (UCH) EARLY-RESPONSIVE 1-RELATED (E3.2.1.28, treA,
MAP ligase (4CL)
stress-induced-phosphoprotein 1-deoxy-D-xylulose-5-phosphate TO Limonene treF)
Serine protease
1 synthase DEHYDRATION
family S10 serine Chalcone-Flavonone
large subunit
FAD2.1 synthase
Apolipoprotein D
STRESS
Carboxypeptidase Isomerase ribosomal protein
PROTEIN L28e PECTINESTERASE
and lipocalin 3-Related coniferyl-alcohol Naringenin
family protein TIP120 SKP1 FAD7.2 glucosyltransferase 3-dioxygenase
(APOD) MYC2 acetyl-CoA
raffinose synthase Anthocyanidin GIP1 Betaine-aldehyde
Leucoanthocyanidin acetyl-CoA Beta-ketoacyl-[acyl-carrier-protein] carboxylase dehydrogenase
dioxygenase MEMBER OF Annexin A13 carboxyl 3-O-glucoside
carboxylase 12-oxophytodienoic synthase II N-MYC 2''-O-glucosyltransferase
'GDXG' FAMILY (ANXA13) carboxyl acid reductase DOWNSTREAM transferase
Long-chain
OF LIPOLYTIC transferase REGULATED subunit alpha Prohibitin 1
acyl-CoA
ENZYMES PECTATE subunit alpha Linoleate GE10H (PHB1)
small subunit LYASE acetyl-CoA 13S-lipoxygenase Solute carrier synthetase METALLOPROTEASE
ribosomal protein Phosphoglycerate
1-RELATED acyltransferase 1 ATP SYNTHASE PROTEIN family 25 M41 FTSH
S27Ae kinase (PGK)
CHAPERONIN (ACAA1) PEPTIDYL-PROLYL DELTA CHAIN (PAC:37713294) (mitochondrial
Oxalyl-CoA Polyneuridine-aldehyde calcium-binding
60 SUBUNIT CIS-TRANS Linoleate oxoglutarate
MEMBER OF decarboxylase SERINE esterase EF hand family
ALPHA 1 ISOMERASE 9S-lipoxygenase transporter),
Solute Carrier 'GDXG' FAMILY member 11 Cytochrome P450 protein
family 25 OF LIPOLYTICCHLOROPLASTIC CYP18-3-RELATED
L-ascorbate 1-hydroxy-2-methyl-2-(E)-butenyl (p450)
(mitochondrial ENZYMES HALOACID
SERINE 9-cis-epoxycarotenoid peroxidase 4-diphosphate
oxoglutarate DEHALOGENASE-LIKE
CARBOXYPEPTIDASE-LIKE DIHYDROLIPOAMIDE dioxygenase Beta-ketoacyl-[acyl-carrier-protein] synthase aminocyclopropanecarboxylateHYDROLASE
transporter), 36-RELATED ACETYL
member 11
GDPmannose synthase III TRIOSEPHOSPHATE oxidase
Aspartate 4,6-dehydratase ISOMERASE Inositol-3-phosphate enoyl- (fabI)
GLUCOSYL Arsenical
Aminotransferase, synthase
Trans-cinnamate pump-driving SUGAR
cytoplasmic 4-monooxygenase 3-oxo-5-beta-steroid
atpase plastocyanin UTILIZATION
(GOT1) cis-zeatin 4-dehydrogenase
arsenite-translocating
g (petE) REGULATORY Cathepsin F
O-glucosyltransferase
atpase Cell cycle arrest PROTEIN IMP2
Geraniol
dehydrogenase protein BUB3
(BUB3)
(NADP(+)) Villin 1 (VIL1)
T-complex protein 3-oxoacyl- (fabG)
FK506-binding
protein
1 subunit gamma Dormancy Edge
(CCT3, TRIC5)
Cysteine synthase
3-hydroxyacyl-CoA
dehydrogenase
-0.97 0.99
Fig. 3 sPLS network using the transcript matrix as the predictor matrix and the protein matrix as the response
matrix. Edge legend represents the weight of the links between proteins and transcripts. Network represen-
tation was drawn employing Cytoscape
pdf(file="networksPLS_0.7.pdf")
net <- network(spls.analysis, comp = 1:2, color.node = c("white","pink"),
cutoff=0.7,shape.node = c("rectangle", "rectangle"),show.color.key = TRUE)
dev.off()
Protein Interaction Networks 33
Data-Driven Integration PLS algorithm and its variants have many advantages, some of them
and Differential Network especially useful for omics dataset research as stated in Subheading
Analysis (xMWAS) 3.2.2.1, but they are restricted to two different omics layers. Statis-
tical based integrative networks aim to collect the relations among
all the studied omics layers of molecular information, “the more,
the better,” to get the more accurate and complete network. To this
end, other tools have been developed, expanding sPLS algorithm,
which allow for the integration of more levels (up to four). xMWAS
uses this method of integration and performs network analysis to
allow for visualization of positive or negative associations between
different datasets [8].
In this section, xMWAS package has been used to obtain net-
works of interaction of three omics data: metabolites, proteins, and
transcripts. An example of the xMWAS network is represented in
Fig. 4.
protein 16
protein 23 protein 20 metabolite 20
transcript 8 protein 50 metabolite 140
metabolite 119 metabolite 1 metabolite 12 metabolite 28 metabolite 147
protein 22 transcript 6 metabolite 4
protein 36 metabolite 261 transcript 12 metabolite 115 metabolite 144
metabolite 113 metabolite 7 protein 63
protein 21
transcript 5
metabolite 114
metabolite 98 protein 48 metabolite 17
metabolite 255 metabolite 37 metabolite 5
metabolite 112 protein 14
metabolite 171 transcript 7 protein 1 protein 6 protein 8protein 10 protein 12
metabolite 229 metabolite 104 metabolite 2
transcript 9 metabolite 9 protein 13
metabolite 191 protein 56 metabolite 35 metabolite 106 metabolite 10 protein 39
transcript 1
protein 52 protein 26 protein 58
transcript 10 metabolite 34 metabolite 110 protein 54 metabolite 18
protein 27 protein 37 metabolite 15
transcript 4 metabolite 100 metabolite 16
protein 31
transcript 2 metabolite 96
metabolite 131 transcript 3 metabolite 105
metabolite 99 metabolite 223
metabolite 135 metabolite 118 protein 64 protein 4 metabolite 200
metabolite 117 metabolite 111 protein 38
metabolite 149 metabolite 150 protein 33
metabolite 120 metabolite 177 protein 30
protein 65 protein 32 metabolite 252 metabolite 107
protein 29
protein 15 metabolite 95
protein 55 metabolite 161 metabolite 154 metabolite 245
protein 5
metabolite 250
metabolite 212
protein 3
metabolite 60
metabolite 8 metabolite 146
metabolite
metabolite 32 31 metabolite 256 metabolite 23
metabolite 101 metabolite 13 metabolite 6 protein 25 metabolite 176 metabolite 196
protein 59 metabolite 33 metabolite 26
Proteins
protein 60 metabolite 24 protein 2 protein 24 metabolite 227 metabolite 19
protein 53
protein 9 protein 62
protein 35 protein 11 protein 19protein 45 Transcripts
metabolite 29
metabolite 109 metabolite 25 metabolite 178
protein 46 metabolite 97 metabolite 22
metabolite 3 metabolite 14
protein 57 metabolite 27
metabolite 21
metabolite 155 metabolite 156
Metabolites
metabolite 30 metabolite 11
metabolite 108 metabolite 102
Fig. 4 xMWAS, multi sPLS network using transcripts, proteins, and metabolites. The clusters are formed by the
500 most relevant variables of each omics dataset, and its presence is greater than 20% along the treatments.
Negative connections are shown in red and positive connections in blue. Network representation was drawn
employing Cytoscape
34 Mónica Escandón et al.
source("https://bioconductor.org/biocLite.R");
biocLite(c("GO.db","graph","RBGL","impute","preprocessCore"),dependencies=TRUE);
install.packages(c("devtools","WGCNA","mixOmics","snow","igraph","plyr","pl
sgenomics"),dependencies=TRUE,type="binary", repos="http://cran.r-project.org")
6. Run the function (it may take a while). For further argument
details (see Note 10). In the following example regression
mode of sPLS has been used, selecting the 500 most relevant
variables of each omics dataset and ten components. Tran-
scripts, proteins, and metabolites have been depicted in RStu-
dio Viewer Window as gold rectangles, green circles, and cyan
triangles, respectively. All those variables with a presence less
than 20% along the treatments have been discarded.
Protein Interaction Networks 35
Edge
Contig 72015
Contig 42439
1,2,4-trithiolane
Contig 46910 n540
p635 n575
-1.00 1.00
Serine/threonine-protein Contig 50072
p661 p634
kinase p598 p532
Aurora-3 n229 Contig 35628
p599 Gibberellin
n908 GA7
Glycine-rich n1239 n974
Castasterone RNA-binding n75 n1082
protein n3283/n1295 Contig 49804 Pyridoxine
p1502
Isopentenyladenosine n143
n10
p610 3-Caffeoylpelargonidin
Contig 50114
5-glucoside
Salicilic acid
Selenoprotein p275 Contig 61535
O n34
E3 O-glycosyl
Delta-1-pyrroline-5-carboxylate ubiquitin-protein p224
Contig 39406 hydrolases
synthetase ligase BAH1
Anthocyanidin Putative NAD-dependent family 17
Contig 14852 protein
Contig 48980 synthase sugar malic enzyme
Cytochrome Dihydrokaempferol
phosphate/phosphate 2
b5 Contig 32263 translocator
Benzyl Contig 28711 n962
alcohol Contig 72233 Peroxidase
Predicted
O-benzoyltransferase Contig 58633 Cellulose
Os10g0434900 Contig 44151 protein
n1142 synthase-like
W.nobilis RmlC-like protein Contig 14494 H2 Transmembrane
(R.T.26054_591) cupins
Contig 60614 protein,
RNA superfamily Caffeoyl-CoA Protein putative
sequence protein O-methyltransferase Proteins DETOXIFICATION
Disease Contig 60576
resistance Contig 54211 Transcripts
response Metabolites
protein Contig 28611
Fig. 5 DIABLO network showing interactions between different omic levels. Edge legend represents the weight
of the links between proteins, transcripts, and metabolites. Network representation was drawn employing
Cytoscape
36 Mónica Escandón et al.
3. Introduce all dataset (in this case, the three matrixes) individu-
ally. These matrixes are named as proteins as protein matrix
(Fig. 1a), metabolites for metabolites matrix (Fig. 1b) and
contigs as transcripts matrix (Fig. 1c) (see Note 11). In the
session tab of R, set working directory where are all matrixes of
the datasets to use. The csv matrix files are the individual
matrices of each data set stored in Excel as csv (see Fig. 1 and
Supplementary dataset S1).
metabolites <- read.table("metabolites.csv", header = T, row.names=1, sep=";")
proteins <- read.table("proteins.csv", header = T, row.names=1, sep=";")
contigs <- read.table("contigs.csv", header = T, row.names=1, sep=";")
metabolites <- as.data.frame(t(metabolites))
proteins <- as.data.frame(t(proteins))
contigs <- as.data.frame(t(contigs))
You get a list of the variables selected for each dataset for
the ncomp (e.g., $met [1] 50 [2] 10 $prot [1] 8 [2] 10 $mrna
[1] 10 [2] 12). This means that the optimal variables for
metabolites are in component [1] 50 metabolites, component
[2] 10 metabolites; those for proteins are in component [1] 8
proteins, component [2] 10 proteins; and those for mRNA are
in component [1] 10 transcripts, component [2] 12 tran-
scripts. Alternatively, you can manually input those parameters
(see Note 15).
7. Make the DIABLO with the selection of variables made previ-
ously (datasetsdiablo, diablovectorY, list.keepX, and design).
38 Mónica Escandón et al.
3.3 Biological The functional interaction network is very useful for the full under-
Interaction Network standing of biological phenomena [25]. For this purpose, there are
Enrichment different databases and programs that help to compile and integrate
all protein–protein interactions, including both direct (physical)
and indirect (functional) relationships. In this section we will
develop two workflows using two of the most used databases
(STRING and ShinyGO) to obtain networks in species included
and not included in these databases. Initial input data could be the
complete protein dataset or the differentially expressed proteins
obtained in Subheading. 3.1.
Fig. 6 STRING-based interaction network represented in the webtool (a) and in Cytoscape (b). This network
employed Vitis vinifera as reference organism and proteins sequences (Supplementary S2). Edge represents
confidence in STRING web network, and the score in Cytoscape-represented network
A) B)
>protein000006.2
E0CQ31 MAKVPPKHARDQFQDFEGLLNNLQDWELSFKDKDKRLKSQFVGKDKLDLPAQRHSMNSASQHSNGTGVNEKPPMGKTTALDNLGSGRQYDYMKDYDAIHR
LSDGLMEEEAVDANSEKELGNEFFKQKKFNEAIDCYSRSIAFSPTAVAYANRAMAYIKIKRFQEAENDCTEALNLDDRYIKAYSRRSTARKELGKLKESI
D7SKD8 DDTSFALRLDPHNQEIKKQYAELKSLLEKEILKKASGVAGGSSQGVQREGKLKVEKSKSIHKVQSVSPSSPAGVAEVLKDNSKDREGGAETSMEVESSRL
D7THC1 RTHRADMNTSFGNVKIEHKNGEQELKASVQELAARAANLAKAEAAKNISPPNSAYQFEVSWRGLSGDRTLQAHLLKVTPATALPGIFKNALSAPMLVDVI
RCIATFFTEDMDLGVKYLENLTKVPRFDMVIMCLSPSDKADLWKIWDEVFSKGTSEYAENLGNLRLKYGVKQ*
Q6B4V4 >protein000016.1
MWFSLFVLLIYICYVNSKDGWENRWVKSDWKKDENMAGEWNYTSGKWNGDANDKGIQTSEDYRFYAISAEFPEFSNKGKTLVFQFSVKHEQKLDCGGGYM
A5AWT3 KLLSGEVDQKKFGGDTPYSIMFGPDICGYSTKKVHAILTYNGTNHLIKKEVPCETDQLSHVYTFILRPDATYSILIDNVEKQSGSLYSDWDLLPPKEIKD
D7U5G1 PEAKKPEDWDDKEYIPDPEDKKPEGYDDIPKEIPDPDAKKPEDWDDEEDGEWTAPTIPNPEYKGPWKPKKIKNPNYKGKWKAPMIDNPDFKDDPDLYVYP
KLKYVGVELWQVKSGTLFDNVLVCDDPEYAKQLAEETWGKQKDAEKAAFDEAEKKREEEESKDDPIDSDAEDGDDDAEDNDTDDDSKSDSTEDEATSVDD
F6HQP9 DAHDEL*
>protein000017.1
D7U044 MFLVDWFYGVLASLGLWQKEAKILFLGLDNAGKTTLLHMLKDERLVQHQPTQYPTSEELSIGKIKFKAFDLGGHQIARRVWKDYYAKVDAVIYLVDAYDK
ERFAESKKELDALLSDESLATVPFLILGNKIDIPYAASEDELRYHMGLTGITTGKGKVNLADSNVRPLEVFMCSIVRKMGYGDGFKWVSQYIK*
A5BD13 >protein000019.1
A5ACP0 MSNSELLQIEPLELQFPFELKKQISCSLQLTNKSDNYVAFKVKTTNPKKYCVRPNTGVVLPHSTCDVTVTMQAQKEAPPDLQCKDKFLLQSVVVGPGVTT
ENIKPDVFNKESGNRVEECKLRVSYVPPPQPPSPVREGSEEGSSPRASLSDNGTVNQIPDYNSMSRAYVDSLENTPEIDPC*
F6HEM8 >protein000041.1
MAITSRTPDISGERQSGQDVRTQNVVACQAVANIVKSSLGPVGLDKMLVDDIGDVTITNDGATILKMLEVEHPAAKVLVELAELQDREVGDGTTSVVIIA
E0CR38 AELLKRANDLVRNKIHPTSIISGYRLAMREACKYVDEKLAVKVEKLGKDSLVNCAKTSMSSKLIGGDSDFFANLVVEAVQTVKMTNGRGEVKYPIKGINI
F6HAX5 LKAHGKSARDSYLLKGYALNTGRAAQGMPMRVAPARIACLDFNLQKAKMQMGVQVLVTDPRELEKIRQREADMTKERIDKLLKAGANVVLTTKGIDDMAL
KYFVEAGAIAVRRVRKEDLRHVAKATGATVVSTFADMEGEETFDSSLLGYADEVVEERIADDDVIMIKGTKTTSAVSLILRGANDFMLDEMDRALHDALC
D7TGC8 IVKRTLESNTVVAGGGAVEAALSVYLENLATTLGSREQLAIAEFAESFLIIPKVLAVNAAKDATELVAKLRAYHHTAQTKADKKHLSSMGLDLGKGTVRN
NLEAGVIEPAMSKVKIIQFATEAAITILRIDDMIKLVKDESQNEE*
D7U1Z1 >protein000053.1
MNPLTFLRVLGPEPWNVAYVEPSIRPDDSRYGENPNRLQRHTQFQVILKPDPGNSQDLFIRSLSALGINVHDHDIRFVEDNWESPVLGAWGLGWEIWMDG
F6H1H4 MEITQFTYFQQAGSLQLTPVSVEITYGLERILMLLQGVDHFKKIQYADGITYGELFLENEKEMSAYYLKHASVDNIHKHFDLFEAEARCLLDSGLAIPAY
D7TDE2 DQLLKTSHAFNILDSRGFVGVTERARYFGRMRSLARQCAQLWLKTRESLGYPLGVTSQSDHIVFPKEVLEEAAGKVSTDPRLFVLEIGTEELPPNEVVNA
CKQLKDLIEQLLEKQRLSHGKVLTFGTPRRLVVHVHNLYAKQVANEIDVRGPPASKAFDQGGNPTKAAEGFCRRNGVPLGSLFRRVEGKTEYVYVRAVEP
D7TCD0 SRLALEVLSEELPGTIGKILFPKSMRWNSEVMFSRPIRWILALHGDVVVPFIGNLSHGLRNTPSATVKVASAESYTDVMQRAGIAISMEQRKQTILDSSN
ALAKSVGGIIILQNDLLDEVANLVEKPVPVLGKFNESFLVLPKDLLIMVMQKHQKYFAITDQGGNLLPYFISVANGAINEMVVRKGNEAVLRARYEDAKF
A7NVX9 FYEVDTSKRFSEFRSQLNGILFHEKLGTMLDKMTRVQHLVTEVGSSLRVSGDTLQIIKGAASLAMIDLATAVVTEFTSLSGIMARHYALRDGYSEQIAEA
A5B3K2 LFEITLPRFSGDIVPKTDAGTVLAITDRLESLVGLFAAGCQPSSSNDPFGLRRISYCLVQLLVETNRDLDLRHGLELAAAVQPINVAAETIDTVHQFVTR
RLEQLLMDQGISPEVVRSVLAERANQPCLATKSAYKMEALSRGELLPKIVEVYSRPTRIVRGKDINDDLEVDEGAFETKEEKALWCTFTSLRTKIRPDME
D7UC14 VDDFVEASSDLLQPLEDFFNNVFVMVEDERIRKNRLALLKKISDLPKGIADLSILPGF*
>protein000056.1
A5C3G7 MEQTFIMIKPDGVHRGLVGEIIGRFEKKGFTLKGLKLITVDRHFAEQHYADLSAKPFFNGLVEYIISGPVTAMVWEGKNVVTTGRKIIGATNPADSAPGT
IRGDYAIDIGRNVIHGSDSVESAKKEIALWFPEGIAEWRSSVHQWIYE*
etc. etc.
Fig. 7 Dataset Input. (a) Multiple proteins in gene names. (b) Extract of a fasta file with sequences of proteins
6. The STRING will give you a list of the proteins with which it
made target, being able to have more than one identification by
protein. Review all annotations, according to identity and bit-
score that gives you the STRING. Choose the optimum or the
one that matches your identification. STRING will take by
default all identifications with the highest bitscore.
7. Once selected the protein identification in the new database of
the organism, download the document (option MAPPING) to
save which protein corresponds each STRING identification in
the organism database (Supplementary material S3). It will be
used later in Cytoscape.
8. There are two possibilities for viewing the network: continue
from the web tool or go to the Cytoscape and work with the
STRING app available in it.
(a) STRING web tool:
l Click the continue button and STRING makes us an
interaction network for proteins on the web.
l Improve the visualization of your network using
the tabs below. Here are some examples, but for more
see [26]:
Protein Interaction Networks 41
Fig. 8 ShinyGO networks using the Website application. (a) Protein Network with enrichment of Go Biological
Process (all proteins dataset) and (b) Protein response to stress Network with enrichment of a selected Go
Category (Go category: response to stress) (only proteins in category were introduced)
3.3.2 ShinyGO ShinyGO [28] is an intuitive and graphical web platform with more
than 200 species of plants and animals, annotated from GO and
other databases, which allows for the graphical visualization of
enrichment results and with access to a program interface (API)
to STRING to make protein–protein interaction networks [28]. An
example of the ShinyGO networks is represented in Fig. 8.
3.4 Merged Once experimental statistical and biological based networks are
Functional already depicted separately, they can be merged to deeper under-
and Statistical stand our system. Following this approach, we will be able to
Interaction Networks visualize in one graph already known biological information and
expand it with our experimental results. This depiction can be also
used to check and validate our experimental network reliability.
These networks can be merged either by using Cytoscape built-in
function or manually. An example of the Merged networks is repre-
sented in Fig. 9.
Create a STRING Network 1. Go to STRING web page (version 11.0) to “Proteins with
for Merged Networks values/Ranks”.
2. Introduce a list containing our experimental proteins and one
numeric value as fold change as calculated in Subheading 3.1.1.
The matrix with protein identifiers must be compatible with
STRING database (see Subheading 3.3.1).
3. Select the organism you are working with.
4. Click Search.
5. Download the mapping document and continue.
6. Download the depicted network.
44 Mónica Escandón et al.
Prot ei n 196
Prot ei n 217
Prot ei n 105
Prot ei n 194
Prot ei n 160
Prot ei n 235
Fig. 9 Merged Network combining two different mathematical methods (sPLS, ARTIVA) and STRING network.
sPLS edges are shown in blue, ARTIVA in red, and STRING in green. Network representation was drawn
employing Cytoscape
3.4.1 Cytoscape Merged 1. Open already built networks in the same Cytoscape session.
Functional and Statistical Example: STRING network and sPLS network.
Interaction Network 2. Cytoscape > Tools > Merge > Networks and select the net-
Workflow works to combine.
3. Choose between union, intersection, and difference.
4. Mark Enable merge nodes/edges in the same network.
5. Select the matching columns.
6. See Cytoscape workflow (Subheading 3.5.1) to improve the
network visualization.
Protein Interaction Networks 45
3.4.2 Manually Merged Another choice, more flexible, is pasting both table networks, just
Functional and Statistical concatenating data. The only requirement for the subsequent visu-
Interaction Network alization to work is to verify that node names are the same in both
networks. In this way, we will able to create a consensus network of
different statistical analyses as those mentioned above or comple-
mentary annotation tools. This network table will be compatible
with both visualization tools described below. An example of the
resulting network is shown in the Fig. 9.
Workflow 1. Check all the nodes have coherent identifiers across the differ-
ent networks.
2. Paste the network tables in the spreadsheet maintaining the
source origin. Please note that whether directed (sPLS) and
undirected (STRING) edges are mixed together all the result-
ing network should be treated as undirected.
3. Add a column “Method” to distinguish the provenance of each
type of relations (sPLS, STRING, ARTIVA, . . .).
4. Save the table document (Table 1) in xlsx format for opening it
with a network visualization tool (Subheading 3.5.1).
3.5 Network Two workflows are presented for two most used and freely accessi-
Visualization Tools ble platforms to visualize networks: Cytoscape and Gephi. Net-
works can be introduced to the visualization programs in a
multiple variety of formats, the two most widely used are as a
table (.csv, .xlsx) and as a gml file.
Table 1
Example of input table combining two different mathematical methods (sPLS, ARTIVA) and STRING
network
3.5.1 Cytoscape Cytoscape is an open source software project for integrating bio-
molecular interaction networks with high-throughput expression
data and other molecular states into a unified conceptual frame-
work [29]. Some examples of resulting networks are shown in the
Figs. 2, 3, 4, 5, 6b, and 9.
3.5.2 Gephi Gephi is an open source software that allows you to visualize,
manipulate, and explore all types of graphics and networks for all
types of data (a general-use platform), while Cytoscape is com-
monly used in the biology domain with specific apps and pluggings.
Gephi have some advantages over Cytoscape like have a good preset
and an integrated statistical analysis module. Furthermore, Gephi is
recommended for visualize large networks (up to 100,000 nodes).
An example of network created with Gephi is shown in Fig.10.
Protein 232
Protein 210
Protein 235
Protein 202
Protein 224
Protein 129
Protein 182 Protein 176
Protein 221 Protein 218 Protein 39
Protein 130 Protein
Protein 11 20 Protein 7 Protein 228
Protein 201 200
Protein Protein 189 Protein 205
Protein 204
Protein
Protein 227 128
Protein 170
Protein 133 Protein Protein
5 21
Protein 35 Protein
Protein 15 26 Protein 206
Protein 19 Protein 36 Protein 56
ProteinProtein
10 17 Protein 8Protein 9
Protein 12
Protein 23 Protein 187
Protein 34 Protein 31 Protein 1
ProteinProtein
29 32 Protein 38 13 Protein 171
Protein 131 Protein 6 ProteinProtein 24 Protein 18
Protein 132 Protein 157 Protein 108Protein 190
Protein 109
Protein 28 Protein 168
Protein 4 Protein
Protein 25
153 Protein 52
Protein Protein
21414
Protein 22 Protein 50 Protein 45
Protein 44 Protein 146 Protein 192
Protein 37Protein 16 Protein 30 Protein 155 Protein 110
Protein 3 Protein 158 Protein 186 Protein 47Protein
Protein 148 98
Protein 33 Protein Protein
55 53
Protein
Protein 212 150 Protein 27
Protein 152 Protein 54 Protein 80
Protein 156 Protein 216 Protein 207 Protein 46 Protein 217
Protein 167 Protein 48 Protein 93 Protein
Protein 94 81
ProteinProtein
219 231 Protein 149 Protein 104
Protein 102
Protein 154 Protein 105
Protein 220 191
Protein Protein 107 Protein 89Protein Protein
85 161
Protein
Protein 49 147
Protein 100 Protein 183Protein 87
Protein 151Protein 112 Protein 111 Protein 79 Protein
Protein 196 169
Protein 181 Protein 77 Protein 101
Protein 166
Protein 209
Protein 51Protein 95
Protein 2
Protein 163 Protein 84
Protein 195
Protein 83 145 Protein 90 Protein 97
Protein
Protein 162
Protein 173Protein 92
Protein 188 Protein 230 Protein
Protein 99106 Protein 194
Protein 213 Protein 164
Protein 199 Protein 172 86
Protein
Protein Protein 82
Protein 211 165 Protein 223
Protein 136
ProteinProtein
88 78
Protein 103
Protein 91
Protein 159 Protein 62
Protein 66
Protein 184 Protein 113 Protein 96
Protein 73 197
Protein
Protein 215
Protein 208
Protein 67
Protein 160 Protein 42
Protein 122
Protein 65
Protein 138 Protein 225Protein 72
Protein 142
Protein 193 Protein 61
Protein 121Protein 143 Protein 74
Protein 76 Protein 75
Protein 127 Protein 71
Protein 115 Protein 57
Protein 134
Protein 119 Protein 59 Protein 58
Protein 126 Protein 63
Protein 116 139
Protein Protein 70
Protein 117
Protein 64 Protein 144
ProteinProtein 120 Protein
114 Protein 69
Protein
60 141
Protein 118
Protein 123 Protein 198
Protein 135Protein 137 ProteinProtein
40
203
Protein 125 Protein 43
Protein 140 Protein 177
Protein 68
Protein 185
Protein 124
Protein 175
Protein 174
Protein
Protein 226 222
Protein 41
Protein 233
Protein 229
Fig. 10 Protein–protein interaction network depicted using Gephi. Node colors indicate the different subnet-
work groups based on the cluster betweenness graphical analysis. Arrows between nodes and labels are
colored as the parental node
(a) Node color: Change the color of the nodes in the Appear-
ance>Nodes menu, click on the palette icon and select
among “unique” for all nodes in the same color, “Parti-
tion” for make color clusters of nodes (attending to for
example a statistical network parameter (Choose and attri-
bute drop-down) as degree, or cluster coefficient) or
“Ranking” for a continue color scale. Click Apply to
save the changes and see the result.
(b) Node size: change node size in the next right icon; possi-
bilities are a unique value for all nodes and a Ranking, as a
continuous sizing scale. Click Apply to save the changes
and see the result.
(c) Change edge color following the first part of point
10 “Node color” in the Appearance > Edges menu.
50 Mónica Escandón et al.
4 Notes
13. Note that the function has been set to favor the small-ish
signature while allowing to obtain a sufficient number of vari-
ables for downstream validation/interpretation. See Singh
et al. [34] for further information.
14. An example of manual entry of parameters:
test.keepX <- list ("met" = c(5:9,
seq(10, 18, 2), seq(20,50,5)),"prot" = c(5:9, seq(10, 18, 2),
seq(20,50,5)),"mrna" = c(5:9, seq(10, 18, 2), seq(20,50,5)))
16. If there are several species with a good number of nr, make for
each species the network STRING and choose the one that
best suits your experiment.
17. For each interaction, the edge attributes include the overall
confidence score and the subscores from seven individual evi-
dence channels. These channels are as follows:
(a) The experiments channel: Evidence comes from actual
experiments in the lab.
(b) The database channel: Evidence that has been asserted by
a human expert curator.
(c) The textmining channel: Pairs of proteins are given an
association score when they are frequently mentioned
together in the same paper, abstract, or even sentence.
(d) The coexpression channel: Pairs of proteins that are con-
sistently similar in their expression patterns, under a vari-
ety of conditions, will receive a high association score.
(e) The neighborhood channel: Genes are given an associa-
tion score where they are consistently observed in each
other’s genome neighborhood (such as in the case of
conserved, cotranscribed “operons”).
(f) The fusion channel: Pairs of proteins are given an associa-
tion score when there is at least one organism where their
respective orthologs have fused into a single, protein-
coding gene.
(g) The co-occurrence channel: STRING evaluates the phy-
logenetic distribution of orthologs of all proteins in a
given organism.
Therefore, the three possible display options will be
displayed:
Evidence: Included Known interaction from curated
databases and experimentally determined), predicted
interaction (gene neighborhood, fusions, and
54 Mónica Escandón et al.
Acknowledgments
References
1. Valledor L, Jorrı́n J (2011) Back to the basics: 4. Steuer R, Morgenthal K, Weckwerth W et al
maximizing the information obtained by (2007) A gentle guide to the analysis of meta-
quantitative two dimensional gel electrophore- bolomic data. Methods Mol Biol 358:105–126
sis analyses by an appropriate experimental 5. Lê Cao K-A, Boitard S, Besse P (2011) Sparse
design and statistical analyses. J Proteome PLS discriminant analysis: biologically relevant
74:1–18 feature selection and graphical displays for mul-
2. Singh A, Gautier B, Shannon CP et al (2016) ticlass problems. BMC Bioinformatics 12:253
DIABLO – an integrative, multi-omics, multi- 6. Groth D, Hartmann S, Klie S et al (2013)
variate method for multi-group classification. Principal components analysis. In: Reisfeld B,
bioRxiv. https://doi.org/10.1101/067611 Mayeno AN (eds) Computational toxicology,
3. Scholz M, Selbig J (2007) Visualization and Methods in molecular biology, vol II. Humana
analysis of molecular data. In: Weckwerth W Press, New York City. p chapter 22
(ed) Metabolomics methods protocol. 7. Meng C, Kuster B, Culhane AC et al (2014) A
Humana Press, Totowa, NJ, pp 87–104 multivariate approach to the integration of
56 Mónica Escandón et al.
multi-omics datasets. BMC Bioinformatics 21. Cramer RD (1993) Partial least squares (PLS):
15:162 its strengths and limitations. Perspect Drug
8. Uppal K, Go Y-M, Jones DP (2017) xMWAS: Discov Des 1:269–278
an R package for data-driven integration and 22. Lê Cao K-A, Rossouw D, Robert-Granie C
differential network analysis. bioRxiv:122432 et al (2008) A sparse PLS for variable selection
9. Von Mering C, Jensen LJ, Snel B et al (2005) when integrating omics data. Stat Appl Genet
STRING: known and predicted protein- Mol Biol 7:35
protein associations, integrated and transferred 23. Lee HK, Hsu AK, Sajdak J et al (2004) Coex-
across organisms. Nucleic Acids Res 33: pression analysis of human genes across many
D433–D437 microarray data sets. Genome Res
10. Pluskal T, Castillo S, Villar-briones A et al 14:1085–1094
(2010) MZmine 2 : modular framework for 24. Tenenhaus A, Philippe C, Guillemot V et al
processing, visualizing, and analyzing mass (2014) Variable selection for generalized
spectrometry-based molecular profile data. canonical correlation analysis. Biostatistics
BMC Bioinformatics 11:395 15:569–583
11. Haas BJ, Papanicolaou A, Yassour M et al 25. Szklarczyk D, Gable AL, Lyon D et al (2019)
(2013) De novo transcript sequence recon- STRING v11: protein-protein association net-
struction from RNA-seq using the Trinity plat- works with increased coverage, supporting
form for reference generation and analysis. Nat functional discovery in genome-wide experi-
Protoc 8:1494 mental datasets. Nucleic Acids Res 47:
12. Valledor L, Romero-Rodriguez MC, Jorrin- D607–D613
Novo JV (2014) Standardization of data pro- 26. Szklarczyk D, Franceschini A, Wyder S et al
cessing and statistical analysis in comparative (2015) STRING v10: protein–protein interac-
plant proteomics experiment. Methods Mol tion networks, integrated over the tree of life.
Biol 1072:51–60 Nucleic Acids Res 43:D447–D452
13. Escandon M, Valledor L, Pascual J et al (2017) 27. Szklarczyk D, Morris JH, Cook H et al (2017)
System-wide analysis of short-term response to The STRING database in 2017: quality-
high temperature in Pinus radiata. J Exp Bot controlled protein-protein association net-
68:3629–3641 works, made broadly accessible. Nucleic Acids
14. Pascual J, Cañal MJ, Escandón M et al (2017) Res 45:D362–D368
Integrated physiological, proteomic, and meta- 28. Ge SX, Jung D (2018) ShinyGO: a graphical
bolomic analysis of ultra violet (UV) stress enrichment tool for animals and plants.
responses and adaptation mechanisms in Pinus bioRxiv:315150
radiata. MCP 16:485–501 29. Shannon P, Markiel A, Ozier O et al (2003)
15. Branson OE, Freitas MA (2016) A multi- Cytoscape: a software environment for
model statistical approach for proteomic spec- integrated models of biomolecular interaction
tral count quantitation. J Proteome 144:23–32 networks. Genome Res 13:2498–2504
16. Robinson MD, Oshlack A (2010) A scaling 30. Marbach D, Costello JC, Küffner R et al
normalization method for differential expres- (2012) Wisdom of crowds for robust gene net-
sion analysis of RNA-seq data. Genome Biol. work inference. Nat Methods 9:796
https://doi.org/10.1186/gb-2010-11-3-r25 31. Schiffthaler B, Serrano A, Delhomme N et al
17. Lèbre S, Becq J, Devaux F et al (2010) Statisti- (2019) Seidr: a gene meta-network calculation
cal inference of the time-varying structure of toolkit. bioRxiv:250696
gene-regulation networks. BMC Syst Biol 32. Grimes T, Potter SS, Datta S (2019) Integrat-
4:130 ing gene regulatory pathways into differential
18. Nagarajan R, Scutari M, Lèbre S (2013) Bayes- network analysis of gene expression data. Sci
ian networks in R with applications in systems Rep 9:5479
biology. Springer-Verlag, New York. https:// 33. Rohart F, Gautier B, Singh A et al (2017)
doi.org/10.1007/978-1-4614-6446-4 mixOmics: an R package for ‘omics feature
19. Wold H (1966) Estimation of principal com- selection and multiple data integration. PLoS
ponents and related models by iterative least Comput Biol 13:e1005752
squares. Multivariate analysis, NewYork. Aca- 34. Singh A, Gautier B, Shannon CP et al (2018)
demic Press, Cambridge DIABLO: from multi-omics assays to bio-
20. Wold S, Sjöström M, Eriksson L (2001) marker discovery, an integrative approach.
PLS-regression: a basic tool of chemometrics. bioRxiv. https://doi.org/10.1101/067611
Chemom Intell Lab Syst 58:109–130
Chapter 4
Abstract
Proteomics encompasses efforts to identify all the proteins of a proteome, with most of studies about plant
proteomics based on a bottom-up mass spectrometry (MS) strategy, in which the proteins are subjected to
digestion by trypsin and the tryptic fragments are subjected to MS analysis. The identification of proteins
from MS/MS spectra has been performed using different algorithms (Mascot, Sequest) against plant
protein sequence databases such as UniProtKB or NCBI_Viridiplantae. But these databases are not the
best choice for nonmodel species where they are underrepresented, resulting in poor identification rates. A
high identification rate requires a sequenced and well-annotated genome of the species under investigation.
For nonmodel organisms, the identification of proteins is challenging since, in the best of the cases, only hits
or orthologs instead of gene products are identified. However, in the absence of a sequenced genome, this
situation can be improved using transcriptome data to generate a specific species database to compare
proteins. In this chapter, we report the protein database construction from RNA-Seq data in a nonmodel
species, in this particular case Holm oak (Q. ilex).
Key words Quercus ilex, Nonmodel species proteomics, Protein databases, RNA-Seq analysis, Cus-
tom protein databases
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_4, © Springer Science+Business Media, LLC, part of Springer Nature 2020
57
58 Vı́ctor M. Guerrero-Sanchez et al.
2 Materials
2.1 Nucleotide mRNA reads in FastQ format obtained from the Illumina sequenc-
Sequences ing platform.
2.1.1 Required Software All the software required for completing this protocol is publicly
available. Documentation and software can be freely downloaded
from the following addresses.
3 Methods
3.1 Nucleotide 1. Download raw transcriptome data of Holm oak from the
Sequences European Nucleotide Archive (ENA) at European Bioinfor-
matics Institute (EBI) in FastQ-Format (Illumina_ R1 and
Illumina_R2). Repository can be accessed via http://www.
ebi.ac.uk/ena. Sequencing reads were uploaded using the
accession number: SRR5815058 (see Note 1).
$ wget
ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR581/008/SRR5815058/SRR5815058_1.fastq.g
z
$ wget
ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR581/008/SRR5815058/SRR5815058_2.fastq.g
z
3.2 Raw Reads Quality control (QC) is critical to ensure that RNA-seq data are of
Quality Control (See high quality and suitable for subsequent analyses. Due to the
Note 2) presence of intrinsic biases and limitations, such as nucleotide
composition bias, GC bias, and RCA bias, a quality control should
be carried out in a RNA-seq. These biases directly affect the accu-
racy of many RNA-seq applications [8, 9] and they can be checked
from raw sequences using tools like FastQC. The FastQC software
provides a FastQC Report of a set of high throughputs sequencing
reads, allowing identifying sequencing errors or bias for a later
trimming. Basic statistics (total sequences, filtered sequences,
sequence length, GC %) are provided by using this tool.
1. Download and install the FastQC software v0.11.8.
$ wget
https://www.bioinformatics.babraham.ac.uk/projects/fastqc/fastqc_v0.11.8.zi
p
$ unzip fastqc v0.11.8.zip
$ gunzip SRR5815058_1.fastq.gz
$ gunzip SRR5815058_2.fastq.gz
$ ./fastqc SRR5815058_1.fastq
$ ./fastqc SRR5815058_2.fastq
3.3 Preprocessing Preprocess all raw data using Fastx_Toolkit version 0.0.13 and
Raw Data Cutadapt version 1.9 [10]. The aim of this step is retaining only
high-quality sequences by removing of low-quality reads with
ambiguous bases and adapter sequences.
1. Download and install Fastx_Toolkit.
$ wget
http://hannonlab.cshl.edu/fastx_toolkit/fastx_toolkit_0.0.13_binaries_Linux_
2.6_amd64.tar.bz2
$ tar -xjvf fastx_toolkit_0.0.13_binaries_Linux_2.6_amd64.tar.bz2
$ ./fastq_quality_filter -q 20 -i SRR5815058_1.fastq -o
SRR5815058_1_f.fastq
$ ./fastq_quality_filter -q 20 -i SRR5815058_2.fastq -o
SRR5815058_2_f.fastq
Protein Database Creation in Non-Model Organisms 61
3.4 Assembling The assembling of the de novo transcriptome can be carried out
Raw Data using different assemblers such as Velvet [11], Trans-ABySS [12],
SOAPdenovo [13], TRINITY [14], MIRA [15], or RAY [16],
among others. In Holm oak, we used TRINITY version 2.5.1,
RAY version 2.3.1, and MIRA V5rc1 (see Note 3). The use of
more than a single assembly should be carried out to assess the
quality of an assembly [17]. This is due to the read lengths, read
counts, and error profiles that are produced by different next-
generation sequencing technologies [17].
1. Download and install RAY.
$ wget https://github.com/trinityrnaseq/trinityrnaseq/archive/Trinity-
v2.8.4.zip
$ unzip Trinity-v2.8.4.zip
$ sudo apt-get instann cmake
$ make
$ make plugins
$ make install
3.6 Evaluating Evaluate the structure of the generated assemblies using the rna-
the Assembly QUAST (Quality Assessment Tool for Transcriptome Assemblies)
Structure software version 1.5.1 [20] (see Note 6).
and Completeness of a 1. Download and install rnaQUAST.
Transcriptome
Protein Database Creation in Non-Model Organisms 63
$ wget http://cab.spbu.ru/files/rnaquast/release1.5.2/rnaQUAST-1.5.2.tar.gz
$ tar -xjvf rnaQUAST-1.5.2.tar.gz$ sudo apt-get install python3-matplotlib
$ pip install joblib
$ pip install gffutils
$ sudo apt-get install ncbi-blast+
$ wget http://research-pub.gene.com/gmap/src/gmap-gsnap-2019-03-15.tar.gz
$ tar -xjvf gmap-gsnap-2019-03-15.tar.gz
$ ./configure
$ make
$ make install
$ wget http://deweylab.biostat.wisc.edu/detonate/detonate-1.11-precompiled.tar.gz
$ tar -xjvf precompiled.tar.gz
2. Run DETONATE.
$ ./rsem-eval-estimate-transcript-length-distribution clusteredassembly.fasta
parameter_file
$ ./rsem-eval-calculate-score -p 8 --transcript-length-parameters parameters_file
SRR5815058_1_fc.fastq SRR5815058_2_fc.fastq clusteredassembly.fasta
output_folder
64 Vı́ctor M. Guerrero-Sanchez et al.
$ export PATH="/path/to/AUGUSTUS/augustus-3.2.3/bin:$PATH"
$ export PATH="/path/to/AUGUSTUS/augustus-3.2.3/scripts:$PATH"
$ export AUGUSTUS_CONFIG_PATH="/path/to/AUGUSTUS/augustus-
3.2.3/config/"
$ sudo python setup.py install
$ wget https://busco.ezlab.org/datasets/embryophyta_odb9.tar.gz
$ tar –xf embryophyta_odb9.tar.gz
3.7 Annotation of a Annotate the transcriptome using the Sma3s annotator version
Transcriptome 2 [25, 26] (see Note 7). Sma3s (Sequence massive annotation by
Protein Database Creation in Non-Model Organisms 65
$ wget http://www.bioinfocabd.upo.es/node/11#sma3s.pl
$ wget http://www.bioinfocabd.upo.es/sma3s/db/uniref90.fasta.gz
$ gunzip uniref90.fasta.gz
$ wget http://www.bioinfocabd.upo.es/sma3s/db/uniref90.annot.gz
$ gunzip uniref90.annot.gz
2. Run Sma3s.
3.8 Construction of a Generate a six-frame translation for each sequence of the transcrip-
Custom Protein tome using the TransDecoder software version 5.5.0 [27].
Database
1. Download TransDecoder.
$ wget
https://github.com/TransDecoder/TransDecoder/archive/TransDecoder-
v5.5.0.zip
$ gunzip TransDecoder-v5.5.0.zip
$ ./TransDecoder.LongOrfs -t clusteredassembly.fasta
-m 75 –o outputfolder
4 Notes
Acknowledgments
References
1. Eng JK, McCormack AL, Yates JR (1994) An Characterization of Quercus ilex seed and
approach to correlate tandem mass spectral Pinus radiata needle proteomes by using
data of peptides with amino acid sequences in SEQUEST and custom databases. J Proteome
a protein database. J Am Soc Mass Sprectrom 105:85–91
5:976–989 5. Valledor L, Jorrı́n-Novo JV, Rodrı́guez JL et al
2. Perkins DN, Pappin DJ, Creasy DM et al (2010) Combined proteomic and transcrip-
(1999) Probability-based protein identification tomic analysis identifies differentially expressed
by searching sequence databases using mass pathways associated to Pinus radiata needle
spectra. Bioinformatics 20:3551–3567 maturation. J Proteome Res 9:3954–3979
3. Cox J, Neuhauser N, Michalski A et al (2011) 6. Guerrero-Sanchez VM, Maldonado-Alconada
Andromeda: a peptide search engine integrated AM, Amil-Ruiz F et al (2017) Holm oak
into the MaxQuant environment. J Proteome (Quercus Ilex) transcriptome. De novo
Res 10:1794–1805 sequencing and assembly analysis. Front Mol
4. Romero-Rodrı́guez MC, Pascual J, Valledor L Biosci 4:70
et al (2014) Improving the quality of protein 7. Guerrero-Sanchez VM, Maldonado-Alconada
identification in non-model species. AM, Amil-Ruiz F et al (2019) Ion torrent and
68 Vı́ctor M. Guerrero-Sanchez et al.
lllumina, two complementary RNA-seq plat- 19. Fu L, Niu B, Zhu Z et al (2012) CD-HIT:
forms for constructing the holm oak (Quercus accelerated for clustering the next-generation
ilex) transcriptome. PLoS One 14:e0210356 sequencing data. Bioinformatics
8. Benjamini Y, Speed TP (2012) Summarizing 28:3150–3152
and correcting the GC content bias in high- 20. Bushmanova E, Antipov D, Lapidus A et al
throughput sequencing. Nucleic Acids Res 40: (2016) RnaQUAST: a quality assessment tool
e72 for de novo transcriptome assemblies. Bioinfor-
9. Hansen K, Brenner S (2010) Biases in Illumina matics 32:2210–2212
transcriptome sequencing caused by random 21. Li B, Fillmore N, Bai Y et al (2014) Evaluation
hexamer priming. Nucleic Acids Res 38:e131 of de novo transcriptome assemblies from
10. Martin M (2011) Cutadapt removes adapter RNA-Seq data. Genome Biol 15:553
sequences from high-throughput sequencing 22. Li B, Dewey CN (2011) RSEM: accurate tran-
reads. EMBnet J 17:10–20 script quantification from RNA-Seq data with
11. Zerbino DR, Birney E (2008) Velvet: algo- or without a reference genome. BMC Bioinfor-
rithms for de novo short read assembly using matics 12:323
de Bruijn graphs. Genome Res 18:821–829 23. Simão FA, Waterhouse RM, Ioannidis P
12. Simpson JT, Wong K, Jackman SD et al (2009) (2015) BUSCO: assessing genome assembly
ABySS: a parallel assembler for short read and annotation completeness with single-copy
sequence data. Genome Res 9:1117–1123 orthologs. Bioinformatics 31:3210–3212
13. Li R, Zhu H, Ruan J et al (2010) De novo 24. Waterhouse RM, Seppey M, Simão FA et al
assembly of human genomes with massively (2017) BUSCO applications from quality
parallel short read sequencing. Genome Res assessments to gene prediction and phyloge-
20:265–272 nomics. Mol Biol Evol 35:543–548
14. Grabherr MG, Haas BJ, Yassour M et al (2011) 25. Muñoz-Merida A, Viguera E, Claros MG et al
Trinity: reconstructing a full-length transcrip- (2014) Sma3s: a three-step modular annotator
tome without a genome from RNA-Seq data. for large sequence datasets. DNA Res
Nat Biotechnol 29:644–652 21:341–353
15. Chevreux B, Wetter T, Suhai S (1999) Genome 26. Casimiro-Soriguer CS, Muñoz-Mérida A,
sequence assembly using trace signals and addi- Pérez-Pulido AJ (2017) Sma3s: a universal
tional sequence information. German Conf tool for easy functional annotation of pro-
Bioinformatics 99:45–56 teomes and transcriptomes. Proteomics
16. Boisvert S, Laviolette F, Corbeil J (2010) Ray: 17:1700071
simultaneous assembly of reads from a mix of 27. Haas B, Papanicolaou A (2017) TransDecoder.
high-throughput sequencing technologies. J https://transdecoder.github.io
Comput Biol 17:1519–1533 28. Bryant DM, Johnson K, DiTommaso T et al
17. Bradnam KR, Fass JN, Alexandrov A et al (2017) A tissue-mapped axolotl de novo tran-
(2013) Assemblathon 2: evaluating de novo scriptome enables identification of limb regen-
methods of genome assembly in three verte- eration factors. Cell Rep 18:762–776
brate species. Gigascience 2:10 29. Conesa A, Götz S (2008) Blast2GO: a compre-
18. Weizhong L, Godzik A (2006) Cdhit: a fast hensive suite for functional analysis in plant
program for clustering and comparing large genomics. Int J Plant Genomics 2008:619832
sets of protein or nucleotide sequences. Bioin- 30. Ma B (2015) Novor: real-time peptide de novo
formatics 22:16589 sequencing software. J Am Soc Mass Spectrom
26:1885–1894
Chapter 5
Abstract
The complexity of the plant cell proteome, exhibiting thousands of proteins whose abundance varies in
several orders of magnitude, makes impossible to cover most of the plant proteins using standard shotgun-
based approaches. Despite this general description of plant proteomes, the complexity is not a big issue
(current protocols and instrumentation allow for the identification of several thousand proteins per
injection), low or medium abundant proteins cannot be detected most of times, being necessary to fraction
or perform targeted analyses in order to detect and quantify them. Among fractioning choices, cell
fractioning in its different organelles is a good strategy for gaining not only a deeper coverage of the
proteome but also the basis for understanding organelle function, protein dynamics, and trafficking within
the cell, as nuclear and chloroplast communication. This approach is used routinely in many labs working
with model species; however, the available protocols focusing on tree species are scarce. In this chapter, we
provide a simple but robust protocol for isolating nuclei and chloroplasts in pine needles that is fully
compatible with later mass spectrometry–based proteome analysis.
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_5, © Springer Science+Business Media, LLC, part of Springer Nature 2020
69
70 Laura Lamelas et al.
2 Materials
SUBCELLULAR PROTEOME
EXTRACTION
NUCLEUS CHLOROPLAST
• Organelle • Chloroplast
Extraction Isolation
Buffer PLASTID Buffer
• Filtration ISOLATION • Filtration
• Centrifugation • Differential
centrifugation
• Plastid
• Chloroplast
Disruption
ORGANELLE Isolation
Buffer
CLEANING Buffer
• Centrifugation
• Differential
• Washing Buffer
centrifugation
• Centrifugation
• Pellet
Suspension • Percoll-
Buffer PURIFICATION BY Sucrose
• Sucrose DISCONTINUOUS GRADIENT gradient
gradient
PROTEIN EXTRACTION
• Protein Extraction Buffer and SDS
• Incubate by shaking (15 min,RT)
• Buffer Z and phenol
• Centrifugation
• Protein Precipitation Buffer
• Incubate overnight (-20ºC)
PROTEIN PURIFICATION
• Centrifugation
• Washes with acetone
• Air dry
PROTEIN RESUSPENSION
• Protein Solubilization Solution
PROTEIN ASSESSMENT
3 Methods
3.1 Nuclei Isolation Unless otherwise specified, all steps of nuclei isolation must be
performed at 4 C
1. Cell lysis.
– Collect pine needles and keep them at 80 C or in liquid
nitrogen until use.
– Ground 1 g of plant material to fine powder using a mortar
and pestle in liquid nitrogen. Immediately transfer the pow-
der to a 50 mL tube with 10 mL of Organelle Extraction
Buffer (OEB) and mix gently by inversion.
– Incubate on ice for 30 min and mix it carefully by inversion
every 5 min to avoid the formation of clumps.
2. Nuclei isolation.
Pine Nuclei and Chloroplast Proteomics 75
– Centrifuge the tubes and wash the protein pellet with acetone
twice.
– Dry the pellets at room temperature and dissolve them in the
minimum amount of Protein Solubilization Solution (PSS)
(30–40 μL or more) (see Note 9).
– Protein content can be quantified by BCA assay [12]. And the
enrichment in nuclear proteins can be assessed by 1-DE SDS-
PAGE by comparing total protein fraction with nuclei/chloro-
plast fraction.
4 Expected Results
5 Notes
Table 1
Expected results of protein extraction yield and identifications
Acknowledgments
References
1. Hebert AS, Richards AL, Bailey DJ et al (2014) 7. Bouchnak I, Brugiere S, Moyet L et al (2019)
The one hour yeast proteome. Mol Cell Prote- Unraveling hidden components of the chloro-
omics 13:339–347 plast envelope proteome: opportunities and
2. Kislinger T, Gramolini AO, MacLennan et al limits of better MS sensitivity. Mol Cell Prote-
(2005) Multidimensional protein identifica- omics 18:1285–1306
tion technology (MudPIT): technical overview 8. Wu W, Yan Y (2018) Chloroplast proteome
of a profiling method optimized for the com- analysis of Nicotiana tabacum overexpressing
prehensive proteomic investigation of normal TERF1 under drought stress condition. Bot
and diseased heart tissue. J Am Soc Mass Spec- Stud 59:26
trom 16:1207–1220 9. Brown EC, Somanchi A, Mayfield SP (2001)
3. Pascual J, Alegre S, Nagler et al (2016) The Interorganellar crosstalk: new perspectives on
variations in the nuclear proteome reveal new signaling from the chloroplast to the nucleus.
transcription factors and mechanisms involved Genome Biol 2:REVIEWS1021
in UV stress response in Pinus radiata. J Pro- 10. Colombo M, Tadini L, Peracchio C et al
teome 143:390–400 (2016) GUN1, a jack-of-all-trades in chloro-
4. Millar AH, Taylor NL (2014) Subcellular plast protein homeostasis and signaling. Front
proteomics-where cell biology meets protein Plant Sci 7:1427
chemistry. Front Plant Sci 5:55 11. Zhao X, Huang J, Chory J (2019) GUN1
5. Goto C, Hashizume S, Fukao Y et al (2019) interacts with MORF2 to regulate plastid
Comprehensive nuclear proteome of Arabi- RNA editing during retrograde signaling.
dopsis obtained by sequential extraction. Proc Natl Acad Sci U S A 116:10162–10167
Nucleus 10:81–92 12. Smith PK, Krohn RI, Hermanson GT et al
6. Yin X, Komatsu S (2016) Plant nuclear prote- (1985) Measurement of protein using bicinch-
omics for unraveling physiological function. oninic acid. Anal Biochem 150:76–85
New Biotechnol 33:644–654
Chapter 6
Abstract
Proteins in the extracellular space (apoplast) play a crucial role at the interface between plant cells and their
proximal environment. Consequently, it is not surprising that plants actively control the apoplastic pro-
teomic profile in response to biotic and abiotic cues. Comparative quantitative proteomics of plant
apoplastic fluids is therefore of general interest in plant physiology. We here describe an efficient method
to isolate apoplastic fluids from Arabidopsis thaliana leaves inoculated with a nonadapted powdery mildew
pathogen.
Key words Apoplast, Protein secretion, Membrane trafficking, Arabidopsis thaliana, Cell wall
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_6, © Springer Science+Business Media, LLC, part of Springer Nature 2020
79
80 Ryohei Thomas Nakano et al.
2 Materials
2.2 Apoplastic Fluid 1. Infiltration Buffer: 5 mM sodium acetate, 0.2 M calcium chlo-
Preparation ride, pH 4.3 (see Note 1). Store at 4 C.
2. Protease inhibitor cocktail, EDTA-free (Roche).
3. Flat-bottom glass beakers (300 mL and 200 mL).
4. A vacuum pump and a vacuum desiccator (see Note 2).
5. Paper towel.
6. Blunt-end needleless syringe (20 mL).
7. 50-mL low-polymer conical tubes (see Note 3).
8. 2-mL low-polymer microtubes (see Note 3).
3 Methods
3.1 Plant Growth 1. To sterilize the seed surface, mix a scoop of A. thaliana seeds
and Pathogen (200~500 seeds) with 1 mL of 70% ethanol in a 1.5- or 2-mL
Inoculation microtube and shake or rotate for 10 min. Discard the ethanol
and briefly rinse the seeds with 99.9% ethanol. Leave on a
bench with its lid open for 30~60 min to dry the seeds (see
Note 5).
2. Sow a generous seed number (see Note 5) in 9 9 9.5 cm3
pots filled with greenhouse soil. Start plant cultivation in the
growth chamber for A. thaliana (see Note 6).
3. Keep watering the plants every 2–3 days. Avoid excessive water-
ing to minimize algae growth on soil surface.
4. After 2 weeks, remove the majority of seedlings so that each pot
harbors 4–5 plants of similar size. In case of plant dwarfism
(e.g., due to particular gene defects), this number can be
increased up to 10–25 plants/pot (see Note 7).
5. Continue plant cultivation under the same conditions for
another 2–3 weeks.
6. One week before pathogen inoculation, bulk up fresh coni-
diospores of Bgh isolate K1 on 7-day-old susceptible H. vulgare
seedlings and incubate in a growth chamber for barley
cultivation.
7. Inoculate Bgh conidiospores on A. thaliana plants by tapping
the leaf blades of infected barley plants (see Note 8). A subset of
plants is used as non-inoculated samples (0 hpi; hour post
inoculation) and proceed to preparation of apoplastic fluid
(see Note 8).
8. Incubate inoculated A. thaliana plants under the same condi-
tions for a desired time period.
3.2 Apoplastic Fluid 1. Collect entire shoots by cutting hypocotyls in a 300-mL glass
Preparation beaker containing 100 mL of the Infiltration Buffer, freshly
supplemented with two tablets of the Protease Inhibitor Cock-
tail (see Notes 9 and 10).
2. Submerge shoots in the infiltration buffer by placing on top an
empty 200-mL glass beaker (Fig. 1a).
3. Vacuum for 10 min and release gently. Vacuum release should
take longer than 10 min (see Note 11).
4. Remove excessive buffer on the leaf surface by blot drying on a
paper towel (Fig. 1b).
5. Introduce plant shoots in a 20-mL blunt-end needleless syringe
after removal of its plunger.
Leaf Secretome of Pathogen-Challenged Arabidopsis Thaliana 83
200-mL
beaker
300-mL
beaker
B C 20-mL blunt-end
needle-less syringe
1,000 g
20 min.
50-mL
conical tube
Fig. 1 Apoplast fluid preparation. (a) Buffer infiltration. A. thaliana shoots are submerged into the Infiltration
Buffer in a 300-mL flat-bottom beaker, and a 200-mL flat-bottom beaker is used as a weight during vacuum
infiltration. (b) Shoots are collected from the beaker, carefully flattened, and blot-dried on a paper towel. (c)
Shoots are introduced in a 20-mL blunt-end needleless syringe within a 50-mL conical tube. After centrifu-
gation, the apoplastic fluid will be extracted to the space between the syringe and the surrounding conical tube
4 Notes
Fig. 2 Randomized design of plant cultivation. Pots need to be randomized when multiple genotypes are to be
compared. A schematic example for six genotypes in a single technical replicate (four pots per replicate) is
shown
Acknowledgments
References
1. Delaunois B, Jeandet P, Clément C et al (2014) proteins and cell wall modifying enzymes.
Uncovering plant-pathogen crosstalk through BMC Plant Biol 1:24
apoplastic proteomic studies. Front Plant Sci 11. Konozy EHE, Rogniaux H, Causse M, Fauro-
5:249 bert M (2013) Proteomic analysis of tomato
2. Uemura T, Nakano RT, Takagi J et al (2019) A (Solanum lycopersicum) secretome. J Plant Res
Golgi-released subpopulation of the trans- 126:251–266
Golgi network mediates protein secretion in 12. Wen F, VanEtten HD, Tsaprailis G, Hawes MC
arabidopsis. Plant Physiol 179:519–532 (2007) Extracellular proteins in pea root tip
3. Ruhe J, Agler MT, Placzek A et al (2016) and border cell exudates. Plant Physiol
Obligate biotroph pathogens of the genus 143:773
albugo are better adapted to active host defense 13. Delannoy M, Alves G, Vertommen D et al
compared to niche competitors. Front Plant (2008) Identification of peptidases in Nicoti-
Sci 7:820 ana tabacum leaf intercellular fluid. Proteo-
4. Kusumawati L, Imin N, Djordjevic MA (2008) mics 8:2285–2298
Characterization of the secretome of suspen- 14. Soares NC, Francisco R, Ricardo CP, Jackson
sion cultures of Medicago species reveals pro- PA (2007) Proteomics of ionically bound and
teins important for defense and development. J soluble extracellular proteins in Medicago trun-
Proteome Res 7:4508–4520 catula leaves. Proteomics 7:2070–2082
5. Oh IS, Park AR, Bae MS et al (2005) Secre- 15. Witzel K, Shahzad M, Matros A et al (2011)
tome analysis reveals an Arabidopsis lipase Comparative evaluation of extraction methods
involved in defense against Alternaria brassici- for apoplastic proteins from maize leaves. Plant
cola. Plant Cell 17:2832 Methods 7(1):48
6. Okushima Y, Koizumi N, Kusano T, Sano H 16. Agrawal GK, Jwa N-S, Lebrun M-H et al
(2000) Secreted proteins of tobacco cultured (2010) Plant secretome: unlocking secrets of
BY2 cells: identification of a new member of the secreted proteins. Proteomics 10:799–827
pathogenesis-related proteins. Plant Mol Biol 17. Casasoli M, Spadoni S, Lilley KS et al (2008)
42:479–488 Identification by 2-D DIGE of apoplastic pro-
7. Cho WK, Chen XY, Chu H et al (2009) Pro- teins regulated by oligogalacturonides in Ara-
teomic analysis of the secretome of rice calli. bidopsis thaliana. Proteomics 8:1042–1054
Physiol Plant 135:331–341 18. Wang Y, Kim SG, Wu J et al (2014) Differential
8. Waghmare S, Lileikyte E, Karnik R et al (2018) proteome and secretome analysis during rice-
SNAREs SYP121 and SYP122 mediate the pathogen interaction. Methods Mol Biol
secretion of distinct cargo subsets. Plant 1072:563–572
Physiol 178:1679–1688 19. Zhou F, Kurth J, Wei F et al (2001) Cell-
9. Lohaus G, Pennewiss K, Sattelmacher B et al autonomous expression of barley Mla1 confers
(2001) Is the infiltration-centrifugation tech- race-specific resistance to the powdery mildew
nique appropriate for the isolation of apoplastic fungus via a Rar1-independent signaling path-
fluid? A critical evaluation with different plant way. Plant Cell 13(2):337–350
species. Physiol Plant 111:457–465 20. Adam L, Somerville SC (1996) Genetic char-
10. Delaunois B, Colby T, Belloy N et al (2013) acterization of five powdery mildew disease
Large-scale proteomic analysis of the grapevine resistance loci in Arabidopsis thaliana. Plant J
leaf apoplastic fluid reveals mainly stress-related
88 Ryohei Thomas Nakano et al.
Abstract
Shotgun proteomics allows for the comprehensive analysis of proteins extracted from plant cells, subcellular
organelles, and membranes. Previously, two-dimensional gel electrophoresis-based proteomics was used for
mass spectrometric analysis of plasma membrane proteins. However, this method is not fully applicable for
highly hydrophobic proteins with multiple transmembrane domains. In order to solve this problem, we
here describe a shotgun proteomics method using nano-LC-MS/MS for proteins in the plasma membrane
and plasma membrane microdomain fractions. The results obtained are easily applicable to label-free
protein semiquantification.
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_7, © Springer Science+Business Media, LLC, part of Springer Nature 2020
89
90 Daisuke Takahashi et al.
2 Materials
2.1 Plasma Ultrapure water, the Polytron homogenizer, centrifuge, and ultra-
Membrane Purification centrifuge rotors should be kept and used at 4 C.
Components
1. Homogenizing medium: 0.5 M sorbitol, 50 mM Mops-KOH,
pH 7.6, 5 mM EGTA (pH 8.0), 5 mM EDTA (pH 8.0), 5%
(w/v) polyvinylpyrrolidone-40 (molecular weight 40,000),
0.5% (w/v) bovine serum albumin (BSA), 2.5 mM
Shotgun Proteomics of Plasma Membrane 91
2.3 In-Gel Tryptic 1. Running gel solution: 1.5 M Tris–HCl, pH 8.8. Add approxi-
Digestion mately 900 mL water to a 1 L glass beaker. Add 181.7 g Tris
and stir moderately using a stirring bar. After Tris is completely
2.3.1 SDS–
dissolved, adjust pH with HCl using a pH meter. Add water up
Polyacrylamide Gel
to 1 L with a graduated cylinder (see Note 6).
Components
2. SDS sample buffer (2): 2% (w/v) SDS, 50 mM Tris–HCl
(pH 6.8), 6% (v/v) β-mercaptoethanol, 10% (w/v) glycerol,
92 Daisuke Takahashi et al.
2.3.2 Tryptic Digestion All of these processes should be carefully performed at a clean
Components bench with gloves and clean lab coat throughout to avoid contami-
nation by keratin, dust, and other exogenous proteinaceous
materials.
1. Fixation solution: Mix 50 mL of water, 40 mL of ethanol
(99.5%, w/w) and 10 mL of acetic acid (100%, w/w).
2. 0.1 M ammonium bicarbonate: Weigh 3.95 g of ammonium
bicarbonate and add to 400 mL of water in a glass beaker. Stir
moderately using a stirring bar and add water up to 500 mL
with a graduated cylinder. Store at 4 C (see Note 10).
3. Acetonitrile (LC-MS grade) (Wako Pure Chemical Industries,
Tokyo, Japan). Store at 4 C (see Note 10).
4. 25 mM ammonium bicarbonate/50% (v/v) acetonitrile: Mix
50 mL of acetonitrile (LC-MS grade), 25 mL of 0.1 M ammo-
nium bicarbonate, and 25 mL of water. Store at 4 C (see Note
10).
5. Reduction buffer: Weigh 7.7 mg DTT and add to 5 mL of
0.1 M ammonium bicarbonate in a conical tube (see Note 11).
6. 55 mM iodoacetamide (IAA)/0.1 M ammonium bicarbonate:
Weigh 51 mg IAA and add to 5 mL of 0.1 M ammonium
bicarbonate in a conical tube (see Note 11).
7. Protease solution: Add 2 mL of 0.1 M ammonium bicarbonate
into a vial containing 20 μg of trypsin (Sequence grade
Shotgun Proteomics of Plasma Membrane 93
2.4 In-Solution In order to avoid keratin, dust, and other exogenous proteinaceous
Tryptic Digestion materials, all of these processes must be carefully performed at a
clean bench with gloves and a clean lab coat throughout the
in-solution tryptic digestion.
1. MPEX PTS reagents kit (GL Sciences, Tokyo, Japan): Make
solution B, DTT solution, IAA solution, and trypsin solution
according to the manufacturer’s instruction manual. Only
solution B should be stored at 4 C. Prepare fresh DTT solu-
tion, IAA solution, and trypsin solution immediately
before use.
2. 5% (v/v) acetonitrile/0.1% (v/v) TFA: Add 50 μL of acetoni-
trile and 1 μL of TFA into 949 μL of water and mix well (see
Note 11).
3. Pierce BCA protein assay kit (Thermo Fisher Scientific, MA,
USA): Store at room temperature.
2.5 Peptide 1. SPE C-TIP T-300 (Nikkyo Technos Co., Ltd., Tokyo, Japan).
Purification 2. 1.5 mL microtubes: Make a hole of 3 mm in diameter on the
Components cap with a soldering iron. Prepare two tubes per sample (Two
microtubes with a hole on the lid per one sample preparation
will be used in two steps of the peptide purification, Subhead-
ing 3.4, steps 1 and 6, below).
3. Solution A: Add 800 μL of acetonitrile and 5 μL of TFA into
195 μL of water and mix well (see Note 11).
4. Solution B: Add 40 μL of acetonitrile and 5 μL of TFA into
955 μL of water and mix well (see Note 11).
5. 0.1% (v/v) TFA: Quickly add 1 μL of TFA into 999 μL of water
and mix well (see Note 11).
3 Methods
Wear gloves and a clean lab coat throughout the processes to avoid
contamination by keratin, dust and other exogenous proteinaceous
materials. It is preferable to use low protein absorption microtubes
at all stages.
94 Daisuke Takahashi et al.
Fig. 1 Schematic outline of the plasma membrane preparation. All steps from harvesting leaves to suspending
the purified plasma membrane fractions are described
3.1 Plasma Perform all steps on crushed ice (unless indicated otherwise). Sche-
Membrane Purification matic outline of the procedure is described in Fig. 1.
1. Cut out the aerial parts of oat seedlings and weigh the samples
(10–70 g in fresh weight is suitable for the plasma membrane
purification). Put the harvested plants on a plastic container
and wash with 500 mL of chilled water. Wash twice. Drain on a
paper towel and put on crushed ice.
2. Cut into small pieces with razor blades. Immediately, put into
four volumes of chilled homogenizing medium and mix well
with a spatula. The homogenizing medium containing the
samples should be again cooled on crushed ice (see Note 12).
3. Homogenize with a chilled Polytron generator (PT10SK,
Kinematica Inc., Lucerne, Switzerland) until the samples are
broken down into tiny pieces (speed 6 for 60–90 s). Filter the
homogenates through four layers of gauze and squeeze tightly.
Shotgun Proteomics of Plasma Membrane 95
11. Centrifuge at 650 g for 5 min and split the resultant upper
phase of tube C into two ultracentrifuge tubes. Fill up the
tubes with PM suspension medium and balance them. Ultra-
centrifuge at 231,000 g for 50 min, as described in step 4 (see
Note 15).
12. Discard the supernatant with an aspirator. Add an appropriate
quantity of PM suspension medium to each tube and homoge-
nize the pellets with a Teflon–glass homogenizer. Collect the
plasma membrane suspensions with a large Pasteur pipette into
ultracentrifuge tubes. Balance ultracentrifuge tubes in pairs
with PM suspension medium. Ultracentrifuge again at
231,000 g for 35 min.
13. Discard the supernatant with an aspirator. Add a minimal
quantity of PM-suspension medium to the plasma membrane
pellets. Homogenize the pellets with a glass rod. Transfer into a
Teflon–glass homogenizer and homogenize well using an elec-
tric Teflon–glass homogenizer (moving up and down five
times) with cooling on ice. Transfer into a 1.5 mL microtube.
14. Measure the protein content using the Bradford assay (Bio-Rad
Protein Assay Kit). Use 10 μg of protein for tryptic digestion
and LC-MS/MS analysis. The remaining PM fractions should
be frozen in liquid nitrogen immediately and stored at 80 C.
3.2 Detergent- Perform all steps on crushed ice (unless indicated otherwise).
Resistant Membrane
1. Prepare PM with approximately 2.5 mg protein and dilute with
Extraction PM-suspension medium in an ultracentrifuge tube. After bal-
ancing the tubes in pairs, ultracentrifuge at 231,000 g for
35 min (see Note 16).
2. Add 2000 μL of PM-suspension medium in an ultracentrifuge
tube and grind pellets with a glass rod. Transfer into a Teflon–
glass homogenizer and homogenize well using an electric
homogenizer (moving up and down five times). Measure the
protein content by Bradford assay and place PM samples with
2 mg of protein into a 35 mL swing rotor tube. Adjust the
volume to 2.7 mL by adding PM suspension medium.
3. Add 300 μL of 10% (w/v) Triton X-100 buffer and mix well
(at this point, protein–detergent ratio is 1:15). Incubate for
30 min.
4. Add 12 mL of 65% (w/w) sucrose solution and mix well (at this
point, the final concentration of sucrose is 52%). Overlay 5 mL
of 48%, 35%, 30%, and 5% (w/w) sucrose solution slowly in
sequence (see Note 17).
5. Balance the swing rotor tubes in pairs by adding 5% (w/w)
sucrose solution and ultracentrifuge in a swing rotor at
141,000 g for 20 h.
Shotgun Proteomics of Plasma Membrane 97
3.3 In-Gel Tryptic 1. Mix 2.5 mL running gel solution, 3.35 mL 30% (w/v) acryl-
Digestion amide solution, and 3.95 mL water in a conical flask. Degas
with a vacuum pump for 5 min. Add 100 μL of 10% (w/v)
3.3.1 SDS–
ammonium persulfate, 100 μL of 10% (w/v) SDS and 5 μL of
Polyacrylamide Gel
TEMED. Cast gel into a 90 mm (W) 83 mm (H) 1 mm
Electrophoresis
(T) gel cassette immediately. Insert a 14-well comb without
introducing air bubbles. Incubate at room temperature for 1 h
(see Note 19).
2. Mix 5 μg of PM or DRM protein samples (within 10 μL) and
equal volume of SDS sample buffer. Vortex and centrifuge
tubes briefly. Heat at 95 C for 5 min. Centrifuge and cool to
room temperature.
3. Wash out the wells by pipetting up and down. Slowly load the
samples onto the gel. Electrophorese at 100 V until the upper
end of sample dye band enters 2 mm from the well (see Note
20).
4. Pry the gel plates open with a knife. Cut out the gel slice from
the well to 2 mm in front of the BPB dye with a scalpel on a
glass plate. Cut the gel slice into four equal pieces (Fig. 2) and
put into 1.5 mL microtubes (see Note 21).
5. Add 200 μL of fixation solution and agitate for 10 min. Cen-
trifuge briefly and discard the supernatant. Repeat these steps
twice.
3.3.2 In-Gel Tryptic All of these procedures should be performed at room temperature
Digestion for Nano-LC-MS/ (unless otherwise specified).
MS
1. Add 200 μL of water and agitate for 10 min. Centrifuge briefly
and discard the supernatant.
98 Daisuke Takahashi et al.
Fig. 2 Excision of a protein band from gel. Wells are separated to avoid
contamination of different samples during loading and electrophoresis. After
the BPB dye migrates into a gel (about 2 mm), a 4 mm of gel piece centered on
the BPB dye band is cut out. Subsequently, the gel piece is cut into four equal
pieces. Each of the gel pieces is separately put into 1.5 mL microtubes
3.6 Nano-LC-MS/MS Separate digested and purified peptide solutions with a C18 column
Analysis by nano-flow LC. Make a linear gradient of acetonitrile (from 5%
[v/v] to 45% [v/v]) at a flow rate of 500 nL/min for 100 min.
Detect and analyze the separated and ionized peptides in a mass
spectrometer. Examples of analyzed results using 5 μg of oat PM
and DRM proteins are shown in Figs. 4, 5, and 6. You can see
detailed results in figure legends.
4 Notes
1. Mops-KOH (pH 7.6), EGTA (pH 8.0), and EDTA (pH 8.0)
should be prepared as 0.5 M stock solutions and stored at 4 C.
The pH of EGTA and EDTA should be adjusted using NaOH.
When BSA is dissolved, BSA powder should be preset at room
temperature. PMSF and SHAM should be prepared as 1 M and
1.6 M stock solutions in DMSO, respectively, and stored at
4 C. DTT should be stored at 30 C as a 1 M stock solution.
PMSF, SHAM, DTT should be diluted only as needed just
before use. If you prepare Arabidopsis PM fraction, the homo-
genizing medium should consist of 0.5 M sorbitol, 50 mM
Mops-KOH (pH 7.6), 5 mM EGTA (pH 8.0), 5 mM EDTA
(pH 8.0), 1.5% (w/v) polyvinylpyrrolidone-40 (molecular
weight 40,000), 0.5% (w/v) BSA, 2 mM PMSF, 4 mM
SHAM, and 2.5 mM DTT.
2. KH2PO4/K2HPO4 (K-P) buffer (pH 7.8) should be prepared
as a 0.5 M stock solution and diluted to make the MS suspen-
sion medium. First, 200 mL of 0.5 M K2HPO4 and 30 mL of
0.5 M KH2PO4 are prepared. The pH of the 0.5 M K2HPO4 is
adjusted to 7.8 by adding 0.5 M KH2PO4, monitored by a pH
meter. If you prepare Arabidopsis PM fraction, the MS sus-
pending medium should contain 10 mM KH2PO4/K2HPO4
(K-P) buffer (pH 7.8), 0.3 M sucrose.
3. If you prepare Arabidopsis PM fraction, the final concentration
of NaCl should be adjusted to 100 m M in the MS suspension
medium.
4. Mops-KOH (pH 7.3), EGTA (pH 8.0) should be prepared as a
0.5 M stock solution and stored at 4 C. If you prepare Arabi-
dopsis PM fraction, the PM suspending medium consists of
102 Daisuke Takahashi et al.
Fig. 4 The number of transmembrane domains in oat PM proteins separated and identified following in-gel or
in-solution tryptic digestion. Peptide sequences were searched against the NCBI database (version
20,120,216, comprising 17,282,984 sequences), taxonomy viridiplantae. Transmembrane domains were
estimated by SOSUI engine ver. 1.10 (http://bp.nuap.nagoya-u.ac.jp/sosui/). (a) Proteins with up to 24 trans-
membrane domains were identified in oat PM by in-gel digestion in four biological replicates. On average,
700 proteins with transmembrane domains were identified in the four replicates. (b) Proteins with up to
24 transmembrane domains were identified in oat PM by in-solution digestion in four biological replicates. On
average, 397 proteins with transmembrane domains were identified in the four replicates
Fig. 5 Distribution of the number of transmembrane domains in oat DRM proteins identified using the “In-
Solution Digestion” protocol. Peptides were analyzed as described in Fig. 4. Compared to PM (Fig. 4b), in DRM,
more transmembrane proteins, especially those containing more than five transmembrane domains, could be
identified
Fig. 6 Estimated number of proteins with secretory signal peptides in oat PM. 173 and 81 proteins with signal
peptides were identified from four replicates of in-gel or in-solution tryptic digests of 5 μg oat PM proteins,
respectively
104 Daisuke Takahashi et al.
Acknowledgments
References
1. Aebersold R, Mann M (2003) Mass 7. Gorg A, Weiss W, Dunn M-J (2004) Current
spectrometry-based proteomics. Nature two-dimensional electrophoresis technology
422:198–207 for proteomics. Proteomics 4:3665–3685
2. Domon B, Aebersold R (2006) Mass spec- 8. Rabilloud T (2009) Membrane proteins and
trometry and protein analysis. Science proteomics: love is possible, but so difficult.
312:212–217 Electrophoresis 30:174–180
3. Kersten B, Burkle L, Kuhn E-J et al (2002) 9. Rabilloud T (2002) Two-dimensional gel elec-
Large-scale plant proteomics. Plant Mol Biol trophoresis in proteomics: old, old fashioned,
48:133–141 but it still climbs up the mountains. Proteomics
4. van Wijk KJ (2001) Challenges and prospects 2:3–10
of plant proteomics. Plant Physiol 10. Simons K, Ikenen E (1997) Functional rafts in
126:501–508 cell membranes. Nature 387:569–572
5. Santoni V, Kieffer S, Desclaux D et al (2000) 11. Peskan T, Westermann M, Oelmuller R (2000)
Membrane proteomics: use of additive main Identification of low-density triton
effects with multiplicative interaction model X-100-insoluble plasma membrane microdo-
to classify plasma membrane proteins accord- mains in higher plants. Eur J Biochem
ing to their solubility and electrophoretic prop- 267:6989–6995
erties. Electrophoresis 21:3329–3344 12. Mongrand S, Morel J, Laroche J et al (2004)
6. Luche S, Santoni V, Rabilloud T (2003) Eval- Lipid rafts in higher plant cells. J Biol Chem
uation of nonionic and zwitterionic detergents 279:36277–36286
as membrane protein solubilizers in 13. Bhat RA, Panstruga R (2005) Lipid rafts in
two-dimensional electrophoresis. Proteomics plants. Planta 223:5–19
3:249–253
106 Daisuke Takahashi et al.
14. Martin SW, Glover BJ, Davies JM (2005) Lipid proteome in oat and rye: similarities and dis-
microdomains: plant membranes get similarities between two monocotyledonous
organized. Trends Plant Sci 10:263–265 plants. J Proteome Res 11:1654–1665
15. Grennan AK (2007) Lipid rafts in plants. Plant 19. Li B, Takahashi D, Kawamura Y et al (2012)
Physiol 143:1083–1085 Comparison of plasma membrane proteomic
16. Thakur S-S, Geiger T, Chatterjee B et al (2011) changes of Arabidopsis suspension cells (T87
Deep and highly sensitive proteome coverage line) after cold and abscisic acid treatment in
by LC-MS/MS without prefractionation. Mol association with freezing tolerance develop-
Cell Proteomics 10:M110.003699 ment. Plant Cell Physiol 53:543–554
17. Matros A, Kasper S, Witzel K et al (2011) 20. Masuda T, Tomita M, Ishihama Y (2008)
Recent progress in liquid chromatography- Phase transfer surfactant-aided trypsin diges-
based separation and label-free quantitative tion for membrane proteome analysis. J Prote-
plant proteomics. Phytochemistry 72:963–974 ome Res 7:731–740
18. Takahashi D, Kawamura Y, Yamashita T et al
(2011) Detergent-resistant plasma membrane
Chapter 8
Abstract
Subcellular proteome analysis is one of the most effective ways to reduce the complexity of total proteome.
With the advancement in protein extraction methodologies, it is now possible to fractionate and isolate the
proteins from subcellular compartments without significant contamination from the cytoplasm and other
organelles. Of the different subcellular proteomes, plasma membrane remained largely uncharacterized
because of the difficulties in isolation of contamination free plasma membrane proteins. Moreover, prote-
ome analysis in the past two decades majorly relied on the two-dimensional gel electrophoresis which
showed limited protein loading ability and poor separation of highly hydrophobic plasma membrane
proteins. Development of shotgun proteomics methods has facilitated the identification and quantification
of hydrophobic proteins isolated from plasma membrane or other cellular membranes. Here, we present a
simplified procedure for the isolation of plasma membrane proteins by a two-phase partitioning method
and their identification by shotgun proteomics approach using rice as a model plant.
Key words Plasma membrane, Shotgun proteomics, Rice, Two-phase partitioning, Signaling
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_8, © Springer Science+Business Media, LLC, part of Springer Nature 2020
107
108 Ravi Gupta et al.
2 Materials
3 Methods
3.1 Extraction of 1. Take 20 g of fresh healthy green leaves of rice and grind it in the
Plasma Membrane prechilled pestle and mortar using liquid nitrogen.
Proteins 2. Add 40 mL of homogenization buffer and vortex the homog-
enate for 5 min.
110 Ravi Gupta et al.
Fig. 1 (a) SDS-PAGE showing the protein profile of total cellular (T) as well as plasma membrane (PM) proteins
isolated from young rice leaves. (b) Immunoblots showing total (T) and plasma membrane proteins probed
with anti-glutamine synthase (GS), anti-histone 1 (H1), and anti-plasma membrane intrinsic protein 2 (PIP2)
antibodies used as cytosolic, nuclear, and plasma membrane markers, respectively. (c) Functional annotation
of the MS identified proteins using MapMan program showing signaling overview
3.3 Desalting of 1. Centrifuge the acidified digest at 2000 g for 15 min. Transfer
Peptides the supernatant to a new protein low bind tube and discard the
precipitate.
112 Ravi Gupta et al.
3.5 Data Processing 1. Export all the raw files and load in the MaxQuant software (see
Using MaxQuant Note 14).
Software 2. Download rice protein database (Osativa_373, 52424
sequences) file from Phytozome and upload to the MaxQuant
as the database file (see Note 15).
Rice Leaf Plasma Membrane Proteome 113
4 Notes
Acknowledgments
References
3. Cordwell SJ, Thingholm TE (2010) Technolo- involvement of a novel cysteine protease in its
gies for plasma membrane proteomics. Proteo- pathogenicity. J Proteome 169:202–214
mics 10:611–627 7. Wang Y, Wu J, Kim SG et al (2016) Magna-
4. Voothuluru P, Anderson JC, Sharp RE et al porthe oryzae-secreted protein MSP1 induces
(2016) Plasma membrane proteomics in the cell death and elicits defense responses in rice.
maize primary root growth zone: novel insights Mol Plant-Microbe Interact 29:299–312
into root growth adaptation to water stress. 8. Santoni V (2007) Plant plasma membrane pro-
Plant Cell Environ 39:2043–2054 tein extraction and solubilization for proteomic
5. Gupta R, Lee SE, Agrawal GK et al (2015) analysis. In: Plant proteomics. Springer, Cham,
Understanding the plant-pathogen interactions pp 93–109
in the context of proteomics-generated apoplas- 9. Meng Q, Gupta R, Min CW et al (2019) A
tic proteins inventory. Front Plant Sci 6:352 proteomic insight into the MSP1 and flg22
6. Wang Y, Gupta R, Song W et al (2017) Label- induced signaling in Oryza sativa leaves. J Prote-
free quantitative secretome analysis of Xantho- ome 196:120–130
monas oryzae pv. Oryzae highlights the
Chapter 9
Abstract
Subcellular proteomics include, in its experimental workflow, steps aimed at purifying organelles. The purity
of the subcellular fraction should be assessed before mass spectrometry analysis, in order to confidently
conclude the presence of associated specific proteoforms, deepening the knowledge of its biological
function. In this chapter, a protocol for isolating endoplasmic reticulum (ER) and purity assessment is
reported, and it precedes the proteomic analysis through a gel-free/label-free proteomic approach. Dys-
function of quality-control mechanisms of protein metabolism in ER leads to ER stress. Additionally, ER,
which is a calcium-storage organelle, is responsible for signaling and homeostatic function, and calcium
homeostasis is required for plant tolerance. With such predominant cell functions, effective protocols to
fractionate highly purified ER are needed. Here, isolation methods and purity assessments of ER are
described. In addition, a gel-free/label-free proteomic approach of ER is presented.
Key words Endoplasmic reticulum isolation, Endoplasmic reticulum purity, Subcellular proteomics,
Shotgun proteomics
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_9, © Springer Science+Business Media, LLC, part of Springer Nature 2020
117
118 Xin Wang and Setsuko Komatsu
2 Materials
2.6 Search Engine, 1. Search engine: Mascot search engine (version 2.5.1; Matrix
Software, Science, London, UK).
and Database 2. Software: Xcalibur software (version 2.0.7; Thermo Fisher Sci-
for Proteomic Analysis entific), Proteome Discoverer software (version 1.4.0.288;
Thermo Fisher Scientific), SIEVE software (version 2.1.377;
Thermo Fisher Scientific), MapMan software (version
3.6.0RC1), SUBA3 (http://suba3.plantenergy.uwa.edu.au/),
MultiLoc2 (http://abi.inf.uni-tuebingen.de/Services/Multi
Loc2), and WoLF PSORT (http://www.genscript.com/wolf-
psort.html).
3. Database: soybean peptide database constructed from soybean
genome database (Phytozome version 12; http://www.
phytozome.net/soybean) and Gmax_109_peptide (http://
mapman.gabipd.org/web/guest/mapman).
3 Methods
3.1 Isolation of Total The procedure is conducted on ice or at 4 C in the cold room.
ER Fraction
1. Fresh sample is ground in a glass mortar and pestle with isos-
motic homogenization buffer (see Note 1).
2. The homogenate is transferred to a Falcon tube and centri-
fuged at 3000 g for 10 min at 4 C.
3. The pellet is collected as Fraction 1. The supernatant is col-
lected and centrifuged at 12,000 g for 15 min at 4 C.
4. The pellet is collected as Fraction 2. The supernatant is col-
lected and centrifuged at 90,000 g for 60 min at 4 C.
5. The supernatant is discarded and the pellet collected as total
ER fraction (Fig. 1).
Sample homogenization
Pellet
(Total ER fraction)
Sample homogenization
Supernatant precipitation
Pellet
(Rough ER fraction)
3.5 Proteomic 1. ER fraction (Subheading 3.1, step 5 and Subheading 3.2, step
Analysis of ER Proteins 8) is dissolved in lysis buffer followed by sonication in cold
water for 20 min. The suspension is centrifuged at 20,000 g
3.5.1 Preparation
for 20 min at 4 C. The solubilized proteins kept in the
of Peptides for Gel-Free/
supernatant.
Label-Free Proteomic
Analysis 2. Proteins concentration is determined as described in Subhead-
ing 3.4, step 3.
3. Proteins (100 μg) are added to 400 μL of methanol and mixed
thoroughly before adding 100 μL of chloroform and 300 μL of
water.
4. Mixed sample is centrifuged at 20,000 g for 10 min at room
temperature to achieve phase separation.
5. Upper aqueous phase is discarded and 300 μL of methanol is
slowly added to lower phase.
6. Mixture is centrifuged at 20,000 g for 10 min at room
temperature. Supernatant is discarded and the pellet allowed
drying at room temperature.
7. Dried pellet is re-suspended in 20 μL of 50 mM ammonium
bicarbonate.
8. Proteins are reduced with 5 μL of 250 mM dithiothreitol in
50 mM ammonium bicarbonate for 30 min at 56 C.
9. Proteins are alkylated with 5 μL of 300 mM iodoacetamide in
50 mM ammonium bicarbonate for 30 min at 37 C in
darkness.
10. Alkylated proteins are resuspended in 40 μL of 100 mM
ammonium bicarbonate.
11. Proteins are digested with 10 μL of 0.1 μg/μL trypsin (Wako)
and 10 μL of 0.1 μg/μL lysyl endopeptidase (Wako) for 16 h at
37 C.
12. Peptides are acidified with 20 μL of 20% (v/v) formic acid
(pH < 3) and centrifuged at 20,000 g for 10 min at room
temperature.
13. Supernatant is collected and acidified peptides are desalted with
C18-pipette tips (SPE C-TIP, Nikkyo Technos, Tokyo, Japan).
14. Desalted peptides are subjected to nano-liquid chromatogra-
phy (LC)-MS/MS analysis.
Proteomic Analysis of Endoplasmic Reticulum 127
3.5.2 Mass Spectrometry Peptides are separated using an Ultimate 3000 nanoLC system
Analysis (Dionex, Germering, Germany) equipped with a C18 PepMap
trap column (300 mm ID 5 mm; Dionex) equilibrated with
0.1% formic acid and eluted with a linear acetonitrile gradient
(8–30% over 150 min) in 0.1% formic acid at a flow rate of
200 nL/min on a C18 Tip column (75 μm 1D 120 mm; Nikkyo
Technos) with a spray voltage of 1.8 kV. Peptide ions are detected
using a nanospray LTQ Orbitrap Discovery MS (Thermo Fisher
Scientific) in data-dependent acquisition mode with installed Xca-
libur software (version 2.0.7; Thermo Fisher Scientific). Full-scan
mass spectra are acquired in mass spectrometer over 400–1500 m/z
with a resolution of 30,000. A lock mass function is used to obtain
high mass accuracy. Ions of C24H39O4+ (m/z 391.28429),
C14H46NO7Si7+ (m/z 536.16536), and C16H52NO8Si8+ (m/z
610.18416) are used as lock mass standards [30]. Top ten most
intense precursor ions are selected for collision-induced fragmenta-
tion in linear ion trap at a normalized collision energy of 35%.
Dynamic exclusion is employed within 90 sec to prevent repetitive
selection of peptides [31].
3.5.4 Analysis of Relative Acquired Mascot results are exported into SIEVE software (version
Protein Abundance Using 2.1.377; Thermo Fisher Scientific) for quantitation analysis
Acquired Mass between the control and experimental groups. Chromatographic
Spectrometry Data peaks detected by MS are aligned, and peptide peaks are detected as
a frame on all parent ions scanned by MS/MS using 5 min of frame
time width and 10 ppm of frame m/z width. Areas of chro-
matographic peak within a frame are compared for each sample,
and ratios between samples are determined for each frame. Frames
with MS/MS scan are matched to Mascot results. Peptide ratios
128 Xin Wang and Setsuko Komatsu
3.5.5 Analysis Exported XML files from Mascot are used to analyze absolute
of Absolute Protein Amount protein abundance. The term of exponentially modified protein
Using Acquired Mass abundance index (emPAI) is used to indicate absolute protein
Spectrometry Data amount. The emPAI value of each identified protein is divided by
sum of emPAI values of all identified proteins and multiplied by
100. The absolute protein amount is determined by molar percent-
age (mol %) [34].
3.5.7 Protein Localization Protein localization is predicted using intracellular targeting predic-
Prediction tion programs of SUBA3 (http://suba3.plantenergy.uwa.edu.au/)
[36], MultiLoc2 (http://abi.inf.uni-tuebingen.de/Services/Multi
Loc2) [37], and WoLF PSORT (http://www.genscript.com/wolf-
psort.html) [38].
4 Notes
Acknowledgments
References
1. Healy SJ, Verfaillie T, Jag̈er R et al (2012) 7. Chen X, Karnovsky A, Sans MD et al (2010)
Biology of the endoplasmic reticulum. In: Molecular characterization of the endoplasmic
Agostinis P, Samali A (eds) Endoplasmic retic- reticulum: insights from proteomic studies.
ulum stress in health and disease. Springer, Proteomics 10:4040–4052
Dordrecht, pp 3–22 8. Maltman DJ, Gadd SM, Simon WJ et al (2007)
2. Kleizen B, Braakman I (2004) Protein folding Differential proteomic analysis of the endoplas-
and quality control in the endoplasmic reticu- mic reticulum from developing and germinat-
lum. Curr Opin Cell Biol 16:343–349 ing seeds of castor (Ricinus communis)
3. Howell SH (2013) Endoplasmic reticulum identifies seed protein precursors as significant
stress responses in plants. Annu Rev Plant components of the endoplasmic reticulum.
Biol 64:477–499 Proteomics 7:1513–1528
4. Papp S, Dziak E, Michalak M, Opas M (2003) 9. Qian D, Tian L, Qu L (2015) Proteomic anal-
Is all of the endoplasmic reticulum created ysis of endoplasmic reticulum stress responses
equal? The effects of the heterogeneous distri- in rice seeds. Sci Rep 5:14255
bution of endoplasmic reticulum Ca2+- 10. Barba-Espı́n G, Dedvisitsakul P, H€agglund P
handling proteins. J Cell Biol 160:475–479 et al (2014) Gibberellic acid-induced aleurone
5. Liu L, Cui F, Li Q et al (2011) The endoplas- layers responding to heat shock or tunicamycin
mic reticulum-associated degradation is neces- provide insight into the N-glycoproteome,
sary for plant salt tolerance. Cell Res protein secretion, and endoplasmic reticulum
21:957–969 stress. Plant Physiol 164:951–965
6. Wang X, Komatsu S (2016) Gel-free/label-free 11. Komatsu S, Kuji R, Nanjo Y et al (2012) Com-
proteomic analysis of endoplasmic reticulum prehensive analysis of endoplasmic reticulum-
proteins in soybean root tips under flooding enriched fraction in root tips of soybean under
and drought stresses. J Proteome Res flooding stress using proteomics techniques. J
15:2211–2227 Proteome 77:531–560
130 Xin Wang and Setsuko Komatsu
crop species, maize. Plant Cell Environ 37. Blum T, Briesemeister S, Kohlbacher O (2009)
32:1211–1229 MultiLoc2: integrating phylogeny and gene
36. Tanz SK, Castleden I, Hooper CM et al (2013) ontology terms improves subcellular protein
SUBA3: a database for integrating experimen- localization prediction. BMC Bioinformatics
tation and prediction to define the SUBcellular 10:274
location of proteins in Arabidopsis. Nucleic 38. Horton P, Park KJ, Obayashi T et al (2007)
Acids Res 41:D1185–D1191 WoLF PSORT: protein localization predictor.
Nucleic Acids Res 35:W585–W587
Chapter 10
Abstract
Dimethyl labeling is a type of stable-isotope labeling suitable for creating isotopic variants of peptides and
thus be utilized for quantitative proteomics experiments. Labeling is achieved through a reductive amina-
tion/alkylation reaction using the low-cost reagents formaldehyde and cyanoborohydride, resulting in
dimethylation of free amine groups of Lys and N-termini. Availability of isotopomeric forms of these
reagents allows for the generation of up to six different isotopic variants. Here we describe the application of
dimethylation to create two isotopic variants, light and heavy, differing in 4 Da, to label the total tryptic
digest peptides of cocoa pod extracted from healthy pods from cultivars susceptible and resistant to the
fungal disease called “frosty pod” caused by Moniliophthora roreri.
Key words Dimethyl labeling, Stable-isotope labeling, Quantitative proteomics, Plant proteomics,
Cocoa pod, Moniliophthora roreri, Fungal disease
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_10, © Springer Science+Business Media, LLC, part of Springer Nature 2020
133
134 Yoel Esteve-Sánchez et al.
intermediate. For each methyl group, the carbon and two hydro-
gens come from formaldehyde and the third hydrogen from the
reducing agent as depicted in Fig. 1; thus, combining isotopomers
of both reagents, a number of resulting isotopic dimethyl variants
can be generated. There are three commercial isotopomers of
formaldehyde, CH2O, CD2O and 13CD2O, and two of cyanobor-
ohydride, NaBH3CN and NaBD3CN, useful for dimethyl labeling.
Combinations of these reagent lead to six isotopic variants that
bring about an increase in nominal mass per –NH2 site of 28, 30,
32, 34, 34, and 36 Da. Actually, the two variants of +34 are pseudo-
isobaric differing in 0.00584 Da [10]. It allows for designing
different multiplex methods for quantitative proteomics depending
on the reagent combinations used, including the mostly used
duplex (+28, +32), triplex (+28, +32, +36) [11], fourplex (+28,
+30, +32, +34) [12], and fiveplex (+28, +30, +32, +34, +36)
[13]. This multiplex-labeling ability of dimethyl labeling compares
with the aforementioned chemical methods. Furthermore, its
much lower cost per sample (as low as €0.10 for 25 3 μg protein
triplex labeling [14] and much less for 25 2 μg duplex labeling)
makes this strategy highly cost-effective versus other labeling
options.
Dimethyl labeling has been shown compatible with major
chromatographic methods used in proteomics [15–18] and, except
for extremely large peptides bearing a single dimethyl label, there is
no significant overlapping of isotopic envelopes between isotopic
variants that might affect quantitation accuracy [19].
Dimethyl labeling has been widely applied in global protein
abundance profiling (search of human diseases biomarkers, basic
research in cellular pathways in disease models) and quantitative
PTM analysis including protein phosphorylation and glycosylation
(reviewed in [20]). In the case of microorganisms, dimethyl
136 Yoel Esteve-Sánchez et al.
labeling has been applied rather scarcely, to basic research with the
models E. coli [21] and yeast [22], and for studying performance of
microbial polysaccharide substrate degradation for biofuel produc-
tion [23]. Yet in plants, dimethyl labeling has not been applied for
global protein abundance profiling to the best of our knowledge.
Cocoa pod is a highly recalcitrant tissue for protein extraction
and exhaustive tissue washing is necessary to extract good quality
and quantity of protein [24]. It is affected by a major fungal disease
known as “frosty pod” [25] due to the attack of Moniliophthora
roreri causing large economic losses to producers in tropical regions
of the Americas and affecting the economy of a large number of
small farms [26]. Here we describe a method to apply dimethyl
labeling to carry out a global quantitative proteomic profiling of
cocoa pods with different susceptibility phenotype to this fungal
disease.
2 Materials
3 Methods
3.1 Sample 1. Grind plant tissue (0.1–0.3 g) with mortar and pestle in liquid
Preparation and nitrogen to a fine powder and place it in 2 ml microtubes.
Labeling 2. Resuspend samples in 1 ml of cold 20% TCA in acetone.
3.1.1 Protein Extraction 3. Vortex thoroughly for 30 s and centrifuge at 10,000 g at 4 C
for 5 min.
4. Discard the supernatant and repeat washing steps until it
becomes colorless.
5. Wash the pellet with 1 ml of cold 20% TCA in water twice as in
step 3.
6. Wash the pellet with 1 ml of cold 80% acetone twice as in step 3.
7. Dry the pellet at room temperature.
138 Yoel Esteve-Sánchez et al.
3.1.2 Sample Follow this stage in case you have your proteins extracted and
Precipitation quantified but resuspended in a different solution to urea 6 M.
Otherwise, skip to Sample trypsin digestion (Subheading 3.1.3).
1. Aliquot 25 μg of protein from each sample in a new tube.
2. Add 0.5 volumes of 2% (w/v) DOC and incubate on ice for
15 min.
3. Add 0.5 volumes of 24% (w/v) TCA and incubate on ice for
20–30 min.
4. Centrifuge at 10,000 g at 4 C for 10 min. Remove the
supernatant.
5. Wash twice the pellet with 80% acetone (20 C) as in step 4.
Dry the pellet at room temperature.
6. Resuspend the pellet in 25 μl of 6 M (see Note 3).
3.1.3 Sample Trypsin 1. Reduce disulfide bridges by adding 0.2 volumes of 0.2 M DTT
Digestion in 25 mM TEAB. Vortex and incubate at 37 C for 1 h.
2. Alkylate cysteines thiol groups by adding 0.7 volumes of 0.2 M
IAM in 25 mM TEAB. Vortex and incubate in the dark at room
temperature for 1 h.
Proteomics of Cocoa Pod Using Dimethyl Labelling 139
3.1.4 Sample Dimethyl 1. Dissolve samples in 100 μl of 0.1 M TEAB with vortex.
Labeling 2. Label one of the two samples with 4 μl of 4% (v/v) formalde-
hyde light variant (CH2O). On the other hand, label the sec-
ond sample with 4 μl of 4% (v/v) formaldehyde heavy variant
(CD2O).
3. Complete labeling by adding 4 μl of 0.6 M sodium cyanobor-
ohydride. Vortex and incubate at room temperature with gen-
tle shaking for 1 h at room temperature.
4. Quench labeling by adding 16 μl of 1% (v/v) ammonium for
each tube. Add this reagent in a fume hood. Vortex.
5. Add 8 μl of 5% (v/v) FA to both samples.
6. Mix samples finally in the same tube to run a single LC-MS/
MS assay later.
3.1.5 Sample Cleaning 1. For desalting samples with PepClean™ C18 spin columns
follow manufacturer’s instructions (Pierce®) (see Note 4).
Briefly, set up the resin with Activation and Equilibration solu-
tions. Load sample with Sample Buffer. Remove impurities
with Wash Solution. Finally, obtain peptides with Elution
Solution.
3.2 Sample Analysis The protocol described here is used with an Agilent 1200 UHPLC
by LC-MS/MS equipped with an AdvancedBio column (2.1 mm 250 mm,
2.7 μm particle size) coupled to an Agilent 6550 hybrid spectrom-
eter Q-TOF equipped with a Jet Stream® source.
1. 8 μl injections are programmed to ensure reproducible sample
injection in autosampler.
2. Peptide are separated in the aforementioned analytical column
using a 140-min linear gradient from 3 to 40% RPB-B flowing
at 0.4 ml/min.
3. Peptides were introduced to the mass spectrometer from the
LC by using a Jet Stream source (Agilent Technologies)
operating in positive-ion mode (3500 V) and in high sensitivity
140 Yoel Esteve-Sánchez et al.
3.3 Protein The protocol described here is based on the functionality of two
Identification and software packages Progenesis QI for proteomics (PQIp) (Nonlin-
Quantitation ear Dynamics, Waters) and Proteolabels (Omics Analytics). For
detailed handling of the software, refer to user guides and online
help (http://www.nonlinear.com/progenesis/qi-for-proteomics/
v3.0/user-guide/, http://www.omicanalytics.com/products/pro
teolabels/doc). PQIp provides a platform for MS and MS/MS
data extraction, LC-MS alignment across runs, MS feature detec-
tion, MS signal intensity normalization, and management of
MS/MS spectra-derived peptide and protein identification. The
PQIp output is imported in Proteolabels to carry out the quantita-
tive analysis based on heavy–light intensity ratio of MS feature pairs.
The previous alignment before fragment spectra identification in
PQIp allows for the propagation of identified fragment spectra
across runs, thus filling in identification gaps between runs; like-
wise, MS feature pairing in Proteolabels allows for the propagation
of identity within runs. As a result, this workflow (Fig. 2) leads to an
enhancement of identified spectra and quantified peptides and
proteins.
3.3.1 LC-MS and MS/MS 1. Import .d files generated from Agilent MassHunter Worksta-
Processing with Progenesis tion (i.e., the software implemented for data acquisition from
QI for Proteomics UHPLC-Q-TOF instrument) in PQIp (see Note 5). This soft-
ware has its own MS and MS/MS raw data extraction tool from
.d files, thus generating peak list files.
2. Select one of the runs as the reference and align to it all LC runs
making MS features overlap each other to a minimal alignment
score of 80% (see Notes 6 and 7) (Fig. 3).
3. Review peak picking automatically assigned by the software.
Add, edit, or delete peak detections manually to define couples
of precursors MS spectra (light and heavy) (see Note 8) (Fig. 4).
4. Export MS/MS spectra as .mgf files (see Note 9).
Proteomics of Cocoa Pod Using Dimethyl Labelling 141
Laboratory workflow
Cocoa Pod Protein Trypsin Stable-isotope Mix pairs of phenotypes UHPLC-MS analysis
tissue sampling extraction digestion dimethyl labeling differentially labeled (e.g. QTOF)
Bioinformatic processing
Fig. 2 Laboratory workflow and subsequent bioinformatic data treatment. Once proteins have been extracted
from plant tissue and digested, dimethyl labeling is carried out independently. Labeled samples are mixed per
pairs so that each pair contains both heavy and light dimethyl versions and the two phenotypes, and the mix is
then analyzed in a LC-MS instrument in data dependent acquisition mode. Bioinformatic processing comprises
LC peaks alignment across different runs and MS features detection. Those features belonging to light-labeled
peptides keep the expected m/z differences with the respective heavy-labeled counterpart and their MS/MS
spectra are used for database search selecting dimethyl labeling as quantitation parameter. After search
result import, association MS/MS-peptide sequence is accomplished. Quantitation by H/L ratio calculation at
peptide and protein levels is the last quantitative workflow step. Identified proteins with differential abundance
between experimental groups can be annotated by a Gene Ontology. Software packages used for each task in
this work is indicated between brackets
Fig. 3 LC-MS map showing peak signals on PQIp alignment stage. Refinement must focus on central area,
where points are gathered the most. Bottom area in the map (i.e., highest retention time and lowest m/z ratio)
may be due to nonpeptidic contaminant compounds. Hairpin-like alignment vectors are first seeded manually
and the added automatically throughout the map to make samples LC signals overlap each other at most.
Suitability degree is ranked in a color code, where green stands for good-quality alignment on depicted area.
High-quality alignment must be focus at least on central zone, where most of the peaks are present
142 Yoel Esteve-Sánchez et al.
Fig. 4 Example of a pair of differentially dimethylated peptides showed in PQIp interface. Top-left images
depict intensity and m/z values for light (m/z ¼ 638.3313, z ¼ 2) precursors’ isotopic envelope and its
chromatographic peak. Top-right images depict intensity and m/z values for heavy (m/z ¼ 640.3313, z ¼ 2)
precursors’ isotopic envelope and its chromatographic peak. Bottom-right image depicts complete signals
map in LC run to locate the showed feature. Bottom-right shows a zoomed area of the LC-MS map centered in
the selected precursor. Light and heavy precursor doublets can be seen throughout. Each precursor in the
doublet must have the same number of nonoverlapped isotopic signals detected (i.e., signals framed in the
same strip of linked squares), ideally four including the monoisotopic one. Charge state is depicted in a color
code (e.g., red peaks stand for doubly charged precursors). Reviewing must focus above all on peaks with high
MS/MS counts, good-quality chromatogram, and placed in central area of the map
3.3.2 Protein The exported file of peak lists is used for fragment ion database
Identification search using an appropriate search engine. Here we have used
MASCOT. If a different search engine is used, peak lists should
be exported in the appropriate format. This search will detect
dimethyl-labeled peptides correctly after previous LC runs refine-
ment. Label assignment is essential for protein quantitation.
1. The exported .mgf dataset is searched against a cocoa genome
encoded peptide database (https://www.cacaogenomedb.org/
databases.cacao11peptides_pub3i.aa.fasta) supplemented with
contaminant proteins selecting the following settings: enzyme
trypsin up to three missed cleavages, quantitation with
dimethylation, carbamidomethylation in cysteines as fixed
modification, oxidation in methionine as variable modification,
peptide tolerance of 20 ppm, MS/MS tolerance of 50 ppm,
monoisotopic mass, and peptide charge of 2+, 3+, and 4+. Data
format as Mascot generic is required.
2. Export results .xml files.
Proteomics of Cocoa Pod Using Dimethyl Labelling 143
Fig. 5 Experimental design setup stage in Proteolabels. Differentially labeled samples run in the same LC
analysis must be placed in the same position on its own Sample column so that they are correctly assigned to
be one replicate. In this case, the only experiment showed (i.e., Condition 1) is made up by four replicates,
each one consisting of differentially labeled peptides with light and heavy dimethyl labels. As an example,
Sample A may be the control group and Sample B would be the treated group. Four H/L ratios would be the
output
144 Yoel Esteve-Sánchez et al.
4 Notes
10. There may be peptides with the same identification score for a
single protein. In this case, those equally assigned peptides are
considered.
Acknowledgments
References
1. Hansen KC, Schmitt-Ulms G, Chalkley RJ et al cerebrospinal fluids by MS/MS using 6-plex
(2003) Mass spectrometric analysis of protein isobaric tags. Anal Chem 80:2921–2931
mixtures at low levels using cleavable 13C-iso- 9. Schmidt A, Kellermann J, Lottspeich F (2005)
tope-coded affinity tag and multidimensional A novel strategy for quantitative proteomics
chromatography. Mol Cell Proteomics using isotope-coded protein labels. Proteomics
2:299–314 5:4–15
2. Boutilier JM, Warden H, Doucette AA et al 10. Zhou Y, Shan Y, Wu Q et al (2013) Mass
(2012) Chromatographic behaviour of pep- defect-based pseudoisobaric dimethyl labeling
tides following dimethylation with H2/D2- for proteome quantification. Anal Chem
formaldehyde: implications for comparative 85:10658–10663
proteomics. J Chromatogr B 908:59–66 11. Boersema PJ, Aye TT, van Veen TA et al (2008)
3. Ong SE, Blagoev B, Kratchmarova I, Kristen- Triplex protein quantification based on stable
sen DB et al (2002) Stable isotope labeling by isotope labeling by peptide dimethylation
amino acids in cell culture, SILAC, as a simple applied to cell and tissue lysates. Proteomics
and accurate approach to expression proteo- 8:4624–4632
mics. Mol Cell Proteomics 1:376–386 12. Hsu JL, Huang SY, Chen SH (2006) Dimethyl
4. Conrads TP, Alving K, Veenstra TD et al multiplexed labeling combined with microcol-
(2001) Quantitative analysis of bacterial and umn separation and MS analysis for time course
mammalian proteomes using a combination study in proteomics. Electrophoresis
of cysteine affinity tags and 15N-metabolic 27:3652–3660
labeling. Anal Chem 73:2132–2139 13. Wu Y, Wang F, Liu Z et al (2014) Five-plex
5. Chahrour O, Cobice D, Malone J (2015) Sta- isotope dimethyl labeling for quantitative pro-
ble isotope labeling methods in mass teomics. Chem Commun (Camb)
spectrometry-based quantitative proteomics. J 50:1708–1710
Pharm Biomed Anal 113:2–20 14. Boersema PJ, Raijmakers R, Lemeer S et al
6. Ross PL, Huang YN, Marchese JN (2004) (2009) Multiplex peptide stable isotope
Multiplexed protein quantitation in Saccharo- dimethyl labeling for quantitative proteomics.
myces cerevisiae using amine-reactive isobaric Nat Protoc 4:484–494
tagging reagents. Mol Cell Proteomics 15. Hsu JL, Huang SY, Chow NH et al (2003)
3:1154–1169 Stable-isotope dimethyl labeling for quantita-
7. Choe L, D’Ascenzo M, Relkin NM (2007) tive proteomics. Anal Chem 75:6843–6852
8-plex quantitation of changes in cerebrospinal 16. Di Palma S, Raijmakers R, Heck AJ et al (2011)
fluid protein expression in subjects undergoing Evaluation of the deuterium isotope effect in
intravenous immunoglobulin treatment for zwitterionic hydrophilic interaction liquid
Alzheimer’s disease. Proteomics 7:3651–3660 chromatography separations for implementa-
8. Dayon L, Hainard A, Licker V (2008) Relative tion in a quantitative proteomic approach.
quantification of proteins in human Anal Chem 83:8352–8356
146 Yoel Esteve-Sánchez et al.
17. Wu CJ, Chen YW, Tai JH et al (2011) Quanti- 23. Tolonen AC, Haas W, Chilaka AC et al (2011)
tative phosphoproteomics studies using stable Proteome wide systems analysis of a cellulosic
isotope dimethyl labeling coupled with IMAC- biofuel-producing microbe. Mol Syst Biol
HILIC-nanoLC- MS/MS for estrogen 7:461
induced transcriptional regulation. J Proteome 24. Martı́nez-Márquez A, Morante-Carriel JA,
Res 10:1088–1097 Bru-Martinez R (2017) A comparison of tissue
18. Xu B, Wang F, Song C et al (2014) Large-scale preparation methods for protein extraction of
proteome quantification of hepatocellular car- cocoa (Theobroma cacao L.) pod. Acta Agron
cinoma tissues by a three-dimensional liquid 66:248–253
chromatography strategy integrated with sam- 25. Phillips-Mora W, Wilkinson MJ (2007) Frosty
ple preparation. J Proteome Res pod of cacao: a disease with a limited geo-
13:3645–3654 graphic range but unlimited potential for dam-
19. Cappadona S, Muñoz J, Spee WPE (2011) age. Phytopathology 97:1644–1647
Deconvolution of overlapping isotopic clusters 26. Phillips-Mora W, Aime M, Wilkinson M
improves quantification of stable isotope (2007) Biodiversity and biogeography of the
labeled peptides. J Proteome 74:2204–2209 cacao (Theobroma cacao) pathogen Moni-
20. Hsu J-L, Chen S-H (2016) Stable isotope liophthora roreri in tropical America. Plant
dimethyl labeling for quantitative proteomics Pathol 56:911–922
and beyond. Philos Trans R Soc A 27. Wang W, Scali M, Vignani R et al (2003) Pro-
374:20150364 tein extraction for two-dimensional electro-
21. Ji C, Li L (2005) Quantitative proteome analy- phoresis from olive leaf, a plant tissue
sis using differential stable isotopic labeling and containing high levels of interfering com-
microbore LCMALDIMS and MS/MS. J pounds. Electrophoresis 24:2369–2375
Proteome Res 4:734–742 28. Klammer AA, MacCoss MJ (2006) Effects of
22. Synowsky SA, van Wijk M, Raijmakers R et al modified digestion schemes on the identifica-
(2009) Comparative multiplexed mass spectro- tion of proteins from complex mixtures. J Pro-
metric analyses of endogenously expressed teome Res 5:695–700
yeast nuclear and cytoplasmic exosomes. J
Mol Biol 385:1300–1313
Chapter 11
Abstract
Proteins produce or regulate nearly every component of cells. Thus, the ability to quantitatively determine
the protein abundance and posttranslational modification (PTM) state is a critical aspect toward our
understanding of biological processes. In this chapter, we describe methods to globally quantify protein
abundance and phosphorylation state using isobaric labeling with tandem mass tags followed by phospho-
peptide enrichment.
Key words Plant proteomics, Tandem mass tags (TMT), Phosphoproteomics, Protein extraction,
Mass spectrometry
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_11, © Springer Science+Business Media, LLC, part of Springer Nature 2020
147
148 Gaoyuan Song et al.
Protein extraction
LC-MS/MS LC-MS/MS
Phospho-peptide
acquisition acquisition
enrichment
m/z m/z
Fig. 1 Quantitative proteomics workflow using tandem mass tags. Proteins from up to 11 samples are
extracted and digested into peptides using phenol-FASP. Peptides from each sample are then independently
labeled with a TMT reporter. Following labeling, the samples are pooled into a single tube. The pooled
TMT-labeled samples can directly be used to quantify protein abundance. The pooled TMT-labeled samples
can also be subjected to phosphopeptide (or other PTM) enrichment prior to LC-MS/MS
150 Gaoyuan Song et al.
2 Materials
3 Methods
3.1 Protein 1. Grind 0.1 g plant tissue in liquid nitrogen into fine power using
Extraction a ceramic mortar and pestle for 15 min (see Note 3).
2. Add 5 volumes (tissue–buffer; w:v) of Tris buffered phenol
pH 8 to the ground tissue and vortex for 1 min. For example,
for 0.1 g tissue add 5 ml of buffer.
3. Add 5 volumes (tissue–buffer; w:v) extraction buffer with 1
phosphatase inhibitor mix to the phenol–tissue solution, vortex
1 min.
4. Centrifuge at 13,000 g for 10 min at 4 C (see Note 4).
5. Transfer the phenol solution (top layer) to a new tube, add
same volume buffered phenol pH 8 as step 2 to the aqueous
phase, and vortex for 1 min.
6. Centrifuge at 13,000 g for 10 min at 4 C (see Note 4).
7. Transfer the phenol phase and combine with the phenol phase
from step 5.
8. Add 5 volumes of prechilled methanol with 0.1 M ammonium
acetate to the phenol solution, vortex to mix well.
9. Incubate at 80 C for 1 h.
10. Centrifuge at 4500 g, for 10 min at 4 C.
11. Discard the supernatant, add same volume of prechilled meth-
anol with 0.1 M ammonium acetate as step 8 to the tube.
12. Resuspend the pellet with probe sonication, then keep at
20 C for 30 min.
13. Centrifuge at 4500 g, for 10 min at 4 C.
14. Repeat once steps 11–13.
15. Add 5 ml prechilled 70% methanol to the tube, resuspend the
pellet with probe sonication, then keep at 20 C for 30 min to
overnight (see Note 5).
16. Centrifuge at 4500 g for 10 min at 4 C.
17. Discard the supernatant, remove the residual methanol by
speedvac at room temperature, resuspend in resuspension
buffer, and measure the protein concentration using the Brad-
ford assay.
152 Gaoyuan Song et al.
3.3 C18 Desalting 1. Setup a C18 column on Vacuum manifold and place a 15 ml
conical tube in vacuum chamber to hold flow through.
2. Rinse the column (for a 100 mg C18 column) with 1 ml
MeOH followed by 1 ml Water; repeat with other 1 ml water.
3. Load digested sample at a flow rate of less than 1 ml per min (see
Note 7).
4. Wash the column with 1 ml water, repeat once with other 1 ml
water.
5. Elute peptides stepwise with 250 μl 20% acetonitrile, 250 μl
40% acetonitrile, and 500 μl 80% acetonitrile.
6. Speedvac the peptides solution until almost dry.
Quantitative Phosphoproteomics Using TMT 153
3.4 TMT Labeling 1. Resuspend 100 μg vacuum dried peptides, per sample, with
100 μl of 0.2 M HEPES, pH 8.5 to a final concentration of
1 μg/μl. Vortex for 10 min to dissolve well (see Note 8).
2. Remove TMT labels from the freezer and warm to room
temperature.
3. Resuspend TMT labels with 60 μl of dry acetonitrile, vortex,
and leave at room temperature for 5 min. Then spin to collect
reagent at the bottom of the vial.
4. When labeling 100 μg of peptides, aliquot 10 μl of the resus-
pended TMT labeling reagent into a tube and add 30 μl of
ACN. The remaining 50 μl of TMT labeling reagent and be
used to label additional samples in parallel or vacuum dried for
later use (see Note 9).
5. Add TMT labels (i.e., 40 μl) to the resuspended peptides
(100 μl) with a ratio 0.4:1 (ACN–HEPES; v:v), vortex to mix
well, and leave at room temperature for 1–2 h (see Note 10).
6. Add 8 μl 5% hydroxylamine to each 140 μl TMT labeling
solution to quench the reaction, vortex to mix well, and leave
at room temperature for 15 min.
7. Pool the TMT-labeled peptides to a single tube and vortex
well, speedvac to almost dry (see Note 11).
3.5 Phosphopeptide 1. Resuspend Titansphere Phos-TiO2 beads with wash and bind-
Enrichment ing buffer (prepare 6 mg beads for each 1 mg peptides) by
vortexing, centrifuge at 3000 g for 1 min, and then remove
the supernatant. Repeat this step for a total three times (see
Note 12).
2. Resuspend the pooled TMT-labeled peptides with wash and
binding buffer to a final concentration of 1 μg/μl and vortex to
dissolve well.
3. Transfer resuspended peptides to the tube with TiO2 beads
(4 mg beads for each 1 mg peptides) and rotate at room
temperature for 1 h.
4. Centrifuge at 3000 g for 1 min, save the beads, and transfer
supernatant to a new tube containing the second aliquot of
TiO2 beads (2 mg beads for each 1 mg peptides), rotate at
room temperature for 1 h.
5. Centrifuge at 3000 g for 1 min; save the beads (see Note 13).
154 Gaoyuan Song et al.
4 Notes
References
1. Wu X, Gong F, Wang W (2014) Protein extrac- 9. Xie X, Kang H, Liu W et al (2015) Compre-
tion from plant tissues for 2DE and its applica- hensive profiling of the rice ubiquitome reveals
tion in proteomic analysis. Proteomics the significance of lysine ubiquitination in
14:645–658 young leaves. J Proteome Res 14:2017–2025
2. Wu X, Xiong E, Wang W et al (2014) Universal 10. Song G, Walley JW (2016) Dynamic protein
sample preparation method integrating tri- acetylation in plant–pathogen interactions.
chloroacetic acid/acetone precipitation with Front Plant Sci 7:421
phenol extraction for crop proteomic analysis. 11. Aguilar-Hernández V, Kim D-Y, Stankey RJ
Nat Protoc 9:362–374 et al (2017) Mass spectrometric analyses reveal
3. Song G, Hsu PY, Walley JW (2018) Assess- a central role for ubiquitylation in remodeling
ment and refinement of sample preparation the Arabidopsis proteome during photomor-
methods for deep and quantitative plant prote- phogenesis. Mol Plant 10:846–865
ome profiling. Proteomics 18:1800220 12. Liu S, Yu F, Yang Z et al (2018) Establishment
4. Finkemeier I, Laxa M, Miguet L et al (2011) of dimethyl labeling-based quantitative acetyl-
Proteins of diverse function and subcellular proteomics in Arabidopsis. Mol Cell Proteo-
location are lysine acetylated in Arabidopsis. mics 17:1010–1027
Plant Physiol 155:1779–1790 13. Walley JW, Shen Z, McReynolds MR et al
5. Silva-Sanchez C, Li H, Chen S (2015) Recent (2018) Fungal-induced protein hyperacetyla-
advances and challenges in plant phosphopro- tion in maize identified by acetylome profiling.
teomics. Proteomics 15:1127–1141 Proc Natl Acad Sci 115:210–215
6. Rao RSP, Thelen JJ, Miernyk JA (2014) Is 14. Kelley DR (2018) E3 ubiquitin ligases: key
Lys-N(ε)-acetylation the next big thing in regulators of hormone signaling in plants.
post-translational modifications? Trends Plant Mol Cell Proteomics 17:1047–1054
Sci 19:550–553 15. Pinkse MWH, Uitto PM, Hilhorst MJ et al
7. Hartl M, Füßl M, Boersema PJ et al (2017) (2004) Selective isolation at the femtomole
Lysine acetylome profiling uncovers novel his- level of phosphopeptides from proteolytic
tone deacetylase substrate proteins in Arabi- digests using 2D-NanoLC-ESI-MS/MS and
dopsis. Mol Syst Biol 13:949 titanium oxide precolumns. Anal Chem
8. Fang X, Chen W, Zhao Y et al (2015) Global 76:3935–3943
analysis of lysine acetylation in strawberry 16. Nakagami H, Sugiyama N, Mochida K et al
leaves. Front Plant Sci 6:739 (2010) Large-scale comparative
156 Gaoyuan Song et al.
Abstract
The proteomics of orphan, unsequenced, and recalcitrant organisms is highly challenging. This is the case of
the typical Mediterranean forest tree Holm oak (Quercus ilex). Proteomics has moved on quite fast from the
classical 2DE-MS to shotgun or gel-free/label-free approaches, with the latter possessing a series of
advantages over the gel-based ones. Before translating proteomics data into biological knowledge, a few
questions as to the analysis technique itself have to be answered including its confidence in protein
identification and quantification. It is important to clearly differentiate a hit from an ortholog and gene
product identification, with the difference depending on the database and the confidence parameters (score,
number of peptides, and coverage). With respect to quantification and for comparative purposes it is
important to make sure that we are within the linear dynamic range. For that, a calibration curve based
on mass spectrometry analysis of a serial dilution of the extracts should be performed. Thus, just by
validating our data with the aim of improving the quality of the analysis enables us to give a correct
interpretation of our results. We show a method that aims to improve the confidence in protein identifica-
tion and quantification in the orphan species Q. ilex using a shotgun proteomics approach.
Key words Holm oak, Orphan species, Plant proteomics, Shotgun, Confidence parameters
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_12, © Springer Science+Business Media, LLC, part of Springer Nature 2020
157
158 Isabel Gómez-Gálvez et al.
2 Material
2.1 Plant Material In the present experiment, a mixture of different organs (embryo,
cotyledons, leaves, and roots) from Q. ilex are used (see Note 1).
2.2 Protein If not specified, reagents are of analytical grade. Solutions are
Extraction prepared in distilled water. Stock solutions are stored at 20 C,
unless otherwise stated.
Optimizing Shotgun Proteomics Analysis in Orphan Plant Species 159
2.3 SDS 1. Running buffer: for a 10 stock mix 30.2 g Tris base, 144 g of
Polyacrylamide Gel glycine, and 1 g of SDS; add H2O up to 1 L.
2. Resolving gel: amount required for 1 mini-Protean (Bio-Rad)
of 12% acrylamide gel: mix 3 mL of 40% (w/v) acrylamide–
bisacrylamide solution (19:1), 2.5 mL 1.5 M Tris–HCl,
pH 8.8, 100 μL of 10% (w/v) SDS, and 4.35 mL of
H2O. Add 50 μL of 10% (w/v) APS and 5 μL of TEMED for
starting the polymerization. Add the mix to the 7 cm gel
cassette (see Note 2). Let the gels polymerize for 1 h.
3. Stacking gel: amount required for 1 mini-Protean (Bio-Rad) of
4% acrylamide gel: mix 0.25 mL of 40% (w/v) acrylamide–
bisacrylamide solution (19:1), 0.63 mL 0.5 M Tris–HCl,
pH 6.8, 25 μL of 10% (w/v) SDS, and 1.59 mL of
H2O. Add 50 μL of 10% (w/v) APS and 5 μL of TEMED for
starting the polymerization (see Note 3).
4. Laemmli buffer (5): 0.3 M Tris–HCl (pH 6.8), 10% SDS,
25% β-mercaptoethanol, 0.005% bromophenol blue, 50%
glycerol.
5. Coomassie stain solution: 40% (v/v) methanol, 10% (v/v)
acetic acid, 0.1% (w/v) Coomassie R-250, in distilled water
(see Note 4).
6. Destaining solution: 40% (v/v) methanol, 10% (v/v) acetic
acid in distilled water.
2.6 Solutions for LC- 1. Mobile phase A: 0.1% (v/v) formic acid in water.
MS/MS 2. Mobile phase B: 0.1% (v/v) formic acid in 80% acetonitrile.
2.7 Equipment and 1. Proteome Discoverer (version 2.1, Thermo Scientific), using
Software the SEQUEST algorithm to perform the search.
2. Viridiplantae database obtained from UniProtKB and a
species-specific database of Quercus ilex developed from the
transcriptome [8].
3. MERCATOR Software for protein classification into MapMan
functional plant categories (http://www.plabipd.de/portal/
mercator-sequence-annotation) complemented with the use
of the KEGG database (https://www.genome.jp/kegg/path
way.html).
3 Methods
3.1 Protein The proteins are extracted by using the method of Wang et al. [9]
Extraction by TCA/ with some modifications.
Acetone/Phenol 1. Add 1 mL of solution 1 precooled at 20 C from protein
extraction protocol to 200 mg of plant material previously
pulverized in liquid nitrogen.
2. Sonicate 10 min at maximum speed (P Selecta Ultrasons) (see
Note 5).
3. Centrifuge at 14,000 g for 10 min (4 C) and discard the
supernatant.
4. Add 1 mL of solution 2 precooled at 20 C and solubilize the
pellet.
5. Centrifuge at 14,000 g for 10 min (4 C) and discard the
supernatant.
6. Add 1 mL of solution 3 precooled at 20 C and solubilize the
pellet.
Optimizing Shotgun Proteomics Analysis in Orphan Plant Species 161
3.2 SDS-PAGE 1. Prepare a mixture of different holm oak plant organs (embryo,
Electrophoresis seed, root, leaf) extracts with 300 μg of protein each.
2. Prepare a serial dilution of proteins in the range of 1–200 μg
BSA equivalents, mixed with Laemmli buffer and heat them to
95 C for 5 min.
3. Perform a SDS-PAGE in a 12% acrylamide gel. Before loading
the protein, mark a line 1 cm below the staking-resolving
interphase. Load the samples and run the electrophoresis at
80 V, constantly until the bromophenol blue reaches the
marked line.
4. Stop the electrophoresis and immediately transfer the gel to a
plate with Coomassie staining solution [11]. Incubate in an
orbital shaker for 1 h.
5. Distain the gels in distaining solution for 80 min. Finish
bleaching with distilled water.
6. Cut the unique bands from the gel, all of them in the same way.
Transfer gel pieces to individual 1.5 mL tubes and cover them
162 Isabel Gómez-Gálvez et al.
with distiller water. At this point the gel pieces can be stored at
4 C.
3.3 Sample 1. Cut the gel bands with a scalpel into small fragments (around
Preparation for MS 1 mm3). Transfer the gel pieces to a 1.5 mL low binding tube.
Analysis 2. Add 1 mM ammonium bicarbonate/acetonitrile (1:1) (v/v).
3.3.1 Protein Digestion Mix equal volumes of washing solution and acetonitrile. Stir for
30 min at 37 C. Repeat this step.
3. Remove the supernatant and add acetonitrile. Incubate for
5 min at room temperature and remove the supernatant.
4. For the reduction and alkylation of the proteins, add 20 mM
DTT/100 mM ammonium bicarbonate. Subsequently add
55 mM iodoacetamide/100 mM ammonium bicarbonate.
Incubate for 30 min at room temperature in each solution.
5. Wash twice with 25 mM ammonium bicarbonate and with
25 mM ammonium bicarbonate/acetonitrile 50%.
6. Digest with the trypsin solution (12.5 ng/μL trypsin) and
incubate overnight with shaking at 37 C.
3.3.2 Peptides Extraction 1. Spin the tubes and transfer the supernatant to a new tube “A”.
2. Add 150 μL (enough volume to cover the gel) of 20% acetoni-
trile/1% formic acid and incubate for 5 min at room tempera-
ture. Sonicate for 3 min. Transfer the peptide solution to the
tube A.
3. Step 2 is repeated two more times, with 150 μL of, respectively,
50% and 90% acetonitrile/1% formic acid. Transfer the final
solutions to the tube A.
4. Dry out in SpeedVac. Keep at 20 C or 80 C for long term
storage.
5. Resuspend the samples in 100 μL of 0.1% formic acid with
sonication.
3.3.3 Peptides Desalting 1. Activate the C18 column with 0.4 mL 70% acetonitrile/0.1%
trifluoroacetic acid. Then wash it with 0.5 mL 0.1%
trifluoroacetic acid.
2. Add the sample to the column and keep it for 5 min at
RT. Collect the flow through and repeat this step twice (see
Note 8).
3. Wash the columns with 0.5 mL 0.1% trifluoroacetic acid (see
Note 9).
4. Elute the peptides with 100 μL of 70% acetonitrile three times
to a final volume of 300 μL and dry in a SpeedVac.
Optimizing Shotgun Proteomics Analysis in Orphan Plant Species 163
3.4 nLC-MS/MS 1. Prepare a serial dilution of the recovered peptides from the
concentrations of protein loaded in the gel (see Note 10).
2. Load the samples onto a nano-LC-MS-UHPLC Ultimate
3000 using a flow of 300 nL/min and a gradient of B in A
from 4 to 35% (120 min), 35–55% (6 min), and 55–90%
(3 min). Finally, elute the column with 90% of B over 8 min
before wash and reequilibration with a total time of chroma-
tography of 150 min.
3. The eluent from the column is introduced in the electrospray
ionization source of an MS/MS instrument (Orbitrap Fusion,
Thermo Fisher Scientific) operating in positive ion mode.
4. Perform survey scans of peptide precursors from 400 to
1500 m/z at 120 K resolution (at 200 m/z) with a 4 105
ion count target. Tandem MS by isolation at 1.2 Da with the
quadrupole, CID fragmentation with normalized collision
energy of 35, and rapid scan MS analysis in the ion trap.
3.5 Protein 1. Use the MS2 spectra for identification, using the SEQUEST
Identification algorithm with the Proteome Discover software (version 2.1.,
Thermo Scientific). The following parameters were set: theo-
retical tryptic digestion allowing up to one missed cleavage,
carbamidomethylation of cysteines as fixed modification and
oxidation of methionine as a variable modification. Precursor
mass tolerance of 10 ppm and product ions search at 0.1 Da
tolerance. Validate peptide spectral matches (PSM) using per-
colator based on q-values at a 1% FDR. To group peptide
identifications into proteins use the law of parsimony and filter
to 1% FDR and a minimum XCorr of 2.
2. Use two databases for protein identification: one generic, Vir-
idiplantae database from UniprotKB and a custom species-
specific Q. ilex database developed from the transcriptome [8].
3. Assign functional categorization of proteins with the MERCA-
TOR software and the KEGG database.
3.6 Protein 1. Protein quantification by peak area: using the peak area values
Quantification given for each protein, following the normalization of data
with the total sum of the peak area values per each sample.
2. Protein quantification by proteotypic peptides: using the proteo-
typic peptides (specific of protein) from different proteins and
representing the peak area intensity in each of the serial dilu-
tions studied (see Note 11).
164 Isabel Gómez-Gálvez et al.
a b
8000 2500
7000
Number of proteins
2000
Number of peptides
6000
5000 Viridiplantae Viridiplantae
1500
4000 Quercus Quercus
1000
3000
2000 500
1000
0
0 0 200 400 600 800 1000
0 200 400 600 800 1000
Total amount of protein (ng)
Total amount of protein (ng)
c d
Quercus UniprotKB- Quercus UniprotKB-
ilex Viridiplantae ilex Viridiplantae
Fig. 1 Number of peptides (a) and proteins (b) identified, using the UniprotKB-Viridiplantae and Quercus ilex
databases. The values correspond to the different amounts of proteins used, in the range of 10–1000 ng. Venn
diagram showing the number of peptides (c) and proteins (d) identified from the two databases used
(UniprotKB-Viridiplantae and Quercus ilex), corresponded to the 600 ng of protein sample dilution
4 Anticipated Results
Table 1
Number of peptides and proteins identified. The values correspond to a serial dilution used in the
range of 10–1000 ng of protein. The UniprotKB (Viridiplantae) and Quercus ilex databases were used,
showing orthologs, for the first, and gene products, for the second one
5 Notes
Fig. 2 Relative number of proteins identified in the sample dilution of 600 ng,
grouped according to the confidence parameter ranges (% coverage, score
value, and number of peptides)
10000000
9000000
8000000
Peak area (Intensity)
7000000 [K].AEYDESGPSIVHR.[K]
6000000
[K].AGEDADTLGLTGHER.[Y]
5000000
[K].AGIVASLDELVK.[E]
4000000
[K].GAPVVAAPAK.[E]
3000000
[K].ILDGPPGTAER.[A]
2000000
1000000 [K].VGNFLNR.[F]
0
0 200 400 600 800 1000
ng of protein
Fig. 3 Peak area for several proteotypic peptides, determined at the different protein dilutions. The selected
peptides corresponded to the following proteins, from top to bottom: actin-97, aconitate hydratase, disulfide-
isomerase A6, elongation factor 1-beta 1, flowering locus K homology domain and UTP-glucose-1phosphate
uridylyltransferase
Acknowledgments
References
1. Barbier-Brygoo H, Jouard J (2004) Focus on quantitative proteomics. J Biol Chem 286
plant proteomics. Plant Physiol Biochem (29):25443–25449
42:913–917 8. Guerrero-Sanchez VM, Maldonado-Alconada
2. Cánovas FM, Dumas-Gaudot E, Recorbet G AM, Amil-Ruiz F, Jorrin-Novo J (2017)
et al (2004) Plant proteome analysis. Proteo- Holm oak (Quercus ilex) transcriptome. De
mics 4:285–298 novo sequencing and assembly analysis. Front
3. Jorrı́n-Novo JV (2014) Plant proteomics: Mol Biosci 4:70
methods and protocols. In: Jorrin-Novo JV, 9. Wang W, Vignani R, Scali M, Mauro C (2006)
Komatsu S, Weckwerth W, Wienkoop S (eds) A universal and rapid protocol for protein
Methods in molecular biology, vol 1072. extraction from recalcitrant plant tissues for
Humana Press, Totowa, pp 3–13 proteomic analysis. Electrophoresis
4. Valledor L, Wolfram W (2014) Standardization 27:2782–2786
of data processing and statistical analysis in 10. Bradford MM (1975) A rapid and sensitive
comparative plant proteomics experiment. In: method for the quantitation of microgram
Jorrin-Novo JV, Komatsu S, Weckwerth W, quantities of protein utilizing the principle of
Wienkoop S (eds) Plants proteomics: methods protein-dye binding. Anal Biochem
and protocols. Methods in molecular biology, 72:248–254
vol 1072. Humana Press, Totowa, pp 347–358 11. Neuhoff V, Arold N, Taube D, Ehrhardt W
5. Romero-Rodriguez MC, Pascual J, Valledor L, (1988) Improved staining of proteins in poly-
Jorrin-Novo J (2014) Improving the quality of acrylamide gels including isoelectric focusing
protein identification in non-model species. gels with clear background at nanogram sensi-
Characterization of Quercus ilex seed and tivity using Coomassie Brilliant Blue G-250
Pinus radiata needle proteomes by using and R-250. Electrophoresis 9:255–262
SEQUEST and custom databases. J Proteome 12. Bonner FT, Vozzo JA (1987) Seed biology and
105:85–91 technology of Quercus. General technical
6. Zhu W, Smith JW, Huang CM (2010) Mass report, SO-66. U.S. Dept. of Agriculture, For-
spectrometry-based label-free quantitative pro- est Service, Southern Forest Experiment Sta-
teomics. J Biomed Biotechnol:1–6. https:// tion, New Orleans, LA, p 21
doi.org/10.1155/2010/840518 13. Hoagland DR, Arnon DI (1950) The water-
7. Xie F, Liu T, Qian WJ et al (2011) Liquid culture method for growing plants without
chromatography-mass spectrometry-based soil. California Agricultural Experiment Sta-
tion, Circular-347
Chapter 13
Abstract
Most quantitative proteomics experiments either target a limited number of selected proteins for quantifi-
cation or quantify proteins on a broad scale in an untargeted manner. However, we recently demonstrated
that experiments that have both targeted and untargeted components can be particularly advantageous.
Using a combined targeted and untargeted liquid chromatography–tandem mass spectrometry data acqui-
sition strategy termed TDA/DDA (shorthand for targeted data acquisition/data-dependent acquisition),
which we applied to a model quantitative plant proteomics experiment performed on Arabidopsis, we
demonstrated improved quantification of both targeted and untargeted proteins relative to purely untar-
geted experiments performed using conventional data-dependent acquisition (Hart-Smith et al. Front
Plant Sci 8:1669, 2017). This suggests that many quantitative proteomics datasets earmarked for collection
using data-dependent acquisition are likely to benefit from the use of TDA/DDA instead.
This chapter describes how TDA/DDA liquid chromatography–tandem mass spectrometry methods can
be created on commonly used mass spectrometric instrument platforms. It described how, using freely
available software, tandem mass spectrometry inclusion lists designed to target proteins of hypothesized
interest can be generated. Best practice implementation of these inclusion lists in TDA/DDA strategies is
then described. Relative to conventional data-dependent acquisition, the liquid chromatography–tandem
mass spectrometry methods created using these guidelines increase the chances of quantifying targeted
proteins and can produce widespread improvements in the reproducibility of untargeted protein quantifi-
cation, without compromising the total numbers of proteins quantified. They are compatible with different
quantitative proteomics methodologies, including metabolic labeling, chemical labeling and label-free
approaches, and can be used to create tailored assay libraries to aid the interpretation of quantitative
proteomics data collected using data-independent acquisition.
Key words Quantitative proteomics, Shotgun proteomics, Inclusion lists, Targeted data acquisition
(TDA), Data-dependent acquisition (DDA)
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_13, © Springer Science+Business Media, LLC, part of Springer Nature 2020
169
170 Gene Hart-Smith
Fig. 1 TDA/DDA LC-MS/MS methods employ TDA inclusion lists for the hypothesis driven selection of peptides
for MS/MS (green arrows), followed by DDA for the non–hypothesis driven selection of peptides for MS/MS
(blue arrows). An illustrative MS survey scan is shown with green signals representing peptides derived from
targeted proteins, and black signals representing other peptides. In this example, up to 5 TDA events and
5 DDA events are employed per MS/MS scan cycle
2 Materials
2.1 Creation 1. Benchtop computer with Skyline [17] installed (see Note 1).
of Inclusion Lists 2. Benchtop computer with mass spectrometry instrument soft-
and TDA/DDA LC-MS/ ware (e.g., Xcalibur if using Thermo Scientific equipment)
MS Methods installed (see Note 2).
3 Methods
3.2 Creation of TDA/ 1. Open the mass spectrometry instrument software and navigate
DDA LC-MS/MS to the method editor. For example, if using Thermo Scientific
Methods equipment, open Xcalibur and click: Roadmap View ! Instru-
ment Setup.
2. Open a preoptimized DDA LC-MS/MS method to use as the
starting point for the new TDA/DDA LC-MS/MS method.
This preoptimized DDA LC-MS/MS method should contain
appropriate parameters for the following: survey scans, precur-
sor ion selection, and MS/MS (elaborated on in step 3,
below).
3. Create a TDA component to the LC-MS/MS method, which
should be prioritized over the DDA component. Specify how
many TDA and DDA events to perform per MS/MS scan cycle
(see Note 8). The steps required to perform these actions will
be dependent on the mass spectrometry instrument software.
For many instruments it will be possible to perform these
actions by modifying the DDA LC-MS/MS method opened in
step 2. This is, for example, possible on Thermo Scientific
Fusion (see Note 9) or Q Exactive (see Note 10) series
instruments.
For other instruments, a new method will need to be
created. This may, for example, be necessary on LTQ Orbitrap
series instruments (see Note 11). Ensure that any such new
method is populated with parameters found in the preopti-
mized DDA LC-MS/MS method (see Note 12).
4. Import the inclusion list created in Subheading 3.1 and specify
that the list should be used with a 10 ppm mass tolerance. The
174 Gene Hart-Smith
4 Notes
References
1. Hart-Smith G, Reis RS, Waterhouse PM et al proteomics strategy. Nat Biotechnol
(2017) Improved quantitative plant proteo- 28:710–721
mics via the combination of targeted and untar- 3. Gillet LC, Leitner A, Aebersold R (2016) Mass
geted data acquisition. Front Plant Sci 8:1669 spectrometry applied to bottom-up proteo-
2. Domon B, Aebersold R (2010) Options and mics: entering the high-throughput era for
considerations when selecting a quantitative
178 Gene Hart-Smith
hypothesis testing. Annu Rev Anal Chem 11. Schmidt A, Gehlenborg N, Bodenmiller B et al
9:449–472 (2008) An integrated, directed mass spectro-
4. Picotti P, Bodenmiller B, Mueller LN et al metric approach for in-depth characterization
(2009) Full dynamic range proteome analysis of complex peptide mixtures. Mol Cell Proteo-
of S. cerevisiae by targeted proteomics. Cell mics 7:2138–2150
138:795–806 12. Gillet LC, Navarro P, Tate S et al (2012) Tar-
5. Schmidt A, Claassen M, Aebersold R (2009) geted data extraction of the MS/MS spectra
Directed mass spectrometry: towards generated by data-independent acquisition: a
hypothesis-driven proteomics. Curr Opin new concept for consistent and accurate prote-
Chem Biol 13:510–517 ome analysis. Mol Cell Proteomics 11:O111.
6. Picotti P, Aebersold R (2012) Selected reaction 016717
monitoring–based proteomics: workflows, 13. Kalli A, Smith GT, Sweredoski MJ et al (2013)
potential, pitfalls and future directions. Nat Evaluation and optimization of mass spectro-
Methods 9:555–566 metric settings during data-dependent acquisi-
7. Peterson AC, Russell JD, Bailey DJ et al (2012) tion mode: focus on LTQ-Orbitrap mass
Parallel reaction monitoring for high resolu- analyzers. J Proteome Res 12:3071–3086
tion and high mass accuracy quantitative, tar- 14. Reis RS, Hart-Smith G, Eamens AL, Wilkins
geted proteomics. Mol Cell Proteomics MR, Waterhouse PM (2015) Gene regulation
11:1475–1488 by translational inhibition is determined by
8. Domon B, Bodenmiller B, Carapito C et al Dicer partnering proteins. Nat Plants 1:1–6
(2009) Electron transfer dissociation in con- 15. Reis RS, Hart-Smith G, Eamens AL et al
junction with collision activation to investigate (2015) MicroRNA regulatory mechanisms
the Drosophila melanogaster phosphopro- play different roles in Arabidopsis. J Proteome
teome. J Proteome Res 8:2633–2639 Res 14:4743–4751
9. Hart-Smith G, Low JK, Erce MA et al (2012) 16. Arsova B, Kierszniowska S, Schulze WX (2012)
Enhanced methylarginine characterization by The use of heavy nitrogen in quantitative pro-
post-translational modification-specific tar- teomics experiments in plants. Trends Plant Sci
geted data acquisition and electron-transfer 17:102–112
dissociation mass spectrometry. J Am Soc 17. MacLean B, Tomazela DM, Shulman N et al
Mass Spectrom 23:1376–1389 (2010) Skyline: an open source document edi-
10. Savitski MM, Fischer F, Mathieson T et al tor for creating and analyzing targeted proteo-
(2010) Targeted data acquisition for improved mics experiments. Bioinformatics 26:966–968
reproducibility and robustness of proteomic 18. Consortium U (2014) UniProt: a hub for pro-
mass spectrometry assays. J Am Soc Mass Spec- tein information. Nucleic Acids Res 43:
trom 21:1668–1679 D204–D212
Chapter 14
Abstract
Phosphorylation is a posttranslational reversible modification related to signaling and regulatory mechan-
isms. Protein phosphorylation is linked to structural changes that modulate protein activity, interaction, or
localization and therefore the cell signaling pathways. The use of techniques for phosphoprotein enrich-
ment along with mass spectrometry has become a powerful tool for the characterization of signal transduc-
tion in model organisms. However, limited efforts have focused on the establishment of protocols for the
analysis of the phosphoproteome in nonmodel organisms such as tropical fruits. This chapter describes a
potential pipeline for sample preparation and enrichment of phosphorylated proteins/peptides before MS
analysis of peels of some species of tropical fruits.
1 Introduction
Arabidopsis thaliana has been the plant model for studying several
biological processes including the application of omics tools and
the massive profiling of posttranslational modifications (PTM)
[1]. The establishment of proteomics pipelines in A. thaliana
represents an invaluable tool in the study of molecular mechanisms
of plants. In most of the cases, the extrapolation of these proteo-
mics protocols to other plant species is not feasible. In fact, in
nonmodel plant species, such as tropical fruits with recalcitrant
tissue and limited genomics information exponentially increase
the complexity of proteomics protocols. Working with fruit tissues
such as peels has some pitfalls due to the presence of a thick cuticle,
cell wall, lining, proteases, storage polysaccharides, phenolic com-
pounds, lipids, and secondary metabolites, and the high dynamic
range of the proteome. Therefore, a protocol should be optimized
for each plant species and tissue.
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_14, © Springer Science+Business Media, LLC, part of Springer Nature 2020
179
180 Janet Juarez-Escobar et al.
Fig. 1 Schematic representation of a phosphoproteomics workflow. Strategies often combine two orthogonal
separation modes or multiple enrichment techniques to enhance phosphoproteome coverage. Colored
diagram is the representation of the methodology presented in this chapter
l SCX (strong cation exchange; please note that this may require a
further desalting step):
– POROS™ XS strong cation exchange resin (Thermo).
– POROS™ XS resin (Thermo).
– Pierce SCX columns (Thermo).
– Amberlite IR-120 (Merck).
– Dowex 50 WX-8 (Merck).
– Amberlyst 15 (Merck).
– Dowex 50 WX-4 (Merck).
– Polysulfoethyl aspartamide.
– RESOURCE S column (GE Healthcare, Sweden).
1.2 Enrichment The most efficient enrichment strategy is carried out after peptide
Methods digestion. It should be considered that not all proteins can be later
identified using the fragmented peptides, since the nonphosphory-
lated “part” is lost during the enrichment step [13]. Phosphopro-
tein enrichment alone (without prefractionation) is less used since
the complexity of vegetal samples; it is also is preferred when
working with proteins associated with subcellular fractions or
isolated organelles [14]. Phosphopeptide enrichment strategies
Phosphoproteomic in Peels 183
Table 1
Affinity chromatography-based techniques commercially available
Stationary
phase Method Component Examples
Metals IMAC Fe, Ga, Al, Zr HisPur™ Cobalt Superflow Agarose, by
Ni Thermo Fisher
Ni-NTA spin column (P/N 31014), Qiagen
MOAC TiO2 Titansphere Phos-TiO kits are available from GL
Sciences Inc. (Torrance, CA, USA)
Antibodies Immunoprecipitation Phosphotyrosine pTyr-100 (Cell Signaling Technology)
peptides
1.4 Plant There are many tools developed as predictors for phosphorylation
Phosphoproteome sites classified under kinase-specific or non–kinase-specific queries.
Tools and Databases For example, in the case of kinase-specific tools, users should pro-
vide protein sequence and the type of kinase under consideration.
Databases of phosphorylation sites and prediction tools are sum-
marized in Table 2.
Table 2
Prediction tools and databases of phosphorylation sites in plant proteomics
(continued)
Phosphoproteomic in Peels 185
Table 2
(continued)
(continued)
186 Janet Juarez-Escobar et al.
Table 2
(continued)
(continued)
Phosphoproteomic in Peels 187
Table 2
(continued)
2 Materials
2.7 Other Materials 1. Gel documentation system (e.g., Gel Doc™ XR System,
Bio-Rad®, and software Image Lab™).
2. Centrifugal vacuum concentrator (e.g., CentriVap,
LABCONCO®).
3. Liquid chromatography–mass spectrometry (LC-MS) system.
3 Methods
3.1 Tissue Protein 1. Grind approximately 3 g of fruit tissue to fine powder in liquid
Extraction nitrogen, adding PVPP (1:10, w/w) while grinding. Keep into
a 50-mL tube. If not processed after sampling, store at 80 C
until analysis.
2. Add 6 mL of extraction buffer to every 3 g of fruit tissue.
3. Add 6 mL Tris-saturated phenol pH 8.0 and incubate the
samples with agitation in crushed ice for 30 min.
4. Centrifuge at 10,000 g for 30 min at 4 C and transfer each
upper phenolic phase to a new centrifuge tube.
5. Add 4 volumes of ice-cold acetone with 0.07% (v/v)
β-mercaptoethanol for soluble protein precipitation at
20 C overnight.
6. Centrifuge at 3000 g for 30 min at 4 C.
7. Remove the supernatant after centrifugation, and let the pellet
dry under a laboratory fume hood.
8. Resuspend the dried pellet with 350 μL of phosphate-buffered
saline (PBS) 1 (Sigma) with SDS (1%). Vortex and sonicate
for 20 min.
9. Centrifuge at 15,000 g for 10 min at RT. Transfer the
supernatant to a new tube.
10. Measure protein concentration of the extract.
11. Store at 80 C until use.
3.2 Subject the 1. Run the gels at 10 mA/gel through the stacking gel and
Extract to SDS– increase to 25 mA/gel when the samples have entered the
Polyacrylamide Gel separation gel.
Electrophoresis (SDS-
PAGE) According to
Laemmli [56]
4 Notes
Fig. 2 Phosphorylated amino acids: T1, T16, S22 (79.96633 Da), charge: +3. Identified with: Mascot (v1.36);
fragments used for search: a, a-H2O, a-NH3; b, b-H2O, b-NH3; y-H2O, y-NH3
Acknowledgments
References
1. Friso G, van Wijk KJ (2015) Posttranslational phosphoproteomics: methods and protocols.
protein modifications in plant metabolism. Springer, New York, pp 25–46
Plant Physiol 169:1469–1487 8. Yang Z, Li N (2015) Absolute quantitation of
2. Mann M, Ong SE, Gronborg M et al (2002) protein posttranslational modification isoform.
Analysis of protein phosphorylation using mass In: Schulze WX (ed) Plant phosphoproteo-
spectrometry: deciphering the phosphopro- mics: methods and protocols. Springer,
teome. Trends Biotechnol 20:261–268 New York, pp 105–119
3. Kumar V, Khare T, Sharma M et al (2018) 9. McNulty DE, Annan RS (2008) Hydrophilic
Engineering crops for the future: a phospho- interaction chromatography reduces the com-
proteomics approach. Curr Protein Pept Sci plexity of the phosphoproteome and improves
19:413–426 global phosphopeptide isolation and detection.
4. de la Fuente van Bentem S, Roitinger E, Mol Cell Proteomics 7:971–980
Anrather D et al (2006) Phosphoproteomics 10. Gan CS, Guo T, Zhang H et al (2008) A
as a tool to unravel plant regulatory mechan- comparative study of electrostatic repulsion-
isms. Physiol Plant 126:110–119 hydrophilic interaction chromatography
5. Macek B, Mann M, Olsen JV (2009) Global (ERLIC) versus SCX-IMAC-based methods
and site-specific quantitative phosphoproteo- for phosphopeptide isolation/enrichment. J
mics: principles and applications. Annu Rev Proteome Res 7:4869–4877
Pharmacol Toxicol 49:199–221 11. Beltran L, Cutillas PR (2012) Advances in
6. Cutillas PR, Timms JF (2010) Approaches and phosphopeptide enrichment techniques for
applications of quantitative LC-MS for proteo- phosphoproteomics. Amino Acids
mics and activitomics. In: Cutillas PR, Timms 43:1009–1024
JF (eds) LC-MS/MS in proteomics. Springer, 12. Silva-Sanchez C, Li H, Chen S (2015) Recent
New York, pp 3–17 advances and challenges in plant phosphopro-
7. Schweighofer A, Meskiene I (2015) Phospha- teomics. Proteomics 15:1127–1141
tases in plants. In: Schulze WX (ed) Plant
Phosphoproteomic in Peels 195
13. Fı́la J, Honys D (2012) Enrichment techniques peptides on hafnium oxide prior to mass spec-
employed in phosphoproteomics. Amino Acids trometric analysis. Analyst 134:31–33
43:1025–1047 26. Ye J, Zhang X, Young C et al (2010) Opti-
14. Ito J, Taylor NL, Castleden I et al (2009) A mized IMAC-IMAC protocol for phosphopep-
survey of the Arabidopsis thaliana mitochon- tide recovery from complex biological samples.
drial phosphoproteome. Proteomics J Proteome Res 9:3561–3573
9:4229–4240 27. Larsen MR, Thingholm TE, Jensen ON et al
15. Villén J, Gygi SP (2008) The SCX/IMAC (2005) Highly selective enrichment of phos-
enrichment approach for global phosphoryla- phorylated peptides from peptide mixtures
tion analysis by mass spectrometry. Nat Protoc using titanium dioxide microcolumns. Mol
3:1630 Cell Proteomics 4:873–886
16. Batth TS, Francavilla C, Olsen JV (2014) 28. Aryal UK, Ross AR (2010) Enrichment and
Off-line high-pH reversed-phase fractionation analysis of phosphopeptides under different
for in-depth phosphoproteomics. J Proteome experimental conditions using titanium dioxide
Res 13:6176–6186 affinity chromatography and mass spectrome-
17. Wu J, Warren P, Shakey Q et al (2010) Inte- try. Rapid Commun Mass Spectrom
grating titania enrichment, iTRAQ labeling, 24:219–231
and Orbitrap CID-HCD for global identifica- 29. Salomon AR, Ficarro SB, Brill LM et al (2003)
tion and quantitative analysis of phosphopep- Profiling of tyrosine phosphorylation pathways
tides. Proteomics 10:2224–2234 in human cells using mass spectrometry. Proc
18. Ren L, Li C, Shao W et al (2017) TiO2 with Natl Acad Sci U S A 100:443–448
tandem fractionation (TAFT): an approach for 30. Rush J, Moritz A, Lee KA et al (2005) Immu-
rapid, deep, reproducible, and high- noaffinity profiling of tyrosine phosphorylation
throughput phosphoproteome analysis. J Pro- in cancer cells. Nat Biotechnol 23:94
teome Res 17:710–721 31. Bergström Lind S, Molin M, Savitski MM et al
19. Adelmant GO, Cardoza JD, Ficarro SB et al (2008) Immunoaffinity enrichments followed
(2011) Affinity and chemical enrichment for by mass spectrometric detection for studying
mass spectrometry-based proteomics analyses. global protein tyrosine phosphorylation. J Pro-
In: Ivanov A, Lazarev A (eds) Sample prepara- teome Res 7:2897–2910
tion in biological mass spectrometry. Springer, 32. Zhang G, Neubert TA (2006) Use of deter-
Dordrecht, pp 437–486 gents to increase selectivity of immunoprecipi-
20. Reinders J, Sickmann A (2005) State-of-the-art tation of tyrosine phosphorylated peptides
in phosphoproteomics. Proteomics prior to identification by MALDI quadrupole-
5:4052–4061 TOF MS. Proteomics 6:571–578
21. Li W, Backlund PS, Boykins RA et al (2003) 33. Hogrebe A, von Stechow L, Bekker-Jensen DB
Susceptibility of the hydroxyl groups in serine et al (2018) Benchmarking common quantifi-
and threonine to b-elimination/Michael addi- cation strategies for large-scale phosphopro-
tion under commonly used moderately high- teomics. Nat Commun 9:1045
temperature conditions. Anal Biochem 34. Boersema PJ, Aye TT, van Veen TA et al (2008)
323:94–102 Triplex protein quantification based on stable
22. Warthaka M, Karwowska-Desaulniers P, Pflum isotope labeling by peptide dimethylation
MK (2006) Phosphopeptide modification and applied to cell and tissue lysates. Proteomics
enrichment by oxidation-reduction condensa- 8:4624–4632
tion. ACS Chem Biol 1:697–701 35. Bonhomme L, Valot B, Tardieu F et al (2012)
23. Pinkse MW, Uitto PM, Hilhorst MJ et al Phosphoproteome dynamics upon changes in
(2004) Selective isolation at the femtomole plant water status reveal early events associated
level of phosphopeptides from proteolytic with rapid growth adjustment in maize leaves.
digests using 2D-NanoLC-ESI-MS/MS and Mol Cell Proteomics 11:957–972
titanium oxide precolumns. Anal Chem 36. Boex-Fontvieille E, Daventure M, Jossier M
76:3935–3943 et al (2013) Photosynthetic control of Arabi-
24. Kweon HK, Håkansson K (2006) Selective zir- dopsis leaf cytoplasmic translation initiation by
conium dioxide-based enrichment of phos- protein phosphorylation. PLoS One 8:e70692
phorylated peptides for mass spectrometric 37. Weckwerth W, Willmitzer L, Fiehn O (2000)
analysis. Anal Chem 78:1743–1749 Comparative quantification and identification
25. Rivera JG, Choi YS, Vujcic S et al (2009) of phosphoproteins using stable isotope label-
Enrichment/isolation of phosphorylated ing and liquid chromatography/mass
196 Janet Juarez-Escobar et al.
Abstract
The unicellular alga Chlamydomonas reinhardtii is a model photosynthetic organism for the study of
microalgal processes. Along with genomic and transcriptomic studies, proteomic analysis of Chlamydomo-
nas has led to an increased understanding of its metabolic signaling as well as a growing interest in the
elucidation of its phosphorylation networks. To this end, mass spectrometry-based proteomics has made
great strides in large-scale protein quantitation as well as analysis of posttranslational modifications (PTMs)
in a high-throughput manner. An accurate quantification of dynamic PTMs, such as phosphorylation,
requires high reproducibility and sensitivity due to the substoichiometric levels of modified peptides, which
can make depth of coverage challenging. Here we present a method using TiO2-based phosphopeptide
enrichment paired with label-free LC-MS/MS for phosphoproteome quantification. Three technical
replicate samples in Chlamydomonas were processed and analyzed using this approach, quantifying a
total of 1775 phosphoproteins with a total of 3595 phosphosites. With a median CV of 21% across
quantified phosphopeptides, implementation of this method for differential studies provides highly repro-
ducible analysis of phosphorylation events. While the culturing and extraction methods used are specific to
facilitate coverage in algal species, this approach is widely applicable and can easily extend beyond algae to
other photosynthetic organisms with minor modifications.
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_15, © Springer Science+Business Media, LLC, part of Springer Nature 2020
197
198 Megan M. Ford et al.
Fig. 1 Phosphoproteomic workflow for Chlamydomonas reinhardtii cells. Briefly, Chlamydomonas cultures are
harvested, resuspended in lysis buffer and sonicated. The lysate is collected and soluble proteins are reduced,
alkylated and digested with trypsin. Phosphopeptides are enriched for using a titanium dioxide-based (TiO2)
enrichment before being subjected to LC-MS/MS analysis. For the data reported here, samples were pooled
after resuspension and aliquoted into three technical replicates to remove any biological variation
2 Materials
2.1 Cell Culture 1. Hutner’s Trace Elements stock [24]. This can be purchased as a
stock solution or prepared in lab (see Note 1).
2. TRIS–Acetate–Phosphate (TAP) Media: 20 mM TRIS base,
17.5 mM acetic acid, 1.65 mM K2HPO4, 945 μM KH2PO4,
287 μM CaCl2, 405 μM MgSO4, 7.01 mM NH4Cl, and Hut-
ner’s Trace Elements. Stock solutions can be made for easy
preparation of TAP media (see Note 2).
3. TAP agar media plates, 1.5% agar: To TAP media (see Subhead-
ing 2.1, item 2), add Bacto Agar and autoclave. Cool media to
52 C and pour plates into petri dishes, 100 15 mm, in
biosafety cabinet, about 10 mL per plate. Let plates solidify
overnight, Parafilm to seal each plate and store at 4 C.
4. Chlamydomonas reinhardtii, strain CC-2895 (6145c mt-).
5. 100 μE m 2
s 1
white light source.
6. Platform shaker.
7. Liquid nitrogen, 0.5 L.
Algal Phosphoproteomics 201
2.4 Desalting 1. Waters (Milford, MA, USA) Sep-Pak C18 1 cc Vac Cartridge,
50 mg, 55–105 μm particle size.
2. 0.1% TFA (LC-MS grade).
3. 80% acetonitrile (ACN, LC-MS grade), 0.1% TFA (LC-MS
grade).
4. Vacuum manifold with 24-port cover (Phenomenex, Torrance,
CA, USA) or equivalent setup.
2.5 Phosphopeptide 1. Wash Buffer: 80% ACN (LC-MS grade), 1% TFA (LC-MS
Enrichment grade).
2. Resuspension Buffer: 80% ACN (LC-MS grade), 1% TFA
(LC-MS grade), 25 mg/mL phthalic acid. This can be made
by adding phthalic acid to the Wash Buffer.
202 Megan M. Ford et al.
2.6 Sample 1. 1% formic acid (FA, LC-MS grade), 2% ACN (LC-MS grade).
Purification 2. 0.1% FA (LC-MS grade).
3. 60% ACN (LC-MS grade), 0.1% FA (LC-MS grade).
4. Millipore (Burlington, MA, USA) C18 ZipTips.
2.8 Data Analysis 1. Progenesis QI for Proteomics v2.0 (Nonlinear Dynamics, Dur-
ham, NC, USA).
2. Mascot Daemon v3.5.1 (Matrix Science, Boston, MA, USA).
3. R script for processing phosphoproteome data. The code used
for processing these data is available on GitHub (https://
github.com/hickslab/QuantifyR).
3 Methods
3.2 Protein 1. Resuspend cell pellets in 4 mL lysis buffer (see Note 5) and
Extraction transfer to Covaris 2 mL tubes. Keep samples on ice during
resuspension.
2. Sonicate samples in a 4 C water bath for 3 min at 200 cycles/
burst, 100 W power, and 13% duty cycle using an E220 focused
ultrasonicator (Covaris, Woburn, MA, USA).
3. Transfer samples from Covaris tubes to 2 mL centrifuge tubes,
keeping the samples on ice.
4. Centrifuge cell lysates at 16,000 g for 10 min at 4 C and
collect the supernatant into a 50 mL conical tube.
5. Add 1 mL of fresh lysis buffer to the pelleted cell debris and
vortex.
6. Centrifuge this sample again at 16,000 g for 10 min at 4 C.
Collect the supernatant and combine with the first extraction in
a 15 mL conical tube.
7. Precipitate proteins by adding 5 volumes (about 30 mL) of
cold 100 mM ammonium acetate in MeOH. Incubate samples
overnight at 80 C.
8. Collect protein pellet by centrifuging for 5 min at 2000 g.
Decant the supernatant without disturbing the pellet.
9. Perform two additional washes with 30 mL fresh 100 mM
ammonium acetate in MeOH followed by a wash with 30 mL
70% EtOH. For each wash, resuspend the pellet by vortexing
before centrifuging for 5 min at 2000 g.
204 Megan M. Ford et al.
10. Allow protein pellets to dry for 5 min in a fume hood at room
temperature.
11. Resolubilize the pellets in 1–2 mL minimal resuspension
buffer. Incubate for 1 hr. to ensure protein is fully dissolved.
12. Use a 10 μL aliquot of each replicate to perform protein
quantification using the CB-X Protein Assay. Complete assay
using manufacturer’s protocol (see Note 6).
13. Normalize each replicate to 4 mg/mL and use a 0.5 mL aliquot
(2 mg) of each sample to continue through the remaining steps
in the protocol.
3.4 Desalting 1. Thaw samples on ice and centrifuge them for 5 min at
10,000 g to pellet. Remove undigested protein pellet from
soluble peptide mixture to avoid clogging the cartridges.
2. Set up one cartridge for each sample on a vacuum manifold
using test tubes to collect the flow through from the cartridges.
3. Wet cartridges by adding 1 mL of 80% ACN, 0.1% TFA (see
Note 8).
4. Equilibrate cartridges using 2 mL of 0.1% TFA.
5. Load peptide samples onto the cartridge and recover the flow
through in a new test tube.
6. Reapply this flow through to the cartridge.
Algal Phosphoproteomics 205
3.5 Phosphopeptide 1. Each sample uses one TiO2 tip placed in a microcentrifuge tube
Enrichment using an adaptor. Preelute the tips using 100 μL of elution
buffer (see Note 9).
2. Condition each tip with 100 μL of wash buffer twice, for a total
of 200 μL, followed by 3 washes using 100 μL of resuspension
buffer.
3. Resuspend the dried peptides in 150 μL of resuspension buffer.
Centrifuge the samples at 10,000 g for 5 min to prevent
clogging and load onto the tips. Use a new centrifuge tube to
recover the sample flow through.
4. Reapply the flow through five times.
5. Following binding using a new centrifuge tube, wash the tips
using 100 μL of resuspension buffer twice and then wash three
times with 100 μL of wash buffer.
6. Using a new centrifuge tube to collect the buffer, elute the
phosphopeptide-enriched samples using two aliquots of
100 μL of elution buffer, combining them for a total of
200 μL of elution.
7. Flash-freeze the elution with liquid nitrogen and vacuum cen-
trifuge to dryness with the concentrator set to room
temperature.
while keeping the resin wet. Repeat twice for a total of three
preelution steps.
6. Equilibrate the ZipTip by pipetting 0.1% FA three times, dis-
carding the solvent each time while keeping the resin wet.
7. Pipet the sample 10 times to load the peptides onto the ZipTip.
8. Wash six times with 0.1% FA.
9. Elute the peptides by pipetting 10 times using aliquoted elu-
tion solvent from step 4, expelling all of the solvent from the
pipette tip.
10. Dry down all of the eluted peptide samples.
3.8 Data Analysis 1. Upload acquired spectral files (∗.raw) into Progenesis QI for
Proteomics (Nonlinear Dynamics). Use automatically assigned
reference spectrum to align the total ion chromatograms to
minimize run-to-run differences in retention time and normal-
ize peak abundances. Design experiment so that replicates are
grouped together as one subject. Export a combined peak list
(∗.mgf).
2. Upload and determine peptide sequence and protein inference
using Mascot (Matrix Science). Use the following search para-
meters: Search against the database containing the proteome
for the organism of interest, in this case the Phytozome Chla-
mydomonas proteome appended with the NCBI mitochon-
drial and chloroplast databases, along with the sequence for
common laboratory contaminants (www.thegpm.org/cRAP;
116 entries). Use a target decoy MS/MS search with trypsin
protease specificity with up to two missed cleavages, a peptide
mass tolerance of 15 ppm, and a fragment mass tolerance of
0.1 Da. Set a fixed modification of carbamidomethylation at
cysteine and include the following variable modifications: acet-
ylation at the protein N-terminus, oxidation at methionine,
and phosphorylation at serine, threonine, and tyrosine. After
the search is complete, adjust the false discovery rate of the
significant peptide identifications to be less than 1% using the
embedded Percolator algorithm. Export matches (∗.xml) and
reupload data to Progenesis.
3. From Progenesis, export the “Peptide Measurements” from
the “Review Proteins” tab (Table S1). These data can be used
to determine the number of phosphosites, and phosphopro-
teins identified in each replicate (Fig. 2a) and the reproducibil-
ity can be assessed (Fig. 3).
4. The proteomics data have been deposited to the ProteomeX-
change Consortium (www.proteomexchange.org) via the
PRIDE partner repository [25] with the dataset identifiers
PXD012261.
5. Parse data using custom R script found at GitHub (https://
github.com/hickslab/QuantifyR) or using similar parsing
technique. This script groups together features matched with
identical sequence, modifications, and score with differing pro-
tein accessions, representing them by the protein accession
with the highest number of unique peptides and largest confi-
dence score assigned by Progenesis. Features duplicated by
multiple peptide identifications are reduced to a single peptide
with the highest Mascot ion score. The results are then limited
to only peptide with one or more phosphosites. Identifiers are
made by joining the protein accession of each feature with the
single-letter amino acid code of the modified residue and
208 Megan M. Ford et al.
Fig. 2 Summary of quantitation results between three replicate samples. A. Number of phosphopeptides,
phosphoproteins, and statistics for each individual replicate and combined data with filtered and imputed
data. B. Histogram of the % CV for quantitated phosphosites
Fig. 3 Plots comparing the log2 transformed abundances between replicate samples
4 Notes
Acknowledgments
References
1. Harris EH (2001) Chlamydomonas as a model 5. Miller R, Wu G, Deshpande RR et al (2010)
organism. Annu Rev Plant Physiol Plant Mol Changes in transcript abundance in Chlamydo-
Biol 52:363–406 monas reinhardtii following nitrogen depriva-
2. Hu Q, Sommerfeld M, Jarvis E et al (2008) tion predict diversion of metabolism. Plant
Microalgal triacylglycerols as feedstocks for Physiol 154:1737–1752
biofuel production: perspectives and advances. 6. Wang H, Alvarez S, Hicks LM (2012) Com-
Plant J 54(4):621–639 prehensive comparison of iTRAQ and label-
3. Merchant SS, Prochnik SE, Vallon O et al free LC-based quantitative proteomics
(2007) The Chlamydomonas genome reveals approaches using two Chlamydomonas rein-
the evolution of key animal and plant func- hardtii strains of interest for biofuels engineer-
tions. Science 318:245–250 ing. J Proteome Res 11:487–501
4. Zones JM, Blaby IK, Merchant SS et al (2015) 7. Roustan V, Bakhtiari S, Roustan P-J et al
High-resolution profiling of a synchronized (2017) Quantitative in vivo phosphoproteo-
diurnal transcriptome from Chlamydomonas mics reveals reversible signaling processes dur-
reinhardtii reveals continuous cell and meta- ing nitrogen starvation and recovery in the
bolic differentiation. Plant Cell 27:2743–2769 biofuel model organism Chlamydomonas rein-
hardtii. Biotechnol Biofuels 10:280. https://
doi.org/10.1186/s13068-017-0949-z
Algal Phosphoproteomics 211
8. Krebs EG, Fischer EH (1955) Phosphorylase recovery from complex biological samples. J
activity of skeletal muscle extracts. J Biol Chem Proteome Res 9:3561–3573
216:113–120 18. Aryal UK, Ross ARS (2010) Enrichment and
9. Fischer EH, Krebs EG (1955) Conversion of analysis of phosphopeptides under different
phosphorylase b to phosphorylase a in muscle experimental conditions using titanium dioxide
extracts. J Biol Chem 216:121–132 affinity chromatography and mass spectrome-
10. Eriksson J, Fenyö D (2010) Modeling experi- try. Rapid Commun Mass Spectrom
mental design for proteomics. Methods Mol 24:219–231
Biol 673:223–230 19. Werth EG, McConnell EW, Lianez IC et al
11. Blackburn K, Goshe MB (2009) Challenges (2019) Investigating the effect of target of
and strategies for targeted phosphorylation rapamycin kinase inhibition on the Chlamydo-
site identification and quantification using monas reinhardtii phosphoproteome: from
mass spectrometry analysis. Brief Funct Geno- known homologs to new targets. New Phytol
mic Proteomic 8:90–103 221:247–260
12. Dunn JD, Reid GE, Bruening ML (2010) 20. Neilson KA, Ali NA (2011) Less label, more
Techniques for phosphopeptide enrichment free: approaches in label-free quantitative mass
prior to analysis by mass spectrometry. Mass spectrometry. Proteomics 11:535–553
Spectrom Rev 29:29–54 21. Bantscheff M, Schirle M, Sweetman G et al
13. Kokubu M, Ishihama Y, Sato T et al (2005) (2007) Quantitative mass spectrometry in pro-
Specificity of immobilized metal affinity-based teomics: a critical review. Anal Bioanal Chem
IMAC/C18 tip enrichment of phosphopep- 389:1017–1031
tides for protein phosphorylation analysis. 22. Werth EG, McConnell EW, Gilbert TSK et al
Anal Chem 77:5144–5154 (2017) Probing the global kinome and phos-
14. Ruprecht B, Koch H, Medard G et al (2015) phoproteome in Chlamydomonas reinhardtii
Comprehensive and reproducible phosphopep- via sequential enrichment and quantitative pro-
tide enrichment using iron immobilized metal teomics. Plant J 89:416–426
ion affinity chromatography (Fe-IMAC) col- 23. Wang H, Gau B, Slade WO et al (2014) The
umns. Mol Cell Proteomics 14:205–215 global phosphoproteome of Chlamydomonas
15. Larsen MR, Thingholm TE, Jensen ON et al reinhardtii reveals complex organellar phos-
(2005) Highly selective enrichment of phos- phorylation in the flagella and thylakoid mem-
phorylated peptides from peptide mixtures brane. Mol Cell Proteomics 13:2337–2353
using titanium dioxide microcolumns. Mol 24. Hutner SH, Provasoli L, Schatz A et al (1950)
Cell Proteomics 4:873–886 Some approaches to the study of the role of
16. Tsai C-F, Wang Y-T, Chen Y-R et al (2008) metals in the metabolism of microorganisms.
Immobilized metal affinity chromatography Proc Am Philos Soc 94:152–170
revisited: pH/acid control toward high selec- 25. Vizcaı́no JA, Côté RG, Csordas A et al (2013)
tivity in phosphoproteomics. J Proteome Res The PRoteomics IDEntifications (PRIDE)
7:4058–4069 database and associated tools: status in 2013.
17. Ye J, Zhang X, Young C et al (2010) Opti- Nucleic Acids Res 41:D1063–D1069
mized IMAC protocol for phosphopeptide
Chapter 16
Abstract
Parallel reaction monitoring (PRM) is a liquid chromatography–mass spectrometry (LC-MS)-based tar-
geted peptide/protein quantification method that was initially implemented for Orbitrap mass spectro-
meters. Here, we describe detailed workflows that utilize the freely available MaxQuant and Skyline
software packages to target peptides of interest, primarily focusing on phosphopeptides.
Key words Parallel reaction monitoring (PRM), Targeted quantification, Orbitrap mass spectrome-
ter, Phosphopeptide, Phosphorylation, Posttranslational modification (PTM)
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_16, © Springer Science+Business Media, LLC, part of Springer Nature 2020
213
214 Sara Christina Stolze and Hirofumi Nakagami
2 Materials
3 Methods
3.3 DDA Data After DDA measurement, analyze derived RAW data using Max-
Processing Quant software.
with MaxQuant
1. Download, install, and open the MaxQuant software (https://
Software maxquant.org).
2. Go to the “Raw files” tab, and click “Load” to import Thermo
RAW files. If more than two files are going to be analyzed,
specify the experimental design in the same tab.
3. Go to the “Group-specific parameters” tab, and choose the
“Modifications” option. Select phosphorylation of serine, thre-
onine, and tyrosine as variable modifications besides the default
settings of alkylation (e.g., carbamidomethylation) of cysteine
residues as fixed and oxidation of methionine residues and
protein N-terminal acetylation as variable modifications.
4. Choose the “Digestion” option and select an enzyme. We
usually keep the default setting for “Max. missed cleavages,”
which is “2.”
5. (Optional for quantification). Choose the “Label-free quantifi-
cation” option and select the “LFQ.” Set the “LFQ min. ratio
count” parameter to “1,” and deselect the “Fast LFQ.” Quan-
tification is optional and not needed for constructing a PRM
method. We often select this option to evaluate the reproduc-
ibility of the replicates.
6. Go to the “Global parameters” tab, and choose the
“Sequences” option. Click “Add file” to import a FASTA-
formatted protein database that is suitable for the analyzed
samples.
7. (Optional for quantification). Choose the “Adv. identification”
option, and enable the “match between runs.”
8. (Optional for quantification). Choose the “Label-free quantifi-
cation” option, and enable the “iBAQ.”
9. Click “Start” to run the analysis.
216 Sara Christina Stolze and Hirofumi Nakagami
3.4 Target List 1. Download, install, and open the Skyline software (https://
Construction skyline.ms).
with Skyline Software 2. Open a “Blank Document.”
Using the MaxQuant
3. If you have used Skyline before, go to the “Settings” menu, and
Output File click “Default” to restore the default settings. Skyline automat-
ically restores settings that were used on the previous occasion,
and, therefore, it is necessary/desirable to clear previous
settings.
4. Go to the “Settings” menu, and open the “Peptide
Settings” form.
5. Go to the “Digestion” tab, and select an enzyme. For “Max
missed cleavages,” set the values that was defined for the Max-
Quant search.
6. Go to the “Filter” tab, and set the “Min length” option to “7”
and the “Max length” option to “25,” which are the default
settings for the MaxQuant search. If you have changed these
parameters for the MaxQuant search, match the parameters
accordingly. Set the “Exclude N-terminal AAs” option to “0.”
7. Go to the “Settings” menu, and open the “Transition
settings” form.
8. Go to the “Filter” tab, and set the “Precursor charges” option
to “2, 3, 4”, the “Ion charges” option to “1,” and the “Ion
types” option to “p”(precursor). Set the “To: (of the “Product
ion selection”)” option to the “last ion.”
9. Go to the “Instrument” tab, and set the “Min m/z” and “Max
m/z” options to the values that were used for the DDA
measurement.
10. Go to the “Full-Scan” tab, and set parameters for the “MS1
filtering.” Set the “Isotope peaks included” option to “Count”
and the “Precursor mass analyzer” option to “Orbitrap.” Set
the “Resolving power: At:” option to the values that were used
for the DDA measurement. Then, close the “Transition
settings” form.
11. Save the document with the adjusted settings.
12. Import the MaxQuant output file. Go to the “File” menu, and
click “Peptide Search” under the “Import” option to open the
“Import Peptide Search” form.
13. Select “DDA with MS1 filtering” for the “Workflow” option.
Click “Add Files” to select the MaxQuant output “msms.txt”
file, then click “Next.” The “mqpar” file for the MaxQuant
search is also needed for building a spectral library and has to
be stored in a same location as the “msms.txt” file.
14. Skyline will build a spectral library from the “msms.txt” file and
automatically search for the Thermo RAW files that were used
PRM of Phosphopeptides 217
for the MaxQuant search. If the Thermo RAW files are stored
in a different location, manually select the files, then click
“Next.”
15. Select the option of adding all modifications and click “Next”
(see Note 3).
16. Click “Browse” to import the FASTA-formatted protein data-
base that was used for the MaxQuant search. If you intend to
make a target list for proteins of interest, import a database
with the selected proteins only. Alternatively, the FASTA-
formatted database can be copy-pasted into the box from a
text document. Click “Finish” to start importing the data.
17. If you want to extract peptides that are unique to the proteins
of interest, select the “Remove duplicate peptides” and click
“OK.” The uniqueness will only be assessed within the
imported database, and, therefore, extra evaluation is needed
to ensure uniqueness within the samples you plan to measure.
18. Arrange the Skyline window for selecting targets. We usually
arrange the window with the following panels, “Peak Areas—
Replicate Comparison”, “Retention Times—Replicate Com-
parison”, and “Retention Times—Scheduling”, as shown in
Fig. 1. These panels can be opened from the “View” menu.
19. Inspect the data for precursors of interest. Good results can be
achieved with sharp, defined peaks without coelution. An
“idotp” value of >0.95 is recommended. High reproducibility
is also a good indicator. In some cases, the software does not
3.5 PRM Data 1. Create a PRM method using the Thermo Xcalibur software. In
Acquisition our case, with a Thermo Q-Exactive Plus, a method combining
one full scan followed by n PRM (n ¼ number of targeted
precursors) events was used to acquire PRM data (see Note 6).
Figure 2 shows the setup of the instrument method on a
Q-Exactive Plus.
2. Measure samples with the PRM method.
3.6 PRM Data 1. Process acquired RAW data using the MaxQuant software, as
Analysis stated above (A). Alternatively, import the Thermo RAW file
into the Skyline software for analysis (B). See Note 7 for limita-
tions of option B. Data inspection with the Skyline software is
basically the same for both approaches (C).
A1. MaxQuant analysis: Analyze the PRM data using the Max-
Quant software, applying the same settings as stated under
Subheading 3.3.
A2. Skyline analysis: Prepare a document for data analysis.
Open the document that was generated as stated in Sub-
heading 3.4, step 23.
PRM of Phosphopeptides 219
3.7 Target List 1. Open a “Blank Document,” and save the document.
Construction 2. Adjust the “Peptide settings.” Go to the “Digestion” tab, and
with the Skyline set appropriate “Enzyme” and “Max missed cleavages.”
Software by In Silico
3. Go to the “Filter” tab, and set the “Min length” option to “7”
Digest and the “Max length” option to “25,” which are the default
settings for the MaxQuant search. You can also attempt to
target shorter or longer peptides, and later adjust the para-
meters for the MaxQuant search. If you want to target the
site in the N-terminal region, set the “Exclude N-terminal
AAs” option to “0.”
4. Go to the “Modifications” tab, and click “Edit list” for the
“Structural modifications.” Add “Phospho (ST)” and “Phos-
pho (Y)” as variable modifications. Other possible modifica-
tions, namely alkylation (e.g., carbamidomethylation) of
cysteine residues as fixed, oxidation of methionine residues
and protein N-terminal acetylation as variable modifications,
also need to be added.
PRM of Phosphopeptides 221
Fig. 4 Transition list created by in silico digest from a FASTA file containing a
protein of interest
4 Notes
Acknowledgments
References
1. Walley JW, Sartor RC, Shen Z et al (2016) 7. Lehmann U, Wienkoop S, Tschoep H et al
Integration of omic networks in a developmen- (2008) If the antibody fails--a mass western
tal atlas of maize. Science 353:814–818 approach. Plant J 55:1039–1046
2. Marx H, Minogue CE, Jayaraman D et al 8. Peterson AC, Russell JD, Bailey DJ et al (2012)
(2016) A proteomic atlas of the legume Medi- Parallel reaction monitoring for high resolu-
cago truncatula and its nitrogen-fixing endo- tion and high mass accuracy quantitative, tar-
symbiont Sinorhizobium meliloti. Nat geted proteomics. Mol Cell Proteomics
Biotechnol 34:1198–1205 11:1475–1488
3. Seaton DD, Graf A, Baerenfaller K et al (2018) 9. Cox J, Mann M (2008) MaxQuant enables
Photoperiodic control of the Arabidopsis pro- high peptide identification rates, individualized
teome reveals a translational coincidence mech- p.p.b.-range mass accuracies and proteome-
anism. Mol Syst Biol 14:e7962 wide protein quantification. Nat Biotechnol
4. Vidova V, Spacil Z (2017) A review on mass 26:1367–1372
spectrometry-based quantitative proteomics: 10. Tyanova S, Temu T, Cox J (2016) The Max-
targeted and data independent acquisition. Quant computational platform for mass
Anal Chim Acta 964:7–23 spectrometry-based shotgun proteomics. Nat
5. Arsova B, Watt M, Usadel B (2018) Monitor- Protoc 11:2301–2319
ing of plant protein post-translational modifi- 11. MacLean B, Tomazela DM, Shulman N et al
cations using targeted proteomics. Front Plant (2010) Skyline: an open source document edi-
Sci 9:1168 tor for creating and analyzing targeted proteo-
6. Bourmaud A, Gallien S, Domon B (2016) Par- mics experiments. Bioinformatics 26:966–968
allel reaction monitoring using quadrupole- 12. Nakagami H (2014) StageTip-based HAM-
Orbitrap mass spectrometer: principle and MOC, an efficient and inexpensive phospho-
applications. Proteomics 16:2146–2159 peptide enrichment method for plant shotgun
phosphoproteomics. Methods Mol Biol
1072:595–607
Chapter 17
Abstract
N-linked glycans are a ubiquitous posttranslational modification and are essential for correct protein folding
in the endoplasmic reticulum of plants. However, this likely represents a narrow functional role for the
diverse array of glycan structures currently associated with N-glycoproteins in plants. The identification of
N-linked glycosylation sites and their structural characterization by mass spectrometry remains challenging
due to their size, relative abundance, structural heterogeneity, and polarity. Current proteomic workflows
are not optimized for the enrichment, identification and characterization of N-glycopeptides. Here we
describe a detailed analytical procedure employing hydrophilic interaction chromatography enrichment,
high-resolution tandem mass spectrometry employing complementary fragmentation techniques (higher-
energy collisional dissociation and electron-transfer dissociation) and a data analytics workflow to produce
an unbiased high confidence N-glycopeptide profile from plant samples.
Key words N-linked glycans, Glycoproteomics, HILIC, Higher-energy collisional dissociation, Elec-
tron-transfer dissociation
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_17, © Springer Science+Business Media, LLC, part of Springer Nature 2020
225
226 Eduardo Antonio Ramirez-Rodriguez and Joshua L. Heazlewood
2 Materials
2.1 Microsomal 1. Approximately 1 g plant tissue (fresh weight) (see Note 1).
Preparation 2. Ceramic mortar and pestle (medium size).
228 Eduardo Antonio Ramirez-Rodriguez and Joshua L. Heazlewood
3 Methods
3.1 Preparation 1. Harvest 1 g fresh weight of plant material (see Note 1).
of Microsomal Fraction 2. Place material in 8 mL of Microsomal Extraction Buffer and a
and Peptide Digestion prechilled mortar and pestle and grind on ice until tissue is
homogenized.
3. Place two layers of Miracloth into a funnel and filter homoge-
nate into 10 mL centrifuge preparative tube on ice. Gently
squeeze the Miracloth to extract as much homogenate as pos-
sible (see Note 9).
4. Centrifuge the homogenate at 3000 g for 10 min at 4 C.
5. Carefully transfer the supernatant into prechilled 12 mL ultra-
centrifuge tubes.
6. Centrifuge the supernatant at 100,000 g for 30 min.
7. Discard supernatant being careful not to disturb the pellet.
230 Eduardo Antonio Ramirez-Rodriguez and Joshua L. Heazlewood
Fig. 1 Graphical summary of the HILIC-based N-glycan peptide analysis workflow from plant material
3.4 Spectral Data 1. Spectral data were interrogated using Byonic™ (Protein
Interrogation Metrics) against a plant specific protein database in FASTA
format (see Note 20).
2. Default search parameters were employed with the following
changes: precursor mass tolerance—5 ppm, Fragmentation
type—Both: HCD and ETD, Fragment mass tolerance
(HCD)—10 ppm, Fragment mass tolerance (ETD)—
20 ppm, Fixed modifications—Carbamidomethyl (Cys), Vari-
able modification—Oxidation (M), Charge states—3,4,5
Applied to unassigned spectra, Precursor isotype off by x—
Too high (wide).
N-linked Glycoproteomics in Plants 233
4 Notes
Table 1
N-glycan structures used to generate plant glycan database for MS/MS data interrogation
(continued)
N-linked Glycoproteomics in Plants 235
Table 1
(continued)
(continued)
236 Eduardo Antonio Ramirez-Rodriguez and Joshua L. Heazlewood
Table 1
(continued)
GlcNAc Gal
a GlcNAc
Man Xyl
204.1 m/z
y11y10y9 y8 y7 y6 y5 y4 y3 Fuc
Glc
INATGVVAPVGFK
b2 b3 b4 b5 b6 b10 b12 1475.84 (M+H)+
Intensity (cps)
1271.75 (M+H)+
2+
ManGlcNAc 636.86 (M+2H)
|
738.42 (M+2H)2+
366.14 m/z | (M+H)+
1621.95
- y7
- y9
- y11
- y5
- y6
- y10
- y8
- b3
- y12
- b2
1679.93 (M+H)+
- b12
b6
- b10
- y3
- b4
- b5
m/z
b
1222.08 (M+2H)2+
INATGVVAPVGFK
Intensity (cps)
c2 c3 c4 c5 c6 c7
- z6
- c7
- z7
- c6
- z2
- z3
- z4
- c5
- z10
- z11
- c4
- c3
- c2
m/z
Fig. 2 An example of HCD triggered ETD fragmentation spectra. (a) A peak of 204.1 m/z (GlcNAc) and
366.1396 m/z (ManGlcNAc) in the HCD fragmentation spectra triggered ETD fragmentation of this precursor
ion 815.05 [M+3H]3+. (b) Resultant ETD spectra (See Notes 18 and 19)
References
1. Hebert DN, Lamriben L, Powers ET et al 9. Elbers IJW, Stoopen GM, Bakker H et al
(2014) The intrinsic and extrinsic effects of (2001) Influence of growth conditions and
N-linked glycans on glycoproteostasis. Nat developmental stage on N-glycan heterogene-
Chem Biol 10:902–910 ity of transgenic immunoglobulin G and
2. Stanley P, Taniguchi N, Aebi M (2015) N- endogenous proteins in tobacco leaves. Plant
Glycans. In: rd VA, Cummings RD et al (eds) Physiol 126:1314–1322
Essentials of glycobiology. Cold Spring Har- 10. Strasser R, Stadlmann J, Svoboda B et al
bor Laboratory Press, Cold Spring Harbor (2005) Molecular basis of N-acetylglucosami-
(NY), pp 99–111 nyltransferase I deficiency in Arabidopsis thali-
3. Liu Y, Li J (2014) Endoplasmic reticulum- ana plants lacking complex N-glycans.
mediated protein quality control in Arabidop- Biochem J 387:385–391
sis. Front Plant Sci 5:162 11. Pedersen CT, Loke I, Lorentzen A et al (2017)
4. Rips S, Bentley N, Jeong IS et al (2014) Multi- N-glycan maturation mutants in Lotus japoni-
ple N-glycans cooperate in the subcellular tar- cus for basic and applied glycoprotein research.
geting and functioning of Arabidopsis Plant J 91:394–407
KORRIGAN1. Plant Cell 26:3792–3808 12. Song W, Mentink RA, Henquet MG et al
5. Strasser R (2016) Plant protein glycosylation. (2013) N-glycan occupancy of Arabidopsis N-
Glycobiology 26:926–939 glycoproteins. J Proteome 93:343–355
6. Fanata WI, Lee KH, Son BH et al (2013) 13. Zielinska DF, Gnad F, Schropp K et al (2012)
N-glycan maturation is crucial for cytokinin- Mapping N-glycosylation sites across seven
mediated development and cellulose synthesis evolutionarily distant species reveals a diver-
in Oryza sativa. Plant J 73:966–979 gent substrate proteome despite a common
7. Zeng W, Ford KL, Bacic A et al (2018) N- core machinery. Mol Cell 46:542–548
linked glycan micro-heterogeneity in glycopro- 14. Ma J, Wang D, She J et al (2016) Endoplasmic
teins of Arabidopsis. Mol Cell Proteomics reticulum-associated N-glycan degradation of
17:413–421 cold-upregulated glycoproteins in response to
8. Henquet M, Lehle L, Schreuder M et al (2008) chilling stress in Arabidopsis. New Phytol
Identification of the gene encoding the alpha 212:282–296
1,3-mannosyltransferase (ALG3) in Arabidop- 15. Xu SL, Medzihradszky KF, Wang ZY et al
sis and characterization of downstream (2016) N-glycopeptide profiling in
N-glycan processing. Plant Cell 20:1652–1664
240 Eduardo Antonio Ramirez-Rodriguez and Joshua L. Heazlewood
Arabidopsis inflorescence. Mol Cell Proteomics assignment of their glycosylation sites using
15:2048–2054 HILIC enrichment and partial deglycosylation.
16. Wilson IB, Zeleny R, Kolarich D et al (2001) J Proteome Res 3:556–566
Analysis of Asn-linked glycans from vegetable 19. Ford KL, Zeng W, Heazlewood JL et al (2015)
foodstuffs: widespread occurrence of Lewis a, Characterization of protein N-glycosylation by
core alpha1,3-linked fucose and xylose substi- tandem mass spectrometry using complemen-
tutions. Glycobiology 11:261–274 tary fragmentation techniques. Front Plant Sci
17. Strasser R, Schoberer J, Jin C et al (2006) 6:674
Molecular cloning and characterization of Ara- 20. Rice RH, Means GE, Brown WD (1977) Sta-
bidopsis thaliana Golgi alpha-mannosidase II, a bilization of bovine trypsin by reductive meth-
key enzyme in the formation of complex N- ylation. Biochim Biophys Acta 492:316–321
glycans in plants. Plant J 45:789–803 21. Goebel-Stengel M, Stengel A, Tache Y (2011)
18. Hagglund P, Bunkenborg J, Elortza F et al The importance of using the optimal plastic-
(2004) A new strategy for identification of N- ware and glassware in studies involving pep-
glycosylated proteins and unambiguous tides. Anal Biochem 414:38–46
Chapter 18
Abstract
Acetylation of lysine side chains at their ε-amino group is a reversible posttranslational modification (PTM),
which can affect diverse protein functions. Lysine acetylation was first described on histones, and nowadays
gains more and more attention due to its more general occurrence in proteomes, and its possible crosstalk
with other protein modifications. Here we describe a workflow to investigate the acetylation of lysine-
containing peptides on a large scale. For this high-resolution lysine acetylome analysis, dimethyl-labeled
peptide samples are pooled and offline-fractionated using hydrophilic interaction liquid chromatography
(HILIC). The offline fractionation is followed by an immunoprecipitation and liquid chromatography–-
tandem mass spectrometry (LC-MS/MS) for data acquisition and subsequent data analysis.
Key words Lysine acetylation, HILIC, Offline fractionation, Dimethyl labeling, MaxQuant
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_18, © Springer Science+Business Media, LLC, part of Springer Nature 2020
241
242 Jonas Giese et al.
Fig. 1 Workflow for quantitative lysine acetylome profiling. The workflow comprises six crucial steps:
(1) Experimental setup to investigate up to three different conditions/genotypes or treatments. (2) Protein
extraction from harvested leaf tissue under denaturing and reducing conditions with subsequent alkylation and
trypsin digestion using a modified FASP protocol. (3) Desalting and isotopic dimethyl labeling, followed by
pooling of the labeled samples in equal amounts of peptide. (4) Offline fractionation of peptides utilizing a ZIC-
HILIC column on an HPLC system collecting seven peptide fractions. (5) Enrichment of lysine-acetylated
peptides by immunoprecipitation (IP) using an anti-acetyl lysine antibody. (6) Measuring desalted samples
(enriched and IP input for whole proteome analysis) on a nano-LC MS/MS system. Repeat analysis for at
least four biological replicates
mAU %B
70
150
60
50
100
40
30
50
20
10
0
Fractions 1 2 3456 7
Fig. 2 Stepped-linear gradient and peptide distribution of ZIC-HILIC offline fractionation. The gradient was set
up with a flow rate of 0.5 mL per min for 22 column volumes (CV, 1 CV ¼ 2493 mL), which corresponds to
115 min. The gradient started with 10 % buffer BZH in buffer AZH for the duration of three CV. The concentration
of buffer BZH was then increased to 58 % with a linear gradient over 12 CV. Another three CV ran at 79 %
buffer BZH. The gradient was then reset to 10 % for another three CV. Twenty-two fractions of peptides were
collected during the first 19 CV; twenty-one of those fractions were pooled into seven fractions with about
equal peptide amounts. The last fraction was discarded, as it contained no peptides
2 Materials
2.4 ZIC-HILIC Offline 1. Buffer AZH: 95 % (v/v) ACN (LC-MS-grade) with 2 % (v/v)
Fractionation FA and 5 mM ammonium acetate.
2. Buffer BZH: 0.07 % (v/v) FA and 5 mM ammonium acetate.
3. ZIC-HILIC column (e.g., Sequant ZIC-HILIC column,
150 4.6 mm, 3.5 μm, 200 Å, Merck).
246 Jonas Giese et al.
3 Methods
3.1 Protein The following protocol is optimized for Arabidopsis leaves (see
Extraction Note 3). Unless, indicated otherwise all steps are carried out at
room temperature (see Note 4).
1. Leaves are harvested and immediately frozen in liquid nitrogen.
Material is then ground under liquid nitrogen to a fine powder.
Required amount of material for protein extraction is weighed
into a precooled reaction tube (2 mL for up to 300 mg or a
15 mL tube) (see Note 5).
2. Heat a water bath to 95 C and preheat SDT extraction buffer
to 95 C (see Note 6).
3. Mix the samples with 2.5 volumes (v/w) of hot SDT-lysis
buffer. Vortex immediately to resuspend the powder in the
buffer. Incubate samples for 5 min at 95 C in the water bath
and vortex twice in between for 15–20 s.
4. Place samples in an ultrasonic bath and sonicate for 15 min.
5. Centrifuge extract for 30 min in a benchtop centrifuge at Vmax
(15,000–21,000 g) (see Note 7).
6. Transfer the supernatant to a new tube without disturbing the
pelleted material.
7. Repeat steps 5 and 6.
8. Determine protein concentration using the Pierce 660 nm
protein assay with ionic detergent compatibility reagent
according to the manufacturer’s instructions.
3.3 Desalting Labeling reagent should be prepared shortly before use as shown in
and Dimethyl Labeling Table 1. Solutions must be kept cold until use to prevent unwanted
of Peptides on C18 side-reactions.
Columns 1. Sep-Pak C18 (360 mg) cartridges are assembled with a 5 mL
syringe (without piston) in a rack above a tray to collect flow-
through.
2. Flush Sep-Paks with 3 mL methanol to condition the C18
matrix (see Note 12).
3. Repeat step 2 once with 3 mL buffer BC18 and afterward with
3 mL buffer AC18.
4. Load peptide sample on Sep-Paks (see Note 13).
5. Add 1 mL buffer AC18.
6. Add 3 mL buffer AC18.
7. Add dimethyl-labeling reagent on Sep-Paks (see Notes 13 and
14).
8. Add 1 mL buffer AC18.
9. Add 3 mL buffer AC18.
10. Transfer SepPaks to 2 mL reaction tubes to collect eluate. Elute
samples twice by loading 700 μL buffer BC18 (see Note 13).
Offline Fractionation for Lysine Acetylome Profiling 249
Table 1
Scheme to prepare dimethyl labeling reagent. 5 mL reagent per sample is used
3.4 ZIC-HILIC Offline 1. Dissolve dried peptides in 34 μL buffer AZH and 900 μL buffer
Fractionation BZH. Sonicate for 4 min in ultrasonic bath and vortex shortly.
Centrifuge for 1 min at 12,000 g.
2. Transfer supernatant into a new 2 mL reaction tube. Dissolve
pellet in 34 μL buffer AZH and 500 μL buffer BZH. Sonicate for
4 min in ultrasonic bath and vortex shortly. Centrifuge for
1 min with maximum speed.
3. Combine supernatants and repeat step 2 if a pellet is persistent.
4. Collect supernatant and combine with previous supernatants
(see Note 16).
5. Samples are fractionated on an HPLC unit using a 3.5 μm
ZIC-HILIC column. Up to 10 mg can be loaded on the
column. A segmented linear gradient from 10 to 58% buffer
BZH and a flow rate of 500 μL per minute is used (Fig. 2).
6. Twenty-two fractions are collected and combined into seven
fractions in 1.5 mL reaction tubes aiming at equally distributed
peptide amounts.
7. Fractions are then dried in a vacuum centrifuge.
250 Jonas Giese et al.
500 5000
250 2500
0 0
g
g
/m
/m
/m
/m
/m
/m
AB
AB
AB
AB
AB
AB
µL
µL
µL
µL
µL
µL
10
25
50
10
25
50
Fig. 3 Optimization of antibody bead amount for lysine acetylome analysis. (a) The number of identified lysine-
acetylated peptides (Kac) differs between different antibody bead (AB) amounts used for the IP (10, 25, 50 μL
antibody solution, respectively, with 1 mg peptide). (b) The nonenriched total proteome samples showed
identical numbers of protein groups (mean SD, n ¼ 4). Peptides and protein groups were identified with
MaxQuant with settings reported in Hartl et al. [29]
3.5 Enrichment All steps regarding the anti-acetyl-lysine antibody bound to agarose
of Lysine-Acetylated beads should be performed on ice, as the antibody is sensitive to
Peptides temperature changes. All buffers should be precooled.
1. Resuspend samples in a maximum volume of 1 mL TBS buffer
(see Note 17).
2. The samples should be adjusted to pH 7–8.
3. Prepare an aliquot of 10 μg of peptide sample for the total
proteome analysis and acidify with 10 % (v/v) TFA to a final
concentration of 1 % (v/v).
4. Use 25–50 μL of antibody–bead slurry per milligram of peptide
(Fig. 3). Pipetting of the antibody should be done using cut
tips. Up to 500 μL of antibody can be pooled into one 1.5 mL
low-binding reaction tube for equilibration (see Note 18).
5. Add 1 mL precooled TBS and incubate the antibody for 5 min
on a rolling wheel in the fridge or a cold room. Centrifuge
afterward for 1 min at 1000 g, 4 C.
6. Carefully remove the supernatant (see Note 19).
7. Repeat steps 4 and 5 twice.
8. Distribute corresponding amounts of the washed antibody
beads into fresh tubes and add the peptides.
9. Incubate overnight on a rolling wheel at 4 C.
10. Centrifuge samples for 2 min at 1000 g, 4 C.
Offline Fractionation for Lysine Acetylome Profiling 251
3.6 Desalting 1. Stack three layers of SDB-RPS matrix on top of each other.
of Peptides Punch one disk consisting of three stacked layers out and push
on SDB-RPS into a 200 μL pipette tip (see Note 20).
Stop-and-Go- 2. Assemble the Stage Tip with a 2 mL reaction tube joint by an
Extraction Tips adaptor.
(Stage Tips) 3. Add 100 μL ACN onto the Stage Tip and centrifuge tips at
1500 g (approximately 1–2 min).
4. Discard flow-through.
5. Add 100 μL 30 % (v/v) MeOH, 1 % (v/v) TFA onto the Stage
Tip and centrifuge the tips at 1500 g (approximately
1–2 min).
6. Discard flow-through.
7. Add 100 μL 0.2 % (v/v) TFA onto the Stage Tip and centrifuge
the tips at 1500 g (approximately 1–2 min).
8. Discard flow-through.
9. Load each sample (enriched for acetylome and input for total
proteome analyses) onto a Stage Tip and centrifuge tips at
252 Jonas Giese et al.
4 Notes
Acknowledgments
References
1. Calfapietra C, Peñuelas J, Niinemets U€ (2015) yeast histone H4 acetyltransferase. J Biol
Urban plant physiology: adaptation-mitigation Chem 270:24674–24677
strategies under permanent stress. Trends Plant 10. Drazic A, Myklebust LM, Ree R et al (2016)
Sci 20:72–75 The world of protein acetylation. Biochim Bio-
2. Nunes-Nesi A, Fernie AR, Stitt M (2010) Met- phys Acta 1864:1372–1401
abolic and signaling aspects underpinning the 11. Koskela MM, Brünje A, Ivanauskaite A et al
regulation of plant carbon nitrogen interac- (2018) Chloroplast acetyltransferase NSI is
tions. Mol Plant 3:973–996 required for state transitions in Arabidopsis
3. Dietz K-J (2015) Efficient high light acclima- thaliana. Plant Cell 30(8):1695–1709
tion involves rapid processes at multiple mech- 12. Wagner GR, Payne RM (2013) Widespread
anistic levels. J Exp Bot 66:2401–2414 and enzyme-independent Nε-acetylation and
4. Hartl M, Finkemeier I (2012) Plant mitochon- Nε-succinylation of proteins in the chemical
drial retrograde signaling: post-translational conditions of the mitochondrial matrix. J Biol
modifications enter the stage. Front Plant Sci Chem 288:29036–29045
3:1–7 13. König A-C, Hartl M, Boersema PJ et al (2014)
5. Johnová P, Skalák J, Saiz-Fernández I et al The mitochondrial lysine acetylome of Arabi-
(2016) Plant responses to ambient temperature dopsis. Mitochondrion 19:252–260
fluctuations and water-limiting conditions: a 14. Hosp F, Lassowskat I, Santoro V et al (2017)
proteome-wide perspective. Biochim Biophys Lysine acetylation in mitochondria: from
Acta 1864:916–931 inventory to function. Mitochondrion
6. Huber SC, Hardin SC (2004) Numerous post- 33:58–71
translational modifications provide opportu- 15. Alinsug MV, Yu C-W, Wu K (2009) Phyloge-
nities for the intricate regulation of metabolic netic analysis, subcellular localization, and
enzymes at multiple levels. Curr Opin Plant expression patterns of RPD3/HDA1 family
Biol 7:318–322 histone deacetylases in plants. BMC Plant Biol
7. Allfrey VG, Faulkner R, Mirsky AE (1964) 9:37
Acetylation and methylation of histones and 16. Shen Y, Wei W, Zhou D-X (2015) Histone
their possible role in the regulation of RNA acetylation enzymes coordinate metabolism
synthesis. Proc Natl Acad Sci U S A and gene expression. Trends Plant Sci
51:786–794 20:614–621
8. Yang X-J, Seto E (2008) Lysine acetylation: 17. Pandey R, Müller A, Napoli CA et al (2002)
codified crosstalk with other posttranslational Analysis of histone acetyltransferase and his-
modifications. Mol Cell 31:449–461 tone deacetylase families of Arabidopsis thali-
9. Kleff S, Andrulis ED, Anderson CW et al ana suggests functional diversification of
(1995) Identification of a gene encoding a chromatin modification among multicellular
eukaryotes. Nucleic Acids Res 30:5036–5055
256 Jonas Giese et al.
18. König A, Hartl M, Pham PA et al (2014) The histone deacetylase substrate proteins in Ara-
Arabidopsis class II sirtuin is a lysine deacety- bidopsis. Mol Syst Biol 13:949
lase and interacts with mitochondrial energy 30. Füßl M, Lassowskat I, Née G et al (2018)
metabolism. Plant Physiol 164:1401–1414 Beyond histones: new substrate proteins of
19. Choudhary C, Mann M (2010) Decoding sig- lysine deacetylases in Arabidopsis nuclei.
nalling networks by mass spectrometry-based Front Plant Sci 9:461
proteomics. Nat Rev Mol Cell Biol 31. He D, Wang Q, Li M et al (2016) Global
11:427–439 proteome analyses of lysine acetylation and suc-
20. Zhang K, Zheng S, Yang JS et al (2013) Com- cinylation reveal the widespread involvement of
prehensive profiling of protein lysine acetyla- both modification in metabolism in the
tion in Escherichia coli. J Proteome Res embryo of germinating rice seed. J Proteome
12:844–851 Res 15:879–890
21. Henriksen P, Wagner SA, Weinert BT et al 32. Smith-Hammond CL, Hoyos E, Miernyk JA
(2012) Proteome-wide analysis of lysine acety- (2014) The pea seedling mitochondrial N-
lation suggests its broad regulatory scope in ε-lysine acetylome. Mitochondrion
Saccharomyces cerevisiae. Mol Cell Proteomics 19:154–165
11:1510–1522 33. Xiong Y, Peng X, Cheng Z et al (2016) A
22. Lundby A, Lage K, Weinert B et al (2012) comprehensive catalog of the lysine-acetylation
Proteomic analysis of lysine acetylation sites in targets in rice (Oryza sativa) based on proteo-
rat tissues reveals organ specificity and subcel- mic analyses. J Proteome 138:20–29
lular patterns. Cell Rep 2:419–431 34. Zhang Y, Song L, Liang W et al (2016) Com-
23. Weinert BT, Wagner SA, Horn H et al (2011) prehensive profiling of lysine acetylproteome
Proteome-wide mapping of the drosophila analysis reveals diverse functions of lysine acet-
acetylome demonstrates a high degree of con- ylation in common wheat. Sci Rep 6:21069
servation of lysine acetylation. Sci Signal 4:ra48 35. Weinert BT, Iesmantavicius V, Moustafa T et al
24. Svinkina T, Gu H, Silva JC et al (2015) Deep, (2014) Acetylation dynamics and stoichiome-
quantitative coverage of the lysine acetylome try in Saccharomyces cerevisiae. Mol Syst Biol
using novel anti-acetyl-lysine antibodies and 10:716
an optimized proteomic workflow. Mol Cell 36. Lassowskat I, Hartl M, Hosp F et al (2017)
Proteomics 14:2429–2440 Dimethyl-labeling-based quantification of the
25. Zhou H, Finkemeier I, Guan W et al (2018) lysine acetylome and proteome of plants. In:
Oxidative stress-triggered interactions between Fernie AR, Bauwe H, Weber APM (eds) Pho-
the succinyl- and acetyl-proteomes of rice torespiration. Springer New York, New York,
leaves. Plant Cell Environ 41:1139–1153 NY, pp 65–81
26. Walley JW, Shen Z, McReynolds MR et al 37. Wiśniewski JR, Zougman A, Nagaraj N et al
(2018) Fungal-induced protein hyperacetyla- (2009) Universal sample preparation method
tion in maize identified by acetylome profiling. for proteome analysis. Nat Methods
Proc Natl Acad Sci U S A 115:210–215 6:359–362
27. Finkemeier I, Laxa M, Miguet L et al (2011) 38. Boersema PJ, Raijmakers R, Lemeer S et al
Proteins of diverse function and subcellular (2009) Multiplex peptide stable isotope
location are lysine acetylated in Arabidopsis. dimethyl labeling for quantitative proteomics.
Plant Physiol 155:1779–1790 Nat Protoc 4:484–494
28. Wu X, Oh M-H, Schwarz EM et al (2011) 39. Tyanova S, Temu T, Cox J (2016) The Max-
Lysine acetylation is a widespread protein mod- Quant computational platform for mass
ification for diverse proteins in Arabidopsis. spectrometry-based shotgun proteomics. Nat
Plant Physiol 155:1769–1778 Protoc 11:2301–2319
29. Hartl M, Füßl M, Boersema PJ et al (2017)
Lysine acetylome profiling uncovers novel
Chapter 19
Abstract
Protein functions often rely on protein–protein interactions. Hence, knowledge about the protein interac-
tion network is essential for an understanding of protein functions and plant physiology. A major challenge
of the postgenomic era is the mapping of protein–protein interaction networks. This chapter describes a
mass spectrometry-based label-free quantification approach to identify in vivo protein interaction networks.
The procedure starts with the extraction of intact protein complexes from transgenic plants expressing the
protein of interest fused to a GFP-Tag (bait-GFP), as well as plants expressing a free GFP as background
control. Enrichment of the GFP-tagged protein together with its interaction partners, as well as the free
GFP, is performed by immunoaffinity purification. The pull-down quality can be evaluated by simple
gel-based techniques. In parallel, the captured proteins are trypsin-digested and relatively quantified by
label-free mass spectrometry-based quantification. The relative quantification approach largely relies on the
normalization of protein abundances of background-binding proteins, which occur in both bait-GFP and
free GFP pull-downs. Therefore, relative quantification of the protein pull-down is superior over methods
that solely rely on protein identifications and removal of often copurified high-abundance proteins from the
bait-GFP pull-downs, which might remove real interaction partners. A further strength of this method is
that it can be applied to any soluble GFP-tagged protein.
1 Introduction
Guillaume Née and Priyadarshini Tilak have contributed equally to this work.
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_19, © Springer Science+Business Media, LLC, part of Springer Nature 2020
257
258 Guillaume Née et al.
3
SN SN
Bait protein
2.5
complexes
GFP protein
-log10 p-value
2
complexes
1.5
Bait enrichment
1
SN SN
0.5
Non-bound
proteins
0
Indirect interaction -4 -3 -2 -1 0 1 2 3 4
log2 fold enrichment
Direct interaction Interacting
Bait protein proteins
P P
Contaminant proteins GFP Contaminant
proteins
Washes
SN SN
No proteins
Enriched bait
protein complexes
LC MS/MS analysis
P P
Contaminant proteins
MS2
Intensity
Elution MS1
Interacting proteins
SN SN
Intensity
GFP GFP
Relative abundance
bait
m/z
Contaminant
proteins m/z
P P
Elution Time
optional
Pull-down procedure analysis Tryptic digest and desalting
EXP CTR EXP CTR
ML
ML
ML
ML
1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4
C18 Stage tip
Trp
Fig. 1 Workflow for the mass spectrometry-based label-free quantification of protein interaction partners of
GFP-tagged proteins. Proteins are extracted from tissue under native conditions. The GFP-tagged protein and
possible interaction partners (EXP) are enriched on an agarose-coupled GFP antibody matrix. Next to the
Quantification of Protein-Protein Interactions with GFP-Trap Beads 259
Fig. 1 (continued) proteins of interest, background-binding proteins (grey and black dots) are copurified.
These unspecific binders are important for the correct normalization of the label-free quantification. After
washing, the bound proteins are eluted by acidic denaturation. The quality of the pull-down can be assessed
by gel electrophoresis and Western blotting. The eluted protein samples can be directly processed by tryptic
digestion followed by a C18 desalting steps. Peptide samples are analyzed by LC-MS/MS and the acquired
spectra are processed by computational analysis using MaxQuant [32, 33]. Label-free quantification values
(LFQ) are used to calculate the fold-enrichment and a t-test p-value from the biological replicates. Data can be
presented as a volcano plot, in which the interacting protein (blue dots), which satisfy quality thresholds (e.g.,
adjusted p-value <0.05 and at least twofold enrichment), appear on the upper right part, while the unspecific
GFP-agarose interactors (black dots) are located outside the threshold limits. SN supernatant, P pellet, ML
molecular ladder, Trp trypsin
260 Guillaume Née et al.
2 Materials
2.1 Plant Tissue 1. Leaf tissue harvested from Arabidopsis thaliana plants expres-
sing the bait-GFP protein and leaf tissue harvested from Ara-
bidopsis thaliana plant lines expressing a free GFP (see Note 1).
3 Methods
3.2.2 Western Blot 1. Rinse the gel with water and transfer it to a container filled with
Analysis Western blot transfer buffer.
2. Cut a nitrocellulose membrane to the size of the gel and
immerse it with Western blot transfer buffer (see Note 20).
3. Soak four to eight pieces of Whatman filter paper (same size as
the nitrocellulose membrane) in the transfer buffer and place
them onto the anode side of the transfer apparatus. Position
the nitrocellulose membrane onto the Whatman paper. Place
the gel on top of the membrane and place four to eight pieces
of Whatman filter paper soaked in the transfer buffer on top (see
Note 21).
4. Assemble the blotting system and transfer proteins at 1.2 mA
per cm2 of membrane for 1 h.
5. After transfer, carefully recover the membrane and place it,
protein face up, into a clean plastic container.
Quantification of Protein-Protein Interactions with GFP-Trap Beads 265
4 Notes
Acknowledgments
References
1. Bolger M, Schwacke R, Gundlach H et al 12. Zhang Y, Beard KF, Swart C (2017) Protein-
(2017) From plant genomes to phenotypes. J protein interactions and metabolite channel-
Biotechnol 261:46–52 ling in the plant tricarboxylic acid cycle. Nat
2. Schatz MC, Witkowski J, Mccombie WR Commun 8:15212
(2012) Current challenges in de novo plant 13. Liebert MA, Rouhier N, Villarejo A et al
genome assembly. Genome Biol 13:2–7 (2005) Identification of plant glutaredoxin tar-
3. Hümann U, Woetzel S, Madrid-Herrero E et al gets. Antioxid Redox Signal 7:919–929
(2017) Improving and correcting the contigu- 14. Hao Y, Wang H, Qiao S et al (2016) Histone
ity of long-read genome assemblies of three deacetylase HDA6 enhances brassinosteroid
plant species using optical mapping and chro- signaling by inhibiting the BIN2 kinase. Proc
mosome conformation capture data. Genome Natl Acad Sci U S A 113:2–7
Res 27:778–786 15. Jones AM, Xuan Y, Xu M et al (2014) Border
4. Bontinck M, Van Leene J, Gadeyne A et al control — a membrane-linked interactome of
(2018) Recent trends in plant protein complex Arabidopsis. Science 1:711–717
analysis in a developmental context. Front 16. Krishnakumar V, Hanlon MR, Contrino S et al
Plant Sci 9:1–14 (2015) Araport: the Arabidopsis information
5. Sako K, Yanagawa Y, Kanai T et al (2014) portal. Nucleic Acids Res 43:D1003–D1009
Proteomic analysis of the 26S proteasome 17. Luhua S, Hegie A, Suzuki N et al (2013) Link-
reveals its direct interaction with transit pep- ing genes of unknown function with abiotic
tides of plastid protein precursors for their deg- stress responses by high-throughput pheno-
radation. J Proteome Res 13:3223–3230 type screening. Physiol Plant 148:322–333
6. Dose A, Sindlinger J, Bierlmeier J et al (2016) 18. Née G, Kramer K, Nakabayashi K et al (2017)
Interrogating substrate selectivity and compo- DELAY of GERMINATION1 requires PP2C
sition of endogenous histone deacetylase com- phosphatases of the ABA signalling pathway to
plexes with chemical probes. Angew Chem Int control seed dormancy. Nat Commun 8:1–8
Ed 55:1192–1195 19. König A, Hartl M, Pham PA et al (2014) The
7. Inoshima MM, Ikuchi KK (2015) Chemical Arabidopsis class II sirtuin is a lysine deacety-
tools for probing histone deacetylase (HDAC) lase and interacts with mitochondrial energy
activity. Anal Sci 31:287–292 metabolism. Plant Physiol 164:1401–1414
8. Kramer K, Finkemeier I, Humpf H et al (2016) 20. Trigg SA, Garza RM, MacWilliams A et al
The SAGA complex in the rice pathogen Fusar- (2017) CrY2H-seq: a massively multiplexed
ium fujikuroi: structure and functional charac- assay for deep-coverage interactome mapping.
terization. Mol Microbiol 102:951–974 Nat Methods 14:819–825
9. Gao X, Liu CZ, Li DD et al (2016) The Arabi- 21. Bock R (2016) Lighting the way to protein-
dopsis KIN βγ subunit of the SnRK1 complex protein interactions: recommendations on best
regulates pollen hydration on the stigma by practices for bimolecular fluorescence comple-
mediating the level of reactive oxygen species mentation analyses. Plant Cell 28:1002–1008
in pollen. PLoS Genet 12(7):e1006228 22. Senkler J, Senkler M, Eubel H et al (2017) The
10. Editor D (2014) Dual-targeting of Arabidopsis mitochondrial complexome of Arabidopsis
chloroplasts and peroxisomes involves interac- thaliana. Plant J 89:1079–1092
tion with Trx m2 in the cytosol. Mol Plant 23. Cui B, Fang S, Xing Y et al (2015) Crystallo-
7:252–255 graphic analysis of the Arabidopsis thaliana
11. Garagounis C, Kostaki K, Hawkins TJ et al BAG5 – calmodulin protein complex research
(2017) Microcompartmentation of cytosolic communications. Acta Crystallogr F Struct
aldolase by interaction with the actin cytoskel- Biol Cryst Commun 71:870–875
eton in Arabidopsis. J Exp Bot 68:885–898 24. Bai Y (2015) Detecting protein-protein inter-
actions by gel filtration chromatography. In:
Quantification of Protein-Protein Interactions with GFP-Trap Beads 271
Abstract
Cross-linking converts noncovalent interactions between proteins into covalent bonds. The now artificially
fused molecules are stable during purification steps (e.g., immunoprecipitation). In combination with a
variety of techniques, including Western blotting, mass spectrometry (MS), and bioinformatics, this
technology provides improved opportunities for modelling structural details of functional complexes in
living cells and protein–protein interaction networks. The presented strategy of immunoaffinity purification
and mass spectrometry (AP-MS) coupled with in vivo cross-linking can easily be adapted as a robust
workflow in interactome analyses of various species, also nonmodel organisms.
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_20, © Springer Science+Business Media, LLC, part of Springer Nature 2020
273
274 Heidi Pertl-Obermeyer and Gerhard Obermeyer
2 Materials
2.1 Plant Material All plant tissues or adequate plant cell cultures, for example, pollen
cultures from lily (Lilium longiflorum Thunb.), thale cress (Arabi-
dopsis thaliana), tobacco (Nicotiana tabacum), and tomato (Sola-
num lycopersicum) or seedlings as well as liquid cultures from
specific plant tissues or organs (explants from roots, stems, leaves,
flowers) as well as yeast cell cultures can be used for in vivo cross-
linking experiments.
Interactome Analysis in Living Cells 275
250-- 250--
250--
150-- 150--
150--
100-- 100-- *
75-- 100--
75-- PM H+
75-- ATPase
50--
50--
50--
37-- 37-- 37--
14-3-3s
25-- 25-- 25--
M
250-- 250--
250--
150-- 150-- 150--
100-- 100-- * 100--
PM H+
75-- 75-- 75-- ATPase
50-- 50-- 50--
37-- 37-- 37--
14-3-3s
25-- 25--
25--
M 0 10 20 30 45 60 min 0 10 20 30 45 60 min 0 10 20 30 45 60 min
c cell lysates (0.5% PFA, 20 min) cell lysate (0.5 % PFA, 20 min) cell lysate (0.5 % PFA, 20 min)
kDa kDa kDa
250--
148-- 250-- 250--
150-- 150--
100-- * 100--
98-- 75-- 75--
64-- 50-- 50--
50-- 37-- 37--
14-3-3s
36--
25-- 25--
22--
M 15 5 10 15 20 30 min 15 5 10 15 20 30 min 5 10 15 20 30 min
RT 95 °C RT 95 °C 95 °C
Fig. 1 Optimization of in vivo cross-linking of lily pollen grains. (a) To determine the optimal PFA concentration
pollen grains from 5 flowers (~0.5 g fresh weight) were incubated in 0.0625–1% (w/v) PFA for 20 min at room
temperature. Cell lysate proteins were then separated by SDS-PAGE and visualized by Coomassie stain (left)
and analyzed by immunodetection with monoclonal anti-14-3-3 antibody (middle) and monoclonal anti-PM H+
ATPase antibody (right). (b) To define the optimal incubation time pollen grains were incubated for 10–60 min
in 0.5% (w/v) PFA at RT. Incubation times of 20 min and more resulted in band smearing, which is an
indication that the cross-linking is too excessive. 0 ¼ untreated pollen grains; 5 μl cell lysate were loaded per
gel lane; (a) and (b) 15 min denaturing of samples at RT prior SDS-PAGE. (c) To reverse PFA cross-links, cell
lysates (pollen grains in 0.5% (w/v) PFA for 20 min) were incubated in 6 sample loading buffer, incubated for
5–30 min at 95 C and analyzed by CBB (Coomassie Brilliant Blue R250) or Western Blot. A shift from 14-3-3
proteins cross-linked in complexes (∗) to 14-3-3 monomers and a losing of high molecular weight PM H+
ATPase complexes (arrow) are clearly visible. 5 μl cell lysate were loaded per gel lane [4]
276 Heidi Pertl-Obermeyer and Gerhard Obermeyer
2.4 Immunoaffinity Prepare fresh solutions for each purification. Solutions can be kept
Purification at room temperature, if not otherwise indicated.
1. Magnetic beads (see Note 7).
2. Magnetic particle concentrator.
3. PBS, pH 7.4.
4. Wash Buffer: PBS + 0.1% (w/v) BSA (bovine serum albumin),
0.01 g BSA and fill up to 10 ml with PBS. Store at 4 C.
5. Storage Buffer: PBS + 0.1% (w/v) BSA + 0.02% (v/v) NaN3,
weigh 0.01 g BSA, add 20 μl of 1 M NaN3 stock solution and
fill up to 10 ml with PBS. Store at 4 C.
Interactome Analysis in Living Cells 277
2.5 In-Solution Always use the highest grade reagents available. To avoid keratin
Trypsin Digestion contamination of the samples, wear powder-free gloves and work
clean. To avoid contaminations with released softening agents do
not use autoclaved tips and tubes.
1. 10 mM Tris–HCl, pH 8.0: Weigh 0.121 g Tris and make up to
100 ml with double distilled water. Adjust pH value with 1 N
HCl. Store at room temperature.
2. UTU Buffer: 6 M urea, 2 M thiourea, in 10 mM Tris–HCl,
pH 8.0, weigh 9.009 g urea, 3.806 g thiourea and fill up to
25 ml with 10 mM Tris–HCl, pH 8.0. Store as 1 ml aliquots at
20 C.
3. Reduction buffer: 6.5 mM DTT (DL-dithiothreitol), weigh
5 mg DTT and add 5 ml of 10 mM Tris–HCl, pH 8.0. Store
500 μl aliquots at 20 C.
4. Alkylation buffer: 27 mM iodoacetamide (IAA), weigh
2.49 mg iodoacetamide and add 5 ml of 10 mM Tris–HCl,
pH 8.0. Keep the 500 μl aliquots protected from light at
20 C.
5. Endoproteinase Lys-C: 0.5 μg/μl stock solution, dissolve
15 μg Lys-C in 30 μl resuspension buffer (provided by manu-
facturer). Keep at 20 C for short-term storage or at 80 C
for long-term storage.
6. Sequencing grade modified trypsin: 0.5 μg/μl stock solution,
dissolve 20 μg trypsin in 40 μl resuspension buffer (provided by
manufacturer). Keep at 20 C for short-term storage or at
80 C for long-term storage.
278 Heidi Pertl-Obermeyer and Gerhard Obermeyer
2.6 Peptide Prepare fresh solutions and store them at room temperature. Use
Desalting with C18- analytical grade or hypergrade agents.
StageTips 1. Acetonitrile (ACN).
2. Buffer A: 5% (v/v) ACN, 0.1% (v/v) TFA, to 9.4 ml ultra-pure
water add 500 μl ACN, and 100 μl of 10% (v/v) TFA in a glass
bottle.
3. Buffer B: 80% (v/v) ACN, 0.1% (v/v) TFA, to 1.9 ml ultrapure
water add 8 ml ACN, and 100 μl of 10% (v/v) TFA in a glass
bottle.
4. C18 StageTips (see Note 9).
5. Centrifuge.
6. Spin adaptors (Fig. 2).
7. Vacuum concentrator.
3 Methods
3.1 Formaldehyde This work flow was developed to study putative interaction partners
Cross-Linking and for the PM H+ ATPase in lily pollen [4]. Although the following
Preparation of Cell protocol uses pollen cultures as starting material, it can be easily
Lysates adapted to and optimized for other plant tissues or cell cultures.
1. Incubate pollen grains of 5 flowers (~0.5 g fresh weight) in
10 ml germination medium (Med B) for 10 min at room
temperature in petri dishes.
2. For optimization of the cross-linking reactions, incubate pollen
grains in germination medium with different formaldehyde
concentrations (0.0625–1% (w/v) PFA (see Fig. 1.a) (see
Note 10) for 20 min at room temperature. In addition, it is
very important to define the optimal incubation time for the
cross-linking reaction (see Fig. 1b). Therefore, incubate pollen
grains in the before tested optimal PFA concentration (e.g.,
0.5% (w/v) PFA) for 0, 10, 20, 30, 45, and 60 min at room
temperature. Keep controls (¼ untreated, not cross-linked pol-
len grains, 0% PFA, 0 min) on ice.
3. Stop the cross-linking reactions by adding glycine to a final
concentration of 125 mM (see Note 11) and incubate for
10 min at room temperature.
4. Filter the pollen grain culture through a 8 μm filter using a
vacuum filtration unit and immediately transfer the remainder
on the filter (¼ pollen grains) with a spatula into a prechilled
glass beaker filled with liquid nitrogen. Avoid thawing of the
pollen grains!
5. Homogenize the pollen grains in liquid N2 using a precooled
mortar and pestle to a very fine powder. Avoid thawing of the
powder!
6. Transfer the fine powder with a spatula into a precooled (liquid
N2) 1.5 ml microcentrifuge tube. Let the liquid nitrogen evap-
orate and cautiously add 1 ml ice-cold Lysis buffer (see Note
12).
7. Incubate on a rotational shaker at 4 C for 10 min.
8. Centrifuge at 14,000 g for 10 min at 4 C.
9. After centrifugation, collect the supernatant (¼ cell lysate)
using a pipette or a syringe with a fine needle. Do not disturb
the pellet.
10. Analyze the cross-linking reactions by SDS-PAGE and/or
immunodetection (see Fig. 1a and b.). In order to analyze the
sample by mass spectrometry, reverse the PFA cross-links of the
cell lysates by incubating the cell lysates in 6 sample loading
280 Heidi Pertl-Obermeyer and Gerhard Obermeyer
3.2 Covalent This protocol uses a monoclonal primary antibody for immunopre-
Coupling of Antibodies cipitation, but can be used for polyclonal antibodies, too.
to Magnetic Beads
1. Resuspend the magnetic beads in the original vial by briefly
vortexing and transfer the required amount of beads
(100–500 μl beads) into a 1.5 ml microcentrifuge tube (see
Note 13).
2. Place the tube in the particle concentrator (¼ magnet) for
2 min at room temperature.
3. Suck out the supernatant and discard. Do not touch the col-
lected magnetic beads inside the tube with the pipette tip.
4. Remove the tube from the magnet and add 1 ml PBS, pH 7.4
to the beads. Mix well by pipetting up and down.
5. Repeat steps 2–4 three times in total. Take a 10 μl aliquot (¼
“untreated beads”; washed beads before antibody capture, see
Fig. 3) for SDS-PAGE analysis.
6. Place the tube on the magnet for 2 min at room temperature.
The now washed magnetic beads are ready for capture of target
Ig (¼ primary antibody).
7. Discard the supernatant and add 100–1000 μl primary mono-
clonal antibody to the corresponding amount of magnetic
beads into the tube, for example, to 500 μl washed magnetic
beads 1 ml monoclonal antibody against a PM H+ ATPase
(hybridoma clone 46E5B11F6) [7] is added (see Note 14).
8. Incubate tube with slow rotation mixing for 2 h at 4 C.
9. Place the tube on the magnet for 2 min at room temperature.
10. Take out the supernatant (¼ unbound primary antibody) and
store at 4 C (see Note 15).
11. Wash magnetic beads with 1 ml PBS, pH 7.4 by slowly pipet-
ting up and down.
12. Place the tube on the magnet for 2 min at room temperature.
13. Remove from the magnet and pipette off the supernatant and
discard.
14. Repeat steps 11–14 three times in total. Take a 5 μl aliquot
from the third washing step (¼ “beads – DMP”; magnetic
beads before DMP treatment, see Fig. 3) for SDS-PAGE
analysis.
15. Place the tube on the magnet for 2 min, pipette off the super-
natant and discard. Remove the tube from the magnet and add
Interactome Analysis in Living Cells 281
kDa
Xlink Ig to beads mock elution
230--
150--
100--
80--
60--
50--
40--
30--
25--
20--
15--
beads + DMP
beads - DMP
untreated beads
Ab W1 W3 E1 E2
21. Place the tube on the magnet for 2 min and discard
supernatant.
22. Wash beads with 1 ml Wash Buffer by cautiously pipetting
up-and-down.
23. Repeat steps 21–22 three times in total. Take a 5 μl aliquot
from the third washing step (¼ “beads + DMP”; magnetic
beads after DMP treatment, see Fig. 3) for SDS-PAGE analysis.
24. Resuspend the beads in 1 ml Storage Buffer and store the now
activated beads at 4 C.
15. Place the tube on the magnet for 2 min, pipette off the super-
natant (¼ unbound protein) and transfer into a 1.5 ml micro-
centrifuge tube. Keep the tube on ice.
16. Remove the tube from the magnet, add 1 ml Lysis Buffer and
carefully wash beads with captured target proteins for 2 min by
cautiously vortexing.
17. Place the tube on the magnet for 2 min, and transfer superna-
tant into fresh 1.5 ml tubes (wash fraction, W1).
18. Repeat steps 16–17 three times in total. Keep the wash frac-
tions on ice (W1–W3).
19. Remove tube from the magnet, and add 200 μl Elution Buffer
to the beads. Vortex gently for 2 min.
20. Place the tube on the magnet for 2 min.
21. Transfer the supernatant into a fresh 1.5 ml microcentrifuge
tube and immediately add 20 μl of saturated 1 M Tris solution
to the elution fraction. Keep the tube on ice (¼ E1, eluate 1).
22. Repeat steps 19–21 for second eluate (¼ E2, eluate 2).
23. Store all samples at 20 C until used for analysis via immu-
nodetection and/or mass spectrometry.
24. Remove the tube from the magnet, add 1 ml PBS, pH 7.4, and
wash beads by slowly pipetting up-and-down.
25. Place the tube on the magnet for 2 min, and pipet off the
supernatant and discard.
26. Remove the tube from the magnet, and add 1 ml Storage
Buffer and store beads at 4 C.
3.5 Peptide 1. Insert the 200 μl C18-StageTip into a spin adapter (see Fig. 2)
Desalting and and place it into a fresh 2 ml microcentrifuge tube where the lid
Purification has been removed.
2. Place into a centrifuge and load 50 μl of Buffer B into the C18
tip for equilibration. Centrifuge at 1000 g for 2 min at RT.
3. Load 100 μl of Buffer A into the C18 tip, and centrifuge at
1000 g for 3 min at RT.
4. Repeat step 3.
5. Remove the tube from the centrifuge, discard the waste from
the microcentrifuge tube, and insert the tube with the C18
back into the centrifuge.
6. Load the tryptic peptide mixture into the C18 tip. Centrifuge
at 2650 g for 5 min at RT. If necessary repeat this step
to force the sample solution through.
7. Load 100 μl of Buffer A into the C18 tip for washing, and
centrifuge at 1000 g for 3 min at RT.
8. Repeat step 7.
9. Remove the tube from the centrifuge and discard the waste
from the microcentrifuge tube. Place the C18 tip with the spin
adapter into a fresh 1.5 ml microcentrifuge tube, and insert the
tube into the centrifuge.
10. Load 20 μl Buffer B into the C18 tip to elute the bound tryptic
peptides from the C18 matrix. Centrifuge at 1000 g for
2 min at RT.
11. Repeat step 10.
12. Spin down the final volume of the eluate (¼ 40 μl) to dryness in
a centrifugal vacuum concentrator (~ 45 min).
13. Store samples at 80 C until used for LC-MS/MS.
Interactome Analysis in Living Cells 285
4 Notes
Acknowledgments
References
1. Miernyk JA, Thelen JJ (2008) Biochemical with K15NO3 as a tool for quantitative
approaches for discovering protein-protein analysis of proteins and metabolites. Plant Meth-
interactions. Plant J 53:597–609 ods 2:14
2. Engelsberger WR, Erban A, Kopka J et al 3. Vasilescu J, Guo X, Kast J (2004) Identification
(2006) Metabolic labeling of plant cell cultures of the protein-protein interactions using in vivo
Interactome Analysis in Living Cells 287
cross-linking and mass spectrometry. Proteomics 6. Rappsilber J, Mann M, Ishihama Y (2007) Pro-
4:3845–3854 tocol for micro-purification, enrichment,
4. Pertl-Obermeyer H, Schulze WX, Obermeyer G pre-fractionation and storage of peptides for
(2014) In vivo cross-linking combined with proteomics using StageTips. Nat Protoc
mass spectrometry analysis reveals receptor-like 2:1896–1906
kinases and Ca2+ signalling proteins as putative 7. Villalba JM, Lützelschwab M, Serrano R (1991)
interaction partners of pollen plasma membrane Immunolocalisation of the plasma membrane
H+ ATPases. J Proteome 108:17–29 H+ ATPase in maize coleoptiles and enclosed
5. Pertl-Obermeyer H, Obermeyer G (2013) leaves. Planta 185:458–461
Pollen-cultivation and preparation for proteome
studies. Methods Mol Biol 1072:435–449
Chapter 21
Abstract
Tomato is a major crop plant and an important constituent of the human diet. Exclusive features such as
bearing fleshy fruits and undergoing a phase transition from partially photosynthetic to fully heterotrophic
metabolism make tomato fruit a model system for fruit development studies. Although the tomato genome
has been completely sequenced, functional proteomics studies are still at their starting stage. Proteomics
technologies, especially the combination of multiple approaches, provide a very powerful tool to accurately
identify functional proteins and investigate certain sets of proteins in more detail. The direct binding of
plant 14-3-3 proteins to their multiple target proteins modulates the functions of the latter, suggesting that
these 14-3-3 proteins are directly involved in various physiological pathways. This chapter outline methods
for the identification of 14-3-3 protein complexes in tomato fruit tissues. These methods include detailed
protocols for protein extraction, coimmunoprecipitation, SDS-PAGE, SYPRO Ruby staining, in-gel trypsin
digestion, and LC-MS/MS analysis for 14-3-3 interactomics.
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_21, © Springer Science+Business Media, LLC, part of Springer Nature 2020
289
290 Yongming Luo et al.
Fig. 1 Illustration of plant 14-3-3 protein functions in various physiological pathways. The 14-3-3 proteins
preferentially form saddle-shaped homo- or heterodimers, in which a broad central groove is able to bind to
target proteins. These 14-3-3 proteins bind to the target protein mainly through the phosphorylated 14-3-3
binding motifs in the latter
2 Materials
2.1 Plant Material The tomato (Solanum lycopersicum L.) cultivar Micro-Tom, expres-
sing FLAG tag-fused 14-3-3λ driven by a CaMV 35S promoter
(FLAG-14-3-3λ), and wild-type Solanum lycopersicum L were
employed [10] (see Note 1). Tomato plants are grown in 9 cm
pots containing peat moss-based soil Jiffy-Mix soil (Sakata Seed,
Japan), supplemented with HYPONeX nutrient mixture (N:P:
K ¼ 6:10:5) (HYPONeX JAPAN, Japan), at 25 C and 16:8 h
light–dark cycles .
3 Methods
3.1 Immunopreci- 1. Harvest the expanding green tomato fruit from WT and
pitation of 14-3-3 FLAG-14-3-3λ plants and grind the fruits in liquid nitrogen
Complex from Tomato using mortar and pestle [10] (see Note 2).
Fruit Tissue 2. Equilibrate 20 μL of anti-FLAG M2 magnet beads for each
sample with extraction buffer at least for two times to remove
the stock buffer.
3. Add 3 μL protein extraction buffer supplemented with inhibi-
tors per mg fresh weight of tomato fruit; transfer the suspen-
sion to a prechilled 1.5 mL tube and place on ice.
4. Centrifuge twice at 20,000 g for 5 min at 4 C and remove
the insoluble residues.
5. Determine the protein concentration in the supernatant by the
Bradford method and transfer lysates containing 3 mg proteins
to new 1.5 mL tubes. Add sufficient extraction buffer to equal-
ize lysate volume between negative control (WT) and FLAG-
14-3-3λ.
6. Add 20 μL anti-FLAG M2 magnetic beads to the protein
extracts.
7. Incubate samples at 4 C for 1 h on a rotary shaker.
8. Spin down magnet beads particles at 8000 g for 30 s at 4 C
and discard the supernatant by placing the tube in the appro-
priate magnetic separator. Then wash the beads for three times
with wash buffer and add 140 μL of wash buffer.
9. Add 60 μL of 3FLAG peptides solution and incubate at 4 C
for 1 h on a rotary shaker to elute the FLAG-14-3-3λ proteins
from the beads.
10. Centrifuge at 8000 g for 30 s at 4 C and transfer the
supernatant into a new 1.5 mL tube by placing the tube in
the appropriate magnetic separator.
14-3-3 Interactome in Tomato Fruit 293
3.3 In-Gel Trypsin 1. Excise the entire gel of each lane with a clean scalpel and chop
Digestion the excised gels into pieces of approximately 1 1 mm; transfer
these pieces into a 1.5 mL microcentrifuge tube (see Notes 5–7).
2. Add 400 μL 100% acetonitrile to each tube and shake for
15 min at room temperature (see Note 6).
3. Remove the acetonitrile and dry the gel with a vacuum centri-
fuge (approx. 15 min at 37 C).
4. Add 400 μL reduction solution to each tube and allow the dry
gel pieces to soak by shaking at 56 C for 45 min.
5. Cool the tube to room temperature, remove the reduction
solution, and add 400 μL alkylation solution; shake for
30 min in the dark.
294 Yongming Luo et al.
6. Discard the alkylation solution and wash the gel samples with
400 μL gel washing buffer for 10 min.
7. Discard the washing buffer and dehydrate again with 400 μL
dehydration solution for 10 min. Repeat the dehydration
protocol.
8. Dry the gel with a vacuum centrifuge for 15 min at 37 C.
9. Add a sufficient amount of trypsin solution to each dried gel
samples, making sure the trypsin solution covers all the gel
pieces, and incubate at 37 C for 16 h (see Note 8).
10. Add 100 μL extraction solution I and shake for 30 min at room
temperature. Transfer the supernatant to a new tube (see
Note 9). Repeat this procedure with extraction solution II.
11. Dry the solution with a vacuum centrifuge (approx. 1.5 h at
37 C).
12. Add 20 μL dissolving solution to dissolve the dried peptides,
and filter each with an Ultrafree-MC Centrifugal Filter to avoid
contamination of gel pieces.
4 Notes
Acknowledgments
References
1. Tohge T, Alseekh S, Fernie AR (2014) On the for growth phase transition in Arabidopsis
regulation and function of secondary metabo- seedlings. Plant J 60:852–864
lism during fruit development and ripening. J 7. Sato T, Maekawa S, Yasuda S et al (2011)
Exp Bot 65:4599–4611 Identification of 14-3-3 proteins as a target of
2. Shikata M, Hoshikawa K, Ariizumi T et al ATL31 ubiquitin ligase, a regulator of the C/N
(2016) TOMATOMA update: phenotypic response in Arabidopsis. Plant J 68:137–146
and metabolite information in the Micro-Tom 8. Yasuda S, Sato T, Maekawa S et al (2014)
mutant resource. Plant Cell Physiol 57:e11 Phosphorylation of Arabidopsis ubiquitin
3. The Tomato Genome Consortium (2012) The ligase ATL31 is critical for plant carbon/nitro-
tomato genome sequence provides insights gen nutrient balance response and controls the
into fleshy fruit evolution. Nature stability of 14-3-3 proteins. J Biol Chem
485:635–641 289:15179–15193
4. Oecking C, Jaspert N (2009) Plant 14-3-3 9. Yasuda S, Aoyama S, Hasegawa Y et al (2017)
proteins catch up with their mammalian ortho- Arabidopsis CBL-interacting protein kinases
logs. Curr Opin Plant Biol 12:760–765 regulate carbon/nitrogen-nutrient response
5. Chang IF, Curran A, Woolsey R et al (2009) by phosphorylating ubiquitin ligase ATL31.
Proteomic profiling of tandem affinity purified Mol Plant 10:605–618
14-3-3 protein complexes in Arabidopsis thali- 10. Lu Y, Yasuda S, Li X et al (2016) Characteriza-
ana. Proteomics 9:2967–2985 tion of ubiquitin ligase SlATL31 and proteo-
6. Sato T, Maekawa S, Yasuda S et al (2009) mic analysis of 14-3-3 targets in tomato fruit
CNI1/ATL31, a RING-type ubiquitin ligase tissue (Solanum lycopersicum L.). J Proteome
that functions in the carbon/nitrogen response 143:254–264
Chapter 22
Abstract
Most organisms are diploid, meaning they only have two copies of each chromosome (one set inherited
from each parent). Polyploid organisms have more than two paired (homologous) sets of chromosomes.
Many plant species are polyploid. Polyploid species cope better with stresses thanks to the redundancy in the
chromosome copy number and dispose in this way a greater flexibility in gene expression. Allopolyploid
species are polyploids that contain an alternative set of chromosomes by the cross of two (or more) species.
Gene variants unique for a preferential phenotype are most probable candidate markers controlling the
observed phenotype. Organ or tissue-specific silencing or overexpression of one parental homeolog is quite
common. It is very challenging to find those tissue-specific gene variants. High-throughput proteomics is a
successful method to discover them. This chapter proposes two possible workflows depending on the
available resources and the knowledge of the species. An example is given for an AAB hybrid and an ABB
hybrid. Allele-specific gene responses are picked up in this workflow as gene loci displaying genotype-
specific differential expression that often have single amino acid polymorphisms. If the resources are
sufficient, a genotype-specific mRNAseq database is recommended where a link is made to the allele-
specific transcription levels. If the resources are limited, allele-specific proteins can be detected by the
detection of genotype-specific peptides and the identification against existing genomics libraries of the
parents.
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_22, © Springer Science+Business Media, LLC, part of Springer Nature 2020
297
298 Sebastien Christian Carpentier
Fig. 1 Example of an allele-specific protein in an AAB hybrid. The A genome is depicted in blue, the B genome
in Red. The sequence of this on chromosome 11 encoded protein is depicted and shows 2 SAAP indicated in
yellow
allows for SNP calling [5]. Yet most current-day read mapping
software have difficulties to process complex (polyploid) genomes.
The read mapping efficiency to the reference genome might be
biased, and the degree of heterozygosity greatly increases compu-
tational effort, hampering quantitative results per genome
[7]. Consequently, the RNA reads are not separated and traced
back to their (sub)genome. Algorithms like PolyCat and HANDS2
process reads based on classification toward their genome of origin
but heavily depend on the presence and the quality of reference
genomes [7, 8]. Since not all cultivars of interest carry the reference
sequence, mapping efficiency biases can still occur when one refer-
ence genome is more closely linked to a constituting genome than
the other [7]. Proteomics allows for picking up and quantifying the
actual differential products without previous genome knowledge
[3, 9]. 2DE-based proteomics is very useful to identify allele-
specific protein isoforms in complex protein families [10]. This is
exemplified in banana for the HSP70 protein family [4]. However,
2DE is no longer the tool of choice in high-throughput differential
proteomics because of the labor and time involved in producing
2DE gels. Via LC-MSMS multiple allele-specific products (tryptic
specific peptides) can be quantified in a high-throughput manner
without required prior knowledge [11]. Proteomes clearly provide
insights into the consequences of genomic merging and reorgani-
zation [3, 5, 12–15].
2 Methods
2.1 Workflow 1: No When no resources are available to construct a proper mRNA seq
Resources Available library, genotype-specific proteins can be identified by analyzing a
for mRNA Seq (Fig. 2) sufficient amount of biological replicates and confirming the allele
specificity by identifying the genotypic peptides against a relevant
library. An example is worked out for an AAA and an AAB genotype
of banana/plantain. One is the well-known dessert banana, the
other gives a fruit that is higher in starch content and is consumed
cooked as a starch source. How to identify plantain (AAB) specific
proteins? The plantain genome is not known. A reference genome
is known of a double haploid AA [16] variety that has been rese-
quenced [17] and a reference B genome where the genomic reads
were aligned to the AA reference genome to determine the chro-
mosome locus [18]. The first plantain proteome was published in
2018 [13].
1. Download the publicly available FASTA files of both parent
genotypes AA and BB (e.g., in the case of banana https://
banana-genome-hub.southgreen.fr/).
2. Download the commonly proteomic contaminants (keratins
and trypsin) (https://www.uniprot.org/).
300 Sebastien Christian Carpentier
Fig. 2 Workflow for the identification of allele-specific proteins when no resources are available to generate
mRNA seq libraries. Example of an AAB hybrid
Fig. 3 Label-free quantification of the peptides. The visualization of the peptide abundance in 3D (retention
time, intensity, m/z) helps to confirm the absence of the B allele–specific peptide in the AAA genotype (left
picture). Even if the peptide was not selected for MSMS in a run the abundance pattern confirms that the
peptide is present or absent
2.2 Workflow 2: When sufficient resources are available to construct a proper mRNA
Resources Are seq library, genotype-specific proteins can be conducted by analyz-
Available to Generate ing a sufficient amount of biological replicates and confirming the
mRNA Seq Libraries allele specificity by identifying the genotypic peptides against its
(Fig. 4) own mRNA library. An example is worked out for an AAA and an
ABB genotype of banana. How to identify B genome–derived
proteins in the ABB genotype?
302 Sebastien Christian Carpentier
Table 1
Example of protein inference between allelic isoforms
Match in Spectral
Max chromosome countsb
probability Locus Volcano plot Allele
Identified peptidea (%) 1B 1A unique selected AAA AAB specific
ADPNVDFAFCSQSL 100 1 1 1 0 1 B
R
GDPNVDFTFCSQSL 100 1 1 0 10 1 A
R
GLAIISLK 99 1 1 1c 1 0 33 B
SAAG 100 1 1 0 10 1 A
SDGADLHGLAII
SLK
SAAGSDGADLR 100 1 1 1 0 2 B
SCLDACR 100 1 1 0 0 46 80 0
RSIVGETCNQIAR 100 1 1 0 27 4 A
SIVGETCNQIAR 100 1 1 0 40 25 A
SIVSETCNQIAR 100 1 1 1 0 30 B
VVYADAASELR 100 1 1 0 0 3 0d
a
Single amino acid polymorphism is indicated in bold. The SAAP G48A in GDPNVDFTFCSQSLR has been evaluated by
PANTHER as probably damaging
b
Total number of spectral counts for 21 AAA samples and 30 AAB
c
The peptide GLAIISLK should not be present in the AAA sample since in the AA genome the sequence is (H)
GLAIISLK(L) while in the BB genome the sequence is (R)GLAIISLK(L)
d
A BLAST suggests that VVYADAASELR is also allele specific since the AA genome sequence would code for
EVYADAASDLR, but the low spectral counts and the high variability in abundance prevent us from confirming this
peptide
Spectral
Match in chromosome countsb
Maximum probability Locus Volcano plot Allele
Peptidea (%) 6B 6A 7B 6A 9A 4B unique selected AAA AAB specific
APGGCNNPCTVFK 100 1 1 1 0 0 89 131 0
CAADINGQCPAALK 100 1 1 1 0 13 B
CAADINGQCPAALKAPGGCNNPC 100 1 1 1 0 1 B
TVFK
CSYTVWAAAVPGGGR 100 1 1 0 0 71 126 0
DDQTSTFTCPGGANYR 100 1 1 1 0 25 B
DDQTSTFTCPGGTNYR 100 1 1 0 47 74 A
NCPDAYSYPK 100 1 1 1 1 1 0 0 48 92 0
NCPDAYSYPKDDQTSTF 100 1 1 1 0 75 B
TCPGGANYR
NNCPDAYSYPKDDATSTFTCPGG 100 1 1 0 0 19 40 0
TNYR
QLNQGQSWTINVNAGTTGGR 100 1 1 0 0 32 37 0
RNCPDAYSYPK 100 1 1 0 0 7 9 0
RNCPDAYSYPKDDQTSTF 100 1 1 1 0 16 B
TCPGGANYR
TDQYCCNSGSCGPTDYSR 100 1 1 1 0 76 B
(continued)
Identification and Quantification of Homeologs via LC MSMS
303
304
Table 2
(continued)
Sebastien Christian Carpentier
Spectral
Match in chromosome countsb
Maximum probability Locus Volcano plot Allele
Peptidea (%) 6B 6A 7B 6A 9A 4B unique selected AAA AAB specific
TGCSFDGSGR 100 1 1 0 0 193 342 0
TGCSFDGSGRGR 100 1 1 0 0 8 25 0
a
Single amino acid polymorphism is indicated in bold. The SAAP T218A in DDQTSTFTCPGGTNYR has been evaluated by PANTHER as probably benign. A BLAST
additionally confirms that CAADINGQCPAALK is also allele specific since the AA genome sequence would code for CAADINGQCPGALK. A BLAST confirms that
TDQYCCNSGSCGPTDYSR is also allele specific, since the AA genome sequence would code for TDQYCCNSGSCSPTDYSR
b
Total number of spectral counts for 21 AAA samples and 30 AAB
Identification and Quantification of Homeologs via LC MSMS 305
12. Eliminate all peptides that have a 100% match with more than
one gene locus and/or library (see Note 6).
13. Upload your BAM files and the reference genome to visualize
the read count per allele [24].
14. Confirm in Integrative Genomics Viewer (IGV) on read count-
ing that the peptide is unique for the genotype, confirm the
different alleles, and quantify the allele expression (see Note 7).
15. Send all the AA substitutions in the protein to Panther [20]
http://pantherdb.org/tools/csnpScoreForm.jsp to judge the
impact.
16. Send all the differential proteins to Panther [20] for a GO
enrichment.
3 Notes
Acknowledgments
References
18. Davey MW, Gudimella R, Harikrishna JA et al 23. Li H (2011) A statistical framework for SNP
(2013) A draft Musa balbisiana genome calling, mutation discovery, association
sequence for molecular genetics in polyploid, mapping and population genetical parameter
inter-and intra-specific Musa hybrids. BMC estimation from sequencing data. Bioinformat-
Genomics 14:683 ics 27:2987–2993
19. Li W, Godzik A (2006) Cd-hit: a fast program 24. Thorvaldsdóttir H, Robinson JT, Mesirov JP
for clustering and comparing large sets of pro- (2012) Integrative genomics viewer (IGV):
tein or nucleotide sequences. Bioinformatics high-performance genomics data visualization
22:1658–1659 and exploration. Brief Bioinform 14:178–192
20. Thomas PD, Campbell MJ, Kejariwal A et al 25. Benjamini Y, Hochberg Y (1995) Controlling
(2003) PANTHER: a library of protein families the false discovery rate: a practical and powerful
and subfamilies indexed by function. Genome approach to multiple testing. J Royal Stat Soc B
Res 13:2129–2141 57:289–300
21. Dobin A, Davis CA, Schlesinger F et al (2013) 26. Johansson P, Alm R, Emanuelsson C et al
STAR: ultrafast universal RNA-seq aligner. (2006) SPECLUST: a web tool for clustering
Bioinformatics 29:15–21 of mass spectra. J Proteome Res 5(4):785–792
22. Li H, Handsaker B, Wysoker A et al (2009)
The sequence alignment/map format and
SAMtools. Bioinformatics 25:2078–2079
Chapter 23
Abstract
The complexity in chemical composition alongside the genomic complexity of crop plants poses significant
challenges for the characterization of their proteomes. This chapter provides specific methods that can be
used for the extraction and identification of proteins from sweet potato, and a proteogenomic method for
the subsequent peptide mapping on the haplotype-derived sweet potato genome assembly. We outline two
basic methods for extracting proteins expressed in root and leaf tissues for the label-free quantitative
proteomics—one phenol-based procedure and one polyethylene glycol (PEG) 4000-based fractionation
method—and discuss strategies for the organ-specific protein extraction and increased recovery of
low-abundance proteins. Next, we describe computational methods for improved proteome annotation
of sweet potato based on aggregated genomics and transcriptomics resources available in our and public
databases. Lastly, we describe an easily customizable proteogenomics approach for mapping sweet potato
peptides back to their genome location and exemplify its use in improving genome annotations using a mass
spectrometry data set.
Key words Sweet potato proteomics, Phenol protein extraction, Polyethylene glycol protein extrac-
tion, Proteogenomics analysis
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_23, © Springer Science+Business Media, LLC, part of Springer Nature 2020
309
310 Thualfeqar Al-Mohanna et al.
and I. triloba) [3, 4], the proteome annotations of Ipomoea nil [5],
transcriptomics datasets available in public databases [6, 7]. Using a
new technique of haplotype-resolved genome assembly, Yang et al.
[2] de novo sequenced and assembled the hexaploid I. batatas
without the guidance of wild related diploid genomes. The study
examined the probable evolutionary history of the cultivated sweet
potato. As such, sweet potato genome contained two B1 and four
B2 component genomes (B1B1B2B2B2B2) and was proposed to
have resulted from an initial crossing between a tetraploid ancestor
and a diploid progenitor, followed by a whole genome duplication
event [2].
Current extraction methodologies and present technologies
enable analysis of almost complete proteomes in humans and ani-
mals [8]. In plant systems, although discovery and comparative
proteomics approaches have accelerated the pace of breakthroughs
in experimental and crop plants, significant challenges remain. The
depth by which proteomes are being explored and analyzed, and
the means of enhancing the confidence level of protein identifica-
tion continues to be important issues in this field [8, 9]. Conse-
quently, approaches for protein extraction and separation from
tissues are constantly evaluated for performance on parameters
such as protein extraction and detection, accurate quantification,
post-extraction artifacts [10], and amenability to combinatorial
utilization or multiplexing for improving the extraction of low-
abundance proteins and increased throughput [11]. In addition
to these generic challenges, additional hurdles in obtaining protein
preparations of sufficient quality and quantity for mass spectrome-
try have been described for sweet potato, such as low-protein
content in storage roots and high abundance of secondary meta-
bolites in leaves.
Given the rapid advances in genomics and proteomics technol-
ogies, proteogenomic methods are being developed to take advan-
tage of available genomics resources for resolving proteomes as well
as to improve annotations of complex genomes [12, 13]. A pro-
teogenomic analysis of Arabidopsis thaliana uncovered evidence
that 13% of the Arabidopsis proteome was incomplete due to
missing and incorrect gene models including 778 new protein-
coding genes and refined annotation of 695 gene models
[14]. For complex plant genomes with high ploidy, such as sweet
potato, proteogenomics tools can significantly improve genome
annotations while providing a basis for improved peptide
identification.
Recently we have performed a root and leaf LC-MS/MS anal-
ysis and identified 4321 nonredundant proteins from sweet potato
[15]. In this chapter, we describe two methods for protein extrac-
tion and solubilization using sweet potato leaves and tuberous
roots and a proteogenomic method to identify the composition of
leaf and root proteomes using a high-throughput label-free
methodology.
Optimization of Protein Identification for Label-Free Quantitative. . . 311
2 Materials
3 Methods
3.1 Overview In this section, we describe two protein extraction methods and the
of the Protein optimized protocols for sweet potato leaf and root tissue processing.
Extraction The pipeline for sample processing and protein extraction and
and Optimization solubilization is presented in Fig. 1a. Two distinct protocols were
Methodology optimized for the extraction of proteins from lyophilized sweet
potato tissue, a phenol-based Method 1 (M1) and polyethylene
glycol (PEG) 4000 fractionation-based method (M2) [16]. Briefly,
314 Thualfeqar Al-Mohanna et al.
Fig. 1 Methodology for sweet potato protein extraction and identification. (a) Workflow describing the
methodology for tissue processing, the phenol-based procedure (M1) and PEG fractionation (M2). (b and c)
Representative two-dimensional SDS-PAGE of total protein preparations obtained from sweet potato roots.
Total protein extracted using the Phenol-based procedure in shown in (b), and total protein extracted using the
PEG 4000 procedure in shown in (c). Molecular weights (MW, kDa) of proteins in the marker lanes are listed on
the left. (d) Phenol-based procedure for the extraction of proteins from root and leaves showing the upper
phase following phenol extraction
3.1.1 Tissue Collection Sweet potato leaves and storage roots are utilized for protein
extraction. Leaf samples harvested from plants in the vegetative
stage yield abundant protein than leaves from mature plants. Stor-
age roots are collected from fully developed mature plants. Follow-
ing collection, samples are immediately frozen in liquid nitrogen
and stored at 80 C until further processing. Frozen tissues can be
lyophilized or ground to a powder in liquid nitrogen using mortar
and pestle.
3.1.2 Protein Extraction In this section, we describe a phenol-based method for extraction and
Using Phenol Procedure solubilization of proteins from sweet potato root and leaf tissue.
(M1) (See Notes 1–4)
1. Add 4 μL of β-Mercaptoethanol for each mL of extraction
buffer to make the final percent is 0.4%.
2. Weigh 200 mg of powdered sample in 2 mL tube and add
3 BBs in it.
3. Add 750 μL of phenol saturated buffer, pH 8 and 750 μL of
extraction buffer, and vortexing vigorously for 1 h at RT to be
homogeneous.
4. Centrifuge the sample at 15,550 g for 10 min at 4 C. The
results for root and leaf are shown in Fig. 1d.
5. Transfer around 330 μL of the upper phase to a screw top
2 mL tube.
6. Add 5 volumes (1650 μL) of ice-cold precipitation solution
(stored at 20 C), and add 3 BBs in it, vortex the sample
briefly to mix it [5].
7. Store the sample at 20 C overnight (16 h) to precipitate the
proteins.
8. Invert the sample that has Precipitated proteins to mix it.
9. Centrifuge at 3,312 g for 10 min at RT.
10. Decant the supernatant and collect the pellets using a magnetic
stand.
11. Add 1 mL of ice-cold precipitation solution and vortex
vigorously.
12. Centrifuge the sample at 15,550 g for 5 min at RT.
13. Repeat step 10.
14. Wash the pellets with 1 mL 80% ice-cold acetone (stored at
20 C) and vortex vigorously.
316 Thualfeqar Al-Mohanna et al.
3.1.3 Protein Extraction In this section, we describe a PEG 4000-based method for extraction
Using Polyethylene Glycol and solubilization of proteins from sweet potato root and leaf tissue.
Procedure 4000 (M2)
1. Weigh 500 mg of powdered sample in 15 mL tube.
2. Add 5 mL of extraction buffer I, and vortexing vigorously for
15 min at RT to be homogeneous.
3. Centrifuge the sample at 828 g for 15 min at 4 C.
(a) Supernatant: keep it for step 4.
(b) Pellets: add 5 mL of extraction buffer II, and repeat
steps 2 and 3.
4. Filter the supernatant by using a 2.0 μm filter to remove any
impurities or insoluble residues in the supernatant.
5. Add 50% PEG 4000 to make final concentration is 15% in the
supernatant and mix the solution to be homogeneous.
6. Incubate the sample on the ice for 30 min.
7. Centrifuge the sample at 13,250 g for 15 min at 4 C and
keep the supernatant for next step [6, 7].
8. Add cold acetone (stored at 20 C) 2 volumes for sweet
potato root and 4 volumes for sweet potato leaves to precipitate
the proteins.
9. Incubate the sample at 20 C for 30 min.
10. Centrifuge at 15,550 g for 5 min [8].
11. Keep the pellets for the next step.
12. Wash the pellets with 1 mL 80% ice-cold acetone (stored at
20 C) and vortex vigorously.
13. Centrifuge the sample at 15,550 g for 5 min at RT.
14. Decant the supernatant and collect the pellets.
15. Wash the pellets with 1 mL 70% ice-cold ethanol (stored at
20 C) and vortex vigorously.
16. Repeat steps 13 and 14.
17. Resuspend the pellets in 500 μL 70% ethanol and store them at
20 C.
Optimization of Protein Identification for Label-Free Quantitative. . . 317
Fig. 2 Workflow for peptide search and identification by LC-MS/MS. A concatenated database containing
“target” and “decoy” sequences was employed to estimate the false discovery rate (FDR) [20]. Peptide
assignments from the database search were filtered down to a 1% FDR by a logistic spectral score as
previously described [20, 21]
3.2 LC-MS/MS Purified sweet potato protein preparations were identified using
and Peptide LC-MS/MS. The LC-MS/MS analysis was performed as described
Identification in [19] and outlined in Fig. 2. Briefly, the peptides were separated
through a linear reversed-phase gradient through a C18 column.
Survey full scan MS spectra (m/z 400–1800) were acquired at a
resolution of 17,500. The search was performed using MASCOT
v. 2.4 (Matrix Science, Ltd., London, UK). The resulting peptide-
spectrum matches (PSMs) were reduced to sets of unique PSMs by
eliminating lower scoring duplicates. To provide high confidence
data, the MASCOT results were filtered for Mowse Score (>20).
Peptide assignments from the database search were filtered down to
a 1% FDR as previously described [19, 20]. Peptide spectrum
matching of MS/MS spectra was searched against the NCBI Ipo-
moea taxon (txid4119) proteins dataset containing 58,282 pro-
teins (NCBI; downloaded 2/12/2018) using MASCOT v. 2.4
(Matrix Science, Ltd., London, UK).
3.3 Proteogenomic Unique peptides from the LC-MS/MS analysis were queried using
Analysis Workflow tBLASTn against either the haplotype resolved sweet potato
genome or the transcriptome available at http://public-genomes-
ngs.molgen.mpg.de/SweetPotato/DOWNLOADS/. To imple-
ment a new classification method of BLAST search results, that
allow mapping of peptides to existing genome annotations while
allows the discovery of novel peptides, composition-based filtering
was turned off, the word size was decreased to 2, and the e-value
318 Thualfeqar Al-Mohanna et al.
3.4 Proteogenomic Here we describe the computational methods needed to analyze the
Analysis Method (See peptides identified in the proteomics screen. The method assumes that
Note 5) genome and transcriptome annotations are available for matching
and validating identified peptides. The method allows for the discov-
ery and improvement of genomics annotations in the target genome.
3.4.1 Data Input 1. MASCOT output file for individual tissues, extraction meth-
Processing ods, and replicate numbers should first be obtained.
for Proteogenomic Analysis 2. Uniquely identified peptides from all tissues, extraction meth-
ods, and replicates should be parsed, modified to remove fea-
ture annotations, and pooled into a FASTA format query file.
(a) Read MASCOT output files into R using the “read.xlsx”
function in the “openxlsx” package.
(b) Parse unique peptides from read MASCOT files using the
R code: “all_peptides<-unique(c(file1$peptide_column,
file2$peptide_column, filex, peptide_column)”.
Optimization of Protein Identification for Label-Free Quantitative. . . 319
>SWPT_1
MGKGPGLYTDIGKK
>SWPT_2
KKKPVTVSYNGEDKPGFLKK
>SWPT_3
MTLGAGGSSVVVPRN
>SWPT_4
MASLLLPGGRT
...
3.4.2 Blast Peptides 1. Build a nucleotide database from the genome using the “make-
against Genome blastdb” function of BLAST 2.2.31.
and Transcriptome 2. Run tBLASTn to map peptides against the genome using the
Annotations following parameters:
(a) db: nucleotide database created in step 4,
(b) query: peptide query created in step 2,
(c) word_size: 2,
(d) outfmt: 6,
(e) comp_based_stats: F,
(f) evalue: 1000,
(g) max_target_seqs: 50.
320 Thualfeqar Al-Mohanna et al.
3.4.3 Classify Peptides 1. Design a peptide hit classification scheme based on perfect or
and Generate New imperfect matches within the genome or transcriptome and
Annotations (See Note 6) generate the corresponding category lists. Use the following
definitions:
(a) Perfect match:
l Coverage: 100%.
l Mismatches: 0.
l Gaps: 0.
(b) Imperfect match:
l Peptide hit was not previously classified as a perfect
match.
l Coverage: >90%.
l Sequence identity: >80%.
2. Generate lists of classified peptide hits as follows:
(a) Perfect match on the genome and perfect match on the
transcriptome—these are the peptides that support cur-
rent genome sequence and gene model annotations.
(b) Perfect match on the genome and imperfect match on the
transcriptome—this category includes short exons
extensions.
(c) Perfect match on the genome and no match on the tran-
scriptome—this category includes putative coding
sequences in intergenic regions, large exons extensions,
intron retentions, exon alternative ORFs and 50 and
30 UTRs.
(d) Imperfect match on the genome and no match on the tran-
scriptome—this category represents a combination of
events: putative coding sequences in intergenic regions,
exons extensions, intron retentions, exon alternative
ORFs and 50 and 30 UTRs combined with genome
Optimization of Protein Identification for Label-Free Quantitative. . . 321
3.5 Novel Peptide For well annotated genomes, newly identified peptides in shotgun
Analysis proteomics are typically assigned to one of the following categories
and Validation [13, 23]: pseudogenes, lncRNA, intergenic region, exon extension,
322 Thualfeqar Al-Mohanna et al.
Fig. 4 Visualization of sweet potato genome transcriptome and peptide annotations files in the Integrative
Genomics Viewer
4 Notes
Acknowledgments
References
1. International Potato Center (2017) Sweetpo- 7. NCBI (2019) The Sequence Read Archive
tato facts and figures. Accessed 15 Jan 2019. online at: https://www.ncbi.nlm.nih.gov/sra.
http://www.cipotato.org/sweetpotato/ Projects: SRA PRJNA79717, SRA
2. Yang J, Moeinzadeh M-H, Kuhl H et al (2017) PRJEB4145, SRA PRJNA72435
Haplotype-resolved sweet potato genome 8. Aebersold R, Mann M (2016) Mass-
traces back its hexaploidization history. Nat spectrometric exploration of proteome struc-
Plants 33:696–703 ture and function. Nature 537:347
3. Hirakawa H, Okada Y, Tabuchi H et al (2015) 9. Omenn GS, Lane L, Lundberg EK et al (2015)
Survey of genome sequences in a wild sweet Metrics for the human proteome project 2015:
potato, Ipomoea trifida (HBK) G. Don. DNA Progress on the human proteome and guide-
Res 22:171–179 lines for high-confidence protein identification.
4. Wu S, Lau KH, Cao Q et al (2018) Genome J Proteome Res 14:3452–3460
sequences of two diploid wild relatives of 10. Rose JKC, Bashir S, Giovannoni JJ et al (2004)
cultivated sweetpotato reveal targets for Tackling the plant proteome: practical
genetic improvement. Nat Commun 9:4580 approaches, hurdles and experimental tools.
5. Hoshino A, Jayakumar V, Nitasaka E et al Plant J 39:715–733
(2016) Genome sequence and analysis of the 11. Erickson BK, Rose CM, Braun CR et al (2017)
Japanese morning glory Ipomoea nil. Nat A strategy to combine sample multiplexing
Commun 7:13295 with targeted proteomics assays for high-
6. Leinonen R, Sugawara H, Shumway M et al throughput protein signature characterization.
(2010) The sequence read archive. Nucleic Mol Cell 65:361–370
Acids Res 39:D19–D21
324 Thualfeqar Al-Mohanna et al.
12. Jaffe JD, Berg HC, Church GM (2004) Pro- 19. Ahsan N, Belmont J, al CZ (2017) Highly
teogenomic mapping as a complementary reproducible improved label-free quantitative
method to perform genome annotation. Pro- analysis of cellular phosphoproteome by opti-
teomics 4:59–77 mization of LC-MS/MS gradient and analyti-
13. Nesvizhskii AI (2014) Proteogenomics: con- cal column construction. J Proteome
cepts, applications and computational strate- 165:69–74
gies. Nat Methods 11:1114–1125 20. Elias JE, Gygi SP (2007) Target-decoy search
14. Castellana NE, Payne SH, Shen Z et al (2008) strategy for increased confidence in large-scale
Discovery and revision of Arabidopsis genes by protein identifications by mass spectrometry.
proteogenomics. Proc Natl Acad Sci U S A Nat Methods 4:207
105:21034–21038 21. Yu K, Sabelli A, DeKeukelaere L et al (2009)
15. Al-Mohanna T, Ahsan N, Bokros NT et al Integrated platform for manual and high-
(2019) Tissue-specific proteomic and proteo- throughput statistical validation of tandem
genomic analysis of sweetpotato (Ipomoea mass spectra. Proteomics 9:3115–3125
batatas). J Proteome Res 18:2719–2734 22. Zhou K, Panisko EA, Magnuson JK et al
16. Lee DG, Ahsan N, Lee SH et al (2007) A (2008) Proteomics for validation of automated
proteomic approach in analyzing heat- gene model predictions. United States.
responsive proteins in rice leaves. Proteomics Accessed 15 Jan 2019 https://www.osti.gov/
7:3369–3383 servlets/purl/1241230
17. Atha DH, Ingham KC (1981) Mechanism of 23. Zhu Y, Orre LM, Johansson HJ et al (2018)
precipitation of proteins by polyethylene gly- Discovery of coding regions in the human
cols. Analysis in terms of excluded volume. J genome by integrated proteogenomics analysis
Biol Chem 256:12108–12117 workflow. Nat Commun 9:903
18. Ingham KC (1990) Precipitation of proteins 24. Robinson JT, Thorvaldsdóttir H, Winckler W
with polyethylene glycol. Methods Enzymol et al (2011) Integrative genomics viewer. Nat
182:301–306 Biotechnol 29:24
Chapter 24
Abstract
Functional analyses of peroxidases are a major challenge. In silico analysis appears to be a powerful tool to
overcome at least some of the problems that arose from (1) the numerous possible functions of peroxidases,
(2) their low substrate specificity, and (3) the compensation of knockout mutants by other isoenzymes.
Amino acid sequences and crystal structures of peroxidases were used for the prediction of tertiary
structures, posttranslational modifications, ligand and substrate binding sites, and so on of uncharacterized
peroxidases. This protocol presents tools and their applications for an in silico analysis of soluble and
membrane-bound peroxidases, but it may be used for other proteins, too.
Key words AtPrx47, AtPrx64, HRP, Tertiary structure, Topology, Posttranslational modification,
Substrate channel analysis
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_24, © Springer Science+Business Media, LLC, part of Springer Nature 2020
325
326 Sabine Lüthje and Kalaivani Ramanathan
2 Materials
SwissModel
2.7 Interactive Protein images were prepared either by UCSF Chimera v. 1.13.1 or
Visualization by PyMOL v. 2.2. The electrostatics of protein surfaces was calcu-
of Structures lated with the APBS plugin using PyMOL generated PQR and
existing hydrogens and termini. Docking models were prepared
with UCSF Chimera.
3 Methods
3.1 Amino Acid Amino acid sequences (FASTA-Format) of peroxidases were down-
Sequences loaded from PeroxiBase.
3.7 Interactive Protein images have been prepared by PyMOL (Fig. 1a–f) and
Visualization UCSF Chimera (Fig. 1g–o) as described below.
of Structures
3.7.1 PyMOL PyMOL is free for educational use. For publication of PyMOL
images, a license is necessary. Tutorials, Script Library, Plugins
and Commands, and so on are given at https://pymolwiki.org/
index.php/Main_Page.
Tertiary Structure 1. Open the target PDB (e.g., 1hch) in PyMOL v. 2.2 in the
Upper Control Panel, by file and open (see Note 12).
2. Modify the protein structure in the PyMOL Viewer window by
the Object Control Panel, in layer target (e.g., 1HCH): H
(Hide) everything; S (show) cartoon.
3. Visualize cofactors (heme and calcium atoms) of the peroxidase
by the following steps. PyMOL Viewer window, Selection
Tools: S (sequence).
4. PyMOL Viewer window, Selection Tools: Selecting: Atoms.
5. Move the gray bar at the bottom of the sequence to the other
end and select the heme (HEM),
6. Modify the molecule in the PyMOL Viewer window. Use the
Object Control Panel, choose the layer of the selected HEM
(sele): S (Show) sticks, C (Color) grays and gray 60.
7. Deselect the heme by a click to the background in the
Display Area.
8. Rotate the molecule according to the Mouse Legend (left
button and mouse movement). Select δMeso edge, and cen-
ter atom of the heme in the Display Area.
9. Change the color of both atoms in the PyMOL Viewer win-
dow, use the Object Control Panel: layer (sele): C (Color)
reds, red.
10. Deselect by click to background in the Display Area.
11. Select both calcium (CA) atoms at the end of the sequence.
12. Visualize the atoms in the PyMOL Viewer window. Use the
Object Control Panel; choose the layer of the selected CA
(sele): S (Show) spheres, C (Color) cyans, cyan.
13. Deselect by click to background in the Display Area.
14. Highlight β-Sheets in the PyMOL Viewer Window, Use Selec-
tion Tools: Selecting: Residues
15. Select the three amino acids of each β-sheet in the structure
according to the Mouse Control Legend. Mouse Mode:
3-Button Viewing, find β-sheets by rotation of the molecule
(left button and mouse movement). Zoom in if necessary
(right button and mouse movement).
Fig. 1 In silico analyses of AtPrx47 and AtPrx64 in comparison to HRP. Electrostatic potential surface of (a) the
reference protein HRP is shown in comparison to Phyre2 models of (b) AtPrx47 and (c) AtPrx64. Negative
electrostatic potential (red); positive electrostatic potential (blue); neutral electrostatic potential (white). (d)
HRP with helices (pink), β-sheets (yellow), heme (gray sticks), and Ca2+ ions (blue spheres) and four disulfide
bonds (blue sticks). Alignment of models predicted by SwissModel (orange) and Phyre2 (blue) for (e) AtPrx47
and (f) AtPrx64. Alignment of AtPrx47 models by the two prediction programs clearly reveals the missing
β-sheets from the modeling. (g) Ferulic acid binding in HRP, (h) caffeyl alcohol binding in AtPrx47, (i) sinapyl
alcohol binding in AtPrx64. Helices (green), substrates (blue sticks) showing different substrate affinity pattern
based on the channel electrostatics. (j) Active site of HRP with ferulic acid, (k) active site of AtPrx47 with
caffeyl alcohol, (l) active site of AtPrx64 with sinapyl alcohol. Substrate channel of (m) HRP, (n) AtPrx47, and
(o) AtPrx64. Hydrophobic residues (yellow), polar residues (green), basic residues (blue) and acidic residues
(red) are shown. Figures a–f were created by the PyMOL Molecular Graphics system and a–c with the ABPS
plugin. Figures g–o were created by UCSF Chimera Molecular Visualization Application. Further results of in
silico analysis of AtPrx47 and AtPrx64 may be found in Lüthje and Martinez-Cortes [10]
334 Sabine Lüthje and Kalaivani Ramanathan
Alignment 1. Open the target PDBs in PyMOL Upper control panel, by file
and open.
2. PyMOL Viewer window, Object Control Panel, layer all: H
(Hide) everything; S (Show) cartoon.
3. PyMOL Viewer Window, Object Control Panel, layer target 1:
C (Color) for example, blues, blue.
4. PyMOL Viewer Window, Object Control Panel, layer target 2:
C (Color) for example, oranges, orange.
5. PyMOL Viewer Window, Object Control Panel, layer target 1:
A (Action) align to molecule, Object: target 2.
6. Deselect by click to background in the Display Area.
7. Upper control panel, Ray.
8. Upper control window, File, Save Image as, for example, png.
APBS Plugin 1. Open the target in PDB format by PyMOL in the Upper
Control Panel, file and open.
2. Upper Control Panel, choose Plugin and APBS Tools.
3. Use “PyMOL generated PQR and PyMOL generated Hydro-
gens and termini” (see Note 13).
4. Set Grid and Run APBS (see Note 14).
5. Open visualization.
6. Update.
7. In the Molecular Surface Box choose solvent accessible surface
and Show.
8. Upper Control Panel, open File and Save Image as, for exam-
ple, png in the main menu for all Images of interest.
9. Upper Control Panel, type “rotate y, 180” in the command line
to get the back view.
10. Upper Control Panel, type “rotate x, 90” in the command line
to get the top view.
In Silico Analysis of Class III Peroxidases 335
Active Center 1. Select file menu and Open the required PDB model.
2. Select, Structure, Protein.
3. Action, Ribbon, hide.
Surface by Properties 1. Select file menu and Open the required PDB model.
of Residues 2. Open Select menu, select Residue, amino acid category (e.g.,
Hydrophobic).
3. Open Actions menu, select Color, yellow.
4. Open Select menu, select Residue, amino acid category, polar.
5. Open Actions menu, select Color, green.
6. Open Select menu, select Residue, standard amino acids, basic
amino acids: LYS, ARG, HIS.
7. Open Actions menu, select Color, blue.
8. Open Select menu, select Residue, standard amino acids, acidic
amino acids: GLU, ASP.
9. Open Actions menu, select Color, blue.
10. Open Select menu, select residue, all nonstandard, HEM.
11. Open Actions menu, select Color, by heteroatom.
3.8.2 Protein-Heme 1. The Protein Database (PDB) file from SwissModel was used in
Docking PatchDock server v. 1.3 to get a heme bound peroxidase
structure.
2. The receptor molecule and the substrate (heme) molecule are
given as PDB files.
3. Clustering RMSD (Root-Mean-Square Deviation) value is nor-
mally kept as 4.0.
4. An e-mail address is given to get the results.
5. The results obtained are redefined by FireDock server and
ranked based on the ACE (Atomic Contact Energy) values.
6. The best heme bound peroxidase PDB model is chosen for
substrate docking.
3.8.3 SwissDock 1. The heme bound protein (peroxidase) (PDB file) is used as the
Analysis target in the SwissDock server.
2. The substrate (ligand) obtained from ZINC database v. 12.0 is
Protein–Substrate Docking uploaded in MOL2 format.
3. The job name and the e-mail address are given and the docking
procedures are initiated by click on the Start Docking button.
4. A link to a zip file with all docking possibilities is obtained as a
result to the e-mail address.
3.8.4 Visualization 1. UCSF Chimera v 1.13.1 was used for docking analysis.
of Docking Results 2. After opening the required PDB file (target.pdb) obtained
from the SwissDock online tool, the substrate interaction is
analyzed.
3. Under Tools menu, Surface/binding analysis submenu, View
Dock option is used.
4. Open Dock results tab pops up, which helps in selecting the
substrate cluster file, “Clusters.dock4.pdb”.
5. Then the type selection is made by clicking “Dock 4, 5 or 6”
option.
6. This gives all the possible docking results of the substrate.
7. Then the substrate docking is checked manually and the plau-
sible product is decided by its access to the substrate channel
and based on “Delta G values.”
8. This procedure has been repeated for each protein with all
substrates tested (Table 1).
In Silico Analysis of Class III Peroxidases 337
Table 1
Docking analyses of natural and artificial peroxidase substrates
4 Notes
Acknowledgments
References
1. Welinder KG, Justesen AF, Kjaersgard IV et al 9. Shigeto J, Nagano M, Fujita K et al (2014)
(2002) Structural diversity and transcription of Catalytic profile of Arabidopsis peroxidases,
class III peroxidases from Arabidopsis thaliana. AtPrx-2, 25 and 71, contributing to stem lig-
Eur J Biochem 269:6063–6081 nification. PLoS One 9:e105332
2. Zámocký M, Furtmüller PG, Obinger C 10. Lüthje S, Martinez-Cortes T (2018)
(2010) Evolution of structure and function of Membrane-bound class III peroxidases: unex-
class I peroxidases. Arch Biochem Biophys pected enzymes with exciting functions. Int J
500:45–57 Mol Sci 19:E2876
3. Hiraga S, Sasaki K, Ito H et al (2001) A large 11. Artimo P, Jonnalagedda M, Arnold K et al
family of class III plant peroxidases. Plant Cell (2012) ExPASy: SIB bioinformatics resource
Physiol 42:462–468 portal. Nucleic Acids Res 40:W597–W603
4. Passardi F, Cosio C, Penel C et al (2005) Per- 12. UniProt Consortium T (2018) UniProt: the
oxidases have more functions than a Swiss army universal protein knowledgebase. Nucleic
knife. Plant Cell Rep 24:255–265 Acids Res 46:2699
5. Cosio C, Dunand C (2009) Specific functions 13. Sayers EW, Agarwala R, Bolton EE et al (2019)
of individual class III peroxidase genes. J Exp Database resources of the National Center for
Bot 60:391–409 Biotechnology Information. Nucleic Acids Res
6. Lüthje S, Meisrimler CN, Hopff D et al (2011) 47:D23–D28
Phylogeny, topology, structure and functions 14. Schwacke R, Schneider A, Van Der Graaff E
of membrane-bound class III peroxidases in et al (2003) ARAMEMNON, a novel database
vascular plants. Phytochemistry 72:1124–1135 for Arabidopsis integral membrane proteins.
7. Herrero J, Esteban-Carrasco A, Zapata JM Plant Physiol 131:16–26
(2013a) Looking for Arabidopsis thaliana per- 15. Fawal N, Li Q, Savelli B et al (2013) Peroxi-
oxidases involved in lignin biosynthesis. Plant Base: a database for large-scale evolutionary
Physiol Biochem 67:77–86 analysis of peroxidases. Nucleic Acids Res 41:
8. Herrero J, Fernández-Pérez F, Yebra T et al D441–D444
(2013b) Bioinformatic and functional charac- 16. Berman HM, Westbrook J, Feng Z et al (2000)
terization of the basic peroxidase 72 from Ara- The Protein Data Bank. Nucleic Acids Res
bidopsis thaliana involved in lignin 28:235–242
biosynthesis. Planta 237:1599–1612
In Silico Analysis of Class III Peroxidases 339
17. Lipman DJ, Pearson WR (1985) Rapid and 31. Krissinel E, Henrick K (2007) Inference of
sensitive protein similarity searches. Science macromolecular assemblies from crystalline
227:1435–1441 state. J Mol Biol 372:774–797
18. Gasteiger E, Hoogland C, Gattiker A et al 32. Moural TW, Lewis KM, Barnaba C et al (2017)
(2005) Protein identification and analysis Characterization of class III peroxidases from
tools on the ExPASy server. In: Walker JM Switchgrass. Plant Physiol 173:417–433
(ed) The proteomics protocols handbook. 33. Pettersen EF, Goddard TD, Huang CC et al
Humana Press, New York, pp 571–607 (2004) UCSF chimera a visualization system
19. Ren J, Wen L, Gao X et al (2008) CSS-Palm for exploratory research and analysis. J Comput
2.0: an updated software for palmitoylation Chem 25:1605–1612
sites prediction. Protein Eng Des Sel 34. Grosdidier A, Zoete V, Michielin O (2011)
21:639–644 SwissDock, a protein-small molecule docking
20. Sigrist CJA, de Castro E, Cerutti L et al (2012) web service based on EADock DSS. Nucleic
New and continuing developments at PRO- Acids Res 39:W270–W277
SITE. Nucleic Acids Res 21:D344–D347 35. Morris GM, Huey R, Lindstrom W et al (2009)
21. Blom N, Sicheritz-Ponten T, Gupta R et al Autodock4 and AutoDockTools4: automated
(2004) Prediction of post-translational glyco- docking with selective receptor flexiblity. J
sylation and phosphorylation of proteins from Comput Chem 16:2785–2791
the amino acid sequence. Proteomics 36. Seeliger D, de Groot BL (2010) Ligand dock-
4:1633–1649 ing and binding site analysis with PyMOL and
22. Fankhauser N, M€aser P (2005) Identification autodock/Vina. J Comput Aided Mol Des
of GPI anchor attachment signals by a Koho- 24:417–422
nen self-organizing map. Bioinformatics 37. Nanda T, Tripathy K, Ashwin P (2011) Inte-
21:1846–1852 gration of Bioinformatics Tools for Proteomics
23. Krogh A, Larsson B, von Heijne G et al (2001) Research. J Comput Sci Syst Biol S13. https://
Predicting transmembrane protein topology doi.org/10.4172/jcsb.S13-002
with a hidden Markov model: application to 38. Hetenyi C, van der Spoel D (2011) Toward
complete genomes. J Mol Biol 305:567–580 prediction of functional protein pockets using
24. Tusnády GE, Simon I (2001) The HMMTOP blind docking and pocket search algorithms.
transmembrane topology prediction server. Protein Sci 20:880–893
Bioinformatics 17:849–850 39. Andrusier N, Nussinov R, Wolfson HJ (2007)
25. Möller S, Croning MDR, Apweiler R (2001) FireDock: fast interaction refinement in molec-
Evaluation of methods for the prediction of ular docking. Proteins 69:139–159
membrane spanning regions. Bioinformatics 40. Mashiach E, Schneidman-Duhovny D, Andru-
17:646–653 sier N et al (2008) FireDock: a web server for
26. Nielsen H, Krogh A (1998) Prediction of sig- fast interaction refinement in molecular dock-
nal peptides and signal anchors by a hidden ing. Nucleic Acids Res 36:W229–W232
Markov model. Proc Int Conf Intell Syst Mol 41. Schneidman-Duhovny D, Inbar Y, Nussinov R
Biol 6:122–130 et al (2005) PatchDock and SymmDock: ser-
27. Nakai K, Horton P (1999) PSORT: a program vers for rigid and symmetric docking. Nucleic
for detecting the sorting signals of proteins and Acids Res 33:W363–W367
predicting their subcellular localization. Trends 42. Irwin JJ, Sterling T, Mysinger MM et al (2012)
Biochem Sci 24:34–35 ZINC: a free tool to discover chemistry for
28. Waterhouse A, Bertoni M, Bienert S et al biology. J Chem Inf Model 52:1757–1768
(2018) SWISS-MODEL: homology modelling 43. Lee Y, Rubio MC, Alassimone J et al (2013) A
of protein structures and complexes. Nucleic mechanism for localized lignin deposition in
Acids Res 46:W296–W303 the endodermis. Cell 153:402–412
29. Kelley LA, Mezulis S, Yates CM et al (2015) 44. Watanabe L, de Moura PR, Bleicher L et al
The Phyre2 web portal for protein modeling, (2010) Crystal structure and statistical cou-
prediction and analysis. Nat Protoc pling analysis of highly glycosylated peroxidase
10:845–858 from royal palm tree (Roystonea regia). J Struct
30. Wass MN, Kelley LA, Sternberg MJ (2010) Biol 169:226–242
3DLigandSite: predicting ligand-binding sites 45. Berglund GI, Carlsson GH, Smith AT et al
using similar structures. Nucleic Acids Res 38 (2002) The catalytic pathway of horseradish
(Suppl):W469–W473 peroxidase at high resolution. Nature 417:463
Chapter 25
Abstract
Mass spectrometry imaging is routinely used to visualize the distributions of biomolecules in tissue sections.
In plants, mass spectrometry imaging of metabolites is more often conducted, but the imaging of larger
molecules is less frequently performed despite the importance of proteins and endogenous peptides to the
plant. Here, we describe a matrix-assisted laser desorption/ionization mass spectrometry imaging method
for the imaging of peptides in Medicago truncatula root nodules. Sample preparation steps including
embedding in gelatin, sectioning, and matrix application are described. The method described is employed
to determine the spatial distribution of hundreds of peptide peaks.
Key words Medicago truncatula, Root nodules, Peptides, Mass spectrometry imaging, MSI, MALDI
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_25, © Springer Science+Business Media, LLC, part of Springer Nature 2020
341
342 Caitlin Keller et al.
2 Materials
3 Methods
Fig. 1 MALDI-MSI scheme showing the sample preparation, instrument analysis, and data analysis steps for a
typical experiment
344 Caitlin Keller et al.
3.2 MALDI-MSI 1. Take the embedded nodule and trim sample to rectangle with a
Sample Preparation couple mm of gelatin surrounding the tissue on all sides. Do
this quickly to minimize the time the sample is at room
temperature.
2. Attach sample to a cryostat chuck with a drop of OCT com-
pound (see Note 4).
3. Allow sample on the chuck to equilibrate in the cryostat at
20 C for 15 min.
4. Align the sample so that the cryostat is cutting sections evenly
across the root and root nodule. This can be done by taking
about five sections and adjusting the chuck if part of the sample
is being missed (see Note 5).
5. Once the center of the nodule (or other desired depth) is
reached, thaw mount sections onto a glass slide by warming
the back of the slide against your hand and then placing the
front of the slide gently onto the tissue section.
6. Continue until a desired number of sections across the z stack
(i.e., the depth) of the root nodule are obtained.
7. Keep the sections in a dry environment (i.e., dry box) while
preparing the TM Sprayer for matrix application (see Note 6).
8. Turn nitrogen gas on TM Sprayer to 10 psi, and the solvent
pump to 0.25 mL/min. The solvent for the pump should
match what the matrix is dissolved in (without the FA), so for
DHB this would be 50% methanol and for CHCA this would
be 50% acetonitrile. Turn on the TM Sprayer and laptop (see
Note 7).
9. Set the temperature on the software to the appropriate temper-
ature for the desired solvent and TM Sprayer system (see
Note 8). As a starting point, 80 C is the appropriate tempera-
ture for 50% methanol.
10. Load the dissolved matrix (i.e., DHB, CHCA, SA; see Note 9)
into the sample loop with the knob in the load position.
11. Load the TM Sprayer method and manually change gas pres-
sure and flow rate if method differs from the initial parameters
of 10 psi and 0.25 mL/min. The TM Sprayer has recom-
mended methods for specific matrices and analyte types,
although method parameters may need to be optimized for a
specific application. For DHB imaging of peptides, method
parameters typically used are 1250 velocity, 0.1 mL/min,
12 passes, 30 s dry time, rotate and offset (cc pattern), 10 psi,
80 C. For imaging of peptides using CHCA and SA as
Peptide Imaging in Medicago Truncatula 345
3.3 MSI Data 1. Place glass slide(s) with sample into the slide adapter. If import-
Acquisition ing the image of the glass slide, scan the slide in the adapter
on the MALDI LTQ with a scanner. Then add the backing plate and insert the plate
Orbitrap XL into the instrument. Alternatively, the slide can be scanned after
inserting the plate into the instrument with the camera in the
instrument (see Note 11).
2. Open the plate image in the MALDI source dialog box in the
Tune software. Zoom in as necessary to see sample, depending
on sample size. Draw boxes around the areas to be imaged (see
Note 12). Save this as a MALDI position file. For MS1 imag-
ing, using a rectangle box and raster motion works best. Also
set the desired spatial step size (75 μm is the smallest raster size
without oversampling).
3. In Xcalibur software (Thermo), set up the sequence by adding
the file name, path location, instrument method, and MALDI
position file. The instrument method contains parameters
controlling the mass resolution, mass range, and centroid/
profile data. The instrument method also requires a tune file,
which controls the laser energy and the microscans (micro-
scans/step is controlled in the instrument file). The microscans
and microscans/step should match to ensure that one pixel is
one mass spectrum in the data file.
4. Check the laser energy by shooting the laser on a matrix only
area that is not being imaged and checking the signal level. You
can adjust the laser energy in your tune file as necessary to get
the optimal signal.
5. Start the sequence.
346 Caitlin Keller et al.
3.4 Data Processing 1. Once the data is collected, it can be viewed in ImageQuest, or
exported to another software program. To visualize the data in
ImageQuest, use the average spectra within a selected area tool
to view an average spectrum of a certain area of the sample. In
the bottom window of ImageQuest, there should be a spec-
trum from the sample. Figure 2 shows example spectra aver-
aged over the nodules for peptide imaging results with DHB,
CHCA, and SA matrices.
2. Look through the peaks in the collected spectrum, zooming in
as appropriate, and when one wants to visualize the distribution
of a certain peak in the tissue, select add new data set. Select the
single dataset option with plot type Mass Range/TIC. Use the
m/z for the mass range and select the desired tolerance window
(i.e., 5 ppm). Repeat as necessary to visualize the m/z in the
sample. Under the 2D tab, there are other color bar options as
well as smoothing options.
3. To view in MSiReader [18], export the data in ImageQuest
into an imzML format, keeping the data in profile.
4. Load the imzML file into MSiReader and select the desired
mass tolerance, image smoothing, and color bar parameters.
Insert a m/z that is localized across the sample to visualize the
sample (one can find a good m/z for this in ImageQuest).
Normalize to the total ion count (TIC). To pull out m/z
unique to the sample, use the polygon tool to create interro-
gated and reference zones. Outline around the sample to create
an interrogated zone, then create a matrix only region for the
reference zone.
5. Use the extract peaks unique to the interrogated zone tool to
create a list of m/z present in the image. One will need to set
percentage numbers for the threshold a m/z needs to be above
in the interrogated zone and the threshold a m/z needs to be
below in the reference zone to be added to the list. Also set the
algorithm for peak centroid calculation (typically parabolic
centroid works well).
6. Once the list has been created, use the generate an image for
each peak in a list tool to create images for all the m/z. Manu-
ally go through the images and remove any bad images (i.e.,
images that have signal in the matrix as well as the sample or
do not appear to have any signal anywhere). Figure 3 shows
example MALDI-MSI images generated from peptide imaging
of root nodules with either DHB or CHCA as the matrix.
Different distributions across the root and root nodules are
observed.
Peptide Imaging in Medicago Truncatula 347
Fig. 2 Example spectra average over the entire root nodules for MALDI-MSI on
the root nodules with different matrices. The matrices are CHCA (a), DHB (b), and
SA (c)
348 Caitlin Keller et al.
Fig. 3 MALDI-MSI images of peptides with either DHB (a, b) or CHCA (c, d, e) as the matrix. The images are
generated at 5 ppm
4 Notes
1. For best results, select nodules that are red in color and elon-
gated rod in shape rather than round. These are the nodules in
which the symbiosis is well developed.
2. To make the sectioning process easier, ensure that the nodule is
as flat as possible with the root in line with the nodule. This will
help to get both the root and the nodule in the same plane
when sectioning.
3. If the nodule is not completely frozen when covered in gelatin,
it will not stick to the bottom of the cup and instead will float
up to the middle or top of the cup. This makes the nodule
harder to find and may result in the positioning of the nodule
being lost. After adding the gelatin, the cup should be kept
level while waiting for the rest of the gelatin to freeze. If the
gelatin freezes at an angle, it will be harder to level the nodule
while sectioning to get both the root and root nodule in a
single section. Avoid air bubbles close to the nodule when
adding the gelatin, as this also will make the nodules harder
to section.
4. OCT compound is beneficial as it provides a way to “glue” the
sample to the cryostat chuck during sectioning. However, it is a
polymeric species and will suppress analyte signal if it comes
into contact with the sample. Thus, care should be taken to
ensure that the OCT compound does not come into contact
with the sample or with the blade or stage of the cryostat. By
placing a small drop of OCT to the back of the gelatin sur-
rounded sample, where the OCT does not contact the sample
or blade, the sample can be secured onto the chuck without any
interference from the OCT.
Peptide Imaging in Medicago Truncatula 349
5. For plant root nodules, our lab typically uses 16 μm, but other
section thicknesses between 8 and 35 μm [19], can be used.
6. After sectioning and before matrix application, washing steps
to remove highly abundant lipid species can increase signal
intensity and observed protein peaks [20]. For protein imag-
ing, ethanol washes and potentially a Carnoy wash are typically
used to remove the lipid species that can suppress protein
signal. For endogenous peptide imaging, washes may (or may
not) remove the target peptides, depending on the chemical
properties of the peptides. Thus, care should be taken when
using washing techniques with peptides to ensure that they are
not being removed in the washing steps.
7. Here the TM Sprayer is used to apply the matrix evenly across
the sample. It is important that the matrix is applied in a
homogenous manner at all points on the tissue so that matrix
inhomogeneity does not skew the results. A matrix application
method should be reproducible run-to-run to ensure that
results remain consistent. Other automatic sprayers can be
used (i.e., home-built or the Bruker ImagePrep). Other matrix
application techniques include the airbrush and sublimation
[21]. Airbrush application can be achieved easily with minimal
expense, however, user-to-user variation can be high and
reproducibility can be a challenge. Sublimation provides very
small crystal size and good imaging results for metabolomics
studies, but due to the dry application, the method requires
further recrystallization steps for analysis of larger molecules
(i.e., peptides and proteins) [22].
8. The temperature of the TM Sprayer should be about 5 C
below the temperature at which the “puffing” sound starts.
This sound indicates that the matrix is not being sprayed in a
consistent manner. If run at a temperature when the solvent is
“puffing” the matrix will not cover the sample homogeneously,
which will negatively affect results.
9. There are many different matrices to choose from. DHB and
CHCA are both common matrices and can be used for a variety
of analytes. Other matrices may be used primarily for larger
peptides and proteins (i.e., SA) or primarily for negative mode
(i.e., 9-aminoacrilamide). Matrices other than DHB and
CHCA may work well depending on your desired analyte.
10. If this method is too wet, you can cut the flow rate in half and
double the number of passes to achieve the same matrix density
but with a drier spray.
11. The preferred scanning method depends on the sample and
time considerations. For the nodules, scanning in with the
camera on the instrument provides good alignment and
image quality, but this takes 25 min per slide. For larger tissues,
350 Caitlin Keller et al.
Acknowledgments
References
1. Caprioli RM, Farmer TB, Gile J (1997) Molec- 7. Chaurand P, Norris JL, Cornett DS et al
ular imaging of biological samples: localization (2006) New developments in profiling and
of peptides and proteins using MALDI-TOF imaging of proteins from tissue sections by
MS. Anal Chem 69:4751–4760 MALDI mass spectrometry. J Proteome Res
2. Goodwin RJ, Pennington SR, Pitt AR (2008) 5:2889–2900
Protein and peptides in pictures: imaging with 8. Lee YJ, Perdian DC, Song Z et al (2012) Use
MALDI mass spectrometry. Proteomics of mass spectrometry for imaging metabolites
8:3785–3800 in plants. Plant J 70:81–95
3. Buchberger AR, DeLaney K, Johnson J et al 9. Gemperline E, Keller C, Jayaraman D et al
(2018) Mass spectrometry imaging: a review of (2016) Examination of endogenous peptides
emerging advancements and future insights. in Medicago truncatula using mass spectrome-
Anal Chem 90:240–265 try imaging. J Proteome Res 15:4403–4411
4. Ye H, Gemperline E, Venkateshwaran M et al 10. Poth AG, Mylne JS, Grassl J et al (2012) Cyclo-
(2013) MALDI mass spectrometry-assisted tides associate with leaf vasculature and are the
molecular imaging of metabolites during nitro- products of a novel precursor in petunia (Sola-
gen fixation in the Medicago truncatula-Sinor- naceae). J Biol Chem 287:27033–27046
hizobium meliloti symbiosis. Plant J 11. Cavatorta V, Sforza S, Mastrobuoni G et al
75:130–145 (2009) Unambiguous characterization and tis-
5. Gemperline E, Jayaraman D, Maeda J et al sue localization of Pru P 3 peach allergen by
(2015) Multifaceted investigation of metabo- electrospray mass spectrometry and MALDI
lites during nitrogen fixation in Medicago via imaging. J Mass Spectrom 44:891–897
high resolution MALDI-MS imaging and 12. Grassl J, Taylor NL, Millar AH (2011) Matrix-
ESI-MS. J Am Soc Mass Spectrom assisted laser desorption/ionisation mass spec-
26:149–158 trometry imaging and its development for
6. Chen RB, Li L (2010) Mass spectral imaging plant protein imaging. Plant Methods 7:11
and profiling of neuropeptides at the organ and 13. Tavormina P, De Coninck B, Nikonorova N
cellular domains. Anal Bioanal Chem et al (2015) The plant peptidome: an
397:3185–3193
Peptide Imaging in Medicago Truncatula 351
expanding repertoire of structural features and 19. Qin L, Zhang Y, Liu Y et al (2018) Recent
biological functions. Plant Cell 27:2095–2118 advances in matrix-assisted laser desorption/
14. Batut J, Mergaert P, Masson-Boivin C (2011) ionisation mass spectrometry imaging
Peptide signalling in the rhizobium-legume (MALDI-MSI) for in situ analysis of endoge-
symbiosis. Curr Opin Microbiol 14:181–187 nous molecules in plants. Phytochem Anal
15. Van de Velde W, Zehirov G, Szatmari A et al 29:351–364
(2010) Plant peptides govern terminal differ- 20. Seeley EH, Oppenheimer SR, Mi D et al
entiation of bacteria in symbiosis. Science (2008) Enhancement of protein sensitivity for
327:1122–1126 MALDI imaging mass spectrometry after
16. Mortier V, Den Herder G, Whitford R et al chemical treatment of tissue sections. J Am
(2010) CLE peptides control Medicago trun- Soc Mass Spectrom 19:1069–1077
catula nodulation locally and systemically. 21. Gemperline E, Rawson S, Li L (2014) Optimi-
Plant Physiol 153:222–237 zation and comparison of multiple MALDI
17. Mortier V, De Wever E, Vuylsteke M et al matrix application methods for small molecule
(2012) Nodule numbers are governed by inter- mass spectrometric imaging. Anal Chem
action between CLE peptides and cytokinin 86:10030–10035
signaling. Plant J 70:367–376 22. Yang J, Caprioli RM (2011) Matrix sublima-
18. Robichaud G, Garrard KP, Barry JA et al tion/recrystallization for imaging proteins by
(2013) MSiReader: an open-source interface mass spectrometry at high spatial resolution.
to view and analyze high resolving power MS Anal Chem 83:5728–5734
imaging files on matlab platform. J Am Soc
Mass Spectrom 24:718–721
Chapter 26
Abstract
Protease inhibitors of the cystatin protein superfamily show potential in plant protection for the control of
herbivorous pests. Here, we describe a cystatin activity–based profiling procedure for the selection of potent
cystatin candidates, using single functional variants of tomato cystatin SlCYS8 and digestive Cys proteases
of the herbivore insect Colorado potato beetle as a case study. The procedure involves the capture of target
Cys proteases with biotinylated versions of the cystatins, followed by the identification and quantitation of
captured proteases by mass spectrometry. An example is given to illustrate usefulness of the approach as an
alternative to current procedures for recombinant inhibitor selection based on in vitro assays with synthetic
peptide substrates. A second example is given showing its usefulness as a tool to compare the affinity spectra
of inhibitor variants toward different subsets of target protease complements.
Key words Plant protease inhibitors, Herbivorous insect digestive proteases, Cystatin activity–based
protease profiling, Cys protease capture, Biotinylated cystatins
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_26, © Springer Science+Business Media, LLC, part of Springer Nature 2020
353
354 Marie-Claire Goulet et al.
2 Materials
2.2 Insect Midgut 1. Insect proteins from snap-frozen Colorado potato beetle (Lep-
Proteins tinotarsa decemlineata) fourth-instar larvae reduced to a fine
powder in liquid nitrogen (Praxair) [11].
2.4 Media, Buffers Culture media, buffers, and solvents are made up as aqueous solu-
and Other Solutions tions with ultrapure water and analytical grade reagents. All solu-
tions are stored at 4 C unless otherwise indicated. Working
buffers, reagents, and standard step-by-step protocols for SDS-
PAGE are used as described in [16].
1. Luria–Bertani (LB) medium: 10 g/L tryptone, 5 g/L yeast
extract, 5 g/L NaCl, pH 7.0.
2. 10 μg/mL chloramphenicol (Sigma-Aldrich).
3. 100 μg/mL carbenicillin (Sigma-Aldrich).
4. 1 mM isopropyl ß-D-1-thiogalactopyranoside (Sigma-Aldrich).
5. 50 μM D-biotin (Sigma-Aldrich).
6. Phosphate buffered-saline (PBS), pH 7.4.
7. 100 mM citrate phosphate buffer, pH 6.0.
356 Marie-Claire Goulet et al.
3 Methods
3.1 Capture of Target The whole procedure in this chapter includes two steps consisting
Proteases with of (1) capturing cystatin-sensitive target protease isoforms with
Biotinylated Cystatins biotinylated AviTagged cystatins (this section) (Fig. 1); and
(2) quantifying the captured proteases by LC-MS/MS analysis
(Subheading 3.2).
Fig. 1 Schematic overview of the cystatin activity-based protease capture procedure. PHASE 1: Bacterially
expressed AviTagged inhibitors (e.g., AviTagged cystatins) are ligated to D-biotin by the action a biotin ligase,
in vivo during their expression in E. coli or in vitro following their recovery from the bacteria (Subheading
3.1.1). Phase 2: The biotinylated inhibitors (cystatins) are incubated with NeutrAvidin™ agarose beads to
generate cystatin-embedded agarose matrices (or beads) for protease capture (Subheading 3.1.2). Phase 3:
The beads are incubated with a crude protein extract of the putative target proteases (Subheading 3.1.3) to
capture those protease isoforms that show affinity for the tested inhibitor variants (Subheading 3.1.4). These
beads bound to the inhibitor–target protease complexes then serve as source material for LC-MS/MS analyses
(Subheading 3.2)
3.1.3 Extraction of Target 1. Extract insect powder proteins in two volumes (e.g., 2 mL
Proteases buffer/g fresh powder) of 100 mM citrate phosphate buffer,
pH 6.0.
2. Keep the mixture on ice for 10 min.
3. Discard insoluble material by centrifugation at 20,000 g for
10 min at 4 C.
4. Assay soluble proteins in the supernatant using the Bio-Rad
Protein Assay™ kit (Bio-Rad), as described by the provider.
5. Use the supernatant as freshly prepared for Subheading 3.1.4,
or keep it at 80 C until use.
biotin/SlCYS8 biotin/Q47P
Crude Flow Wash Beads Wash Flow Beads+ Beads
Mr
extract biotin (empty)
200
116
97
66
45
31 a
21
b
14.4 c
6.5
Fig. 2 Avidin-affinity enrichment of Colorado potato beetle Cys proteases captured using biotinylated
AviTagged SlCYS8. Q47P, a single functional variant of SlCYS8 with limited inhibitory activity against
papain-like Cys proteases [17], was here used as a negative control for the protease capture step. Biotinylated
cystatins bound to the avidin beads were incubated with the insect protein extract for target protease capture
(Subheading 3.1.4). Proteins in test and control (Q47P) samples were visualized by Coomassie Blue staining
following 12% (w/v) SDS-PAGE. The crude (Crude extract), flow-through (Flow), washing (Wash), and beads-
bound (Beads) protein fractions are shown on the gel. A 30-kDa protein was readily detected in the Beads
fraction using wild-type SlCYS8 (Box a), corresponding to the previously described Cys protease LdP30
purified from Colorado potato beetle midgut extracts by affinity chromatography with the model plant cystatin
OCI as a ligand [18]. Boxes b and c correspond to avidin and AviTagged SlCYS8 recovered from the affinity
beads, respectively. Mr, on the left, refers to molecular weight protein markers (kDa)
3.2 Mass Gel slices collected at Subheading 3.1.4 are used as source material
Spectrometric to identify and quantify cystatin-captured protease isoforms. This
Analysis of Captured part of the procedure first involves protein sample preparation for
Proteases mass spectrometry, followed by the LC-MS/MS analysis per se,
peptide-based identification of the captured proteases, and their
quantitation based on peptide spectral count sampling statistics.
3.2.1 Sample 1. Wash the gel slices for 5 min in water and destain proteins three
Preparation for Mass times with equal volumes of 100 mM ammonium bicarbonate
Spectrometry and 50% (v/v) acetonitrile.
2. Dry the gel slices by washing for 10 min in 50% (v/v)
acetonitrile.
3. Reduce and alkylate entrapped proteins with 10 mM dithio-
threitol and 55 mM iodoacetamide, respectively.
360 Marie-Claire Goulet et al.
3.2.2 LC-MS/MS Our LC-MS/MS analyses are performed at the Proteomics Plat-
Analysis form of CHU de Québec Research Center (http://proteomique.
ulaval.ca), Québec, QC, Canada. In brief, peptide samples pro-
duced at Subheading 3.2.1 are resolved by online reversed-phase
nanoscale capillary LC and analyzed by electrospray MS/MS. An
Eksigent ekspert™ nanoLC425 System is used, coupled to a
Triple-TOF 5600 plus mass spectrometer equipped with a nanoe-
lectrospray ion source (Sciex). Peptide separations take place in self-
pack PicoFrit columns [75 μm ID/15 μm tip] (New Objective)
packed with Reprosil-Pur C18 AQ media composed of 3-μm par-
ticles with pores of 120 A (Dr. Maisch, Woburn, MA, USA). The
peptides are eluted at 300 nL/min over 35 min along a 5–35%
(v/v) acetonitrile–0.1% (v/v) formic acid linear gradient. Full-scan
mass spectra [400–1250 m/z] are acquired under a data-dependent
acquisition mode using the Analyst software, version 1.7 (Sciex).
The 20 most intense ions are selected for collision-induced dissoci-
ation, with the dynamic exclusion period set a 20 s and a peptide ion
mass tolerance of 100 ppm.
3.2.3 Identification of MGF peak list files are generated with the Protein Pilot software,
Captured Proteases version 4.5 (Sciex) and analyzed using the Mascot software, version
2.5.1 (Matrix Science) to search the Uniprot protein sequence
database (http://www.uniprot.org/). Search parameters for pro-
tein matching are set as follows: a fragment ion mass tolerance of
0.1 Da; a parent ion tolerance of 0.1 Da; iodoacetamide derivatives
of Cys residues as fixed modification; oxidized Met residues as
variable modification; and a maximum allowed of two missed tryp-
sin cleavages. MS/MS-based peptide and protein identifications are
validated using the SCAFFOLD software, version 4.7.1 (Proteome
Software). A false discovery rate of 1%, as determined with the
Scaffold Local FDR algorithm, is applied for both peptides and
proteins. Proteins that contain similar peptides and cannot be
differentiated based on the MS/MS spectra are grouped to satisfy
the principle of parsimony.
Cystatin Activity-Based Protease Profiling 361
3.3 Working Spectral count data in Subheading 3.2.4 may be used to address
Examples questions of practical or scientific interest. We used the approach in
recent years to identify potent inhibitor variants for Colorado
potato beetle control [11, 12, 21] (Subheading 3.3.1). We also
used it to address basic questions about the evolution and struc-
ture/function relationships of protease–inhibitor interactions in
plant/insect systems, again taking the Colorado potato beetle as a
model [10, 22] (Subheading 3.3.2).
3.3.2 Example 2: The Complex protease inhibitor complements in plants are the result of
Protease Capture Approach evolutionary processes often involving gene duplication and posi-
as an Analytical Tool to tive selection of nonsynonymous mutations at functionally signifi-
Address Basic Questions cant amino acid sites [24]. A well-documented case is the 8-domain
on the Evolution and potato multicystatin, an 88-kDa protease inhibitor induced in leaf
Protease Binding tissue by Colorado potato beetle feeding [25]. The eight domains
Preferences of Plant of this protein present hypervariable amino acid sites at conserved
Cystatins protease inhibitory motifs, assumed to be instrumental in its broad
362 Marie-Claire Goulet et al.
A B C
Z-Phe-Arg-MCA 100 Z-Arg-Arg-MCA 10
20 25
2
T6 Loop 1
P2 Loop 2 0 0 0
0 10 20 30 40 50 WT T6R P2V WT T6R P2V
Inhibitor (nM) SlCYS8 variant (1 µM) SlCYS8 variant
Fig. 3 Affinity spectra of tomato SlCYS8 and single functional variants T6R and P2V toward Colorado potato
beetle digestive Cys proteases. (a) Structure model for SlCYS8 (GenBank Accession No. AF198390) showing
the approximate position of residues Pro-2 (P2) and Thr-6 (T6) targeted to produce P2V and T6R. Details for
the in silico modeling are given in ref. [10]. (b) Z-Arg-Phe-methylcoumarin (MCA) (cathepsin L-like) and Z-Arg-
Arg-MCA (cathepsin B-like) hydrolyzing activities in larval midgut extracts preincubated with the three cystatin
variants. Data on this panel were inferred from ref. [14]. Each bar is the mean of three independent (insect
replicate) values SE. (c) Relative spectral counts for digestive Cys protease (intestain) peptides captured with
biotinylated SlCYS8, P2V, or T6R in midgut extracts of fourth-instars. Data on this panel were inferred from ref.
[11]. Spectral counts are expressed relative to total spectra counted for wild-type SlCYS8 (mean value of 1).
Each bar is the mean of three independent (insect replicate) values SE
IntB IntD
Fig. 4 Affinity spectra of wild-type SlCYS8 and single variants P2I, P2L, and P2V toward Colorado potato beetle
digestive Cys proteases. (a) Relative spectral counts for intestain peptides captured with biotinylated wild-type
SlCYS8 (WT), P2I, P2L, or P2V in midgut extracts of fourth instars. Spectral counts are expressed relative to
wild-type SlCYS8 (mean value of 1) for all detected intestains (Total, corresponding to subfamilies IntA–F) or
for peptides specific to major intestain subfamilies IntB and IntD [12]. Each bar is the mean of three
independent (insect replicate) values SE. (b) Intestain subfamily preference patterns of wild-type SlCYS8,
P2I, P2L, and P2V for major intestain families IntB and IntD. Pie charts illustrate the relative proportions of
IntB- vs IntD-specific peptides detected in the insect crude extract. Data on this figure were inferred from
reference [22]
4 Notes
Acknowledgments
References
8. Michaud D, Nguyen-Quoc B (2000) Using 18. Visal-Shah SD, Vrain TC, Yelle S et al (2001)
natural and modified protease inhibitors. In: An electroblotting, two-step procedure for the
Michaud D (ed) Recombinant protease inhibi- detection of proteinases and the study of pro-
tors in plants. CRC Press, Boca Raton, FL, pp teinase/inhibitor complexes in gelatin-
114–127 containing polyacrylamide gels. Electrophore-
9. Srinivasan A, Giri AP, Gupta VS (2006) Struc- sis 22:2646–2652
tural and functional diversities in lepidopteran 19. Zhang B, VerBerkmoes NC, Langston MA et al
serine proteases. Cell Mol Biol Lett (2006) Detecting differential and correlated
11:132–154 protein expression in label-free shotgun prote-
10. Vorster J, Rasoolizadeh A, Goulet MC et al omics. J Proteome Res 5:2909–2918
(2015) Positive selection of digestive Cys pro- 20. Old WM, Meyer-Arendt K, Aveline-Wolf L
teases in herbivorous Coleoptera. Insect Bio- et al (2005) Comparison of label-free methods
chem Mol Biol 65:10–19 for quantifying human proteins by shotgun
11. Rasoolizadeh A, Munger A, Goulet MC et al proteomics. Mol Cell Proteomics
(2016) Functional proteomics-aided selection 4:1487–1502
of protease inhibitors for herbivore insect con- 21. Oppert B, Rasoolizadeh A, Michaud D (2014)
trol. Sci Rep 6:38827 The coleopteran gut and targets for pest con-
12. Sainsbury F, Rhéaume AJ, Goulet MC et al trol. In: Hoffmann K (ed) Insect molecular
(2012) Discrimination of differentially inhib- biology and ecology. CRC Press, Boca Raton,
ited cysteine proteases by activity-based FL, pp 291–317
profiling using cystatin variants with tailored 22. Rasoolizadeh A, Goulet MC, Sainsbury F et al
specificities. J Proteome Res 11:5983–5993 (2016) Single substitutions to closely related
13. Benchabane M, Schlüter U, Vorster J et al amino acids contribute to the functional diver-
(2010) Plant cystatins. Biochimie sification of an insect-inducible, positively
92:1657–1666 selected plant cystatin. FEBS J 283:1623–1635
14. Goulet MC, Dallaire C, Vaillancourt LP et al 23. Cingel A, Savic J, Lazarevic J et al (2016)
(2008) Tailoring the specificity of a plant cysta- Extraordinary adaptive plasticity of Colorado
tin toward herbivorous insect digestive cysteine potato beetle: “ten-striped spearman” in the
proteases by single mutations at positively era of biotechnological warfare. Int J Mol Sci
selected amino acid sites. Plant Physiol 17:1538
146:1010–1019 24. Christeller JT (2005) Evolutionary mechan-
15. Beckett D, Kovaleva E, Schatz PJ (1999) A isms acting on proteinase inhibitor variability.
minimal peptide substrate in biotin holoen- FEBS J 272:5710–5722
zyme synthetase-catalyzed biotinylation. Pro- 25. Bouchard E, Cloutier C, Michaud D (2003)
tein Sci 8:921–929 Oryzacystatin I expressed in transgenic potato
16. Smith BJ (1984) SDS polyacrylamide gel elec- induces digestive compensation in an insect
trophoresis of proteins. In: Walker JM natural predator via its herbivorous prey feed-
(ed) Methods in molecular biology, Proteins, ing on the plant. Mol Ecol 12:2439–2446
vol 1. Humana Press, Clifton, NJ, pp 41–55 26. Kiggundu A, Goulet MC, Goulet C et al
17. Arai S, Watanabe H, Kondo H et al (1991) (2006) Modulating the proteinase inhibitory
Papain-inhibitory activity of oryzacystatin, a profile of a plant cystatin by single mutations
rice seed cysteine proteinase inhibitor, depends at positively selected amino acid sites. Plant J
on the central Gln-Val-Val-Ala-Gly region con- 48:403–413
served among cystatin superfamily members. J
Biochem 109:294–298
Chapter 27
Abstract
In the era of high-throughput biology, it is necessary to develop a simple pipeline for metabolic pathway
reconstruction in plant orphan species. However, obtaining a global picture of the plant metabolism may be
challenging, especially in nonmodel species. Moreover, the use of bioinformatics tools and statistical
analyses is required. This chapter describes how to use different software and online tools for the recon-
struction of metabolic pathways of plant species using existing pathway knowledge. In particular, Quercus
ilex omics data is employed to develop the present pipeline.
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_27, © Springer Science+Business Media, LLC, part of Springer Nature 2020
367
368 Cristina López-Hidalgo et al.
2 Materials
2.1 Datasets The data employed in this work belongs to previously published
works [11–13].
2.1.3 Metabolomics The metabolites were obtained as indicated in [11]. Some pipelines
Datasets are implemented for metabolite identification, including both com-
mercial software such as Compound Discoverer 3.0 (Thermo Sci-
entific™) and Progenesis QI software (Nonlinear Dynamics) and
open and free software packages such as MZmine2 [14], XCMS
[15], and MSDIAL [16]. The former group of software identify
compounds using online database search tools including
mzCloud™, Chemspider™, KEGG, and METLIN [17], and
local or in-house databases. In the employed data, data raws are
analyzed by AMDIS (http://www.amdis.net/) and metabolites are
“tentatively assigned” based on GC retention times (RT) and m/z
values through searches in different databases, including the Gölm
Metabolome Database [18], Alkane, Fiehn library 1 y 2 [19],
GC-TSQ, MoSys, and NIST/EPA/NIH Mass Spectral Library.
The annotated metabolites are named using the KEGG com-
pound reference database. For MapMan visualization, the name of
the metabolites must be compatible with the MapMan metabolite
370 Cristina López-Hidalgo et al.
Fig. 1 The workflow for metabolic pathway reconstruction is divided in four steps: omics data collection,
bioinformatics for (semi)quantitative analyses, bioinformatics for annotation, and data visualization. Employed
software and tools are referenced (transcriptomics, proteomics, and metabolomics)
2.2 Integration Tools Different resources and web application are employed to integrate
the multiple omics information. The first one, Mercator [20], is an
online tool to batch classify protein or gene sequences into Map-
Man functional plant categories. This tool allows for the automatic
structuring of whole plant transcriptomes and/or proteomes. Once
the annotation and functional plant categorization have been con-
ducted, KEGG (Kyoto Encyclopedia for Genes and Genomes;
http://www.genome.jp/kegg/) and MapMan (http://mapman.
gabipd.org/) [21, 22] are used to visualize the data in different
plant metabolic pathways.
3 Methods
3.2 KEGG Metabolic The KEGG metabolic pathway database contains a collection of
Pathways pathway maps that allow for the representation of molecular inter-
actions and reactions. Both transcripts and proteins can be
employed. In order to see the presence of transcripts or proteins
related to enzymes, the process must be conducted twice.
1. Copy the EC and C numbers in the KEGG mapper (https://
www.genome.jp/kegg/tool/map_pathway1.html) or upload
the previously generated file with the list of these numbers
(Fig. 4).
372 Cristina López-Hidalgo et al.
Fig. 2 (a) Screenshot of Mercator sequence annotator. This tool performs Blast searches against Arabidopsis
TAIR 10, Swiss-Prot, and Uniref90. In the picture, other databases can be shown. The results are filtered by a
threshold (Blast_cutoff). (b) Screenshot of the Mercator sequence annotator status process. (c) Screenshot of
the Mercator finished status. When the status process indicates that it is finished, the results can be
downloaded
2. Select the organism in search against (Press org and write ath
for Arabidopsis thaliana (thale cress) or other species, such as
pop for Populus trichocarpa).
3. Press execute.
4. On the following page, the result is displayed (Fig. 5a) (see
Note 3).
5. Choose the metabolic pathway by clicking on the name (e.g.,
ath00020 Citrate cycle (TCA cycle)—Arabidopsis thaliana
(thale cress)).
6. A picture of the metabolic pathway with the metabolic reac-
tions is shown (Fig. 5b). The detected items are highlighted
in red.
Metabolic Pathway Reconstruction in Plant Species 373
Fig. 3 (a) Functional categorization and distribution in percentage of the proteins or genes, according to the
categories established by MERCATOR. The pie chart shows different functional categories: PS (Photosynthe-
sis), major CHO metabolism, minor CHO metabolism, glycolysis, fermentation, gluconeogenesis/glyoxylate
cycle, OPP (Oxidative Pentose Phosphate), TCA/org transformation, mitochondrial electron transport/ATP
synthesis, cell wall, lipid metabolism, N-metabolism, amino acid metabolism, S-assimilation, metal handling,
secondary metabolism, hormone metabolism, cofactor and vitamin metabolism, tetrapyrrole synthesis, stress,
redox, polyamine metabolism, nucleotide metabolism, biodegradation of xenobiotics, C1-metabolism, mis-
cellaneous RNA, DNA, protein, signaling, cell, micro RNA, natural antisense, development, transport, and not
assigned. (b) Screenshot of the functional categorization result file (txt format) which lists for each gene or
protein in the first column the BINCODE, in the second column the BINCODE name, in the third column the
fasta file identifier, the fourth column the description of the annotated gene or protein, and in the fifth column
the type of molecular component (T is transcript; P is protein; and M is metabolite. The description contains
the information indicated in Table 1
374 Cristina López-Hidalgo et al.
Table 1
Example of the information indicated in DESCRIPTION column
Annotated transcript/
protein Functions in Involved in Located in Expressed in
(p48715|rbl_sinal: Ribulose- Response to In 10 components 24 plant structures
98.2) Ribulose bisphosphate cadmium ion,
bisphosphate carboxylase carbon fixation,
carboxylase large activity peptidyl-
chain precursor cysteine
(EC 4.1.1.39) S-nitrosylation,
(RuBisCO large response to
subunit) abscisic acid
(Fragment)—Sinapis stimulus
alba (White mustard)
(Brassica hirta) &
(atcg00490: 97.1)
large subunit of
RUBISCO. Protein
is tyrosine-
phosphorylated, and
its phosphorylation
state is modulated in
response to ABA in
Arabidopsis thaliana
seeds. RBCL
Fig. 4 Screenshot of Search Pathway mapping tool. This tool searches against KEGG pathway maps the given
objects (genes, transcripts, proteins, and metabolites)
Fig. 5 (a) Screenshot of the KEGG pathways mapper results. The results consist of a list of the assigned
transcripts/proteins and the metabolites to each KEGG metabolic pathway. (b) Screenshot of the citrate cycle
pathway. The detected transcripts and metabolites are indicated in red
Metabolic Pathway Reconstruction in Plant Species 377
Fig. 6 Workflow with screenshots of the process that it is essential to carry out for visualizing the data on maps
of biological processes. The data must be uploaded to Experiment and Mapping folders
378 Cristina López-Hidalgo et al.
Fig. 7 Screenshots of different means to visualize the citrate cycle pathway in MapMan. (a) Core metabolism
overview. (b) Metabolites. (c) TCA representation. Each red square represents a metabolite or a transcript/
protein. More details can be found in [22]
Metabolic Pathway Reconstruction in Plant Species 379
4 Notes
References
1. Rai A, Saito K, Yamazaki M (2017) Integrated Eucalyptus globulus recovery from water deficit.
omics analysis of specialized metabolism in Metabolomics 12:141
medicinal plants. Plant J 90:764–787 9. Pascual J, Cañal MJ, Escandón M et al (2017)
2. Viant MR, Kurland IJ, Jones MR et al (2017) Integrated physiological, proteomic and meta-
How close are we to complete annotation of bolomic analysis of UV stress responses and
metabolomes? Curr Opin Chem Biol adaptation mechanisms in Pinus radiata. Mol
36:64–69 Cell Proteomics 16:485–501
3. Ernst M, Silva DB, Silva RR et al (2014) Mass 10. Qi Q, Li J, Cheng J (2014) Reconstruction of
spectrometry in plant metabolomics strategies: metabolic pathways by combining probabilistic
from analytical platforms to data acquisition graphical model-based and knowledge-based
and processing. Nat Prod Rep 31:784 methods. BMC Proc 8:1–10
4. Allen DK, Libourel IGL, Shachar-Hill Y 11. López-Hidalgo C, Guerrero-Sánchez VM,
(2009) Metabolic flux analysis in plants: coping Gómez-Gálvez I et al (2018) A multi-omics
with complexity. Plant Cell Environ analysis pipeline for the metabolic pathway
32:1241–1257 reconstruction in the orphan species Quercus
5. Fiehn O (2002) Metabolomics - the link ilex. Front Plant Sci 9:1–16
between genotypes and phenotypes. Plant 12. Guerrero-Sanchez VM, Maldonado-Alconada
Mol Biol 48:155–171 AM, Amil-Ruiz F et al (2017) Holm oak
6. Meijón M, Feito I, Oravec M et al (2016) (Quercus ilex) Transcriptome. De novo
Exploring natural variation of Pinus pinaster sequencing and assembly analysis. Front Mol
Aiton using metabolomics: is it possible to Biosci 4:70
identify the region of origin of a pine from its 13. Guerrero-Sanchez VM, Maldonado-Alconada
metabolites? Mol Ecol 25:959–976 AM, Amil-Ruiz F et al (2019) Ion torrent and
7. Valledor L, Carbó M, Lamelas L et al (2018) lllumina , two complementary RNA-seq plat-
When the tree let us see the forest: systems forms for constructing the holm oak (Quercus
biology and natural variation studies in forest ilex ) transcriptome. PLoS One 7454228:1–18
species. In: Progress in botany. Springer, Ber- 14. Pluskal T, Castillo S, Villar-Briones A et al
lin, Heidelberg, pp 345–367 (2010) MZmine 2: modular framework for
8. Correia B, Valledor L, Hancock RD et al processing, visualizing, and analyzing mass
(2016) Integrated proteomics and metabolo- spectrometry-based molecular profile data.
mics to unlock global and clonal responses of BMC Bioinformatics 11:395
380 Cristina López-Hidalgo et al.
Abstract
The detection and identification of low-abundance proteins from plant tissues is still a major challenge.
Among the reasons are the low protein content, the presence of few very high-abundance proteins, and the
presence of massive amounts of other biochemical compounds. In the last decade numerous technologies
have been devised to resolve the situation, in particular with methods based on solid-phase combinatorial
peptide ligand libraries. This methodology, allowing for an enhancement of low-abundance proteins, has
been extensively applied with the advantage of deciphering the proteome composition of various plant
organs. This general methodology is here described extensively along with a number of possible variations.
Specific guidelines are suggested to cover peculiar situations or to comply with other associated analytical
methods.
Key words Plant proteome, Low-abundance proteins, Combinatorial peptide ligand library
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_28, © Springer Science+Business Media, LLC, part of Springer Nature 2020
381
382 Egisto Boschetti and Pier Giorgio Righetti
6.1 General Capture The amount of neutral salt to reach physiological conditions is
Method under 150 mM. Most frequently the buffer used is a 25 mM phosphate
Physiological buffer containing 0.15 M sodium chloride, pH 7.2 (PBS). These
Conditions buffers mimic the conditions that reign within a cell; by definition
these conditions fully preserve the biological functions of proteins.
(a) Equilibrate the plant protein clear sample with the selected
physiological buffer (e.g., PBS). This operation is performed
by different ways. For instance, if the protein sample is a
lyophilized material, dissolve the powder in the buffer and
clarify by centrifugation. Otherwise a dialysis or a diafiltration
operation or a desalting chromatography or even a desalting
by centrifugation using dedicated membrane devices can be
adopted. For information on protein amounts and concentra-
tions, see Notes 3, 8–11. If the protein sample contains pro-
teases (this is frequently the case) a tablet of a cocktail of
protease inhibitors is added.
388 Egisto Boschetti and Pier Giorgio Righetti
Low-abundance proteins
Fig. 2 Protein capture phase of the dynamic range reduction process by means
of CPLLs. The operation is most generally performed under physiological con-
ditions; however, the reduction of ionic strength allows for capturing more
proteins especially those that have week affinity for the CPLLs. Specific condi-
tions can be used to enhance either or alkaline or hydrophobic low-abundance
protein capturing
6.2 Protein Capture The reduction of ionic strength of the capture buffer promotes or
in Low-Ionic Strength intensifies electrostatic interactions. Weakly charged proteins can
thus be more easily attracted by the electrical charge of the beads.
The result of the decrease of ionic strength is an increase of binding
capacity as this is the case when dealing with ion exchange chroma-
tography. Under these conditions the amount of proteins offered
Low-Abundance Plant Proteomics 389
6.3 The Capture Proteins carry electrical charges of both signs depending on the
of Dominantly Acidic pH. At low pH the positive charge is exacerbated and proteins that
Proteins at neutral pH are negatively charged will reverse their electrical sign.
In this case proteins that were captured by CPLLs because of
attractive electrical sign could be repulsed by them. This is why
for a good reproducibility a perfect control of pH is mandatory.
Since proteins are captured by the beads thanks also to other
interactions, the variation of pH may contribute to weaken the
interaction intensity with certain protein species to the point that
no capture occurs.
The range of pH values where the CPLLs can be operated is
between 3 and 10 [44]. Beyond these limits virtually all proteins are
charged, respectively, either positively or negatively.
As a general rule when the operation is performed in acidic
conditions the capture of anionic proteins is enhanced.
(a) Adjust the protein extract at the desired acidic pH (most
generally pH 4) by adding dropwise either acetic acid or citric
acid up to pH stabilization. This operation can also be per-
formed by buffer exchange (dialysis or diafiltration or gel
filtration). Remove possible materials in suspension.
(b) Equilibrate the CPLL beads using an acidic buffer of the same
pH selected for the protein extract. Then drain out the excess
of liquid by centrifugation under low-speed (about 1250 g
at 20 C for a few min).
(c) Mix the protein sample and the CPLL beads; stir gently to
maintain the beads in suspension for at least 2–3 h or over-
night at constant room temperature. A majority of acidic
proteins are captured.
(d) The protein capture extent will depend also on the ionic
strength of the buffer as described in Subheading 4.
(e) Eliminate the excess of supernatant by centrifugation.
(f) Wash 2–3 times the CPLL beads with the equilibration buffer
to remove the excess of proteins and proceed for the elution of
captured proteins by one of the methods described in
Subheading 7.
6.4 The Capture As stated above (Subheading 6.3), when varying the buffer pH,
of Dominantly Cationic proteins acquire a different net electrical charge. In alkaline condi-
Proteins tions the dominant charge is negative for proteins having an iso-
electric point below the environmental pH.
390 Egisto Boschetti and Pier Giorgio Righetti
6.5 Focus Among natural amino acids composing proteins the most hydro-
on Hydrophobic phobic are isoleucine, leucine, valine and phenylalanine. They con-
Protein Capture tribute to confer a certain degree of hydrophobicity to the entire
polypeptidic construct. A typical method of separating this cate-
gory of proteins is hydrophobic chromatography [45]. This
method is based on the use of structuring salts selected from the
Hofmeister series. The most common process is to equilibrate the
columns by using a buffer comprising at least 1 M ammonium
sulfate. Under these conditions the most hydrophobic proteins
are adsorbed by the CPLL beads and are thus subtracted from the
protein solution. Electrostatic interactions are minimized because
of the presence of strong salt ions. Within the present context with
an entire proteome, the capture of hydrophobic protein by CPLLs
can easily be enhanced. The technical details are as follows:
(a) To the protein extract add the desired amount of lyotropic salt
(generally this is 1 M ammonium sulfate final concentration).
In case of difficulties with protein precipitation the user should
refer to Note 13. Protein equilibration can alternatively be
equilibrated with a buffer comprising the lyotropic salt by
buffer exchange (dialysis or diafiltration or gel filtration). A
possible cloudy material may appear in the supernatant and
should be removed by centrifugation at 10,000 g for
10 min.
Low-Abundance Plant Proteomics 391
7.1 Global Protein Global protein elution from CPLLs is the most frequent option in
Harvesting protein harvesting for proteomics analysis. To this end all involved
interaction forces have to be challenged. In frequent cases it has
been observed that after elution some proteins are still present on
the beads. They are polypeptides retained with high association
constants, among them low-abundance proteins. If they are not
eluted they escape the proteomics analysis with a significant reduc-
tion efficiency of the CPLL treatment. It is within this context that
several global elution methods can be devised.
7.1.1 Global Protein This is one of the most efficient elution methods. It involves
Elution sodium dodecyl sulfate (SDS) as repeatedly described [46]. SDS is
with SDS-Containing known in electrophoresis to confer to proteins a similar global
Buffers charge by sticking on the proteins via hydrophobic associations
and exposing thus strong sulfonate groups. With this profound
restructuring, proteins desorb from the solid CPLL phase. This
operation is performed in the presence of dithiothreitol preventing
the formation of disulfur bonds while enhancing the solubility of
proteins. In addition the high temperature of treatment (boiling
water bath) accelerates the elution procedure to just a few minutes.
(a) Prepare an aqueous solution of 3% SDS (this concentration
could be as high as 10%) and add dithiothreitol (DTT) up to a
final concentration of 25 mM.
(b) To 100 μL of CPLL beads loaded with proteins add 200 μL of
SDS-DTT solution. Mix gently while preventing the forma-
tion of foam and then put in a boiling bath for 10 min.
(c) Cool down the bead suspension and separate the supernatant
by low-speed centrifugation (e.g., 2000 g for 10 min).
Low-Abundance Plant Proteomics 393
Fig. 3 Protein elution phase from CPLLs. A variety of protein desorption methods
can be combined either as global protein elution or as fractionated elution. The
latter can be composed of two, three, or more desorption steps. When more than
one desorption is involved, proteins are collected as a function of elution
stringency or by challenging individually the elemental molecular interactions
7.1.2 Global Protein Another efficient agent capable to desorb proteins from complex
Elution with Guanidine affinity column is guanidine hydrochloride. Such a solution is used
Hydrochloride Solutions at a quite high concentration. It easily competes against electro-
static interactions. Guanidine is a strong chaotropic agent able to
weaken hydrogen bondings and hydrophobic associations. The
final result is the total desorption of proteins that are captured by
CPLLs. Naturally after exposure with guanidine hydrochloride,
desorbed proteins are destructured and hence denatured.
(a) Prepare an aqueous solution of 6 M guanidine and adjust the
pH to 6 by addition of 3–6 M hydrochloric acid.
(b) To 100 μL of CPLL beads loaded with proteins add 200 μL of
the guanidine elution solution, mix gently for 10 min.
(c) Separate the proteins that are in solution in the supernatant by
low speed centrifugation (e.g., 2000 g for 10 min). Then
repeat the operation with the recovered CPLL pellet to be sure
394 Egisto Boschetti and Pier Giorgio Righetti
that all proteins located within the bead pores are extracted.
The second supernatant also recovered by centrifugation is
pooled with the first one.
(d) The assembled eluate solution is not directly analyzable by
current methods because of the presence of high concentra-
tions of guanidine. The protein solution must thus be dialyzed
against any appropriate buffer and if necessary concentrated
and precipitated.
(e) To check that all proteins are desorbed from CPLL beads a
recommendation is given with Note 15.
7.2 Fractionated Several sequenced elution methods have been reported. They are
Elution Approaches also detailed in a dedicated book where variations are described
[47]. The principle is to start with a relatively mild elution step
followed by other desorption steps each of them being performed
with chemical agents or displacers of increased stringency. The
reason behind this approach is first to be sure that all proteins are
desorbed and second that each fraction is populated by a lower
number of species compared to a global protein elution, thus facil-
itating the following analytical procedures.
7.2.1 Two-Step Elution (a) Prepare two different desorbing aqueous solutions. The first is
with Increased Stringency composed of 4 M urea, 1% CHAPS, 5% acetic acid, the second
is a 6 M guanidine-HCl, pH 6.0.
(b) To 100 μL of CPLL beads loaded with proteins add 200 μL of
the first elution solution. Mix gently for about 10 min.
(c) Separate proteins that are in the supernatant from the beads by
low speed centrifugation (e.g., 2000 g for 10 min). Treat
the CPLL beads a second time under exactly the same condi-
tions and pool the two supernatants. Store this first eluate in
the cold.
(d) Mix then CPLL beads pellets with 200 μL of guanidine-HCL
solution and gently shake for 10 min. Separate the supernatant
by centrifugation and repeat the operation. Separate the sec-
ond supernatant by low-speed centrifugation and pool with
the first one. Store this second eluate in the cold.
(e) The two eluates are ready for protein analysis by chromatog-
raphy, mass spectrometry or electrokinetic methodologies.
7.2.2 Three-Step Elution (a) Prepare three different desorbing aqueous solutions. The first
with Increased Stringency is composed of 2 M thiourea, 7 M urea, and 2% CHAPS (here
(Option 1) named TUC). The second solution is composed of 9 M urea
acidified to pH 3 by acetic acid or citric acid (here named
UCA). The third solution is a mixture of acetonitrile, isopro-
panol, ammonia at 20% and water (6, 12, 10 and 72% respec-
tively) (here named AIAW).
Low-Abundance Plant Proteomics 395
7.2.3 Three-Step (a) Prepare three different desorbing aqueous solutions. The first
Increased Stringency is composed of 1 M sodium chloride, the second composed of
Elution (Option 2) 3 M guanidine-HCl pH 6.0 and the third comprising 9 M
urea titrated with citric acid up to pH 3–3.5.
(b) Proceed as three steps elution described above in Subheading
7.2.2.
(c) Store the three eluates in the cold.
(d) The eluates are ready for protein composition analysis by mass
spectrometry, chromatography or electrokinetic methodolo-
gies (see Subheading 8).
7.3 Direct on-Bead When the analysis of captured proteins is performed by the
Protein Digestion so-called shotgun approach, the most direct way to proceed is to
make a digestion of the captured proteins directly on the beads. The
method is derived from the in-solution digestion of proteins
[48]. The operation requires some excess of trypsin, since part of
it will be captured by the CPLL beads. Basically the process is as
follows:
(a) After protein capture on the peptide library beads (whatever
the method or the physicochemical conditions), the beads are
rapidly washed twice with 200 μL of 100 mM ammonium
bicarbonate containing 0.1% Rapigest (this is not mandatory,
but it facilitates the proteolysis process). This is obtained by
adding 1 mL of 100 mM ammonium bicarbonate to the 1 mg
Rapigest vial lyophilizate and shake gently for few minutes).
The bead suspension is then vortexed for few min.
(b) Add 300 μL of 10 mM DTT and heat the bead suspension at
65 C for 1 h under gentle stirring or occasional shaking.
396 Egisto Boschetti and Pier Giorgio Righetti
Fig. 4 SDS–polyacrylamide gel electrophoresis analysis of various fruit pulp protein extracts before and after
treatment with combinatorial peptide ligand library. (a) banana pulp [55]; (b) mango pulp [57]; (c) lemon pulp
[58]; (d) orange pulp [59]; (e) wolfberry pulp [60]; (f) avocado pulp [54]; (g) olive pulp [56]. By courtesy from
Boschetti and Righetti [36]
sample and 190 present only in the control. Overall 648 new
proteins have been detected via CPLLs. In the case of banana, out
of a total number of 1131 proteins identified, 849 were attributed
to the CPLL technology.
From olive fruit pulp [56], where only native extraction was
applied, the number of unique gene products found was only
252, but already much higher compared to what was known from
the literature. Examples of analysis of fruit proteins before and after
treatment with CPLLs are illustrated on Fig. 4.
To the large list of known protein allergens from plants there
are molecules that are below the detection limits. They can be
evidenced after treatment with CPLLs. One of the most represen-
tative examples is the discovery of low-abundance allergens from
cypress pollen [61]. From patient serum exposure the list of cypress
pollen allergens has been enriched of several new, never-described
species such as chaperone protein HSP104, a Sigma factor SigB
regulation protein (a hydrolase involved in stress regulation mech-
anism), and Rab-like protein. A number of other allergens have
been discovered using CPLLs in Hevea latex [35], mango [62], and
banana [55].
Low-Abundance Plant Proteomics 399
10 Notes
References
1. Jorrı́n-Novo JV, Maldonado AM, Echevarrı́a- nuclear subproteome studies in rice (Oryza
Zomeño S et al (2009) Plant proteomics sativa) endosperm. Electrophoresis
update (2007–2008): second-generation pro- 29:604–617
teomic techniques, an appropriate experimen- 13. Ribeiro M, Nunes-Miranda JD, Branlard G
tal design, and data analysis to fulfill MIAPE (2013) One hundred years of grain omics:
standards, increase plant proteome coverage identifying the glutens that feed the world. J
and expand biological knowledge. J Proteome Proteome Res 12:4702–4716
72:285–314 14. Xiong E, Wu X, Yang L et al (2014)
2. Agrawal GK, Rakwal R (2008) Plant proteo- Chloroform-assisted phenol extraction
mics: technologies, strategies, applications. improving proteome profiling of maize
Wiley, Hoboken embryos through selective depletion of high-
3. Agrawal GK, Job D, Zivy M et al (2011) Time abundance storage proteins. PLoS One 9:
to articulate a vision for the future of plant e112724
proteomics - a global perspective: an initiative 15. Tavakolan M, Alkharouf NW, Matthews B et al
for establishing the international plant proteo- (2014) SoyProLow: a protein database
mics. Proteomics 11:1559–1568 enriched in low abundant soybean proteins.
4. Boschetti E, Hernandez-Castellano LE, Righ- Bioinformation 10:599–601
etti PG (2019) Progress in farm animal prote- 16. Carpentier SC, Panis B, Vertommen A et al
omics: the contribution of combinatorial (2008) Proteome analysis for non-model
peptide ligand libraries. J Proteome 197:1–13 plants: a challenging but powerful approach.
5. Boschetti D’AA, Candiano G, Righetti PG Mass Spectrom Rev 27:354–377
(2018) Protein biomarkers for early detection 17. Gengenheimer P (1990) Preparation of
of diseases: the decisive contribution of CPLLs. extracts from plants. Methods Enzymol
J Proteome 188:1–14 182:174–193
6. Boschetti E, Fasoli E, Righetti PG (2015) The 18. Boschetti E, Bindschedler L, Tang C et al
discovery of low-abundance allergens by prote- (2009) Combinatorial peptide ligand libraries
omics analysis involving combinatorial peptide and plant proteomics: a winning strategy at a
ligand libraries. Jacobs J Allergy Immunol price. J Chromatogr A 1216:1215–1222
2:015 19. Wang W, Vignani R, Scali M et al (2004)
7. Hijazi M, Velasquez SM, Jamet E et al (2014) Removal of lipid contaminants by organic sol-
An update on post-translational modifications vents from oilseed protein extract prior to elec-
of hydroxyproline-rich glycoproteins: toward a trophoresis. Anal Biochem 329:139–141
model highlighting their contribution to plant 20. Méchin V, Damerval C, Zivy M (2007) Total
cell wall architecture. Front Plant Sci protein extraction with TCA-acetone. Meth-
5:395–405 ods Mol Biol 355:1–8
8. Millar DJ, Whitelegge JP, Bindschedler LV et al 21. Wessel D, Flugge UI (1984) A method for the
(2009) The cell wall and secretory proteome of quantitative recovery of proteins in dilute solu-
a tobacco cell line synthesising a secondary tions in the presence of detergents and lipids.
wall. Proteomics 9:2355–2372 Anal Biochem 138:141–143
9. Xu MS, Chen S, Wang WQ et al (2013) 22. Isaacson T, Damasceno CM, Saravanan RS et al
Employing bifunctional enzymes for enhanced (2006) Sample extraction techniques for
extraction of bioactives from plants: flavonoids enhanced proteomic analysis of plant tissues.
as an example. J Agric Food Chem Nat Protoc 1:769–774
61:7941–7948
23. Faurobert M, Pelpoir E, Chaı̈b J (2007) Phe-
10. Cho WK, Hyun TK, Kumar D et al (2015) nol extraction of proteins for proteomic studies
Proteomic analysis to identify tightly-bound of recalcitrant plant tissues. Methods Mol Biol
cell wall protein in rice calli. Mol Cells 355:9–14
38:685–696
24. Cereda A, Kravchuk AV, D’Amato A et al
11. Demirevska-Kepova K, Simova-Stoilova L, (2010) Proteomics of wine additives: mining
Kjurkchiev S (1999) Barley leaf RuBisCO, for the invisible via combinatorial peptide
RuBisCO-binding protein and RuBisCO acti- ligand libraries. J Proteome 73:1732–1739
vase and their protein/protein interactions.
Bulg. J Plant Physiol 25:31–44 25. Kim YJ, Wang Y, Gupta R et al (2015) Prot-
amine sulfate precipitation method depletes
12. Li G, Nallamilli BR, Tan F et al (2008) abundant plant seed-storage proteins: a case
Removal of high-abundance proteins for
Low-Abundance Plant Proteomics 403
51. Jorrı́n-Novo JV, Valledor-González L, Castil- 57. Fasoli E, Righetti PG (2013) The peel and pulp
lejo-Sánchez MA et al (2018) Proteomics anal- of mango fruit: a proteomic samba. Biochim
ysis of plant tissues based on two-dimensional Biophys Acta 1834:2539–2545
gel electrophoresis, in Advances in Plant Eco- 58. Fasoli E, Colzani M, Aldini G et al (2015)
physiology Techniques Lemon peel and Limoncello liqueur: A proteo-
52. Campos NA, Swennen R, Carpentier SC mic duet. Biochim Biophys Acta
(2018) The plantain proteome, a focus on 1834:1484–1491
allele specific proteins obtained from plantain 59. Lerma-Garcı́a MJ, D’Amato A, Simó-Alfonso
fruits. Proteomics 18:1700227 EF et al (2016) Orange proteomic fingerprint-
53. Singh P, Pitambara, Rajput RS et al (2018) ing: from fruit to commercial juices. Food
Proteomics approaches to study host pathogen Chem 196:739–749
interaction. J Pharmacogn Phytochem 60. D’Amato A, Esteve C, Fasoli E et al (2013)
7:1649–1654 Proteomic analysis of Lycium barbarum
54. Esteve C, D’Amato A, Marina ML et al (2012) (Goji) fruit via combinatorial peptide ligand
Identification of avocado (Persea americana) libraries. Electrophoresis 34:1729–1736
pulp proteins by nanoLC-MS/MS via combi- 61. Shahali Y, Sénéchal H, Poncet P (2018) The
national peptide ligand libraries. Electrophore- use of combinatorial hexapeptide ligand library
sis 33:2799–2805 (CPLL) in allergomics. Methods Mol Biol
55. Esteve C, D’Amato A, Marina ML et al (2013) 1871:393–403
In-depth proteomic analysis of banana (Musa 62. Gomez Cardona EE, Heathcote K, Teran ML
spp.) fruit with combinatorial peptide ligand et al (2018) Novel low-abundance allergens
libraries. Electrophoresis 34:207–214 from mango via combinatorial peptide libraries
56. Esteve C, D’Amato A, Marina ML et al (2012) treatment: a proteomics study. Food Chem
Identification of olive (Olea europaea) seed and 269:652–660
pulp proteins by nLC-MS/MS via combinato-
rial peptide ligand libraries. J Proteome
75:2396–2403
Chapter 29
Abstract
Cereal proteins have formed the basis of human diet worldwide, and their level of consumption is expected
to increase. The knowledge of the protein composition and variation of the cereal grains is helpful for
characterizing cereal varieties and to identify biomarkers for tolerance mechanisms. Grains produce a wide
array of proteins, differing under conditions. Quantitative proteomics is a powerful approach allowing the
identification of proteins expressed under defined conditions that may contribute understanding the
complex biological systems of grains. Isobaric tags for relative and absolute quantitation (iTRAQ) is a
mass spectrometry–based quantitative approach allowing, simultaneously, for protein identification and
quantification from multiple samples with high coverage. One of the challenges in identifying grains
proteins is their relatively high content (~90–95%) of carbohydrate (starch) and low protein (~4–10%)
and lipid (~1%) fractions. In this chapter, we present a robust workflow to carry out iTRAQ quantification
of the starchy rice grains.
Key words Protein biomarkers, Chalkiness, Isobaric tags for relative and absolute quantification
(iTRAQ), Oryza sativa, Seed proteomics
1 Introduction
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8_29, © Springer Science+Business Media, LLC, part of Springer Nature 2020
405
406 Marouane Baslam et al.
2 Materials
2.1 Plant Material Seeds from rice plants (Oryza sativa L. cv. Koshihikari) grown in
paddy field or controlled (Biotron LPH-1.5PH-NCII, Nihon-ika,
Tokyo, Japan) conditions (see Note 1).
2.3 Protein Digestion 1. Block Bath (CB-100A, AS ONE Corporation, Osaka, Japan).
and iTRAQ Labeling 2. Urea, 8.0 M.
3. Endoproteinase Lys-C (Wako, Tokyo, Japan), 1 μg μL1 (see
Note 2).
4. Trypsin (Wako, Tokyo, Japan), 1 μg μL1 (see Note 2).
5. iTRAQ Reagent Multiplex kit: (4-plex: 114, 115, 116, 117;
AB SCIEX, Foster, CA).
6. iTRAQ Dissolution buffer provided in iTRAQ kit.
7. Absolute Ethanol.
8. Reducing buffer: 50 mM Tris-(2-carboxyethyl) phosphine
(TCEP) (see Note 2).
9. Alkylating solution: 200 mM methyl methanethiosulfonate
(MMTS) (see Note 2).
3 Methods
3.1 Sample 1. Husk rice seeds with a grain huller (see Note 3).
Preparation 2. Polish 10 g of grain samples for 30–40 min using a rice milling
machine in order to remove the embryo and aleurone layer.
3.3 Protein Digestion 1. Resuspend immediately the protein pellet in 8 M urea. Solubi-
lize completely the protein sample if necessary, by incubation
overnight at 4 C or alternatively by sonication.
2. Determine the protein concentration by Pierce 660 nm. Pro-
tein Assay kit (Thermo Fisher Scientific) using bovine serum
albumin (BSA) as a standard.
3. Add to the protein solution (50 μg total protein) 2 μL of
dissolution buffer from iTRAQ kit and 2 μL of reducing buffer
(TCEP); mix well by vortexing for 15 s, and spin down briefly.
4. Incubate the mixture at 60 C for 1 h, and spin down the
solution.
Proteomics of Starchy Rice Grains 409
3.4 iTRAQ Peptide 1. Bring the iTRAQ reagents to label peptides provided as set of
Labeling four (iTRAQ® Reagent 114, 115, 116, and 117) out of the
freezer to room temperature. Spin down to bring the solution
to the bottom of the vial.
2. Add 500 μL of absolute ethanol provided in iTRAQ kit to each
vial of the iTRAQ Reagent. Vortex each vial for 30 s and then
spin down the solution.
3. Transfer the entire contents of each freshly prepared iTRAQ
reagent to their respective tryptic peptide sample tube. Vortex
each tube for 30 s to mix, then spin.
4. Incubate the iTRAQ labeling reaction tubes for 1 h at room
temperature (see Notes 9–12).
5. Add 400 μL of ultrapure water, vortex for 30 s, and spin down.
6. Combine the content of each iTRAQ reagent-labeled sample
tube into one tube.
7. Vortex to mix, then spin.
3.5 Cation Exchange 1. Set the cation exchange column in a 0.5 mL syringe and clean it
Chromatography with 1 mL of cleaning buffer to condition the cartridge. Keep
the injection flow in this and following steps at 1 drop per
second. Divert to waste.
2. Inject 2 mL of the Cation Exchange Buffer-Load. Divert to
waste.
3. Slowly inject (¼1 drop/second) the mixed iTRAQ-labeled
peptide samples onto the cation-exchange cartridge and collect
the flow-through in a sample tube (see Notes 13 and 14).
4. Wash with 1 mL of loading buffer (see Note 15).
5. To elute the peptide, slowly inject (¼1 drop/s) 500 μL of
elution buffer. Collect the eluted peptides as a single fraction
in an Eppendorf tube.
410 Marouane Baslam et al.
3.6 C18 Spin Columns 1. Add 500 μL of 1% (v/v) formic acid to acidify the peptides
and Peptide Desalting solution.
2. Add 100 μL of 80% acetonitrile in 1% formic acid.
3. Centrifuge the C18 cartridge at 5000 g for 2 min.
4. Equilibrate the C18 column by adding 1% formic acid to the
cartridge. Centrifuge at 10,000 g for 2 min.
5. Transfer completely the iTRAQ-labeled peptides onto the
equilibrated C18 cartridge. Centrifuge at 10000 g for
2 min and collect the flow-through. Load the flow-through
fraction again onto the C18 column and centrifuge at
10,000 g for 1 min (see Note 16).
6. Wash off the column by adding 1.5 mL of 1% formic acid (see
Note 17).
7. Elute the peptides by adding 600 μL of 80% acetonitrile in 1%
formic acid. Collect the flow-through in new Eppendorf tube
the eluted peptides by centrifugation at 10,000 g for 2 min.
8. Dry the desalted peptides in a speed vacuum for further ana-
lyses by MS/MS.
4 Notes
Fig. 1 Photograph of starchy grain aspects during the steps of protein extraction process
Acknowledgments
References
type cotton (Gossypium hirsutum L.). J Prote- phosphoprotein characterization reveals the
ome 126:68–81 central metabolism changes involved in wheat
15. Cui Y, Yang MM, Dong J et al (2017) iTRAQ- grain development. BMC Genomics 15:1029
based quantitative proteome characterization 18. Dorfer V, Pichler P, Stranzl T et al (2014) A
of wheat grains during filling stages. J Integr universal identification algorithm optimized
Agric 16:20156–22167 for high accuracy tandem mass spectra. J Pro-
16. Yang MM, Yang J, Dong WC et al (2016) teome Res 13:3679–3684
Characterization of proteins involved in early 19. Elias JE, Haas W, Faherty BK et al (2005)
stage of wheat grain development by iTRAQ. J Comparative evaluation of mass spectrometry
Proteome 136:157–166 platforms used in large-scale proteomics inves-
17. Ma CY, Zhou JW, Chen GX et al (2014) tigations. Nat Methods 2:667–675
iTRAQ-based quantitative proteome and
INDEX
Jesus V. Jorrin-Novo et al. (eds.), Plant Proteomics: Methods and Protocols, Methods in Molecular Biology, vol. 2139,
https://doi.org/10.1007/978-1-0716-0528-8, © Springer Science+Business Media, LLC, part of Springer Nature 2020
415
PLANT PROTEOMICS: METHODS AND PROTOCOLS
416 Index
I Mass spectrometry (MS)..................................... 5, 69, 90,
108, 118, 133, 147, 157, 169, 189, 204, 226,
Identification ....................................................... 3, 23, 57, 242, 259, 273, 310, 341, 354, 382, 405
69, 80, 89, 109, 127, 133, 148, 158, 170, 183, Mass spectrometry imaging (MSI) ..................... 341–350
207, 215, 226, 244, 257, 274, 290, 297, 310, Matrix-assisted laser desorption/ionization
325, 359, 369, 386 (MALDI) .................................................. 341–350
Immunoaffinity ...............................................5, 260, 261, MaxQuant ................................. 109, 112, 113, 214–219,
276, 281, 286 221, 250, 253, 259, 260, 266, 267
Immunoaffinity purification ................................... 5, 260, Medicago truncatula ............................................ 341–350
276, 281, 286 Membrane trafficking................................................80, 90
Immunoprecipitation (IP) .................................. 241–255, Metabolic pathways.......................................... 8, 367–379
259, 280–283, 290–293, 295 Metabolites ............................................. 7, 12–16, 22–24,
In gel digestion............................................90–93, 97–99, 30, 32–37, 179, 241, 310, 341, 367–371, 373,
102, 291, 293, 294 375, 376, 378, 379, 383
Inhibitors ............................................................. 6, 74, 77, Metabolomics ................................. 23, 30, 349, 368–370
81, 82, 119, 128, 150, 151, 180, 188, 189, 198, Microalgae ....................................................7, 11–20, 197
201, 209, 228, 260, 276, 281, 285, 292, Microdomain ...........................................................89–105
353–365, 386, 387, 400 Moniliophthora roreri .................................................... 136
In silico analysis ....................................... 6, 325–338, 368 Multiple co-inertia analysis (MCIA) .............................. 22
In solution digestion..........................................90, 93, 98,
102, 103, 110, 111, 113, 273, 277, 283, 284, 395 N
Interaction networks.................................. 5, 21–55, 257,
259, 267, 375 Nano-LC-MS/MS .........................................89–105, 125
Interactome ....................................................................... 4 N-linked glycans................................................... 225, 226
In vivo cross-linking............................................. 273–286 Non-model species........................................................ 158
Isobaric tags for relative and absolute quantification Nucleus .................................................... 70, 71, 124, 242
(iTRAQ)...............................................3, 118, 134,
O
148, 183, 405–413
Isolation .......................................................11–20, 70–76, Offline fractionation............................................. 241–255
108, 117–129, 163, 173, 193, 194, 206, 218, Orbitrap ...........................................................3, 109, 112,
221, 223, 253 127, 144, 163, 172, 174–177, 192–194, 202,
Isotopic variants ...........................................133–135, 144 206, 213–216, 219, 229, 266, 269, 292, 294,
345, 408, 410, 411
L Orphan plant species........................................ 8, 157–167
Label-free.........................................................3, 5, 6, 118,
P
119, 125, 148, 183, 197–210, 215, 257–269,
300–302, 310 Parallel reaction monitoring (PRM) ...................... 5, 170,
Label-free quantification (LQF)............................. 5, 148, 213–224
197–210, 215, 257–269, 301 Partial least squares (PLS) ...........................22, 25, 30–33
Ligand binding-sites ..................................................... 326 Partial least square-discriminant analysis
Lipids .................................................... 7, 12–16, 89, 107, (PLS-DA).......................................................22, 35
147, 179, 268, 349, 373, 383, 387, 399 Peptides ........................................................ 4, 65, 86, 89,
Liquid chromatography coupled to tandem mass 108, 121, 133, 147, 158, 169, 182, 198, 213,
spectrometry (LC MS/MS) ................. 5, 89–105, 230, 242, 260, 274, 291, 299, 310, 325, 342,
112, 125, 137, 139, 144, 147, 149, 160, 353, 382, 409
169–174, 176, 199, 200, 202, 206, 229, 232, Peroxidases class III ............................................. 325–338
238, 252–253, 260, 262, 266, 284, 290, 291, Phenol protein extraction .................................... 314, 316
294, 295, 310, 317, 354, 356, 357, 360, 397, 411 Phosphopeptides ........................................ 148, 149, 151,
Low-abundance protein......................180, 310, 381–401 153, 155, 180, 181, 189, 191–193, 198–201,
Lysine acetylation........................................ 148, 242, 253 205, 206, 208, 213–224
Phosphoproteome...................................... 181, 183, 184,
M 198, 199, 202, 242, 385
Mascot ..................................................57, 121, 127, 128, Phosphoproteomics ................................... 148, 179–194,
142–144, 193, 202, 207, 317–319, 360 197–210
PLANT PROTEOMICS: METHODS AND PROTOCOLS
Index 417
Phosphorylation ..................................... 7, 135, 147–155, SEQUEST .......................................................57, 65, 160,
180, 184, 198, 199, 207, 215, 223, 224, 241 163, 294, 295, 411
Pigments ........................................................7, 12–16, 19, Shotgun .......................................................................3, 25
147, 383, 384, 387 Signaling .................................................70, 71, 107, 108,
Pinus ................................................................................ 58 111, 180, 198, 241, 289, 373, 382, 399
Plasma membrane ...........................................79, 89–105, Single amino acid polymorphisms (SAAP).................298,
107–114, 119, 327 302, 304
Pollen .......................................................... 274, 275, 279, Skyline......................................... 171–174, 214, 216–222
285, 286, 383, 398 Sodium dodecyl sulfate polyacrilamide gel electrophoresis
Polyacrylamide gel electrophoresis (PAGE) ................. 97, (SDS-PAGE)..................................... 77, 110, 111,
158, 259, 261, 357, 358, 398 123, 161, 190, 263–264, 273, 275, 279–282,
Polyethylene glycol (PEG) ........................ 109, 312–314, 291, 293, 314, 355, 359, 383, 396, 401
316, 383 Sparse partial least squares (sPLS)........ 30–34, 44, 45, 47
Polyploidy...................................................................... 298 Stable-isotope labeling......................................... 133, 148
Post-translational modification (PTM)................ 4–6, 54, Subcellular ....................................................4, 69–78, 118
58, 70, 135, 147–149, 179, 180, 194, 198, 213, Substrate ....................................................... 80, 136, 223,
225, 226, 241, 259, 298, 325–338, 405 242, 354, 362, 367
Principal components analysis (PCA) ......................21, 22 Substrate channel analysis.................................... 333, 336
Protease inhibitors .............................................. 6, 74, 77, Substrates.............................................................. 325–338
81, 82, 119, 128, 188, 201, 209, 260, 276, 281, Sweetpotato ...................................................... 6, 309–323
285, 353–365, 386 SYPRO Ruby............................................... 291, 293, 295
Proteases ....................................................... 6, 16, 74, 77,
81, 82, 92, 99, 119, 128, 134, 179, 180, 188, T
201, 204, 207, 209, 253, 260, 274, 276, 281, Tandem mass tags (TMT) ..................134, 147–155, 183
285, 353–365, 382, 383, 386, 387, 400
Targeted data acquisition (TDA) ....................... 170–172,
Protein networks .......................................................42, 49 174–176
Protein-protein interaction...............................25, 28, 37, Targeted quantification................................171, 213–224
40, 49, 70, 225, 259, 267, 268, 273, 274, 289
Tertiary structure ....................................... 326, 328, 331,
Proteogenomics ............................................... 6, 309–323 332, 334, 335
Purification ................................................. 3, 5, 6, 16–19, TiO2-based phosphopeptide enrichment ........... 199, 202
69–78, 81, 83–84, 90–91, 93–96, 100–101, 183, Tomato ...................................5, 274, 289–295, 354, 362
202, 205, 210, 259, 273, 276, 281, 284, 286,
Topologies ...................................... 47, 54, 326, 327, 330
356, 357, 364 Transcriptomics .6, 7, 23, 30, 49, 57–67, 198, 213, 310,
322, 368–370
Q
Transmembrane domains ..............................90, 102, 107
Quantitative proteomics .......................... 6, 80, 133–145, Tropical fruits ....................................................... 179–194
147–149, 169–177, 405, 406 Trypsin ...................................................... 89, 92, 93, 100,
Quercus ilex.......................................... 8, 57–67, 157–167 108–111, 113, 114, 121, 125, 127, 134, 136,
138, 142, 150, 152, 160, 162, 172, 180, 189,
R 191, 200, 201, 204, 207, 228, 230, 233, 237,
Rice (Oryza sativa) .......................................57, 107–114, 243, 245, 248, 253, 254, 259, 260, 262, 265,
118, 225, 309, 382, 405–413 273, 277, 284, 286, 291, 293–295, 299, 355,
RNA-seq analysis.......................... 24, 26, 49, 57, 60, 298 360, 395–397, 407, 409–412
Two-dimensional gel electrophoresis (2-DE) ....... 89, 90,
Root nodules ........................................................ 341–350
157, 382, 384
S Two-phase partitioning ...................................91, 95, 104