Professional Documents
Culture Documents
2016 Proteomics IIa Sequencing PDF
2016 Proteomics IIa Sequencing PDF
Proteomics IIa
Analyzing Protein Sequence
Learning objectives
1. To learn the steps and method in direct protein
sequencing
Edman degradation
In practice peptides
cannot be much longer
than about 50 residues.
1.
2.
3.
4.
5.
6.
7.
8.
L-dopa
N -CH2OH
Ethanol
COOH
HO
-CH2CH-NH2
CH3CH2OH
HO
HO
MW = 327.1
MW = 197.2
MW = 46.1
Used for;
Determining protein ID
Determining the sequence and structure of protein
Detecting protein modifications
Mass Spectrometer
Q Exactive Orbitrap
LTQ Orbitrap XL
Ionizer
Mass analyser
Ion trap
Orbitrap
Detector
Principal Workflow of MS
MALDI
ESI
TOF
Quadrupole
Ion trap
Orbitrap
nicotinic acid:
absorption at 266 nm
sinapinic acid:
absorption 337-353 nm
-cyano-4-hydroxycinnamic acid:
absorption 337-353 nm
Requires 10 L of
1 pmol/mL sample
ion source
detector
time of flight
detector
reflector
time of flight
detector
time of flight
ion source
detector
reflector
time of flight
Reflectron
focuses ions with the same m/z values, making these ions reach
the detector at the same time.
more accurate in the reflectron mode than in the linear mode
Electron Multiplier
(Dynode)
Characterized
by sharp,
narrow peaks
X-axis position
indicates the
m/z ratio of a
given ion
Relative Abundance
Dibutylphthalate
Enzymatic fragmentation
Chemical fragmentation
trypsin, chymotrypsin,
pepsin, staphylococcal
protease
cyanogen bromide
Trypsin
Most important
Cleaves peptide bond after
positively charged AAs
Principles of Fingerprinting
Sequence
Protein 1
acedfhsakdfqea
sdfpkivtmeeewe
ndadnfekqwfe
Protein 2
acekdfhsadfqea
sdfpkivtmeeewe
nkdadnfeqwfe
Protein 3
acedfhsadfqeka
sdfpkivtmeeewe
ndakdnfeqwfe
Mass (M+H)
Tryptic Fragments
4842.05
acedfhsak
dfgeasdfpk
ivtmeeewendadnfek
gwfe
4842.05
acek
dfhsadfgeasdfpk
ivtmeeewenk
dadnfeqwfe
4842.05
acedfhsadfgek
asdfpk
ivtmeeewendak
dnfegwfe
Principles of Fingerprinting
Sequence
Mass (M+H)
Protein 1
acedfhsakdfqea
sdfpkivtmeeewe
ndadnfekqwfe
4842.05
Protein 2
acekdfhsadfqea
sdfpkivtmeeewe
nkdadnfeqwfe
4842.05
Protein 3
acedfhsadfqeka
sdfpkivtmeeewe
ndakdnfeqwfe
4842.05
Mass Spectrum
57.02147
71.03712
87.03203
97.05277
99.06842
101.04768
103.00919
113.08407
113.08407
114.04293
Aspartic acid
Glutamine
Lysine
Glutamic acid
Methionine
Histidine
Phenylalanine
Arginine
Tyrosine
Tryptophan
115.02695
128.05858
128.09497
129.0426
131.04049
137.05891
147.06842
156.10112
163.06333
186.07932
Tryptic Frags
Calc. Mass
P12345 (Protein 1)
acedfhsakdfqea
sdfpkivtmeeewe
ndadnfekqwfe
acedfhsak
dfgeasdfpk
ivtmeeewendadnfek
gwfe
1007.4251 (P12345)
1183.5266 (P12345)
2098.8909 (P12345)
609.2667 (P12345)
P21234 (Protein 2)
acekdfhsadfqea
sdfpkivtmeeewe
nkdadnfeqwfe
acek
dfhsadfgeasdfpk
ivtmeeewenk
dadnfeqwfe
450.2017 (P21234)
1740.7501 (P21234)
1407.6462 (P21234)
1300.5116 (P21234)
P89212 (Protein 3)
acedfhsadfqeka
sdfpkivtmeeewe
ndakdnfeqwfe
acedfhsadfgek
asdfpk
ivtmeeewendak
dnfegwfe
1526.6211 (P89212)
664.3300 (P89212)
1593.7101 (P89212)
1114.4416 (P89212)
Database Search
Query Masses
450.2201
609.3667
698.3100
1007.5391
1199.4916
2098.9909
Results
2 Unknown masses
1 hit on P21234
3 hits on P12345
800
1200
1600
2000
m/z
Experimental
P12345
P21234
P89212
2400
Protein ID
Theoretical
Theoretical
Theoretical
60 kDa; 57 461 Da
pI = 4.75
http://us.expasy.org/tools/peptide-mass.html
533.3
544.3
545.3
614.4
634.3
674.3
675.4
701.4
726.4
822.4
855.5
861.4
879.4
921.5
953.4
974.5
988.5
1000.6
1196.6
1217.6
1228.5
1232.6
1233.7
1249.6
1249.6
1344.7
1455.8
1484.6
1514.8
1582.9
1583.9
1616.8
1726.7
1759.9
1775.9
1790.6
1853.9
1869.9
2286.2
2302.2
2317.2
2419.2
2526.4
2542.4
3329.6
4211.4
http://us.expasy.org/tools/peptide-mass.html
2D-PAGE approach
Multidimensional LC approach
Fragmented
using trypsin
Spectrum of
fragments
generated
MALDI-TOF
Library
Mascot
Artificial
spectra built
MATCH
Artificially
trypsinated
Database of
sequences
(i.e. SwissProt)
http://www.uniprot.org/
Mascot
Tutorial
http://www.matrixscience.com/help/pmf_help.html
Data entry
http://www.matrixscience.com/cgi/search_form.pl?FORMVE
R=2&SEARCH=PMF
Search result
http://www.matrixscience.com/cgi/master_results.pl?file=../
data/F981122.dat
Peptide Sequence
Mass (M+H)
vgahageygaealer
1529.74
igghgaeygaealer
1529.74
Capillary,
3-4 kV
The applied
voltages affect the
trajectory of ions
traveling down the
flight path
Orbitrap
Full-scan mode
Collision
MS-MS mode
Ion trap
Collision-induced
dissociation (fragmentation)
of a selected ion
A-V-A-G-C-A-G-A-R
769
Collision Cell
A-V-A-G-C-A-G-A
A-V-A-G-C-A-G
A-V-A-G-C-A
A-V-A-G-C
A-V-A-G
A-V-A
A-V
A
601
530
473
402
299
242
171
72
Accumulative
mass
Nomenclature for
fragmentation of peptide ions
b1 A+
y8 V-A-G-C-A-G-A-R+
b2 A-V+
y7 A-G-C-A-G-A-R+
b3 A-V-A+
y6 G-C-A-G-A-R+
b4 A-V-A-G+
y5 C-A-G-A-R+
b5 A-V-A-G-C+
y4 A-G-A-R+
b6 A-V-A-G-C-A+
y3 G-A-R+
b7 A-V-A-G-C-A-G+
y2 A-R+
b8 A-V-A-G-C-A-G-A+
y1 R+
MS
m/z profile of
peptide ion
MS/MS
Fragments of
peptide ion
Identification of
amino acid
Theoretical MS/MS
Spectra
100
95
#1
CALCULATE #2
#3
#4
#5
90
85
80
75
658.36
70
65
900.36
Relative Abundance
60
55
1031.40
50
45
913.42
40
1240.53
782.23
896.29
35
546.19
771.24
25
1028.41
721.31
20
431.15
15
801.38
427.27
559.13
651.14
408.74
399.24
217.91
1241.39
914.34
317.17
10
5
1032.43
895.33
30
K.TVLIMELINNVAK.K
L.NAKMELLIDLVKA.Q
E.ELAILMQNNIIGE.N
A.CGPSRQNLLNAMP.S
L.FAPLQEIINGILE.G
432.40
669.39
1027.22
915.53
986.50
882.07
600.24
481.13
869.23
1258.56
1033.60
1312.35
1142.43
1123.49
1356.10
1195.44
0
200
300
400
500
600
700
800
900
1000
1100
1200
1300
1400
m/z
COMPARE
SCORE
Sequest
Paragon Algorithm
Mascot
Proteome Discoverer
Maxquant
Analyst QS
FAST, Gel-free
Determines MW and AA sequence
Can be used on complex mixtures-including low abundant
proteins
Can detect post-translational modification
High-thoughput capability
Disadvantages
Hardware: $1000
Setup: $300
1 run: $1000
Create ions
Separate ions
Ionization
methods
MALDI
ESI
Mass analyzers
TOF
MALDI-TOF: MW
Quadrupole
ESI-QQQ: AA seq
MALDI-QTOF: MW & AA seq
Ion trap
Orbitrap
Detect ions
Mass spectra
Database search
& matching
MS spectra
Mascot search program
MS/MS spectra