Professional Documents
Culture Documents
It is certain that proteins are extremely complex molecules but they are no
longer beyond the reach of the chemist.
— F. Sanger
Key concepts
• Proteins are made up of twenty different amino acids which are arranged
in a unique order in each protein. This is known as the primary structure
of a protein.
• Modern protein sequencing is done by using automated protein
sequencers which are based on the principle of Edman degradation.
• Tandem mass spectrometry is also used to sequence the proteins and
has many advantages over Edman sequencing.
• For protein sequencing, proteins have to be purified to homogeneity
using liquid chromatography.
• Intact proteins cannot be sequenced using a mass spectrometer. Purified
protein is treated with trypsin to convert it into peptides of a small size
suitable for mass spectrometric analysis.
• The Edman degradation method has some limitations and is one of the
important methods for protein sequencing. Mass spectrometry has
become an alternative to the Edman degradation method where
N-terminally modified amino acid cannot be modified.
14.1 Introduction
1. Edman degradation.
2. De novo sequencing by mass spectrometry.
Since the disulfide bonds between two cysteine amino acids interfere with
protein sequencing, they must be reduced. Usually this is done by the
reducing agent beta mercaptoethanol followed by iodoacetic acid to convert
the reduced SH group of the disulfide bonds. It can also be oxidized to
cysteic acid using performic acid.
The samples can be supplied in lyophilized form but it is preferable for them
to be on a PVDF membrane. The liquid sample may contain salts, SDS,
buffers, free amino acids which will contaminate the sequencing machine.
Sequencing cannot be done with large proteins. They must be cleaved with
some proteases to get the smaller peptides that are suitable for sequencing.
Recommended reading
Banderia, N., Clauser, K.R. and Pevzner, P.A. (2007) ‘Shotgun protein sequencing
assembly of peptide from tandem mass spectra from mixture of modified
protein’, Molecular and Cellular Proteomics, 6: 1123–34.
Roepstorff, P. and Fohlman, J. (1984) ‘Proposal for a common nomenclature for
sequence ions in mass spectra of peptides’, Biomedical Mass Spectrometry, 11: 601.
Smith, B.J. (2007) Protein Sequencing Protocols in Methods in Molecular Biology,
Vol. 211, 2nd edn. Totowa, NJ: Humana Press.
Standing, K.G. (2003) ‘Peptide and protein de novo sequencing by mass
spectrometry’, Current Opinion in Structural Biology, 13: 595–601.
Stretton, A.W. (2002) ‘The first sequence: Fred Sanger and insulin’, Genetics, 162:
527–32.
Zhong, H., Zhang, Y., Wen, Z. and Li, L. (2004) ‘Protein sequencing by mass
analysis of polypeptide ladders after controlled protein hydrolysis’, Nature
Biotechnology, 22: 1291–6.