Professional Documents
Culture Documents
Introduction (Sequence-Structure-Function
Relationships)
• Proteins of similar sequences fold into similar
structures and perform similar biological
functions.
• The protein sequence has the intrinsic
information to encode the protein structure.
Introduction
• Structure-based drug design, analysis of
protein function, interactions, antigenic
behaviour, and rational design of proteins with
increased stability or novel functions.
• Many proteins are simply too large for NMR
analysis and cannot be crystallized for X-ray
diffraction.
Structure Prediction from sequence
If two aligned residues differ, only the backbone coordinates (N,Cα,C and O)
can be copied.
Loop Modeling
• In the majority of cases, the alignment between
model and template sequence contains gaps.
• Either gaps in the model sequence (deletion) or in
the template sequence (insertions).
• In the first case, one simply omits residues from the
template, creating a hole in the model that must be
closed.
• In the second case, one takes the continuous
backbone from the template, cuts it, and inserts the
missing residues.
Loop Modeling
• Knowledge based: one searches the PDB for
known loops with endpoints that match the
residues between which the loop has to be
inserted, and simply copies the loop conformation.
• Energy based: as in true ab initio fold prediction,
an energy function is used to judge the quality of a
loop. Then this function is minimized, using Monte
Carlo (Simons et al., 1999) or molecular dynamics
techniques
Quantitative comparison between model
and experimental 3D structure using
RMSD
• 0.0-0.5 Å • Essentially Identical
• <1.5 Å • Very good fit
• < 5.0 Å • Moderately good fit
• 5.0-7.0 Å • Structurally related
• > 7.0 Å • Dubious relationship
• > 12.0 Å • Completely unrelated
Taken from: http://redpoll.pharmacy.ualberta.ca/bioinfo301/Bioinf301-3Dvisualize.ppt
BIOINFORMATICS PRACTICAL ON
HOMOLOGY MODLING
• Q. Perform a homology model of the given
protein sequence and generate the structure
of the structure in PDB format. The sequence
of the model is given below.
> protein
MPHKEKHPLQDMFTSAIEAVARDSGWAELSAVGSY
LAKNDPSFDPRNWGHGRLSQMVKKLDFLTVQESRN
GSKLHSEIRLRHDG
Model Validation
• Various model validation parameters can be
considered such as RMSD , Ramachandran
Plot, Z-score value from ProSA-web server and
data from other server such as PROCHECK,
ERRAT and Verify3D programs, available at the
SAVES server.
• SUPERPOSE SERVER can be used for
calculation of RMSD value.