Professional Documents
Culture Documents
Bioinformatics is:
driven by the generation of data,
moderated by hardware and analysis methods
Computing power
Analysis methods
V
L
S
E
G
E
W
Q
L O2
V
L
V
.
.
.
Sequence structure function
to perform a
folds into a 3D
Deciphering protein structure and function
• Commercial/Proprietary SW-Tools
• Public Domain Software
• Internet OnLine Resources
NAILS
=
DATA
HAMMER
=
BIOINFORMATICS
Bioinformatics Tools
ClustalW(-mpi)
produces biologically meaningful multiple sequence
alignments of divergent sequences.
t-coffee (CNRS, France)
is a multiple sequence alignment package.
Sequence Analysis Tools
blast (NCBI)
fast similarity searches in biological sequence databases.
Mutations on MutDB
are mapped to protein
structure
Extension in Chimera
queries MutDB
Controller window
identifies mapped
mutation positions
which are highlighted
structurally
Future work
Web services for identifying regions of structural similarity
between a query protein and a database of protein structures
Chimera PyMOL
matplotlib
Examples of Bioinformatics databases
Database interfaces
Genbank/EMBL/DDBJ, Medline, SwissProt, PDB, …
Sequence alignment
BLAST, FASTA
Gene finding
Genscan, GenomeScan, GeneMark, GRAIL
Pattern Identification/Characterization
Gibbs Sampler, AlignACE, MEME
Genbank/Genpept/Structures
BLAST server(s)
Five-plus flavors of blast
Curation!!!
Error rate in the information is greatly reduced in comparison
to most other databases.
Extensive cross-linking to other data sources
SwissProt is the ‘gold-standard’ by which other
databases can be measured, and is the best place
to start if you have a specific protein to investigate
A few more resources to be aware of
Human Genome Working Draft
http://genome.ucsc.edu/
Celera
http://www.celera.com/
Arabidopis: http://www.tair.org/
Mouse: http://www.jax.org/
Fruitfly: http://www.fruitfly.org/
Nematode: http://www.wormbase.org/
Functional genomics
Annotation, experimental design, integration
Pathways
Current DBs incomplete
Data model?
Processes
How to model?
Market definition
“Bioinformatics” is poorly defined/segmented
Commodity pricing
Customers are conditioned to use “point & click” black boxes
Value is disguised
Service mentality
Technology seen as subservient to wet lab data
Bioinformatics In Pakistan