Professional Documents
Culture Documents
in a database.
1970 : Term Bionformatics coined by Pauline Hogweg and
Ben Hesper.
1970 : Needleman Wunch algorithm
1972 : Smith Waterman algorithm
1973 : Multiple Sequence alignment
1979 : First DNA database
1980 : First complete gene sequence for an organism was
published
1989 : First complete genome map was published
(Haemophilia influenza)
1990 : Human genome project started
1970 : PDB
1974 : Genbank
1980 : EMBL
1984 : DDBJ
1984 : SwissProt
1988 : FASTA by Person and Lipman
1990 : BLAST by Altshul Gish and Lipman
Scope of Bioinformatics
1. Structural bioinformatics
Prediction of structure from sequence
Analysis of 3D Structure
Structural databases
Analysis and comparison of biomolecular structures
2. Genomics
Study of genes at mRNA, protein and DNA level
Includes structural genomics, functional genomics, comparative genomics.
Genome sequencing
Genome annotation
Genome assembling
Study gene mutations
Techniques used are PCR amplification, DNA sequencing, DNA
hybridisation
3. Proteomics
Study of proteome using various technologies.
Assess activities, modifications, localisation and interactions of
proteins.
Identification and characterization of proteins.
Protein separation
Consists of structural proteomics, functional proteomics and
expression proteomics.
Various proteomics techniques are SDS PAGE, mass spectrometry
etc.
4. Cheminformatics
The use of physical chemistry theory with computer and
information science techniques.
Encompasses the design, creation, organisation, management,
retrieval, analysis and use of chemical information.
Molecular similarity
QSAR
5. Animal Bioinformatics
Deals with computer added study of genomics, proteomics and
metabolomics in various animal species.
The study includes study of gene mapping, gene sequencing,
animal breed etc.
Illustration of the relationships within the animal data and
also between animals.
6. Plant bioinformatics
Deals with computer aided study of plant species.
It is further divided into agriculture bioinformatics,
horticulture bioinformatics, medicinal plant
bioinformatics and forest plant bioinformatics
Major research fields
Sequence alignment
Gene finding
Genome assembly
Drug design
Drug discovery
Protein structure alignment
Protein structure prediction
Gene expression
Protein – protein interaction
NGS
Biological databases
Heterogeneity
High volume data
Data integration
Data curation
Data sharing
Types of biological databases
It is a protein database.
It consists of entries describing the protein families, domains
and functional sites as well as amino acid patterns.
They are manually curated by a team of Swiss Institute of
Bioinformatics.
Classification of proteins in Prosite is determined using single
conserved motifs, i.e., short sequence regions (10–20 amino
acids) that are conserved in related proteins
Pfam