Professional Documents
Culture Documents
Introduction To Bioinformatics
Introduction To Bioinformatics
Introduction to Bioinformatics
English Courses for Graduate Students
1
Introduction to Bioinformatics
English Courses for Graduate Students
Chapter 1
Introduction
2
Introduction to Bioinformatics
English Courses for Graduate Students
About me
• Dr. rer. nat. Jing Gong
• Bachelor Degree in Marine Biology at the China
Ocean University (former Qingdao Ocean
University)
• Bachelor, Master & Doctoral Degree in
Bioinformatics at the Ludwig Maximilians
Universität München, Germany
• Affiliation: Cancer Research Center of SDU
• Tel: 0531-88380202
• Email: gongjing@sdu.edu.cn
• Office: Dianjing Building, Rm.106, Baotuquan
Campus
3
Introduction to Bioinformatics
English Courses for Graduate Students
Chapter 5 : Tree
4
Introduction to Bioinformatics
English Courses for Graduate Students
Literatures:
1. Bioinformatics - An Introduction, 2nd Edition, Jeremy Ramsden, 2009, Springer.
2. Bioinformatics For Dummies, 2nd Edition, Jean-Michel Claverie, Cedric Notredame, 2007, Wiley.
5
Introduction to Bioinformatics
English Courses for Graduate Students
6
Introduction to Bioinformatics
English Courses for Graduate Students
What is Bioinformatics?
biophysics
biohazards
biometrics
biomathematics
biochemistry
bioterrorism
biopotato
bioinformatics
7
Introduction to Bioinformatics
English Courses for Graduate Students
a biology/medical
researchers, just like you
a professional in the
pharmaceutical industry
a computer scientist
developing bio-databases
a consumer concerned
about GMOs (Genetically
Modified Organisms)
……
8
Introduction to Bioinformatics
English Courses for Graduate Students
What is Bioinformatics?
Definition:
Bioinformatics – the science of collecting and analyzing complex
biological data such as genetic codes. [Oxford Dictionary]
Bioinformatics – the computational branch of molecular biology.
[Bioinformatics for Dummies]
9
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
In 1809, French biologist Jean Baptiste Lamarck published “Philosophie
Zoologique”. Lamarck stressed two main themes in his biological work:
1. The environment gives rise to changes in animals, i.e. changes
through use and disuse.
2. Life was structured in an orderly manner and that many different parts
of all bodies make it possible for the organic movements of animals.
“blind as a mole” “show your teeth” “birds have no teeth?” Jean Baptiste Lamarck
(1744-1829)
10
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
In 1859, English naturalist Charles Darwin published “On the Origin of
Species by Means of Natural Selection, or the Preservation of Favoured
Races in the Struggle for Life”.
Charles Darwin
(1809-1882)
11
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
12
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
In 1869, Swiss physician and biologist Friedrich Miescher isolated
DNA from the white blood cells at Felix Hoppe-Seyler's laboratory at
the University of Tübingen, Germany.
Friedrich Miescher
(1844-1895)
13
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
Thomas Hunt Morgan, American geneticist, famous for his experimental
research with the fruit fly by which he established the chromosome theory
of heredity. He showed that genes are linked in a series on chromosomes
and are responsible for identifiable, hereditary traits. Morgan’s work
played a key role in establishing the field of genetics. He received the
Nobel Prize for Physiology or Medicine in 1933.
Thomas H. Morgen
(1866-1945)
nobel prize 1933
14
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
In 1944, American physician and medical researcher Oswald Avery and
his co-workers Colin MacLeod and Maclyn McCarty demonstrated that
DNA is the material of which genes and chromosomes are made.
History of Bioinformatics
In 1950, American biochemist Erwin Chargaff noticed a pattern in the
amounts of the four bases: adenine (A) , thymine (T) , cytosine (C) ,
guanine (G). He discovered that the amounts of adenine (A) and
thymine (T) in DNA were roughly the same, as were the amounts of
cytosine (C) and guanine (G). This later became known as Chargaff's
rule.
%A = %T and %G = %C
Erwin Chargaff
(1905-2002)
16
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
In 1953, James D. Watson and Francis Crick
suggested the first correct double-helix model of
DNA structure in the journal Nature. Their double-
helix model of DNA was based on a single X-ray
diffraction image taken by Rosalind Franklin and
Maurice Wilkins in 1952.
17
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
The sequence of 77 nucleotides of a yeast
alanine tRNA was found by an American
biochemist Robert W. Holley in 1965. Holley
was awarded the 1968 Nobel Prize in
Physiology or Medicine for describing the
structure of this tRNA, linking DNA and
protein synthesis.
Robert W. Holley
(1922-1993)
nobel prize 1968
18
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
In 1977, Frederick Sanger and
Colleagues introduced the
“dideoxy” chain-termination
method for sequencing DNA
molecules, also known as the
“Sanger method”. Hence, in
1980, he shared Nobel Prize in
chemistry with Walter Gilbert.
History of Bioinformatics
Read protein sequence directly in the DNA sequence!
20
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
Marshall Warren Nirenberg shared a
Nobel Prize in Physiology or Medicine
in 1968 with Har Gobind Khorana and
Robert W. Holley for "breaking the
genetic code" and describing how it
operates in protein synthesis.
21
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
22
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
insulin = (
# 1-letter 3-letter Nmae 30 glycines +
1 A Ala Alanine
A given type of protein 44 alanines +
2 R Arg Arginine always contains the same 5 tyrosines +
3 N Asn Asparagine
number of total amino acids 14 glutamines
4 D Asp Aspartic acid
+ . . .)
5 C Cys Cysteine in the same proportion.
6 Q Gln Glutamine
8 G Gly Glycine
Amino acids are linked
9 H His Histindine together as a chain.
10
11
I
L
Ile
Leu
Isoleucine
Leucine
The first amino acid
12 K Lys Lysine sequence of a protein,
13 M Met Methionine
Insulin, was determined
14 F Phe Phenylalanine Frederick Sanger (1918-)
15 P Pro Proline
in 1951 by Dr. Sanger. nobel prize 1958
16 S Ser Serine
17 T Thr Threonine
insulin = MALWMRLLPLLALLALWGPDPAAAFVNQHLCGSHL
18 W Trp Trytophan
VEALYLVCGERGFFYTPKTRREAEDLQVGQVELGGGPGAGSLQPL
19 Y Tyr Tyrosine ALEGSLQKRGIVEQCCTSICSLYQLENYCN
20 V Val Valine 23
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
In 1956, Symposium on Information Theory in Biology (Gatlinburg, USA).
In 1979, GenBank was established at Los Alamos National Laboratory (USA).
In 1982, nucleotide sequence database of European Molecular Biology Laboratory
(EMBL) was created (Europe).
In 1986, DNA Data Bank of Japan (DDBJ) began data bank activities at NIG (Japan).
in the early 1990s, International Nucleotide Sequence Database Collaboration
(INSDC) was founded in cooperation of Genebank/EMBL/DDBJ.
In 1987, a Chinese-American scientist LIN Hua-an first created the word
“bioinformatics”. At the very beginning, he created the word “compbio”, then
“bioinformatique”, and then “bio-informatics”. But at that time, the email title did not
support the hyphen symbol, thus “bioinformatics” was born.
Since at least the late 1980s, the term “bioinformatics” has been primary used in
genomics and genetics, particularly in those areas of genomics involving large-scale
DNA sequencing.
25
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
26
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
Publicly funded project: Privately funded project
James D. Watson & Francis Collins President Clinton (2000) Craig Venter
History of Bioinformatics
28
Introduction to Bioinformatics
English Courses for Graduate Students
History of Bioinformatics
Shenzhen
AB SOLiDTM
Illumina HiSeq 2000 4.0 System
Shanghai X 137 X 27
Beijing
29
Introduction to Bioinformatics
English Courses for Graduate Students
9 Analyzing DNAs
9 Analyzing RNAs
9 Analyzing Proteins
9 Others: Pathway, Bioimaging, etc.
30
Introduction to Bioinformatics
English Courses for Graduate Students
31
Introduction to Bioinformatics
English Courses for Graduate Students
M
E
V Protein
F
ATGGAAGTATTTAA…… K
A
DNA P
…
32
Introduction to Bioinformatics
English Courses for Graduate Students
Maestro
Function
Pymol
Structure 36
Introduction to Bioinformatics
English Courses for Graduate Students
37
Introduction to Bioinformatics
English Courses for Graduate Students
Super-
computer
39
Introduction to Bioinformatics
English Courses for Graduate Students
statistic graph
CT
magnetic
resonance
40
Introduction to Bioinformatics
English Courses for Graduate Students
Gene Sequence
Specialist
in
Bioinformatics
42
Introduction to Bioinformatics
English Courses for Graduate Students
dUTPase
43
Introduction to Bioinformatics
English Courses for Graduate Students
44
Introduction to Bioinformatics
English Courses for Graduate Students
45
Introduction to Bioinformatics
English Courses for Graduate Students
46
Introduction to Bioinformatics
English Courses for Graduate Students
47
Introduction to Bioinformatics
English Courses for Graduate Students
48
Introduction to Bioinformatics
English Courses for Graduate Students
Author Name
49
Introduction to Bioinformatics
English Courses for Graduate Students
50
Introduction to Bioinformatics
English Courses for Graduate Students
51
Introduction to Bioinformatics
English Courses for Graduate Students
52
Introduction to Bioinformatics
English Courses for Graduate Students
53
Introduction to Bioinformatics
English Courses for Graduate Students
Laboratory
address
authors
54
Introduction to Bioinformatics
English Courses for Graduate Students
Search “Down” in
field “Title [TI]”
Search “Down” in
field “Laboratory
address [AD]”
Search “Down”
everywhere
55
Introduction to Bioinformatics
English Courses for Graduate Students
Beijing
Beijing
2
Tel : 86 - 10 - 6275-5002
Fax : 86 - 10 - 6276-2292
New Life Science Building, Peking
3
University, Summer Palace Road
No. 5, Beijing, P. R. China 100871 56
Introduction to Bioinformatics
English Courses for Graduate Students
dUTPase
Searching
PubMed
using limits
57
Introduction to Bioinformatics
English Courses for Graduate Students
Searching
PubMed
using limits
58
Introduction to Bioinformatics
English Courses for Graduate Students
59
Introduction to Bioinformatics
English Courses for Graduate Students
61
Introduction to Bioinformatics
English Courses for Graduate Students
dUTPase coli
63
Introduction to Bioinformatics
English Courses for Graduate Students
64
Introduction to Bioinformatics
English Courses for Graduate Students
65
Introduction to Bioinformatics
English Courses for Graduate Students
66
Introduction to Bioinformatics
English Courses for Graduate Students
1 2 3
67
Introduction to Bioinformatics
English Courses for Graduate Students
68
Introduction to Bioinformatics
English Courses for Graduate Students
1 2 3
69
Introduction to Bioinformatics
English Courses for Graduate Students
70
Introduction to Bioinformatics
English Courses for Graduate Students
71
Introduction to Bioinformatics
English Courses for Graduate Students
72
Introduction to Bioinformatics
English Courses for Graduate Students
Tab FASTA
Excel
73
Introduction to Bioinformatics
English Courses for Graduate Students
74
Introduction to Bioinformatics
English Courses for Graduate Students
9
75
Introduction to Bioinformatics
English Courses for Graduate Students
“Cross-references”
point to data collections
other than UniProtKB.
76
Introduction to Bioinformatics
English Courses for Graduate Students
right click
“sequences” provides
you with the actual
amino acid sequence of
the protein.
9
retrieve DNA sequence relevant
to dUTPase protein of E. coli.
79
Introduction to Bioinformatics
English Courses for Graduate Students
P06968
80
Introduction to Bioinformatics
English Courses for Graduate Students
81
Introduction to Bioinformatics
English Courses for Graduate Students
82
Introduction to Bioinformatics
English Courses for Graduate Students
83
Introduction to Bioinformatics
English Courses for Graduate Students
84
Introduction to Bioinformatics
English Courses for Graduate Students
1. Summary Section
2. Reference Section
85
……
Introduction to Bioinformatics
English Courses for Graduate Students
4. Sequence
Section
…… 86
Introduction to Bioinformatics
English Courses for Graduate Students
1. Summary Section
2. Reference Section
87
……
Introduction to Bioinformatics
English Courses for Graduate Students
1. Summary Section
2. Reference Section
88
……
Introduction to Bioinformatics
English Courses for Graduate Students
9
retrieve DNA sequence relevant perform a BLAST search
to dUTPase protein of E. coli.
89
Introduction to Bioinformatics
English Courses for Graduate Students
91
Introduction to Bioinformatics
English Courses for Graduate Students
92
Introduction to Bioinformatics
English Courses for Graduate Students
Open “P06968.fasta” at
2 your Desktop, and paste
the sequence here.
Give a name here.
http://1.51.212.243/P06968.fasta
3
93
Introduction to Bioinformatics
English Courses for Graduate Students
94
Introduction to Bioinformatics
English Courses for Graduate Students
95
Introduction to Bioinformatics
English Courses for Graduate Students
96
Introduction to Bioinformatics
English Courses for Graduate Students
97
Introduction to Bioinformatics
English Courses for Graduate Students
98
Introduction to Bioinformatics
English Courses for Graduate Students
What is Alignment?
Alignment is the result of a comparison of two or more
gene or protein sequences in order to determine their
degree of base or amino acid similarity.
Pairwise Alignment
Multiple Alignment
99
Introduction to Bioinformatics
English Courses for Graduate Students
9 9
retrieve DNA sequence relevant perform a BLAST search
to dUTPase protein of E. coli.
100
Introduction to Bioinformatics
English Courses for Graduate Students
101
Introduction to Bioinformatics
English Courses for Graduate Students
102
Introduction to Bioinformatics
English Courses for Graduate Students
103
Introduction to Bioinformatics
English Courses for Graduate Students
http://1.51.212.243/multi.fasta
Select all
Copy
104
Introduction to Bioinformatics
English Courses for Graduate Students
Paste
105
Introduction to Bioinformatics
English Courses for Graduate Students
106
Introduction to Bioinformatics
English Courses for Graduate Students
* identical
: similar
. related
different
107
Introduction to Bioinformatics
English Courses for Graduate Students
108
Introduction to Bioinformatics
English Courses for Graduate Students
Conserved region
109
Introduction to Bioinformatics
English Courses for Graduate Students
110
Introduction to Bioinformatics
English Courses for Graduate Students
111
Introduction to Bioinformatics
English Courses for Graduate Students
9 9
retrieve DNA sequence relevant perform a BLAST search
to dUTPase protein of E. coli.
112
Introduction to Bioinformatics
English Courses for Graduate Students
protein sequence
3D structure
113
Introduction to Bioinformatics
English Courses for Graduate Students
Beijing
Beijing
Tel : 86 - 10 - 6275-5002
Fax : 86 - 10 - 6276-2292
New Life Science Building, Peking
University, Summer Palace Road
No. 5, Beijing, P. R. China 100871 114
Introduction to Bioinformatics
English Courses for Graduate Students
Su XD dUTPase
115
Introduction to Bioinformatics
English Courses for Graduate Students
116
Introduction to Bioinformatics
English Courses for Graduate Students
117
Introduction to Bioinformatics
English Courses for Graduate Students
118
Introduction to Bioinformatics
English Courses for Graduate Students
119
Introduction to Bioinformatics
English Courses for Graduate Students
Action
Rotate View Left Click and Drag
Shift + Left Click
Zoom drag mouse up or down /
roll mouse middle button
Select/
Deselect Left Click
Residue
Jmol Menu Right-Click
120
Introduction to Bioinformatics
English Courses for Graduate Students
121
Introduction to Bioinformatics
English Courses for Graduate Students
123