Professional Documents
Culture Documents
Wikipedia says:
Microbial bioinformatics Bioinformatics is an interdisciplinary field that develops methods
and software tools for understanding biological data. As an
interdisciplinary field of science, bioinformatics combines computer
1. What is bioinformatics? Why does science, statistics, mathematics, and engineering to analyze and
microbiology need bioinformatics? interpret biological data.
2. Bioinformatics and molecular evolution: ...so, using computers to analyse biological data
comparing genomes of bacterial species
(in 1977)
Be a critical thinker!
2. Bioinformatics and molecular evolution: Coding sequences and intergenic regions
comparing genomes of bacterial species
https://www.flickr.com/photos/arkadyevna/227697075 Perna et al 2001 Nature 409: 529-533 Perna et al 2001 Nature 409: 529-533
Pan-genome:
all the genes you can
find across genomes in
a species
Core genome:
genes shared by all Salmonella Mycobacterium Escherichia
genomes in a species enterica tuberculosis coli
A phylogeny is an Which tree shows the true Using an evolution model, compute
relationships among the three taxa? the likelihood of each scenario
evolutionary tree showing
the relationships among
Use genetic information to infer the phylogeny taxa (often species)
or or ?
Se Mt Ec Mt Se Ec Ec Mt Se
Statistical calculations can be used for Statistical calculations can be used for
phylogenetic inference phylogenetic inference
For a given tree, the likelihood is the probability of observing genetic Select the tree with the highest Finding high likelihood trees can be
sequences according to a model of evolution probability. This is called the computationally challenging if there
maximum likelihood tree are many species.
or or ?
Se Mt Ec Mt Se Ec Ec Mt Se Se Mt Ec Mt Se Ec Ec Mt Se
L1 L2 L3
Phylogeny of bacteria
Can we use genetic sequences to establish evolutionary using genome data
relationships among all cellular lifeforms?
5591 sites in 31
proteins
A
B
A B C D E
C A
D B 3
E C 4 1
D 3 2 3
E 4 3 4 1
A vs B: 3 differences
A vs C: 4 differences A B C D E
A vs D: 3 differences A
A vs E: 4 differences
B 3 A B C D E
B vs C: 1 differences C 4 1
B vs D: 2 differences D 3 2 3
B vs E: 3 differences E 4 3 4 1
…
Where is the root of the tree? 3. Bioinformatics and molecular epidemiology:
Variation within species
C A
C A
C G
Species 1
C G
C G
C G
Polymorphic
A B C D E Outgroup A B C D E Substitution sites
A T
B D A T
A T
Species 2
A T
C E A G
A A G
Another simulation
model including
contact network
among hosts
A simulation model of an
epidemic with genome evolution
of pathogens within hosts
Worby et al 2014 PLoS Comput Biol 10(3): e1003549 Worby CJ, Read TD (2015) PLoS ONE 10(6): e0129745