Professional Documents
Culture Documents
in Bioinformatics
Example Domain: Gene Finding
Colin Cherry
colinc@cs
Learning Objectives
When Im done you should know:
1.
2.
3.
4.
Outline
Statistical Models
Definition:
Assumptions
Parameters
Estimation
Usage
HMM Assumptions
HMM Parameters
Lots of parameters
Represented in two
tables.
HMM Estimation
HMM Usage
Gene Finding
(An Ideal HMM Domain)
Our Objective:
Our Motivation:
Scoring of sequences
(Gene Finding)
HMM Requirements
So youve decided you want to build an HMM,
heres what you need:
An architecture
HMM Requirements
Continued
Training data
HMM Advantages
Statistical Grounding
Modularity
Statistics:
Modularity:
Prior Knowledge:
HMM Disadvantages
Markov Chains
P(y)
HMM Disadvantages
continued
Avoid over-fitting
HMM Disadvantages
continued
Speed!!!
Conclusions
Conclusions
Advantages:
Statistics
Modularity
Transparency
Prior Knowledge
Disadvantages:
State independence
Over-fitting
Local Maximums
Speed
Questions
Any Questions?