Professional Documents
Culture Documents
Model Development
Experimental
Knowledge Data
BioLingua
Interactive
Guidance
from Biologists
Genomic
Tools
BioLingua: Computational Biology Workbench
• Integrates Genomic and Data Analysis Tools
Standard
analytic tools
Standard
analytic tools,
plus discovery
Computed An ever-expanding library of computations that tools that
Concepts produce complex, virtual, biological concepts, such as
combine know-
pathways, complexes, regulons, etc.
Layer ledge and data
Unified Basic Structures provided for important biological under user
Concepts concepts: e.g., reactions, molecules, enzymes, control.
experiments, expression-levels, etc.
Layer
Integrated
K/DB Layer KEGG BioCyc GO Remote Access
Other K/DBs
Model Development
Goals of Computational Biology
Prochlorococcus MED4
Prochlorococcus MIT9313
How do cells control response to light?
I.e., What genes are related to the adaptation to high light?
Outline Protocol
Look for:
• Gene present in Prochlorococcus MED4
MED4 is naturally adapted to grow in high light.
• Ortholog absent in Prochlorococcus MIT9313
MIT9313 is naturally adapted to grow in low light
• Ortholog present in Synechocystis PCC 6803
In order to make contact with annotation and microarray data
(http://nostoc.stanford.edu:8002/biologin)
Set: Light-specific genes
Syny6803:sll0990 Formaldehyde dehydrogenase (glutathione dependent)
Syny6803:srl7009 trnR tRNA Arg (UCU)
Syny6803:slr1331 Processing protease
Syny6803:slr1332 fabF beta ketoacyl acyl carrier protein synthase
Syny6803:sll0337 Sensor histidine kinase
Syny6803:sll0335 Hypothetical
Syny6803:sll0789 Response regulator (OmpR)
Syny6803:sll0788 Hypothetical protein
Syny6803:sll0576 Putative epimerase/hydratase
Goals of Computational Biology
Sampling mRNA/cDNA
Time
Light
20030303 Cyclodyn (Cy5) Light Correlated
-2 subunit (cpcB)
SLL1867 50.749165 photosystem II
D1 protein (psbA 3)
SLL1745 105.46256 50S ribosomal
OmpR subfamily
-8
phycocyanin b subunit (cpcB) hypothetical protein (atp1)
phycocyanin a subunitHours
(cpcA)
after midnight
photosystem II CP43 protein (psbC)
allophycocyanin a chain (apcA) photosystem II D1 protein (psbA3)
photosystem II D1 protein (psbA2) 50S ribosomal protein L10 (rpl10)
phycocyanin associated linker protein (cpcC) P700 apoprotein subunit Ia (psaA)
OmpR subfamily ATP synthase subunit a (atpI)
hypothetical protein (atp1) photosystem I subunit III (psaF)
photosystem II CP43 protein (psbC) photosystem II D2 protein (psbD2)
photosystem II D1 protein (psbA3) sensory transduction histidine kinase
50S ribosomal protein L10 (rpl10)
Verbal Model:
When "awake" (day) the cell regulates its photosystem (PS) genes so as to match
photosynthetic output to energy demands. When the available light exceeds its needs, the PS
is down-regulated, leading to an "M" pattern of expression. At night, the cell sleeps, leading
to another drop in expression patterns at night.
Graphical Model:
Computable Model:
(photosynthesis isa process with
inputs (chloroplast-inside.water everywhere.light chloroplast-outside.nadph+
chloroplast-outside.adp chloroplast-outside.pi)
outputs (chloroplast-outside.atp chloroplast-outside.nadph everywhere.o2)
implemented-by photosystem)
(photosystem composition (psii antenna-array atpase pq-pool))
(light-absorption isa process with
inputs (everywhere.light)
outputs (chlorophyll.energy)
function absorption
implemented-by chlorophyll)
(light-energy-concentration isa process with
outputs psii.energy
driver chlorophyll.energy
function concentration
implemented-by antenna-array)
(psii-water-breakdown isa process with
inputs (chloroplast-inside.water)
driver psii.energy
outputs (psii.e- psii.e- chloroplast-inside.h+ chloroplast-inside.o2)
function molecular-splitting
implemented-by psii)
(psii-pq-reduction isa process with
inputs (psii.e- chloroplast-membrane.h+ chloroplast-membrane.plastoquinone)
outputs (chloroplast-membrane.plastoquinol)
function reduction
implemented-by psii
inhibited-by dcmu)
Explanation by Pathway Tracing:
(photosynthesis isa process with
inputs (chloroplast-inside.water everywhere.light chloroplast-outside.nadph+
chloroplast-outside.adp chloroplast-outside.pi)
outputs (chloroplast-outside.atp chloroplast-outside.nadph everywhere.o2)
implemented-by photosystem)
(photosystem composition (psii antenna-array atpase pq-pool))
(light-absorption isa process with
inputs (everywhere.light)
outputs (chlorophyll.energy)
function absorption
implemented-by chlorophyll)
(light-energy-concentration isa process with
outputs psii.energy
driver chlorophyll.energy
function concentration
implemented-by antenna-array)
(psii-water-breakdown isa process with
inputs (chloroplast-inside.water)
driver psii.energy
outputs (psii.e- psii.e- chloroplast-inside.h+ chloroplast-inside.o2)
function molecular-splitting
implemented-by psii)
(psii-pq-reduction isa process with
inputs (psii.e- chloroplast-membrane.h+ chloroplast-membrane.plastoquinone)
outputs (chloroplast-membrane.plastoquinol)
function reduction
implemented-by psii
inhibited-by dcmu)
Explanation by Pathway Tracing:
(track-object 'chloroplast-inside.water)
Tracking CHLOROPLAST-INSIDE.WATER
-> PHOTOSYNTHESIS:
Tracking CHLOROPLAST-OUTSIDE.ATP
Tracking CHLOROPLAST-OUTSIDE.NADPH
Tracking EVERYWHERE.O2
-> PSII-WATER-BREAKDOWN:
Tracking PSII.E-
-> PSII-PQ-REDUCTION:
Tracking CHLOROPLAST-MEMBRANE.PLASTOQUINOL
-> E-FUNNLING-PSII-TO-PSI:
Tracking PSI.E-
-> PSI-NADPH-FORMATION:
Tracking CHLOROPLAST-INSIDE.H+
-> ATP-FORMATION:
Tracking CHLOROPLAST-INSIDE.O2
-> O2-DIFFUSSION:
Explanation by Pathway Tracing:
(track-object 'chloroplast-inside.water)
Tracking CHLOROPLAST-INSIDE.WATER
-> PHOTOSYNTHESIS:
Tracking CHLOROPLAST-OUTSIDE.ATP
Tracking CHLOROPLAST-OUTSIDE.NADPH
Tracking EVERYWHERE.O2
-> PSII-WATER-BREAKDOWN:
Tracking PSII.E-
-> PSII-PQ-REDUCTION:
Tracking CHLOROPLAST-MEMBRANE.PLASTOQUINOL
-> E-FUNNLING-PSII-TO-PSI:
Tracking PSI.E-
-> PSI-NADPH-FORMATION:
Tracking CHLOROPLAST-INSIDE.H+
-> ATP-FORMATION:
Tracking CHLOROPLAST-INSIDE.O2
-> O2-DIFFUSSION:
Current representational practice:
Model Development
Model Development
Model Development as Search in “Model Space”
Generic Biological Experimental
Biological Constraints Data
Processes
- PhotoSynthesis
[Product] UnControlled
Degradation
[Redox [Reactive
- Potential] Oxygen
Species]
[Product] Controlled
[Control Degradation
Species] Rate
[Control
- Species] Type I
Regulation
[RNA] [Protein]
Translation
[Control Rate
Species] Type II
Rate [RNA]
Regulation -
Transcription
With no constraints, any genes could be co-
regulated:
How many regulatory models are there?
n=300
L=4
How many regulatory models are there?
2
1/2(n - n) 89700
L = ~4
Model
n=300 Identification
L=4 n
requires ~2
observations!
Model Development Protocol
Generic Biological Experimental
Biological Constraints Data
Processes
Regulon
Architecture
(simplified)
+ -
DFR NBLR NBLA
AntiCorrelated
+ -
DFR NBLR NBLA
Correlated AntiCorrelated
Goals of Computational Biology
Model Development
Model Development Protocol
Generic Biological Experimental
Biological Constraints Data
Processes
Model Development
Model of Photosynthetic Light Regualtion
+ -
NBLR NBLA PBS
+
-
DFR psbA1 Health
+
- -
+
- +
+ RR psbA2 Photosynthetic
activity
-
Light cpcB
Model Development Protocol
Biological
Preliminary
Process
Hypotheses
Knowledge Experimental
Data
+ -
NBLR NBLA PBS
+
-
DFR psbA1 Health
+
- -
+
- +
+ RR psbA2 Photosynthetic
activity
-
Light cpcB
Adding Regulon Constraints
+ -
NBLR NBLA PBS
+
-
Regulon Constraints
DFR psbA1 Health
+
- -
+
- +
+ RR psbA2 Photosynthetic
activity
-
Light cpcB
Adding Biological Process Knowledge
energy
+ - +
NBLR NBLA PBS +
+
+ damage
Si
Regulon Constraints -
DFR Ca gna psbA1 Health
sc l +
ad - -
e
Detection
Signal
+
- +
+ RR psbA2 Photosynthetic
activity
-
Light cpcB
Model Development Protocol
Biological
Preliminary
Process
Hypotheses
Knowledge Experimental
Data
energy
+ - +
NBLR NBLA PBS +
+
+ damage
Si
Regulon Constraints -
DFR Ca gna psbA1 Health
sc l +
ad - -
e
Detection
Signal
-- - +
+ RR psbA2 Photosynthetic
activity
-
Light cpcB
Goals of Computational Biology
Model Development
BioLingua: Computational Biology Workbench
• Integrates Genomic and Data Analysis Tools
• Integrates Organism-specific as well as General Knowledge
• Unifies Important Knowledge Bases
• Integrates Model Development and Refinement tools
• Offers a Flexible “Open Programming” Methodology
• Provides Convenient Universal Access (fully web-enabled)
BioLingua: Computational Biology Workbench
• Integrates Genomic and Data Analysis Tools
• Integrates Organism-specific as well as General Knowledge
• Unifies Important Knowledge Bases
• Integrates Model Development and Refinement tools
• Offers a Flexible “Open Programming” Methodology
• Provides Convenient Universal Access (fully web-enabled)
• And 8th-graders can learn it!
BioLingua:
• JP Massar
• Mike Travers
• Stephen Bay
• Devaki Bhaya
• Jeff Elhai
• Bob Haxo
• Sumudu Watagala
• Monica Jain
• Ashvin Kumar
• Pat Langley
• Andrew Pohorille
• Karl Schweighofer
• Colin Smith
• Serdar Uckun
With support from NASA, Stanford, and Arthur and Kevin’s Labs
Discovery Tools:
• Pat Langley
• Stephen Bay
• Andrew Pohorille
• Lonnie Chrisman
• Kazumi Saito
• Dileep George