Professional Documents
Culture Documents
Go and Pathways May 2011 PDF
Go and Pathways May 2011 PDF
Ståle Nygård
stale.nygard@medisin.uio.no
• Long list of
differentially
expressed genes
• Possibly hundreds
of papers
describing the
functions of the
genes
• Misleading names
• Different names in
different organisms
Genes seldomly operate on it's
own
-Genes are by nature not independent.
Biologically related genes will often show
expression changes together
– Biological process:
• Cellular events to which the gene product
contributes.
– Cellular component:
• Location or complex of gene/protein.
Molecular Function
• activities or “jobs” of a gene product
Insulin binding
Insulin transport activity
Biological Process
• a commonly recognized series of events
cell division
Cellular Component
• where a gene product acts
Content of GO
As of May 2010
GO Annotation
• Association between gene product and
applicable GO terms
• Provided by member databases. Collaborating
databases annotate their gene products (or genes)
with GO terms, providing references and indicating
what kind of evidence is available to support the
annotations.
• Made by manual or automated methods.
• GO Annotation
• Database object: gene or gene product
• GO term ID
• Evidence supporting annotation
• Reference
– publication or computational method
Overrepresentation of GO terms
• We have a subset of genes
– List of differentially expressed genes
– List of genes that cluster together
0 2 4 6 8 10 12 14 16 18 20 22 24 hours
human fibroblasts -
24 h time course thymidine-block
release
Questions
0 2 4 6 8 10 12 14 16 18 20 22 24 hours
human fibroblasts -
24 h time course thymidine-block
release
173 genes up-regulated 0-4 hours compared to all
genes on the array
Ordered by significance:
146 genes down-regulated 0-4 hours compared to
all genes on the array
homeostasis
0 2 4 6 8 10 12 14 16 18 20 22 24 hours
human fibroblasts -
24 h time course thymidine-block release
Biological pathways
Type of pathways
• Metabolic pathways
– convert raw materials from the environment
into value-added products and recycle or
dispose of intracellular materials
• Signaling pathways
– convert mechanical/chemical stimulus to a cell
into a specific cellular response
• Regulatory pathways
– alter the output of the genetic program
through transcriptional and translational
regulation
• Signaling,
regulatory
and
Signaling
metabolic
events are
often linked Regulatory
Metabolic
Types of pathway
representations
• Cartoons
– Textbooks
– Biocarta
• Circuit diagrams
– KEGG
– Reactome
– geneRifs
• Computational networks
– SBML models
– Transcription factor
networks
KEGG
• A large collection of signaling, metabolic
and regulatory pathways
• Organised by separate pathways with
hand drawn diagrams
• Academic (freely available)
• The pathways can be used to look for
overrepresentation or enrichment
• Can be used to visually check for path-
ness or direction
TGF Beta signalling patway
Same pathway in Biocarta
GO vs. Pathways
• Overview • Detail view
• Can handle a large • Focused sets of
number of genes genes
• Many genes • Scattered data
annotated sources
• Every gene • Focuses on
considered on its own interactions between
genes
Network construction
• Information about established pathways
(e.g. in KEGG) is (not at all) complete
• Pathways interact and depend on context
• An alternative approach to using
established pathways is to construct
networks from the data.
Network construction
• Networks can be inferred inferred from
– correlation in the data (recall gene clustering)
and/or
– interaction databases:
• Protein-protein interactions: BioGRID,
IntACT, DIP,HPRD ++
• Transcription factor data bases:
TRANSFAC, JASPAR ++
• Literature: PubGENE
Network construction: case
study
WT AB CXCR5 KO AB
WT SHAM KO SHAM WT AB KO AB
(n=3) (n=3) (n=4) (n=4)