Professional Documents
Culture Documents
Poster 2
Poster 2
Nuria Alina Chandra, Alexander Sasse, Sara Mostafavi, & the Immunological Genome Project
Paul G. Allen Center for Computer Science & Engineering
Problem
All somatic cells, from heart cells to immune cells, have the same The Model Profile prediction
Average Pool
Total Count Loss λ = Weight on Profile Loss 2.25
Profile loss
Results
12
The model successfully learns to predict both the total
number of ATAC-seq reads and the base-pair resolution
ATAC-seq profile of open chromatin regions.
Profile Prediction for λ=1e-1
Total Count Prediction Performance Profile Prediction Performance Celltype: B.FrE.BM peak 102436
ATAC-seq Data
Assays for transposase-accessible chromatin
(ATAC-seq) measures chromatin accessibility by
counting the number of cuts from the Tn5
enzyme regions of DNA. The enzyme cuts where
λ Weight on profile loss λ Weight on profile loss
DNA is accessible to the transcriptional
Model performance on the profile prediction task improves as the profile loss is weighted more heavily.
machinery. More counts are a proxy for stronger
transcriptional activity through binding of A More Complex Model Modified scalar output head Total Count Prediction Performance
transcription factors (TF). Base-pair resolution
Modifications: the number of
use ATAC-seq data from 90 different mouse model was increased to 300 and 3x
immune cell types collected by the
Immunological Genome Project1 . Hypothesis: a
maxpooling and convolutions were
added to the scalar output head.
Maxpool …
model learning base-pair resolution ATAC-
seq accessibility will have improved total Conclusion: In the more complex model
chromatin accessibility prediction accuracy. ATAC-seq footprints change with TF interactions adding base-pair resolution information
improved ATAC-seq total count λ Weight on profile loss