Professional Documents
Culture Documents
• Excluding cells with a low number of genes (< 0.02) by computing the 2nd percentile of gene counts per library and
removing cells below this threshold.
• Performing Scran normalization, feature selection, dimensionality reduction (PCA), and clustering.
Total merged libraries
Filtered_out 86k
• True_DBL 64226
• low_Gene_Counts 16280
• Cryptic_doublet 5443
Total merged libraries
Total merged libraries
Cryptic doublets accuracy
DD&knn10 &scr
DD&KNN5 TPR 89%
TPR 84% DD&scr
TPR 0.81%
DD&knn5&scr
DD&knn10 TPR 85 %
True postive ratio (TPR) :88%
Cryptic doublets accuracy
Filtered_out
• True_DBL 64226
• low_Gene_Counts 16280
• Cryptic_doublet 5443
• Perform feature selection by computing the coefficient of variation for all genes.
• Select N genes based on their coefficient of variation and mean expression and utilize these features for subsequent
downstream analysis. means>0.0125b& dispersions>0.2
means>0.001 &dispersions<1
Features selection
• Testing deviance for feature selection which works on raw counts [Germain et al., 2020]
• Quantifies whether genes show a constant expression profile across cells
• means>0.0125b&dispersions>0.2
• means>0.001 &dispersions<1
• Highly deviant in all 88 libraries
• Computing the neighborhood graph through calculating a Euclidean distance matrix on the PC-reduced
expression space for all cells and then connect each cell to its K most similar cells.
• Using CellID
• 2 references used to automatically annotate cell type( hao20 “same one yann used”)
Cell type annotation
• Using CellID
• 2 references used to automatically annotate cell type (“using 150k cells from yann datatset ”)
To Do
• Enhance integration accuracy by re-running integration using scANVI, a scvi model that incorporates
annotated cell types for more precise results.
• Conduct an in-depth analysis of pilot 5 libraries, with a specific focus on comparing individuals
sampled in both Pilot 5 and V3, involving different strains of COVID and influenza.
• Obtain preliminary insights into the effects of different viruses by November 28.