Professional Documents
Culture Documents
William John Thrift†, Sasha Ronaghi‡, Muntaha Samad§, Hong Wei†, Dean Gia Nguyen¶,
Antony Superio Cabuslayǁ, Chloe E. Groome†, Peter Joseph Santiago†, Pierre Baldi§, Allon I.
Hochbaum†ǁ¶ǂ, Regina Ragan†¶ *
Table of Contents
Supporting Figure S2: OD600 of a P. aeruginosa cell culture at indicated growth times after adjustment to OD 0.5 at
time, t = 0 h, a) without antibiotic treatment, b) with 50 μg/mL carbenicillin introduced at 0 h, and c) with 400
μg/mL rifampicin introduced at 0 h.
Supporting Figure S3 a, c) dose response t-SNE of P. aeruginosa (a) and E. coli (c). 0, 0.1, 0.5, 1, and 10 μg/mL
carbenicillin dosed lysate are depicted in red, purple, yellow, blue, and green, respectively. b, d) Temporal response
t-SNE of P. aeruginosa (b) and E. coli (d). Lysate processed after 0, 5, 10, 20, and 40 minutes of 10 μg/mL
carbenicillin dosage are depicted in red, purple, yellow, blue, and green, respectively.
Supporting Figure S4 depicts a t-SNE visualization of clustering of the SERS spectra acquired for Figure 4, the AST
dataset.
Table S1 and S2 lists 2-class and 5-class, respectively, DNN model mean accuracy,
sensitivity, and specificity for temporal and dosage dependent response data when using 10-fold
cross validation.
E. coli Temporal P. aeruginosa Temporal
Accuracy Sensitivity Specificity Accuracy Sensitivity Specificity
(0min) | (5min, 10min,20min,40min) 99% ± 0.1% 100% 99% 99% ± 0.2% 100% 99%
(0min, 5min) | (10min,20min,40min) 99% ± 0.1% 99% 100% 99% ± 0.1% 99% 100%
(0min, 5min, 10min) | (20min,40min) 99% ± 0.2% 99% 100% 99% ± 0.2% 99% 99%
(0min, 5min, 10min, 20min) | (40min) 99% ± 0.1% 99% 100% 99% ± 0.1% 99% 99%
(0 μg/mL) | ( 0.1 μg/mL, 0.5 μg/mL, 1 99% ± 1% 100% 99% 98% ± 1% 99% 95%
μg/mL, 10 μg/mL)
(0 μg/mL, 0.1 μg/mL) | (0.5 μg/mL, 1 97% ± 1% 98% 95% 98% ± 1% 98% 98%
μg/mL, 10 μg/mL)
(0 μg/mL, 0.1 μg/mL, 0.5 μg/mL) | (1 95% ± 1% 93% 95% 99% ± 1% 98% 99%
μg/mL, 10 μg/mL)
(0 μg/mL, 0.1 μg/mL, 0.5 μg/mL, 1 99% ± 1% 90% 99% 98% ± 1% 94% 99%
μg/mL) | (10 μg/mL)
Supporting Figure S6 depicts a) VAE visualization of 2-dimensional VAE latent space and
b) t-SNE visualization of a 32-dimensional VAE latent space trained on the metabolite
combination dataset. Table S2 provides the legend for the dataset. The ‘combined’ dataset greatly
benefits from a higher (32) dimensional latent space, which is observed by utilizing t-SNE to
visualize clustering as is observed in Figure S6. Increasing the latent space from 2-dimensions to
32-dimensions not only greatly improves the clustering of the different metabolite conditions, it
also reduces the test loss of the VAE model by nearly 50%.
Table S3: Color labels for combined metabolite mixture data shown in Figure S6. Labels are defined as follows: 2-
methyl napthalene (A), o-cresol (B), 2-amino acetophenone (C), pyrrole (D), 2-pentyl furan (E), and indole (F).
Figure S6: a) VAE visualization and b) t-SNE visualization of the combined metabolite mixture and AST dataset
latent space. All 63 possible mixture combinations of metabolites 2-methyl napthalene (A), o-cresol (B), 2-amino
acetophenone (C), pyrrole (D), 2-pentyl furan (E), and indole (F) are plotted with legend of color and corresponding
mixture in Table S2.
In addition to the superior clustering that the combination VAE latent space provides, it
also enhances the difference between outliers and inliers. Supporting Figure S7 depicts an isolation
forest outlier detection applied to the AST spectra that has been encoded into the combination
VAE model. Outliers are easily identified by isolation forest and removed for processing in the
predictive models.
Supporting Figure S7: Isolation forest predictions of AST spectra that have been encoded with the combination
VAE model. Outliers are shown as red and inliers are shown as blue.
The generative aspect of the VAE model is demonstrated in Supporting Video 1. The
animation is a series of 100 SERS spectra sampled in a line drawn between the center of the
encoded/decoded 0.5 h carbenicillin-treated lysate data and that of the untreated 2 h control data
in the VAE latent space. The shifts in the spectral data between these two antibiotic treatment
classes can be visualized and interpreted in contrast to non-generative machine learning
techniques.