Professional Documents
Culture Documents
Acs Nanolett 1c00416
Acs Nanolett 1c00416
org/NanoLett Letter
■ INTRODUCTION
The integration of machine-learning-driven chemometrics with
when introducing new data or attempting to predict properties
of an unknown sample. Consequently, establishing a strong
surface-enhanced Raman scattering (SERS) presents significant correlation between chemical knowledge and chemometric
potential in translating research-based SERS platforms for model outputs is critical to harness the inherent strengths of
diverse applications.1−3 These applications demand SERS machine learning approaches in extracting and comprehending
platforms to achieve ultratrace detection of multiple molecules complex SERS fingerprints.
with weak Raman scattering cross sections, via either direct Herein, we design a machine-learning-driven “SERS taster”,
detection or indirect analyte capturing with molecular capable of achieving multiplex profiling of five wine flavor
receptors.4,5 Vibrational fingerprints resulting from direct molecules with 100% accuracy at parts-per-million levels (Figure
detection of these molecules are often insignificant, whereas 1). Our strategy employs multiple carefully designed receptor−
subtle changes in receptor fingerprints are difficult to pinpoint flavor chemical interactions to capture all active chemical
through manual visual inspection. Chemometrics employ robust functionalities within flavor molecules. By serially combining all
statistical models to perform automated analyses of large SERS receptor−flavor spectra as a “SERS superprofile”, we construct a
spectral data sets across entire spectral windows, eliminating more complete spectroscopic profile for each molecule. These
subjective judgements to attain unparalleled accuracies.6,7 More
importantly, machine learning algorithms unveil intricate data Received: January 30, 2021
patterns which enable predictive analytics crucial for Raman/ Revised: March 6, 2021
SERS-based applications.8−10 However, there is significant risk Published: March 12, 2021
of over-relying on these powerful algorithms to achieve desired
outcomes without thoroughly understanding the underlying
chemical interactions. The resulting overfit models crumble
Figure 3. Chemometric classification of flavor molecules using superprofiles obtained from our SERS taster. (A) Scheme depicting the construction of
a SERS superprofile by horizontally combining all four receptor spectra in series. (B) PCA score plot of the first two principal components showing the
relative flavor data cluster separation using our SERS taster. (C) PCA biplot of the first two principal components, which highlights key spectral regions
where variations exert dominant influence over component scores. The specific spectral regions, their corresponding receptor−flavor interactions and
vibrational assignment are summarized based on their influence in the flavor data cluster positions in each of the quadrants of the score plot: (i)
quadrant 1, (ii) quadrant 2, (iii) quadrant 3, and (iv) quadrant 4. (D) Graphical representation of the PCA biplot. (E) Confusion matrices obtained
using SVM-DA for (i) our SERS taster and (ii) a bare Ag surface.
window with minimal analytical errors, effectively identifying plots (Figure 3B(i)), our SERS taster exhibits distinct and
even subtle spectral variations. On the basis of the 2D PCA score compact clusters of data representing individual flavor molecules
2645 https://dx.doi.org/10.1021/acs.nanolett.1c00416
Nano Lett. 2021, 21, 2642−2649
Nano Letters pubs.acs.org/NanoLett Letter
including the control. Each data cluster is encapsulated within a separation of linalool and limonene from the flavorless control.
95% confidence ellipse. The clear separation between these Notably, we demonstrate that our SERS taster distinguishes
confidence ellipses indicates that our SERS taster effectively primary (MH), secondary (menthol), and tertiary (linalool)
differentiates all five flavor molecules. alcohols by classifying them in different quadrants with distinct
In contrast, as the number of receptors decreases, the relative SERS spectral changes. This successful classification is driven by
ability to separate these data clusters diminishes (Figure S4). our information-rich SERS superprofiles, which amalgamate and
The use of a single receptor (Ag) results in overlaps between the magnify all spectral variance arising from individual receptor−
confidence ellipses of menthol, limonene, and the control flavor interactions.
(Figure 3B(ii)). This overlap shows that the single receptor By examining the PCA biplot, we used our knowledge of
model cannot differentiate flavors because of insufficient SERS chemical interactions occurring at the molecular level to unravel
spectral variances. In addition, the larger confidence ellipses how the chemometric model classifies different flavors as
signify high intracluster variance. This comparison highlights the distinct clusters. This bridges the gap between SERS spectral
advantage of our SERS superprofiles over single receptor−flavor inputs and chemometric model outputs, ensuring our model is
interactions for distinct flavor identification. built upon valid receptor−flavor spectral variation and not
To further elucidate the underlying chemical meaning behind meaningless background variations.
the PCA scores, we scrutinize the PCA biplot (Figure 3C, D). To quantitatively evaluate the predictive capability of our
The biplot features key receptor spectral regions that drive the SERS taster, we constructed confusion matrices using SVM-DA
separation of each flavor data cluster to different quadrants of the (Figure 3E). SVM-DA is a supervised machine learning model
PCA score plot.19,20 We first deconstruct the biplot into four that allows us to predict the identity of flavor molecules by
quadrants (Figure 3C) and relate these spectral regions to examining their SERS superprofiles with a high degree of
specific receptor−flavor interactions causing the variation. flexibility and robustness.21−23 In the first model, we introduced
First, PCA classifies MHA in the first quadrant of the score superprofiles used in the PCA earlier. In the second model, we
plot (Figure 3B(i), lower right). In this quadrant, electrostatic introduced SERS spectra derived by exposing flavor molecules
interactions of MHA with PY result in significant spectral only to bare Ag.
variations in PY’s pyridine ring breathing mode at the 1000− From the resulting confusion matrices, we affirmed that our
1008 cm−1 region (Figure 3C(i), Figure S5i).15 The proximity of SERS taster achieves 100% accuracy in the classification of all
MHA with the bare Ag surface also amplifies MHA’s πOCO and flavors, including the control (Figure 3E(i), Table S3).
δHCO at the 630−670 cm−1 region. Conversely, bare Ag shows a wide accuracy range of 33.3−
MH lies in the second quadrant of the score plot, at the 87.5% (Figure 3E(ii), Table S3). Notably, the Ag surface
opposing end of the PC1 axis compared to MHA (Figure 3B(i), classifies thiolated flavor molecules (MHA, MH) with higher
lower left). Two reasons contribute to this classification. First, accuracy because of the formation of the strong Ag−thiolate
hydrogen bonding between MH and PY influences PY’s bond bringing them close to the plasmonic surface for SERS
concurrent pyridine ring stretch and C−S stretching (νCS) enhancement. In contrast, nonthiolated flavor molecules
mode at the 416−424 cm−1 region (Figure 3C(ii), FIgure (menthol, linalool, limonene) do not interact well with the Ag
S5i)).15 Next, Ag−thiolate interactions between MH and Ag surface and are prone to misclassification. These prediction
magnifies MH’s νCS in the 630−670 cm−1 region. In this region, outcomes demonstrate that our SERS taster effectively predicts
the Ag-MH spectrum is different from the Ag-MHA spectrum the identity of an unknown flavor molecule with high accuracy
because of inherent differences in molecular structure and through analysis of its SERS superprofile.
interactivity with the Ag surface. These unique peak shape Multiplex Flavor Quantification. To enhance the
changes promote the classification of MH in the opposing applicability of our SERS taster in actual flavor analysis, we
quadrant of MHA. demonstrated the ability of our SERS taster to simultaneously
Menthol positions itself between the second and third quantify two flavor molecules in an artificial wine matrix as a
quadrant of the score plot (Figure 3B(i), upper left). Crucially, proof-of-concept (Figure 4A). We selected MHA and MH, as
hydrogen bonding between menthol and PY causes peak ratio both are sulfur-containing molecules that exhibit intense fruity
change for PY’s out-of-plane C−H deformation mode at the notes concurrently found in many wine types, such as sauvignon
702−710 cm−1 region (Figure 3C(iii), S5(iii)).15 In addition, blanc and cabernet sauvignon.12
van der Waals’ interactions between menthol and NT causes a Using SVM regression (SVM-R), we constructed calibration
subtle decrease in peak intensity of the C−H bending (δHCH) curves to compare the quantification accuracy of our SERS taster
mode of NT in the 630−638 cm−1 region.17 These variations and a single receptor platform (BA). The flavor concentrations
differentiate menthol from the flavorless control. range from 2−10 μM (approximately 0.2−2 ppm, Figure S6).
Finally, linalool and limonene emerge as separate clusters For our SERS taster, the calibration curves show near ideal
within the fourth quadrant of the score plot (Figure 3B(i), upper linearity with high prediction coefficients of 0.998 for both MHA
right). For linalool, hydrogen bonding between linalool and PY and MH (Figure 4B(i), Figure S7i). In contrast, using only BA
increases the peak intensity of PY’s in-plane C−H deformation yields prediction coefficients of 0.964 and 0.952 for MHA and
mode in the 1008−1016 cm−1 region (Figure 3C(iv), Figure MH respectively (Figure 4B(ii), Figure S7ii).
S5i)).15 Concurrently, hydrogen bonding between linalool and Next, we prepared six artificial wine samples with varying
BA results in a distinct peak ratio change involving BA’s νOCO‑ in concentrations of both flavors and expose them to both
the 1350−1400 cm−1 region.16 For limonene, π−π interactions platforms. Using the calibration curves, our SERS taster showed
between limonene and BA result in a red shift of BA’s νCC at the excellent quantification accuracies for both flavors, ranging from
1584−1592 cm−1 region (Figure 3C(iv), S5(iv)).16 π−π 95 to 100% (Figure 4C). Notably, the difference between actual
interactions between limonene and NT also result in a decrease and predicted flavor concentrations in the artificial wine samples
in peak intensity of NT’s ring breathing mode in the 764−770 was 0−0.3 μM. In contrast, we observed lower quantification
cm−1 region.17 Collectively, these differences drive the cluster accuracies of 80−90% using only BA. The large range also
2646 https://dx.doi.org/10.1021/acs.nanolett.1c00416
Nano Lett. 2021, 21, 2642−2649
Nano Letters pubs.acs.org/NanoLett Letter
obtained using SVM-R for MHA using (i) our SERS taster and (ii) a The Supporting Information is available free of charge at
single receptor platform (BA). (C) Comparison of quantification https://pubs.acs.org/doi/10.1021/acs.nanolett.1c00416.
accuracy between our SERS taster and a single receptor (BA) platform Experimental methods; characterization of the synthe-
using six artificial wine samples with varying concentrations of MHA sized Ag nanocubes (Figure S1); in-depth spectral
and MH. For each sample, the predicted flavor concentration, and its
deviation from the actual concentration (% difference) is shown. analysis through peak deconvolution (Figure S2),
comparison of experimental SERS spectra and DFT
simulated spectra (Figure S3); PCA score plots for other
indicates higher inconsistencies arising from the model’s
receptor combinations (Figure S4); experimental SERS
inability to pick up minute spectral changes. The predicted
spectra showing key spectral regions which highlight
concentrations also exhibit larger deviations of 0.3−1.1 μM.
critical spectral variations (Figure S5); SERS spectra with
Crucially, we showed that our SERS taster exhibits enhanced
varying concentrations of MHA and MH within artificial
sensitivity to fluctuations in flavor concentrations as opposed to
wine (Figure S6); calibration curves for MH obtained
conventional single receptor platforms.
using SVM-R (Figure S7); SERS enhancement factor
Collectively, these results reiterate the capability of our
calculations (supplementary note 1), Discussion of DFT
multireceptor SERS taster to precisely quantify flavors in
simulated spectra in comparison to experimental results
samples, even in the presence of potential matrix interferences.
(supplementary note 2); molecular structures of
Notably, glycerol and tartaric acid do not skew our predictive
representative wine flavor molecules and their potential
outcomes, even though they have multiple hydroxyl groups that
interactions with receptor molecules (Table S1);
can potentially interfere with the chemical interactions that
comparison of experimental and DFT simulated peak
occur between a receptor−flavor pair (Table S4). Overall, we
intensities for the ring stretching modes of NT (Table
demonstrate the promising potential of our SERS taster in
S2); summary table for overall classification accuracy, rate
conducting multiplex quantification of flavor molecules, in
of false positives, and false negatives (Table S3);
addition to providing qualitative results.
■
molecular structure of potential interferences in an
CONCLUSIONS artificial wine matrix (Table S4) (PDF)
In conclusion, we have designed a machine-learning-driven
multireceptor SERS taster that enables multiplex profiling of five
wine flavors with 100% accuracy at the parts-per-million level.
■ AUTHOR INFORMATION
Corresponding Author
Notably, our two-pronged approach utilizes multiple molecular Xing Yi Ling − Division of Chemistry and Biological Chemistry,
receptors to generate rich SERS spectral variances and machine- School of Physical and Mathematical Sciences, Nanyang
learning-driven chemometric models to extract these variances Technological University, Singapore 637371, Singapore;
2647 https://dx.doi.org/10.1021/acs.nanolett.1c00416
Nano Lett. 2021, 21, 2642−2649
Nano Letters pubs.acs.org/NanoLett Letter
■
2015, 474, 29−35.
(17) Agarwal, N. R.; Lucotti, A.; Tommasini, M.; Neri, F.; Trusso, S.;
REFERENCES Ossi, P. M. SERS detection and DFT calculation of 2-naphthalene thiol
(1) Kao, Y.-C.; Han, X.; Lee, Y. H.; Lee, H. K.; Phan-Quang, G. C.; adsorbed on Ag and Au probes. Sens. Actuators, B 2016, 237, 545−555.
Lay, C. L.; Sim, H. Y. F.; Phua, V. J. X.; Ng, L. S.; Ku, C. W.; et al. (2020) (18) Polášková, P.; Herszage, J.; Ebeler, S. E. Wine flavor: chemistry in
Multiplex Surface-Enhanced Raman Scattering Identification and a glass. Chem. Soc. Rev. 2008, 37, 2478−2489.
Quantification of Urine Metabolites in Patient Samples within 30 (19) Elci, S. G.; Moyano, D. F.; Rana, S.; Tonga, G. Y.; Phillips, R. L.;
min. ACS Nano 2020, 14, 2542−2552. Bunz, U. H. F.; Rotello, V. M. Recognition of glycosaminoglycan
(2) Phan-Quang, G. C.; Yang, N.; Lee, H. K.; Sim, H. Y. F.; Koh, C. S. chemical patterns using an unbiased sensor array. Chem. Sci. 2013, 4,
L.; Kao, Y.-C.; Wong, Z. C.; Tan, E. K. M.; Miao, Y.-E.; Fan, W.; et al. 2076−2080.
(2019) Tracking Airborne Molecules from Afar: Three-Dimensional (20) Sipos, L.; Kovács, Z.; Sági-Kiss, V.; Csiki, T.; Kókai, Z.; Fekete,
Metal−Organic Framework-Surface-Enhanced Raman Scattering Plat- A.; Héberger, K. (2012) Discrimination of mineral waters by electronic
form for Stand-Off and Real-Time Atmospheric Monitoring. ACS Nano tongue, sensory evaluation and chemical analysis. Food Chem. 2012,
2019, 13, 12090−12099. 135, 2947−2953.
(3) Xu, M.-L.; Gao, Y.; Han, X. X.; Zhao, B. Detection of Pesticide (21) Belousov, A. I.; Verzakov, S. A.; von Frese, J. (2002) A flexible
Residues in Food Using Surface-Enhanced Raman Spectroscopy: A classification approach with optimal generalisation performance:
Review. J. Agric. Food Chem. 2017, 65, 6719−6726. support vector machines. Chemom. Intell. Lab. Syst. 2002, 64, 15−25.
(4) Lee, H. K.; Lee, Y. H.; Koh, C. S. L.; Phan-Quang, G. C.; Han, X.; (22) Alexandre Marcelo, M. C.; Martins, C. A.; Pozebon, D.; Ferrão,
Lay, C. L.; Sim, H. Y. F.; Kao, Y.-C.; An, Q.; Ling, X. Y. Designing M. F. Methods of multivariate analysis of NIR reflectance spectra for
surface-enhanced Raman scattering (SERS) platforms beyond hotspot classification of yerba mate. Anal. Methods 2014, 6, 7621−7627.
2648 https://dx.doi.org/10.1021/acs.nanolett.1c00416
Nano Lett. 2021, 21, 2642−2649
Nano Letters pubs.acs.org/NanoLett Letter
2649 https://dx.doi.org/10.1021/acs.nanolett.1c00416
Nano Lett. 2021, 21, 2642−2649