Professional Documents
Culture Documents
Contents
1. Introduction 83
2. The Effect of the Asymmetric Field 84
2.1 Differential Mobility in a Hard Sphere Environment (Type C Behaviour) 86
2.2 Introduction of Solvent Vapour (Type A and Type B Behaviour) 93
2.3 Exotic Dispersion Curves (Type D and Type E Behaviour) 104
3. Using DMS Clustering Behaviour to Assess Molecular Properties 106
3.1 Correlating Type B Critical Points With Molecular Properties 106
3.2 Application of Supervised ML 112
Acknowledgements 118
References 118
1. INTRODUCTION
The inception of differential mobility spectrometry (DMS) can be
traced back to the Gas Analyser of Ions reported by Gorshkov [1]. Over
the decade following this initial report, the development of differential
mobility devices proceeded along two different directions based on device
geometry. One direction, driven by Gorshkov and his research team, sought
to utilize cylindrical device geometry with curved electrodes to generate
inhomogeneous electric fields at the device output so as to focus the trans-
mitted ions into a detector. This implementation would later be referred to
as high-field asymmetric waveform ion mobility spectrometry (FAIMS)
[2–4]. The other direction of device development sought to utilize a planar
device geometry which could transmit all ions without discrimination when
the separation voltage (SV) was set to 0 V, and transmit cations and anions
simultaneously during operation [5,6]. This implementation became known
as DMS [7]. As these technologies matured, there arose an impetus towards
miniaturization, and the first microscale DMS (μDMS) device was reported
in the year 2000 [8]. While these different implementations of device geom-
etry impart different analytical properties, DMS (both macro- and microscale)
and FAIMS employ the same fundamental principle for physically separating
analytes—an ion’s coefficient of mobility has a very specific, nonlinear
electric field dependence typically described by:
E E
K ¼ K0 1 + α (1)
N N
where K0 is the mobility coefficient under low-field conditions and α is
a function that describes how the mobility depends on the ratio E/N
(expressed in Townsend, Td) where E is electric field and N is concentra-
tion of neutral particles.
Chapter “Tandem ion mobility spectrometry with drift tubes at ambient
pressure” by Eiceman, Section 2.2, provides an excellent overview of the
underlying theory for DMS, and the keen reader is directed to review
articles by Schneider et al. [9] and Kabir and Donald [10] for additional
perspectives. Of pertinence to the discussion which follows in here is the
observation that DMS separations can be substantially improved (in some
cases enabled) by the addition of a low partial pressure of polar solvent
vapour to the collision gas [11–24]. It is generally thought that this behaviour
is a result of dynamic ion–solvent clustering and declustering, which occurs
during the ion’s transit through the DMS cell. Consequently, it is useful to
revisit the principle of operation for DMS paying particular attention to
ion–molecule interactions. This is done in Section 2. We then describe
how one can use clustering propensities as measured by DMS to assess a
variety of gas and condensed-phase molecular properties in Section 3.
Gas flow
To detector
Unstable trajectory
Stable trajectory
SV CV
0 0
Time, t Time, t
Fig. 1 A schematic diagram showing ion trajectory in a DMS cell. The ion’s differential
mobility coefficient, α, is encoded in the compensation voltage (CV) required for optimal
transmission at a given separation voltage (SV). Adapted from W.S. Hopkins, Determining
the properties of gas-phase clusters, Mol. Phys. 113 (21) (2015) 3151–3158.
the SV waveform, then more slowly return to their original distance from
said electrode during the second half-cycle of the SV waveform. Upon
introducing a collision gas, the ion’s differential mobility between the
high- and low-field conditions results in migration towards one of the planar
electrodes as determined by the α parameter (see Eq. 1). To correct the ion’s
trajectory such that it may be transmitted through the DMS cell, one can
apply an appropriate CV. Thus, the ion’s differential mobility coefficient is
encoded in the CV required for optimal transmission at a given SV. Fig. 1
provides a schematic diagram describing this phenomenon. It is important
to note that at the molecular scale the curved electrodes of the FAIMS
implementation appear planar, and that the discussion that follows also
applies to FAIMS and μDMS devices.
By recording the optimal CV for ion transmission as a function of SV,
one can generate a dispersion plot. Dispersion plots may be thought of as a
visual representation of the α parameter and they provide insight as to the
behaviour of an analyte in a given gaseous environment. Fig. 2 shows the
dispersion plot for tributylammonium in a pure N2 environment at a pres-
sure of 1 atm (black curve), a N2 environment seeded with a 1.5% mole ratio
of methanol vapour (red curve), and a N2 environment seeded with a 1.5%
mole ratio of acetonitrile vapour (blue curve) [14]. Clearly, the differential
mobility exhibited by tributylammonium varies from one gaseous environ-
ment to another. The three dispersion plots shown in Fig. 2 are exemplary
of the three common behaviours observed for ions in DMS experiments.
86 W. Scott Hopkins
20 N2 Type C
MeOH
10 ACN
0
CV (V)
Type B
–10
–20
–30
Type A
–40
0 1000 2000 3000 4000
SV (V)
Fig. 2 Dispersion plots for tributylammonium as recorded in (black curve) a pure N2
environment, (red curve) a N2 environment seeded with a 1.5% mole ratio of methanol
vapour, and (blue curve) a N2 environment seeded with a 1.5% mole ratio of acetonitrile
vapour. The experiments were conducted at a pressure of P ¼ 1 atm and a temperature
of T ¼ 150°C. Adapted from J.L. Campbell, M. Zhu, W.S. Hopkins, Ion-molecule clustering in
differential mobility spectrometry: lessons learned from tetraalkylammonium cations and
their isomers, J. Am. Soc. Mass Spectrom. 25 (9) (2014) 1583–1591.
sin(2wt)
sin(wt)
Fig. 3 A DMS waveform produced by adding two sinusoidal functions. Note that the
shaded areas above and below SV ¼ 0 V are equal.
Fig. 4 The viscosity of fluids like honey is governed by the momentum transfer between
molecular layers. The force required to overcome the interactions between fluid layers is
proportional to the velocity gradient, ν0/d.
between the knife’s surface and the molecules in the fluids that are in a layer
directly adjacent to the knife drag the molecules of the fluid along in the
direction of the knife’s movement. This behaviour extends out into the fluid
away from the surface of the knife. Molecular collisions transfer momentum
between the fluid layers; faster molecules collide with slower molecules
causing the slower molecules to speed up; and the faster molecules to slow
down. This effect is often observed at the macroscopic level at ice skating
rinks and stock car ovals. At the microscopic level, this effect gives rise to
fluid viscosity. The friction between the fluid layers originates from the
interactions between molecules as they slide past one another.
In liquids, the proximity of the molecules gives rise to relatively strong
intermolecular interactions (compared to gases). Consequently, a relatively
large shear force is required to cause the layers of the liquid to move past one
another. In our honey example, the force required to overcome the inter-
actions between fluid layers is proportional to the velocity gradient, ν0/d, in
the region of the knife:
F ν0
¼η (3)
A d
where F/A is the force per unit area of plate (i.e. knife) and η is the coefficient
of viscosity [26]. For liquids, η decreases with increasing temperature—at
higher temperatures the high-kinetic energies of the molecules allow them
to overcome the intermolecular forces between the fluid layers allowing the
layers to slip past one another with greater ease. In contrast to the behaviour
Dynamic Clustering and Ion Microsolvation 89
N
z ¼ σ hvrel i (5)
V
where hvreli is the average relative velocity of the particles and N/V is the gas
number density. It is clear from this description that the increased collision
frequency is a local effect for the ion since the neutral, hard sphere collision
gas is affected to only a relatively minor extent by application of the electric
field. In other words, as the applied electric field increases, the instantaneous
relative velocity and collision frequency of an ion in the field also increases.
This increases the momentum transfer to the bath/collision gas and results in
an increased fluid viscosity in the region of the ion. This scenario is depicted
schematically in Fig. 5. Because local fluid viscosity increases with increasing
electric field, ion mobility decreases with increasing electric field, and a pos-
itive voltage offset (i.e. CV) is required to correct ion trajectory and ensure
ion transmission. This gives rise to the observed Type C behaviour.
Given the earlier qualitative model of differential mobility, one might
naı̈vely expect that a quantitative model should be forthcoming. However,
there are two major challenges that must be overcome en route to developing
a quantitative model of differential mobility: [1] How does one determine
the local temperature of the ions from first principles? [2] How does one
90 W. Scott Hopkins
Fig. 5 The local viscosity experienced by the ion (red hexagon) depends on the applied
electric field.
mv2
T¼ (6)
3kB
where m is the particle mass and v is the root mean square of the total veloc-
ity. That is,
frequency, CCS, gas number density, and the partitioning of total energy
between the internal energy states of the molecules and their kinetic energies
following collision. Consequently, the field-dependent temperature can
vary from one system to another.
The CCS of an ion can also potentially vary as a function of the applied
electric field. At relatively low temperature, an ion is constrained to sampling
the relatively few nuclear configurations that are accessible at the energy
available to the system. Generally, these configurations occupy a similar
region of the potential energy landscape and so exhibit similar structures.
Note that this does not discount the possibility of observing multiple isomers
or conformers in a sample since the same argument holds for high energy
species kinetically trapped in a different region of the potential energy sur-
face than the global minimum structure (provided that the barrier to isom-
erization to the ground state is relatively high) [29–31]. As the electric field
increases, the local temperature (viz. thermal energy) of the ion can increase
to the point where substantially different nuclear configurations are acces-
sible. If the ion can adopt a higher energy configuration on the timescale of
the DMS asymmetric waveform (100 s of nanoseconds), the orientationally
averaged CCS of the molecular ion can change. Ultimately, to address
the two major issues outlined earlier, accurate molecular dynamics (MD)
simulations and detailed knowledge molecular partition coefficients are
necessary.
It is also worth noting that the assumption of hard sphere behaviour for a,
e.g., N2 environment, is not necessarily valid. Because N2 has a polarizability of
1.710 Å3 [32], it can form relatively strongly bound charge/induced-dipole
complexes. For example, the dispersion- and counterpoise-corrected D0
value (i.e. dissociation from the vibrational ground state) for the (H2O•N2)+
complex is 78.1 kJmol1 at the B3LYP/6-311++G(d,p) level of theory. In
comparison, the neutral water dimer, (H2O)2, has a similarly calculated D0
value of 24.3 kJ mol1. Thus, ions can exhibit Type A or Type B behaviour
even for environments in which they might be expected to exhibit Type
C behaviour. For example, Fig. 6 shows the dispersion plots for tetramethyl-
ammonium, tetraethylammonium, and tetrapropylammonium in a pure N2
environment at P ¼ 1 atm and T ¼ 150°C [14]. While the tetraethyl and
tetrapropyl analogues exhibit Type C behaviour (as expected), the tetramethyl
analogue clearly exhibits Type B behaviour. Moreover, there are examples of
elemental ions (K+, Cs+) which exhibit Type A behaviour in N2 environments
[33]. However, it should be noted that the purity of the transport gas in these
92 W. Scott Hopkins
R4N+ in N2
R = CH3
= CH2CH3
= CH2CH2CH3
experiments was not stated, and it is possible that the clustering behaviour arose
from interactions between the analyte ions and trace levels of water vapour.
Given the apparent importance of ion–solvent clustering in DMS, a detailed
discussion is warranted and is presented in Section 2.2.
As a final point of consideration regarding ion CCSs in DMS, it is also
important to highlight the fact that the impact of dynamic field-dependent
dipole alignment (and therefore hindered molecular rotation) is not well
understood. Most relatively, small molecules are expected to experience
an essentially free rotation even under the high-field conditions of most
commercial DMS instruments. Consequently, the trajectories of these ions
are influenced by their orientationally averaged CCS, Ωave, with the buffer
gas molecules. However, Shvartsburg et al. showed that permanent electric
dipoles of >400D are oriented by the electric field under typical experimen-
tal conditions [34]. Dipole moments of this magnitude are found for many
macroions, such as proteins with masses in excess of c. 30 kDa. Rather than
Ωave, the ion trajectories of these dipole-aligned species instead depend on
directional CCSs, Ωdir. While direct evidence of dipole alignment in DMS
studies has yet to be reported, ions with masses above 29 kDa have been
shown to exhibit a substantial increase of mobility under high-field condi-
tions, a behaviour which is consistent with dipole alignment [34]. Eventu-
ally, spectroscopic studies and/or high-level MD simulations could shed
light on this phenomenon.
Dynamic Clustering and Ion Microsolvation 93
6-Chloro
7-Chloro
8-Chloro
Fig. 7 Dispersion plots for protonated (black) 6-chloro-, (blue) 7-chloro-, and (red)
8-chloro-2-methylquinolinine in (left) a pure N2 environment and (right) a N2 environ-
ment seeded with 1.5 mol% H2O vapour. Error bars show 2σ of the Gaussian mean
for the fitted ionogram peaks. Adapted from C. Liu, J.C.Y. Le Blanc, J. Shields,
J.S. Janiszewski, C. Ieritano, G.F. Ye, et al., Using differential mobility spectrometry to
measure ion solvation: an examination of the roles of solvents and ionic structures in
separating quinoline-based drugs, Analyst 14 (20) (2015) 6897–6903.
the site of protonation (the ring nitrogen) is also the site of the strongest inter-
action with the H2O solvent molecules. The microsolvation process of the
methylquinoline derivatives begins when a single H2O molecule forms an
ionic hydrogen bond with the protonated ring nitrogen atom. Subsequent
water molecules hydrogen bond to those previously bound to the ion to create
an extended H-bonding network near the site of protonation [13,35,36]. In the
case of the 8-substituted isomer of chloro-2-methylquinoline, the chlorine
atom sterically hinders the formation of the extended H2O hydrogen
bond network [35]. Consequently, the 8-substituted species are more weakly
bound and decluster at lower temperature (viz. electric field) than do the 6- and
7-substituted isomers. In other words, the 8-substituted isomer spends more
of its time as a bare ion and so it adopts a weak-clustering Type B dispersion
curve whereas the 6- and 7-substituted isomers exist as larger clusters further
into the high-field portion of the waveform leading to strong-clustering
Type B and Type A dispersion curves.
Dynamic Clustering and Ion Microsolvation 95
where ΔHform and ΔSform are the enthalpy and entropy of cluster formation,
respectively. Thus, it stands to reason that the dynamic clustering processes
(and therefore DMS behaviours) of ions should be temperature dependent.
Indeed, introducing temperature control to the DMS cell provides an addi-
tional means of fine-tuning ion trajectories (and therefore separations). Fig. 8
shows the dispersion curves for a drug candidate recorded at T ¼ 150, 225,
and 300°C in a N2 environment seeded with 1.5 mol% methanol vapour
[37]. As the temperature of the DMS cell is increased, the ion behaviour
changes from Type A (at 150°C) to Type B (at 225°C), and then to Type
C (at 300°C). These observations provide some insight into the dynamic
ion–solvent clustering process. At T ¼ 150°C, the Gibbs’ energy of the sys-
tem is such that relatively large ion–solvent clusters can form under the low-
field condition, leading to a relatively large difference in ion mobility
between low- and high-field (where the solvent clusters evaporate). When
the temperature is increased to T ¼ 225°C, there is enough thermal energy
in the system to prevent the formation of larger clusters, resulting in a
decrease in the observed differential mobility between low- and high field.
96 W. Scott Hopkins
O
H
N N
N N
O
CF3
300°C
225°C
150°C
Fig. 8 Dispersion plots for a drug candidate molecule (inset) recorded at (black) T ¼ 150°C,
(blue) T ¼ 225°C, and (red) T ¼ 300°C in a N2 environment seeded with 1.5 mol% MeOH
at P ¼ 1 atm. The site of protonation is highlighted in red. Adapted from S.W.C. Walker,
A. Anwar, J. Psutka, J. Crouse, C. Liu, J.C.Y. Le Blanc, J. Montgomery, G.H. Goetz,
J.S. Janiszewski, J.L. Campbell, W.S. Hopkins, Determining Molecular Properties with Differen-
tial Mobility Spectrometry and Machine Learning. Submitted (revision requested) (2018).
At T ¼ 300°C, few (or no) ion–solvent clusters form under the low-field
condition, and the collision gas behaves as a hard sphere environment.
Owing to the difficulties in determining the field-induced temperature
and dynamic CCSs of ions (discussed earlier), it is difficult to develop an
accurate model of dynamic clustering for DMS. However, some insight
can be gained by examining the clustering behaviour of ions under static,
equilibrium conditions. To do so, it is first beneficial to determine the
approximate structures of the ion–solvent clusters that are likely to be present
under the given conditions of the DMS experiment. This can be accom-
plished to a first approximation by conducting NVT MD studies wherein
the number of particles, volume, and temperature of the system are fixed.
Although several varieties of MD simulations/programmes are available
[38,39], the application of basin-hopping (BH) search algorithms has shown
a good deal of success in identifying candidate structures for low-energy iso-
mers of small-to-intermediate-sized ion–solvent clusters [16,30,31,40–43].
The BH–MD methodology is akin to simulated annealing or Monte Carlo
with minimization, and as such it is efficient at locating stationary points on
the cluster potential energy landscape. Furthermore, simulation efficiencies
Dynamic Clustering and Ion Microsolvation 97
n=7 n=8
Fig. 9 The lowest energy structures of protonated 4-aminobenzoic acid/methanol clus-
ters, (4-ABA+ H)+(MeOH)n (n ¼ 1–8), as calculated at the B3LYP/6-311++G(d,p) GD3
level of theory. The green highlighted regions indicate hydrogen bonds. Adapted from
J.L. Campbell, A.M.-C. Yang, L.R. Melo, W.S. Hopkins, Studying gas-phase interconversion
of tautomers using differential mobility spectrometry, J. Am. Soc. Mass Spectrom. 27 (7)
(2016) 1277–1284.
98 W. Scott Hopkins
N=2 N=3
N=4 N=5
Fig. 10 The calculated proton affinities (PAs), gas-phase basicities (GBs), and structures
for isopropyl alcohol clusters, (IPA)n (n ¼ 1–5), as calculated at the B3LYP/6-311 ++G(d,p)
GD3 level of theory.
100 W. Scott Hopkins
where GX•solv(n1), Gsolv, and GX•solv(n) are the Gibbs’ energies of the (n 1)
solvent cluster, the solvent molecule, and the n-solvent cluster, respectively.
ΔGform indicates the relative stability of an n-solvent cluster compared to the
bare ion and n-solvent molecules:
ΔGform ¼ GX + nGsolv GXsolvðnÞ (11)
where GX is the Gibbs’ energy of the bare ion. The cluster with the lowest
Gibbs’ energy of formation for a given pressure and temperature, is the spe-
cies with the highest fractional population within the cluster distribution at
equilibrium. The DMS cell is clearly a perturbed environment compared to
field-free conditions. However, the effect of the dynamic electric field
on ion alignment and polarization is currently an open question. Fortu-
nately, on the timescale of DMS separations, all degrees of freedom
are equilibrated [58–60], so this treatment does yield a coarse estimate
of relative cluster populations. Fig. 11 shows the Gibbs’ energies of forma-
tion for the (4-ABA + H)+(MeOH)n (n ¼ 1–8) species at a pressure of
P ¼ 0.127 atm and a temperature of T ¼ 450 K. This is the partial pressure
of methanol at room temperature (the temperature at which solvent
vapour is typically seeded into the N2 collision gas), and the lowest tem-
perature setting of the DMS cell. Note that the energies in Fig. 11 have
been reported relative to the most stable cluster, (4-ABA + H)+(MeOH)4.
Doing so facilitates estimation of the relative populations for the various
cluster sizes via [61,62]:
P = 0.127 atm
T = 450 K
Spontaneous
formation
0 1 2 3 4 5 6 7 8
N
Fig. 11 The relative Gibb’s energies of formation for (4-ABA +H)+(MeOH)n (n ¼ 0–8), as
calculated at the B3LYP/6-311 ++G(d,p) GD3 level of theory with P ¼ 0.127 atm and
T ¼ 450 K.
Dynamic Clustering and Ion Microsolvation 101
where Ni is the relative population of the ith cluster, N0 is the relative pop-
ulation of the lowest energy cluster (usually set to 1), ΔGrel_form is the Gibbs’
energy of formation relative to the lowest energy cluster, and kB is
Boltzmann’s constant. By calculating the relative populations of the clusters
as a function of temperature (at a constant pressure of P ¼ 0.127 atm), one
can produce a temperature-dependent relative population plot as constructed
for (4-ABA + H)+(MeOH)n (n ¼ 0–8) in Fig. 12. This plot shows that at the
low-temperature setting of the DMS cell (c. 150°C, 450 K), (4-ABA + H)+
ions favour a microsolvated state with c. four bound methanol molecules.
As temperature increases beyond c. 200°C (c. 500 K), the clusters containing
a single methanol molecule are favoured. At temperatures greater than
c. 700°C, the bare ion is the dominant charged species in the DMS cell.
The relative populations of (4-ABA + H)+(MeOH)n (n ¼ 0–8) species
shown in Fig. 12 again raises the question of the field-dependent tempera-
ture in the DMS cell (described earlier). The two-temperature theory sug-
gests that ion temperature, Tion, may be approximated as [58–60]:
M ðKEÞ2
Tion ¼ T0 + Tfield T0 + (13)
3kB
where E is the magnitude of the electric field, K is the ion mobility, and M is
the molecular mass of the collision gas. Ion mobility is related to the CCS of
the ion via the Mason–Schamp equation [58,60]:
1=2
N0 3 2π q
K ¼ K0 ¼ (14)
N 16 μkB Tion N σ ðTion Þ
N=4 N=0
N=1
N=2
N=3
Tmax (K+)
Tmax (Cs+)
Tmin = 450 K
Fig. 13 An estimate of ion temperatures for K+ and Cs+ based on the predictions of two-
temperature theory [59,60]. Calculations employed T0 ¼ 450 K, P ¼ 1 atm N2,
Emax ¼ 15 kV cm 1, K0(K+) ¼ 2.47 cm2 V1 s1, and K0(Cs+) ¼ 2.21 cm2 V1 s1.
where N is the gas number density, N0 is the gas number density at standard
temperature and pressure, μ is the reduced mass for the ion/collision gas
molecule system, and q is the ion charge. However, since the ion CCS
(and therefore ion mobility) is temperature dependent, the treatment pre-
scribed by Eq. 13 has an embedded circular reference. Robinson and
coworkers have demonstrated that one can circumvent this problem in stud-
ies of proteins by monitoring structural changes as a function of gas temper-
ature and electric field [58]. Their work showed that the field-dependent ion
temperature (which scales as E2) oscillates with the applied electric field. For
ubiquitin ions with charge states ranging from z ¼ 6–8, fields of up to
20 kV cm1 were studied and maximum field-induced temperature varia-
tions on the order of a few tens of degrees were observed during the SV
cycle. Of course, it is clear by inspection of Eq. 13 that the field-induced
temperature experienced by an ion varies from one species to another
due to changes in CCS and, therefore, mobility. For example, Fig. 13 shows
the field-dependent temperature of K+ and Cs+ in N2 as calculated via
Eq. 13. These calculations assume that the respective K0 values describe
the ion mobility under all E-field conditions. Since, under hard sphere col-
lision conditions, ion mobility decreases with increasing field, the maximum
calculated temperatures (Tmax) should be viewed as an upper bound on the
local temperature experienced by the ion in a DMS cell with P ¼ 1 atm N2
and Emax ¼ 15 kV cm1. Of course, one could arrive at a more accurate esti-
mate of temperature by taking α(E/N) into consideration. By tracking the
maximum temperature of the K+ and Cs+ ions as a function of field strength,
we arrive at the plot shown in Fig. 14. As expected, based on Eq. 13,
Dynamic Clustering and Ion Microsolvation 103
K+
Cs+
Fig. 14 The maximum temperatures for K+ and Cs+ for a given maximum DMS field
strength as predicted by two-temperature theory [59,60]. Calculations employed
T0 ¼ 450 K, P ¼ 1 atm N2, K0(K+) ¼ 2.47 cm2 V1 s1, and K0(Cs+) ¼ 2.21 cm2 V1 s1.
K+
Cs +
Fig. 15 The average temperatures for K+ and Cs+ for a given maximum DMS field
strength as predicted by two-temperature theory [59,60]. Calculations employed
T0 ¼ 450 K, P ¼ 1 atm N2, K0(K+) ¼ 2.47 cm2 V1 s1, and K0(Cs+) ¼ 2.21 cm2 V1 s1.
ion temperature increases quadratically with increasing field strength and the
temperature of K+ increases more rapidly with field than does the temper-
ature of Cs+ due to differences in ion mobility.
The ion temperature arguments outlined earlier have important bearing
on the dynamic ion–solvent clustering process. The most obvious impact is
that small ions can experience local temperatures of several hundred degrees
during the high-field portion of the SV waveform. It is also worth noting
that the secondary maxima in temperature (see Fig. 13) also reach relatively
high temperatures at high-field (c. 600 K for K+ at 120 Td). Consequently,
the average temperature that an ion experiences during the SV waveform
can also be substantial (see Fig. 15). In extreme cases, the thermal energies
104 W. Scott Hopkins
O
N O
N
O H
150°C
300°C
Fig. 16 Dispersion curves observed for a drug candidate at (blue) T ¼ 150°C and (red)
T ¼ 300°C. The Type D dispersion curve (T ¼ 300°C) occurs when a strongly bound
ion–solvent cluster survives the high-field portion of the SV waveform at low SV volt-
ages. The early part of the Type D dispersion curve is reminiscent of Type
C behaviour. Eventually a critical point is reached where the ion–solvent cluster disso-
ciates at high SV. At SV voltages above this critical point, the ion adopts Type A
behaviour. Error bars show the standard error of the Gaussian ionogram peak fits.
The site of protonation, as determined by calculations at the B3LYP/6-311 ++G(d,p) level
of theory, is highlighted on the inset molecule in red.
Dynamic Clustering and Ion Microsolvation 105
150°C
300°C
(as was the case for the Type D behaviour described earlier). However, in the
case of tricarbastannatrane (see Fig. 17, inset), the transannular nitrogen centre
also donates electron density to stabilize the positive charge on the tin. Con-
sequently, the nitrogen centre is weakly Lewis acidic and forms weakly bound
clusters with acetonitrile. Dynamic clustering associated with the secondary
nitrogen binding site results in Type B behaviour for the N(C3H6)3Sn+•
NCCH3 complex, which dissociates upon entering the mass spectrometer
and appears in the N(C3H6)3Sn+ ion channel. At high SV voltage, the binding
energy of the Sn-bound acetonitrile molecule is also overcome, and the bare
N(C3H6)3Sn+ ion exhibits Type A behaviour. It is this combination of
Type B and Type A behaviour that gives rise to the Type E curve.
When measuring ion behaviour in a modified environment, the forma-
tion of strongly bound ion–solvent complexes is an important consideration.
Not only can these species exhibit unexpected DMS trajectories but also, in
cases where they fragment prior to mass spectrometric detection, their
behaviours can be confused/convoluted with those of the bare ions. It is
therefore useful to examine ion DMS behaviour at multiple temperatures
and to introduce post-DMS probing techniques [23,24] to ensure the cor-
rect identification of the DMS signal carrier. For example, one can employ
in-source CID via application of a declustering potential (DP) to probe ions
post-DMS, but pre-MS detection. As the DP is ramped up, the signals of
bare ions will deplete due to fragmentation. In contrast, the signal intensity
of ions which are produced from fragmentation of larger weakly bound clus-
ters initially increase with increasing DP. Fig. 18 shows the ionogram for
protonated cytosine at SV ¼ 3500 V as recorded over a series of DP voltages.
Three signals corresponding to the three lowest energy isomers are observed
to depleted as the DP voltage is increased. A fourth peak in the ionogram (at
CV ¼ 4 V) increases in intensity with increasing DP, suggesting that the m/z
112 species associated with this feature is being produced via fragmentation
of a larger cluster/complex [23].
II
III
charge sites has been observed [12,14,65]. It is now becoming clear that the
dynamic clustering environment of the DMS cell provides a means of
statistically sampling molecular interaction potentials. Given that the SV
waveform cycles at a frequency on the order of 1 MHz, and that ion flight
times through the DMS cell are on the order of 10 ms [9], an ion in a solvent-
seeded environment undergoes c. 10,000 cycles of condensation and evap-
oration as it transits the DMS cell. Because of this, strong correlations can be
observed between an analyte’s behaviour in DMS experiments and other
molecular properties which are dependent on the interaction potential
between the analyte and its environment.
The most obvious physicochemical property which depends on the
interaction potential between an ion and a solvent molecule is, of course,
the solvent binding energy. There have been several systematic studies
showing that the clustering propensities of ions as observed by DMS corre-
late strongly with the binding energies of clusters which contain a single
108 W. Scott Hopkins
N2
6-Amino
8-Amino
6-Amino-2-methylquinolinium•H2O
H2O
6-Amino
8-Amino
8-Amino-2-methylquinolinium•H2O
Fig. 19 (Left) Molecular orbitals showing the interaction between a water molecule and
the protonation site of (left top) 6-amino-2-methylquinolinium and (left bottom)
8-amino-2-methylquinolinium. Calculations were conducted at the B3LYP/6-311
++G(d,p) level of theory. (Right) Dispersion plots recorded for 6- and 8-amino-2-
methylquinolinium in a pure N2 environment and a N2 environment seeded with
1.5 mol% H2O at P ¼ 1 atm and T ¼ 150°C. Adapted from C. Liu, J.C.Y. Le Blanc, J. Shields,
J.S. Janiszewski, C. Ieritano, G.F. Ye, et al., Using differential mobility spectrometry to mea-
sure ion solvation: an examination of the roles of solvents and ionic structures in separating
quinoline-based drugs, Analyst 14 (20) (2015) 6897–6903.
Dynamic Clustering and Ion Microsolvation 109
Fig. 20 The correlation between calculated solvent binding energies and SV@CVmin for
several derivatives of 2-methylquinolinium. Calculations were conducted at the
CCSD(T)/6-311 ++G(d,p) level of theory. Data were recorded in an N2 environment
seeded with 1.5 mol% H2O at P ¼ 1 atm and T ¼ 150°C. Adapted from C. Liu, J.C.Y. Le
Blanc, J. Shields, J.S. Janiszewski, C. Ieritano, G.F. Ye, et al., Using differential mobility spec-
trometry to measure ion solvation: an examination of the roles of solvents and ionic struc-
tures in separating quinoline-based drugs, Analyst 14 (20) (2015) 6897–6903.
5-Substituted
6-Substituted
7-Substituted
Fig. 21 The correlation between experimentally determined pKb values and SV@CVmin
for several derivatives of 2-methylquinolin-8-ol. (top panel) 5-substituted derivatives,
(middle panel) 6-substituted derivatives, (bottom panel) 7-substituted derivatives where
X ¼ H, F, Cl, Br, CH3, OCH3, NO2. DMS data were recorded in a N2 environment seeded
with 1.5 mol% H3COH at P ¼ 1 atm and T ¼ 150°C. Adapted from C. Liu, J.C.Y. Le Blanc,
B.B. Schneider, J. Shields, J.J. Federico, H. Zhang, et al., Assessing physicochemical properties
of drug molecules via microsolvation measurements with differential mobility spectrometry,
ACS Cent. Sci. 3 (2) (2017) 101–109.
Dynamic Clustering and Ion Microsolvation 111
5-Methoxy-2-methylquinlin-8-ol
Neutral Protonated
Fig. 22 The electrostatic potential mapped onto total electron density for the (left) neu-
tral and (right) protonated forms of 5-methoxy-2-methylquinolin-8-ol as calculated at
the B3LYP/6-311 ++G(d,p) level of theory. Blue regions indicate partial positive charge
and red regions indicate partial negative charge. Adapted from C. Liu, J.C.Y. Le Blanc,
B.B. Schneider, J. Shields, J.J. Federico, H. Zhang, et al., Assessing physicochemical properties
of drug molecules via microsolvation measurements with differential mobility spectrometry,
ACS Cent. Sci. 3 (2) (2017) 101–109.
112 W. Scott Hopkins
onto the total electron density for the neutral and protonated forms of
5-methoxy-2-methylquinolin-8-ol. In both cases, the darkest regions of
the potential map—blue indicating electron deficiency and red indicating
electron richness—are in the region of the NH and OH groups. While
the relative energetics of solvation for the ionized species are likely to differ
substantially from those expected for the neutral molecule, the mechanism/
mode of interaction is similar (i.e. H-bonding at the same site of adduction)
and relative trends for the neutral and ionized systems mirror one another
[36,37]. This interpretation is supported by the fact that intermolecular
hydrogen bonding has been identified as the first step in protonation
reactions, and these motifs closely resemble the early stages proton-transfer
processes [67–69]. It is also worth noting that the protonated forms of drugs
(like the quinoline derivatives discussed earlier) may play an important role
in absorption processes in the digestive tract where pH ranges from approx-
imately 6.5 (in the intestines) to approximately 1.7 (in the stomach) [70], and
that condensed-phase measurements conducted under physiological condi-
tions may be probing the ionized versions of these species.
Although there are strong correlations observed when plotting physico-
chemical properties against SV@CVmin, this methodology fails when
treating a large group of molecules which exhibit a wide range of structures
and chemistries. Even in the case of the methylquinoline-8-ol derivatives
discussed earlier, where strong correlations were observed between, e.g.,
SV@CVmin and cell permeability for a series of positional isomers (i.e.
5- vs 6- vs 7-substituted species), correlations across derivatives (e.g. CN
vs NO2 vs F) were poor [36]. The origin of these discrepancies lies in the
fact that DMS ion trajectories rely not only on ion interaction potentials
(viz. clustering with the surrounding environment) but also on the size of
the molecules (i.e. CCS). Thus, a proper treatment of the experimental data
requires fitting in a multidimensional parameter space. This can be accom-
plished through application of supervised ML.
By mapping the DMS parameter space of a large training set of ions, one
can then generate a predictive model to estimate, e.g., the water binding
energy of ions based on their DMS behaviours. There are, of course, caveats
of which one should be mindful when taking such an approach to DMS data
processing. First and foremost, the DMS dataset must be representative of
the output onto which the data are being mapped. To date, it has been dem-
onstrated that DMS data can reliably predict properties of small molecules
with relatively simple interaction potentials [37], but work must still be con-
ducted to determine the applicability to larger, more complex systems (e.g.
proteins, DNA). It is also necessary that the training set contains examples
that are representative of the species for which one seeks to estimate phys-
icochemical properties; if an ML model is trained on amino acid data, it is
unlike to perform well in predicting the properties of, e.g., fullerenes.
Finally, in the development of accurate ML models, data are key! The larger
the input space, the more likely the learning algorithm is to infer the correct
mapping function and provide accurate output.
The selection of a suitable learning algorithm is also an important con-
sideration as there are numerous methods available [71,72]. Appropriate ML
models must exhibit a balance between bias and variance while avoiding
overfitting. In an ML model, bias typically results by preselecting the func-
tional description of the data (i.e. the fit is biased to reproduce the model
114 W. Scott Hopkins
imposed by the user), and variance, which depends how well the model fits
the data, can depend on how data are partitioned into training sets and test
sets. Overfitting leads to poor applicability of a model outside the data range
used for training. Learning algorithms based on Decision Trees are a good
choice since they are exceptionally unbiased and are easily implemented
[73]. One such algorithm, known as the Random Forest method, employs
an ensemble of random decision trees for training and outputs the mode of
the classes (for classification) or the mean prediction (for regression) with
low variance [74]. Moreover, Random Forest exhibits a relatively low sus-
ceptibility to overfitting. Fig. 23 shows a schematic diagram of the Random
Forest regression algorithm.
Walker et al. have shown that by treating DMS data with the Random
Forest regression algorithm one can create highly accurate models to predict
a variety of molecular properties [37]. For example, Fig. 24 shows the results
of a Random Forest model which uses DMS data, {m/z, charge state, tem-
perature, modifier, SV, CV}, to predict the ion CCS as calculated using the
MobCal programme following geometry optimization at the B3LYP/
6-311 ++G(d,p) level of theory [45,46,75,76]. Note that 10 different
molecular topologies belonging to four different chemical classes were
included in the dataset: (blue) drug molecules exhibiting intramolecular
hydrogen bonding (IMHB), (red) 2-methylquinolin-8-ol derivatives,
(green) 2-methylquinoline derivatives, and (black) acrylamide covalent
xi
3 4 5
}
4±1
Fig. 23 A schematic example of the Random Forest regression algorithm. Each coloured
node on the tree represents a data splitting/binning procedure. For example, if at
SV ¼ 3500 V CV > 0 choose bin A, otherwise choose bin B. Each decision tree yields a
value for the target parameter, Y. By using an ensemble of random decision trees,
one is able to predict the mean value of the target parameter.
Dynamic Clustering and Ion Microsolvation 115
Fig. 24 A Random Forest regression fit of DMS data to calculated collision cross sections
(CCSs) for a training set consisting of 10 topological classes of molecules belonging to
four different chemical classes: (blue) drug molecules exhibiting intramolecular hydro-
gen bonding (IMHB), (red) 2-methylquinolin-8-ol derivatives (MQOH), (green)
2-methylquinoline derivatives (MQ), and (black) acrylamide covalent reactive
groups (CRGs). Adapted from S.W.C. Walker, A. Anwar, J. Psutka, J. Crouse, C. Liu, J.C.Y.
Le Blanc, J. Montgomery, G.H. Goetz, J.S. Janiszewski, J.L. Campbell, W.S. Hopkins, Determin-
ing Molecular Properties with Differential Mobility Spectrometry and Machine Learning.
Submitted (revision requested) (2018).
Fig. 25 A Random Forest regression fit of DMS data to experimentally determined cell
permeabilities for: (green) drug molecules exhibiting intramolecular hydrogen bonding
(IMHB), (blue) 2-methylquinolin-8-ol derivatives (MQOH), and (black) 2-methylquinoline
derivatives (MQ). Adapted from S.W.C. Walker, A. Anwar, J. Psutka, J. Crouse, C. Liu, J.C.Y. Le
Blanc, J. Montgomery, G.H. Goetz, J.S. Janiszewski, J.L. Campbell, W.S. Hopkins, Determining
Molecular Properties with Differential Mobility Spectrometry and Machine Learning. Sub-
mitted (revision requested) (2018).
Fig. 25 shows the results of a Random Forest fit of DMS data to condensed-
phase cell permeabilities. If only experimental DMS data are employed in
the ML treatment, the coefficient of determination is R2 ¼ 0.889 and the
mean average error of the fit is 1.73 106 cm s1 [37]. However, if one
also includes molecular CCS in the parameter vector, the fit improves to
R2 ¼ 0.989 with a mean average error, MAE ¼ 0.82 106 cm s1. Note
that employing SV@CVmin to predict cell permeability yields a very weak
correlation [36]. Since the rate of passive cell permeability is dependent on
molecular size and on the propensity of the molecule to desolvate and trans-
port across the hydrophobic lipid bilayer [78–80], it stands to reason that
DMS data, which encodes molecular size and interaction potential, should
correlate strongly with this molecular property.
Walker et al. have also shown that, when treated with the Random For-
est method, DMS data can be used to predict pKa, pKb, logD, solubility,
experimental polar surface area, and reaction rates for CRG-based drugs
with glutathione [37]. The strong correlation between DMS data and the
physicochemical properties of small molecules is appealing since it suggests
that the technique can be used to quickly and accurately determine a variety
Dynamic Clustering and Ion Microsolvation 117
Fig. 26 Application of the Random Forest ML model shown in Fig. 24. The 89-molecule
set was split into an 88-molecule training set and 1-molecule test set; the procedure was
iterated such that each molecule was left out-of-the-bag as a test (i.e. 89-fold cross val-
idation). The model was used to predict collision cross sections (CCSs) for 10 topological
classes of molecules belonging to four different chemical classes: (purple) drug molecules
exhibiting intramolecular hydrogen bonding (IMHB), (blue) 2-methylquinolin-8-ol deriva-
tives (MQOH), (black) 2-methylquinoline derivatives (MQ), and (green) acrylamide covalent
reactive groups (CRGs). Root mean square error (RMSE), mean average error (MAE), and fit
R2 are provided on the plot. Adapted from S.W.C. Walker, A. Anwar, J. Psutka, J. Crouse,
C. Liu, J.C.Y. Le Blanc, J. Montgomery, G.H. Goetz, J.S. Janiszewski, J.L. Campbell, W.S. Hopkins,
Determining Molecular Properties with Differential Mobility Spectrometry and Machine
Learning. Submitted (revision requested) (2018).
118 W. Scott Hopkins
ACKNOWLEDGEMENTS
W.S.H. would like to thank Prof. W. Alex Donald for his help in reviewing this document,
and Professors Pierre-Nicholas Roy and J. Larry Campbell, as well as Doctors J.C. Yves
LeBlanc and Brad B. Schneider for insightful conversations.
REFERENCES
[1] Gorshkov M. P. (Inventor), Gas Analyzer of ions. International Patent Number 966583,
G01N27/62, U.S.S.R., 1982.
[2] R. Guevremont, High-field asymmetric waveform ion mobility spectrometry: a new
tool for mass spectrometry, J. Chromatogr. A 1058 (1–2) (2004) 3–19.
[3] R. Guevremont, R.W. Purves, Atmospheric pressure ion focusing in a high-field asym-
metric waveform ion mobility spectrometer, Rev. Sci. Instrum. 70 (2) (1999) 1370–1383.
[4] R.W. Purves, R. Guevremont, Electrospray ionization high-field asymmetric waveform
ion mobility spectrometry–mass spectrometry, Anal. Chem. 71 (13) (1999) 2346–2357.
[5] I.A. Buryakov, E.V. Krylov, V.P. Soldatov, Analysis of ionic composition of solutions
using an ion gas analyzer, in: A.A. Malahkov (Ed.), Chemical analysis of environment,
Nauka, Novosibirsk, 1991, pp. 113–127.
[6] E.G. Nazarov, U.K.H. Rusalev, A Surface Ionization Detector of Amines, Hydrazines,
and Their Derivatives (SID), Science Technical Advances, vol. 2, All-Union Institute of
Scientific Information (VIMI), Moscow, 1991, pp. 53–57.
[7] E.G. Nazarov, R.A. Miller, G.A. Eiceman, E.V. Krylov, Monitoring of trace amounts
of sulfur compounds in air/hydrocarbon gas streams using a differential mobility spec-
trometer (DMS), Int. J. Ion Mobil. Spectrom. 5 (2002) 76–81.
[8] R.A. Miller, G.A. Eiceman, E.G. Nazarov, A.T. King, A novel micromachined high-
field asymmetric waveform-ion mobility spectrometer, Sensors Actuators B 67 (3)
(2000) 300–306.
[9] B.B. Schneider, E.G. Nazarov, F. Londry, P. Vouros, T.R. Covey, Differentail mobil-
ity spectrometry/mass spectrometry history, theory, design optimization, simulations,
and applications, Mass Spectrom. Rev. 35 (6) (2016) 687–737.
[10] K.M.M. Kabir, W.A. Donald, Microscale differential ion mobility spectrometry for
field deployable chemical analysis, Trends Anal. Chem. 97 (2017) 399–427.
[11] I.A. Buryakov, E.V. Krylov, A.L. Makas, E.G. Nazarov, V.V. Pervukhin,
U.K. Rasulev, Drift spectrometer for the control of amine traces in the atmosphere,
J. Anal. Chem. 48 (1) (1993) 114–121.
Dynamic Clustering and Ion Microsolvation 119
[12] J.L. Campbell, J.C.Y. Le Blanc, B.B. Schneider, Probing electrospray ionization
dynamics using differential mobility spectrometry: the curious case of 4-aminobenzoic
acid, Anal. Chem. 84 (18) (2012) 7857–7864.
[13] J.L. Campbell, A.M.-C. Yang, L.R. Melo, W.S. Hopkins, Studying gas-phase intercon-
version of tautomers using differential mobility spectrometry, J. Am. Soc. Mass
Spectrom. 27 (7) (2016) 1277–1284.
[14] J.L. Campbell, M. Zhu, W.S. Hopkins, Ion-molecule clustering in differential mobility
spectrometry: lessons learned from tetraalkylammonium cations and their isomers,
J. Am. Soc. Mass Spectrom. 25 (9) (2014) 1583–1591.
[15] G.A. Eiceman, J.A. Stone, Ion mobility spectrometers in national defense, Anal. Chem.
76 (21) (2004) 390A–397A.
[16] W.S. Hopkins, Determining the properties of gas-phase clusters, Mol. Phys. 113 (21)
(2015) 3151–3158.
[17] D.S. Levin, R.A. Miller, E.G. Nazarov, P. Vouros, Rapid separation and quantitative
analysis of peptides using a new nanoelectrospray-differential mobility spectrometer–mass
spectrometer system, Anal. Chem. 78 (15) (2006) 5443–5452.
[18] L.C. Rorrer III, R.A. Yost, Solvent vapor effects on planar high-field asymmetric wave-
form ion mobility spectrometry, Int. J. Mass Spectrom. 300 (2–3) (2011) 173–181.
[19] B.B. Schneider, V.I. Baranov, H. Javaheri, T.R. Covey, Particle discriminator interface
for nanoflow ESI–MS, J. Am. Soc. Mass Spectrom. 14 (11) (2003) 1236–1246.
[20] B.B. Schneider, T.R. Covey, S.L. Coy, E.V. Krylov, E.G. Nazarov, Chemical effects in
the separation process of a differential mobility/mass spectrometer system, Anal. Chem.
82 (5) (2010) 1867–1880.
[21] B.B. Schneider, T.R. Covey, S.L. Coy, E.V. Krylov, E.G. Nazarov, Planar differential
mobility spectrometer as a pre-filter for atmospheric pressure ionization mass spectrom-
etry, Int. J. Mass Spectrom. 298 (1–3) (2010) 45–54.
[22] B.B. Schneider, T.R. Covey, S.L. Coy, E.V. Krylov, E.G. Nazarov, Control of chem-
ical effects in the separation process of a differential mobility mass spectrometer system,
Eur. J. Mass Spectrom. 16 (1) (2010) 57–71.
[23] A. Anwar, J. Psutka, S.W.C. Walker, T. Dieckmann, J.S. Janizewski, J. Larry Campbell,
W.S. Hopkins, Separating and probing tautomers of protonated nucleobases using
differential mobility spectrometry, Int. J. Mass Spectrom. 429 (2018) 174–181.
[24] J.M. Psutka, A. Dion-Fortier, T. Dieckmann, J.L. Campbell, P.A. Segura,
W.S. Hopkins, Identifying Fenton-reacted trimethoprim transformation products using
differential mobility spectrometry, Anal. Chem. 90 (8) (2018) 5352–5357.
[25] E.V. Krylov, E.G. Nazarov, R.A. Miller, Differential mobility spectrometer: model of
operation, Int. J. Mass Spectrom. 266 (1–3) (2007) 76–85.
[26] S.E. Quiñones-Cisneros, C.K. Zeberg-Mikkelsen, E.H. Stenby, The friction theory
(f-theory) for viscosity modeling, Fluid Phase Equilib. 169 (2) (2000) 249–276.
[27] R.A. Alberty, R.J. Silbey, Physical Chemistry, second ed., John Wiley & Sons,
Toronto, 1997, pp. 619–621.
[28] P. Atkins, J. De Paula, Atkins’ Physical Chemistry, eighth ed., Oxford University Press,
Oxford, 2006, pp. 752–754.
[29] W.S. Hopkins, R.A. Marta, V. Steinmetz, T.B. McMahon, Mode-specific fragmentation
of amino acid-containing clusters, Phys. Chem. Chem. Phys. 17 (43) (2015) 28548–28555.
[30] M.J. Lecours, C. WCT, W.S. Hopkins, Density Functional Theory Study of RhnS0,+/-
and Rh(0,+/-)
n+1 (n ¼ 1-9), J. Phys. Chem. A 118 (24) (2014) 4278–4287.
[31] D. Wales, Energy Landscapes: Applications to Clusters, Biomolecules and Glasses,
Cambridge University Press, Cambridge, 2004.
[32] T.N. Olney, N.M. Cann, G. Cooper, C.E. Brion, Absolute scale determination
for photoabsorption spectra and the calculation of molecular properties using dipole
sum-rules, Chem. Phys. 223 (1) (1997) 59–98.
120 W. Scott Hopkins
[33] F.L. Sinatra, T. Wu, S. Manolakos, J. Wang, T.G. Evans-Nguyen, Differential mobility
spectrometry–mass spectrometry for atomic analysis, Anal. Chem. 87 (3) (2015)
1685–1693.
[34] A.A. Shvartsburg, T. Bryskiewicz, R.W. Purves, K. Tang, R. Guevremont,
R.D. Smith, Field asymmetric waveform ion mobility spectrometry studies of proteins:
dipole alignment in ion mobility spectrometry? J. Phys. Chem. B 110 (43) (2006)
21966–21980.
[35] C. Liu, J.C.Y. Le Blanc, J. Shields, J.S. Janiszewski, C. Ieritano, G.F. Ye, et al., Using
differential mobility spectrometry to measure ion solvation: an examination of the roles
of solvents and ionic structures in separating quinoline-based drugs, Analyst 14 (20)
(2015) 6897–6903.
[36] C. Liu, J.C.Y. Le Blanc, B.B. Schneider, J. Shields, J.J. Federico, H. Zhang, et al.,
Assessing physicochemical properties of drug molecules via microsolvation measure-
ments with differential mobility spectrometry, ACS Cent. Sci. 3 (2) (2017) 101–109.
[37] S.W.C. Walker, A. Anwar, J. Psutka, J. Crouse, C. Liu, J.C.Y. Le Blanc,
J. Montgomery, G.H. Goetz, J.S. Janiszewski, J.L. Campbell, W.S. Hopkins,
Determining Molecular Properties With Differential Mobility Spectrometry and
Machine Learning, Nature Communications, 2018. (submitted for publication).
[38] M.J. Abraham, T. Murtola, R. Schulz, S. Páll, J.C. Smith, B. Hess, et al., GROMACS:
high performance molecular simulations through multi-level parallelism from laptops to
supercomputers, SoftwareX 1-2 (2015) 19–25.
[39] D.A. Case, T.E. Cheatham, T. Darden, H. Gohlke, R. Luo, K.M. Merz, et al., The
amber biomolecular simulation programs, J. Comput. Chem. 26 (16) (2005) 1668–1688.
[40] W.S. Hopkins, R.A. Marta, T.B. McMahon, Proton-bound 3-cyanophenylalanine
trimethylamine clusters: isomer-specific fragmentation pathways and evidence of gas-
phase zwitterions, J. Phys. Chem. A 117 (41) (2013) 10714–10718.
[41] D.J. Wales, J.P.K. Doye, Global optimization by basin-hopping and the lowest energy
structures of Lennard–Jones clusters containing up to 110 atoms, J. Phys. Chem. A
101 (28) (1997) 5111–5116.
[42] D.J. Wales, M.A. Miller, T.R. Walsh, Archetypal energy landscapes, Nature 394 (6695)
(1998) 758–760.
[43] D.J. Wales, H.A. Scheraga, Review: chemistry—global optimization of clusters, crys-
tals, and biomolecules, Science 285 (5432) (1999) 1368–1372.
[44] S. Grimme, J. Antony, S. Ehrlich, H. Krieg, A consistent and accurate ab initio param-
etrization of density functional dispersion correction (DFT-D) for the 94 elements
H-Pu, J. Chem. Phys. 132 (15) (2010), 154104.
[45] A.D. Becke, Density-functional exchange-energy approximation with correct
asymptotic-behavior, Phys. Rev. A 38 (6) (1988) 3098–3100.
[46] A.D. Becke, Density-functional thermochemistry. 3. The role of exact exchange,
J. Chem. Phys. 98 (7) (1993) 5648–5652.
[47] J.B. Foresman, A.E. Firsch, Exploring Chemistry With Electronic Structure Methods,
Gaussian Inc., Wallingford, CT, 2015. ISBN 978-1-935522-03-4.
[48] J. Schmidt, M.M. Meyer, I. Spector, S.R. Kass, Infrared multiphoton dissociation spec-
troscopy study of protonated p-aminobenzoic acid: does electrospray ionization afford
the amino- or carboxy-protonated ion? J. Phys. Chem. A 115 (26) (2011) 7625–7632.
[49] Z.X. Tian, S.R. Kass, Gas-phase versus liquid-phase structures by electrospray ioniza-
tion mass spectrometry, Angew. Chem. Int. Ed. 48 (7) (2009) 1321–1323.
[50] T.M. Chang, S. Chakrabarty, E.R. Williams, Hydration of gaseous m-aminobenzoic
acid: ionic vs neutral hydrogen bonding and water bridges, J. Am. Chem. Soc.
136 (29) (2014) 10440–10449.
[51] T.M. Chang, J.S. Prell, E.R. Warrick, E.R. Williams, Where’s the charge? Protonation
sites in gaseous ions change with hydration, J. Am. Chem. Soc. 134 (38) (2012)
15805–15813.
Dynamic Clustering and Ion Microsolvation 121
[72] I. Guyon, A. Saffari, G. Dror, G. Cawley, Model selection: beyond the Bayesian/
frequentist divide, J. Mach. Learn. Res. 11 (2010) 61–87.
[73] J.R. Quinlan, Induction of decision trees, Mach. Learn. 1 (1) (1986) 81–106.
[74] L. Breiman, Random forests, Mach. Learn. 45 (1) (2001) 5–32.
[75] A.A. Shvartsburg, M.F. Jarrold, An exact hard-spheres scattering model for the mobil-
ities of polyatomic ions, Chem. Phys. Lett. 261 (1–2) (1996) 86–91.
[76] M. Mesleh, J. Hunter, A. Shvartsburg, G. Schatz, M. Jarrold, Structural information
from ion mobility measurements: effects of the long-range potential, J. Phys. Chem.
100 (40) (1996) 16082–16086.
[77] S. Arlot, A. Celisse, A survey of cross-validation procedures for model selection, Statist.
Surv. 4 (2010) 40–79.
[78] P. Artursson, K. Palm, K. Luthman, Caco-2 monolayers in experimental and theoretical
predictions of drug transport, Adv. Drug Deliv. Rev. 46 (1–3) (2001) 27–43.
[79] P.S. Burton, R.A. Conradi, A.R. Hilgers, N.F.H. Ho, L.L. Maggiora, The relationship
between peptide structure and transport across epithelial-cell monolayers, J. Control.
Release 19 (1–3) (1992) 87–97.
[80] E.G. Chikhale, K.Y. Ng, P.S. Burton, R.T. Borchardt, Hydrogen-bonding potential as
a determinant of the in-vitro and in-situ blood-brain-barrier permeability of peptides,
Pharm. Res. 11 (3) (1994) 412–419.
[81] I. Campuzano, M.F. Bush, C.V. Robinson, C. Beaumont, K. Richardson, H. Kim,
et al., Structural characterization of drug-like compounds by ion mobility mass spec-
trometry: comparison of theoretical and experimentally derived nitrogen collision cross
sections, Anal. Chem. 84 (2) (2012) 1026–1033.