Chemometric Methods in FS 1

Forensic Science International 302 (2019) 109911
Contents lists available at ScienceDirect
Forensic Science International

journal homepage: www.elsevier.com/locate/forsciint
Review of the most common chemometric techniques in

illicit drug profiling
Ana Popovica , Marie Morelatoa , Claude Rouxa , Alison Beavisa,b,*
a
Centre for Forensic Science, University of Technology Sydney, P.O. Box 123 Broadway NSW 2007 Australia
b
Faculty of Transdisciplinary Innovation, University of Technology Sydney, P.O. Box 123 Broadway NSW 2007 Australia
A R T I C L E I N F O A B S T R A C T
Article history: The information generated through drug profiling can be used to infer a common source between one or
Received 22 January 2019 several seizures as well as drug trafficking routes to provide insights into drug markets. Although well
Received in revised form 19 July 2019 established, it is time-consuming and ineffective to compare all drug profiles manually. In recent years,
Accepted 29 July 2019
there has been a push to automate processes to enable a more efficient comparison of illicit drug
Available online 1 August 2019
specimens. Various chemometric methods have been employed to compare and interpret forensic case
data promptly. The intelligence that is produced can be used by decision-makers to disrupt or reduce the
Keywords:
impact of illicit drug markets. This review highlights the most common chemometric techniques used in
Pattern recognition
Comparison metrics
drug profiling and more specifically, the most efficient comparison metrics and pattern recognition
Forensic intelligence techniques outlined in the literature.
© 2019 Elsevier B.V. All rights reserved.
Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
2. Drug profiling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
2.1. Profiling of chemical characteristics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
2.1.1. ATS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
2.1.2. Cocaine . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.1.3. Heroin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.2. Profiling of physical characteristics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
3. Chemometric techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.1. Pre-treatments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.2. Comparison metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.2.1. Square cosine function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2.2. Pearson correlation coefficient . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2.3. Euclidean distance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2.4. Manhattan distance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2.5. Canberra distance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.3. Unsupervised pattern recognition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.3.1. Principal component analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.3.2. Hierarchical clustering analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.3.3. K-means clustering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.4. Supervised pattern recognition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
3.4.1. Partial least squares – discriminant analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
3.4.2. Linear discriminant analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
3.4.3. Artificial neural networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
4. Evaluation of chemometric techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
4.1. Evaluation of CMs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
* Corresponding author at: Centre for Forensic Science, University of Technology, P.O. Box 123 Broadway, Sydney, NSW 2007, Australia.
E-mail address: Alison.Beavis@uts.edu.au (A. Beavis).
http://dx.doi.org/10.1016/j.forsciint.2019.109911
0379-0738/© 2019 Elsevier B.V. All rights reserved.
2 A. Popovic et al. / Forensic Science International 302 (2019) 109911
4.1.1. Discrimination power . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

4.1.2. Confusion matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
4.1.3. Receiver operating characteristic curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
4.2. Evaluation of UPR techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
4.2.1. PCA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
4.2.2. Cluster validation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
4.3. Evaluation of SPR techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
5. Application and added value of chemometric techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
5.1. Applications of CMs to illicit drugs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
5.1.1. ATS (AM & MA) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
5.1.2. ATS (MDMA) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
5.1.3. Cocaine . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
5.1.4. Heroin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
5.2. Applications of UPR to illicit drugs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
5.2.1. ATS (AM, MA & MDMA) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
5.2.2. Cocaine . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
5.2.3. Heroin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
5.3. Applications of SPR to illicit drugs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
5.3.1. ATS (AM, MA & MDMA) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
5.3.2. Cocaine . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
5.3.3. Heroin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
5.4. Added value of chemometric techniques applied to illicit drugs for ILP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
5.4.1. Added value for active investigations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
5.4.2. Added value for understanding drug markets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
6. Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
CRediT authorship contribution statement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1. Introduction performed on illicit drugs to produce chemical and physical

profiles. Section 3 will explore common chemometrics techniques
Illicit drug markets are continually evolving, so it is essential to used in drug profiling and discuss how they work. Section 4
understand them in order to disrupt or reduce their impact. reviews possible evaluation tools for the chemometric techniques
Decisions by law enforcement agencies regarding crime disrup- outlined in Section 3. While Section 5 will look at the application of
tion, prevention and reduction rely on masses of information from these chemometric techniques to drug profiling data and their
various sources, which is often minimally exploited. The concept of added value for intelligence-led policing (ILP).
forensic intelligence focuses on generating information products
for proactive policing, supporting an intelligence-led model rather 2. Drug profiling
than focusing on individual offenders or cases.
A variety of analyses (qualitative and quantitative) are currently A drug profile contains information about the chemical or/and
performed on illicit drugs to generate profiles [1]. These profiles, in physical characteristics of a specimen. The chemical profile yields
combination with chemometric techniques, can be used proac- data about the illicit substance along with cutting agents
tively and generate intelligence products at tactical, operational (adulterants and diluents), precursors, by-products, impurities
and strategic levels [2,3]. and solvents in different ratios. The physical profile (packaging and
Drug intelligence has multiple aims [4–7], for example they are appearance) of a drug can be just as complex, creating a particular
to (1) identify dealer–user networks; (2) determine the geograph- picture of a specimen when all measurements are considered [16].
ical origin of the seizure; (3) monitor the length of time the It is important to note that the selection of target variables for
clandestine laboratory has been in operation; (4) gather sufficient the analytical method will profoundly influence the results
information to create national and international drug databases; obtained from the chemometric methods applied [17]. There are
(5) monitor the extent of national and international drug criteria that need to be satisfied when choosing target variables:
trafficking; (6) obtain a better understanding of drug networks (1) they must be present in most specimens and have sufficient
and (7) ultimately develop sound strategies to reduce the harm concentration; (2) they must be sufficiently numerous to group
caused to the society. specimens with similar profiles and differentiate unrelated speci-
To maximise the potential of drug profiling, extensive efforts mens; (3) their measurement must be reproducible and repeat-
have been invested in harmonising analytical methods to able; and (4) they should be representative of the specimen to be
facilitate the comparison of data generated from different described [8].
laboratories [8–13]. Visual comparison of profiles is still routinely
used; however, the task becomes laborious and inefficient when 2.1. Profiling of chemical characteristics
applied to a database of specimens [14,15]. Additionally, the
visual analysis does not allow for an overall view of the drug The subsequent sections outline the most common analytical
problem; hence, it is of limited value for intelligence purposes. techniques used to generate chemical profiles for amphetamine-
For this reason, drug analysis laboratories have sought out to type stimulants (ATS), cocaine and heroin specimens.
automate the comparison processes via mathematical and
statistical methods. 2.1.1. ATS
This article aims to review the various chemometric methods As many ATS exist as optical isomers, and they differ in
used in drug profiling. Section 2 will explore the current analyses biological activity, it is essential to determine the concentration of
A. Popovic et al. / Forensic Science International 302 (2019) 109911 3
each enantiomer. This can be achieved using capillary electropho- Table 2

Common analytical methods used in cocaine profiling.
resis (CE) [18]. An elemental profile can be generated using
inductively coupled plasma mass spectrometry (ICP-MS). As most Profile type Analytical method References
of the synthetic pathways to illicit drugs employ catalysts, a Adulterants & diluents ATR-FTIR [52,53]
knowledge of the trace elements present in either 3,4-methyl- Alkaloids GC-MS [17,54,55]
enedioxy-methamphetamine (MDMA) or methylamphetamine Alkaloids GC-NPD [45]
Elements ICP-MS [56]
(MA) can potentially reveal the route of synthesis [19–21].
Isotope ratios IRMS [50,51]
Manufacturing by-products can be determined through a tech- Solvents HS GC-MS [47,49]
nique involving a liquid-liquid extraction (LLE) or headspace solid-
phase microextraction (HS-SPME) followed by gas chromatogra-
phy–mass spectrometry (GC–MS) or gas chromatography-flame
ionisation detection (GC-FID) analysis [22–25]. The presence of Table 3
impurities or by-products provides information on trafficking Common analytical methods used in heroin profiling.
routes, supply origin and optimally link seizures [20].
Profile type Analytical method References
Additionally, The Collaborative Harmonisation of Methods for
Alkaloids CE [63]
Profiling of Amphetamine-Type Stimulants (CHAMP) project
By-products GC-MS [13,17,62,63,66,67]
conducted a significant study on developing a harmonised By-products GC-FID [60,61,68,69,70,71,72,73]
profiling method for ATS [26–31]. Adulterants and diluents are Elements ICP-MS [74,75,76]
also frequently encountered in ATS specimens. Conventional Solvents HS GC-MS [49,63]
analysis techniques for these profile types are capillary electro-
phoresis diode array detector (CE-DAD) and liquid chromatogra-
phy evaporative light scattering detector (LC-ELSD) [32,33]. Stable
isotope ratio – mass spectrometry (IRMS) analysis also forms an which more than a hundred have been identified and characterised
integral part of ATS profiling. The 13C/12C, 15N/14N and D/H ratios of [62]. This type of profile once again aims to determine the origin of
a specimen can infer the route of synthesis and subsequently links the plant [62–65] and is an adjunct of the significant alkaloid
to other specimens [18]. The mentioned ATS profile types and their profiling. As with cocaine, during the manufacturing of heroin, a
relevant references have been summarised in Table 1. range of solvents are employed which are determined using HS
GC–MS [49]. This information can provide insights into where
2.1.2. Cocaine processing occurred [46]. This is because solvent combinations
Determining the presence of coca leaf alkaloids is typically the often differ between major drug production regions.
first step when profiling cocaine, see Table 2 for a list of common
cocaine profile types and relevant references. This information is 2.2. Profiling of physical characteristics
useful for determining the origin of the plant. There have been
several articles that have quantified cocaine alkaloids using GC–MS Physical profiling looks at the appearance and packaging of
or a gas chromatography – nitrogen phosphorous detector (GC- illicit drugs and provides complementary information to respec-
NPD) [42–45]. Another profile for cocaine involves analysing the tive chemical profiles. In terms of collection, physical profiles are
residual solvents present, indicating the process used to convert timelier and more resource friendly than their chemical counter-
the drug base to the salt form [46]. Solvent traces can be easily parts. Furthermore, in an instance where chemical and physical
detected using GC–MS or headspace (HS) GC-FID [47–49]. (including packaging) profiles were evaluated, the latter proved
Further information regarding location can be determined by to be pivotal for supporting ILP [77]. This is especially true for
the ratio of carbon and nitrogen stable isotopes [50,51]. For optimal specimens in a pill form. In the literature, the analysis and
linkage capacity, the mentioned method should be used with a evaluation of physical profiles, appearance in particular, is mainly
significant component (alkaloid) analysis. As cocaine is often cut applied to MDMA tablets. A visual inspection of a specimen’s
with various compounds, adulterant and diluent analysis can be physical characteristics can allow seizures to be subcategorised
performed via vibrational spectroscopy, namely attenuated total based on colour, texture, score/logo presence, measurements, or
reflection Fourier transform infrared (ATR-FTIR) spectroscopy general appearance [11,72,78]. The score/logo and tablet measure-
[52,53]. ments are machine dependent and tend to remain unaltered
between batches, allowing for the potential to link seizures [79].
2.1.3. Heroin Furthermore, a closer examination of packaging material may infer
As with cocaine, the initial analyses performed aim to identify links between chemically unrelated seizures. Table 4 indicates the
the alkaloids present, see Table 3 for a list of common heroin profile current articles focusing on profiling of physical characteristics for
types and relevant references. Remaining analytes, if present, are drug intelligence purposes.
normalised to the heroin response. These types of methods The visual inspection of a specimen is usually performed before
are relatively fast, but they lack sensitivity [57–61]. Throughout the any chemical analyses. This is due to the ease and accessibility
heroin production process, many by-products are generated, of of visual characteristics. For MDMA, physical characteristics (i.e.
Table 1 Table 4
Common analytical methods used in ATS profiling. Current articles focusing on physical profiling for drug intelligence purposes.
Profile type Analytical method References Sample type Analytical method References
By-products GC-MS [10,12,25,31,32,33,34,35,36,37,38,39] MDMA Visual [11,79,80]
By-products GC-FID [40,41] Heroin Visual [72]
Adulterants CE-DAD [32,33] Adhesive tape Py-GC-MS [81]
Diluents LC-ELSD [32,33] Adhesive tape VSC [82]
Elements ICP-MS [32,33] Adhesive tape SFE-HPLC [83]
Enantiomers CE [18] Adhesive tape MSP [84]
Isotope ratios IRMS [18] Heroin Packaging General measurements [72]
post-tabletting characteristics) may yield preliminary profiling datasets are subjected to various mathematical and statistical
data while chemical characteristics (i.e. pre-tabletting character- techniques, which include comparison metrics (CM) for determining
istics) provide complementary data. Together they allow speci- links between objects, pattern recognition for the classification of
mens to be linked at multiple stages of production [80]. objects and the class prediction of new, unclassified objects.
As the two sets of characteristics (physical and chemical) are In the case of illicit drug profiles, it is common to ‘pre-treat’ the
attributed at different stages of the MDMA production process, data resultant from qualitative or quantitative analyses [76].
the individual profiles are complementary [11]. Any similarity in Subsequently, metrics which calculate the similarity between
physical characteristics only goes so far as to suggest MDMA objects are applied to drug data to determine the level of
specimens were tabletted in the same laboratory; common origin association between them.
at a pre-tabletting stage should not be inferred [78]. On the other
hand, specimens with differences in physical characteristics may 3.1. Pre-treatments
still be linked at a chemical level. Manufacturers of drug powders
may tablet the same batch at different times, altering moulds in Analyses obtained from two different instruments may not
between or the tabletting step may be conducted in a different exhibit the same sensitivity. Pre-treatments (PT) are employed to
location [78]. reduce the instrumental influence, and therefore represent an
Chan et al. [72] hypothesised that, for heroin, the distributor's essential part of the overall profiling process [31,86,87]. Also, PTs
trademark was the colour of the specimen. The addition of are used to minimise the effect of significant differences in the
adulterants and food dyes added during cutting would allow the concentration of compounds present in a specimen [76]. The most
distributor and an easy way to differentiate between different common PTs are outlined inTable 5. This is not an exhaustive list of all
batches and manufacturers. possible PTs, for a more thorough list, please refer to Lociciro et al. [9].
Although the use of packaging material in a drug intelligence In these various PT approaches, xi is the untreated peak area of
perspective may be constrained, it can still provide some insight compound i; Ni and Si are the normalised and standardised peak
into how a criminal organisation prepares its goods for transport areas of compound i, respectively.
[72,81,85]. The primary constraint is the fact that the packaging PTs are typically used in combination with CMs, which are
profile may be short-lived, as the materials are readily available. discussed below. Various PTs have been documented. For instance,
The benefit of packaging profiling is that it can be used to link Perkal [88] normalised GC data before comparison of MA
seizures where the drug profile is not useful [81]. Significant drug specimens; Klemenc [66] employed normalisation and weighting
seizures often contain subgroups with different impurity profiles, when comparing different heroin seizures; Jonson [89] weighted
either due to batch intra-variability or sporadic manufacturing and normalised data for comparison of amphetamine specimens.
[78]. Subsequently, linking objects based on chemical character-
istics may prove challenging. In this instance, linking seizures may 3.2. Comparison metrics
only be possible with physical characteristics (appearance and
packaging). The packaging profile can include the dimensions, To determine the level of association between profiles, CMs
colour or general appearance of the package. Furthermore, any need to be used. The metrics that will be outlined in this article are
markings on the package, such as stamps, can infer the geographic the Square Cosine Function (including modifications), Pearson
location of the sender [85]. Rhumorbarbe et al. [85] also evaluated Correlation (including modifications), Euclidean distance, Man-
the use of physical profiling for determining the accuracy of online hattan distance and the Canberra distance. There are alternative
purchase information. It was found that the digital and physical techniques, but the five CMs explored in this article (see
profiling information were complementary and aided the evalua- Sections 3.2.1–3.2.5) are utilised most frequently [31,69].
tion of cryptomarket activity. These CMs produce a score which represents the similarity/
Additionally, research conducted in the field of physical distance between objects or in this context, illicit drug profiles.
profiling also includes the development of analytical techniques After the drug profiles are separated into two populations, i.e.
for profiling common packaging materials [78,81–83]. In a three- linked and unlinked, scores are calculated for each pair of profiles.
part study, Huttunen et al. investigated whether pyrolysis GC–MS The ideal statistical combination of PT and CM must be robust in
(Py-GC–MS) [81], a Video Spectral Comparator (VSC) [82] and differentiating between linked and unlinked populations.
supercritical fluid extraction with high-performance liquid chro- To create the linked population, all specimens in a seizure are
matography (SFE-HPLC) [83] could be used to differentiate compared against each other. This process is repeated for every
adhesive tapes. Although these analytical techniques are chemical, seizure containing more than one specimen. Doing this allows for
the resultant profiles ultimately describe a physical entity of the the assessment of the intra-variability of a seizure, establishing the
seized specimen. level of similarity between specimens of an assumed common
Some limitations need to be taken into account when analysing origin. Similarly, to create the unlinked population, specimens
and evaluating drug packaging [81]. It is entirely possible that from different seizures are compared to each other. This
two independent laboratories might be using the same packing determines the inter-variability between seizures or the level of
material, hence introducing FPs. Furthermore, changes noticed in similarity between specimens of an assumed different origin [8].
the chemical composition of packaging may be due to a change in
production by the manufacturer. For this reason, the focus of
seizure comparison made through physical profiling should be Table 5
Most common PT methods.
relational. That is, rather than aiming to identify the manufacturer
of the packaging and thereby inferring links, analysts should aim to PT (abbreviation) Formula
compare physical profiles to existing physical profiles from Normalisation (N) Ni ¼ Pxn i
xi
previous seizures. Standardisation (S) xi
i¼1
Si ¼ S:D:i
Logarithm (L) logðxi Þ
3. Chemometric techniques p ffiffiffiffi
Fourth root (4R) 4
xi
Normalisation + standardisation (N + S) Ni
S:D:i
Forensic scientists utilise a multitude of instruments for the Normalisation + fourth root (N + 4R) p
4
ffiffiffiffiffi
Ni
analysis of traces which generate large and complex datasets. These
score of 100 means that the specimen pair has a low correlation.
The formula for the modified SCF (MCF) is:
MCF = 100 SCF
3.2.2. Pearson correlation coefficient

The Pearson correlation coefficient (PCC) is another popular and
well-documented CM used in drug intelligence [9,10,12,13,17,31–
37,39,54,69]. As with the SCF, the PCC correlation is a measure of
the angle between the two vectors a and b. Hence, the smaller the
angle, the more correlated the values are, with the correlation
determined by:
Fig. 1. Schematic of most common linked vs unlinked population distribution, Xn
adapted from Janzen et al. [45]. ½ðai aÞðbi bÞ
rab ¼ qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
Xn
i¼1
Xn
i¼1 i
ða aÞ2 ðb bÞ2
i¼1 i
The frequency of scores in both populations are often plotted as a P
histogram. Ideally, there should be no crossover, meaning that the where a ¼ 1n ni¼1 ai ; and analogously for b. The PCC correlation
true positive (TP) and true negative (TN) rates should be 100%. function is quite often modified to scale the results between 0 and
However, it is very common to find that the two populations overlap, 100, this is achieved through the formula:
see Fig. 1. Once the scores of the two populations are elucidated, a ð1 rab Þ
threshold value (THV) is set. This THV allows for the determination of PCC ¼ 100
2
linkage for new specimens, e.g. it is possible to assess the level of
similarity between a new specimen and the existing specimen in the
database. Based on the position of the THV, links between the new 3.2.3. Euclidean distance
seizure and the specimens in the existing database can be uncovered. The Euclidean distance (EUC) is another measure of specimen
Information of this nature is beneficial for intelligence-led policing as similarity and is one of the simplest methods used. When two or
it uncovers previously unknown links. more variables are measured, each specimen can be represented by
This position of the THV is dependent on the decision makers a point in an n-dimensional space. When considering points a and
needs and can be altered for different purposes [33]. If a law b, the EUC is the length of the line segment connecting them [90].
enforcement agency is limited to specific resources, the decision The distance between the two points is written as:
vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
maker might choose to set the THV to minimise false positives u n
uX
EUC ¼ t
(FPs). If the aim is to maximise the intelligence value of the data, 2
ðai bi Þ
then the value can be shifted to minimise false negatives (FNs). i¼1
It is often the case that large seizures exhibit high intra-
variability [78]. This becomes a significant issue when linked and This method has been utilised extensively for the comparison of
unlinked populations need to be defined. Significant variability in various drug types [11,25,31–33,36,38,40,41,45,68].
the linked population will increase the rate of FNs in subsequent
statistical analyses. For this reason, seizure membership needs to 3.2.4. Manhattan distance
be clearly defined before CMs are attempted. The Manhattan distance (MAN) or taxi-cab geometry gets its
The information generated from applying chemometric tech- name from the perpendicular streets of Manhattan [91]. This grid
niques to forensic case data is often stored in a structured memory layout means that the shortest distance a car could take between
[5]. As the memory is updated, the optimum combination of CM two intersections is equal to the distance calculated by the MAN
and PT needs to be periodically re-evaluated to ensure the most distance. This distance can also be used in cluster analysis (CA) [92]
significant separation between populations is achieved. and is given by the formula:
The subsequent sections outline the theory behind the most Xn
frequently used CMs. In the equations below, ai and bi are the pre- MAN ¼ i¼1
j ai b i j
treated responses of impurity i, in specimens a and  b,
where the distance between a and b is the sum of the absolute
respectively; a and b are the mean pre-treated peak areas of all difference of their coordinates. Unlike the EUC, which is composed
compounds in specimens a and b, respectively; n is the maximum of one-line segment, the MAN distance contains multiple line
number of compounds present in a specimen; z is the maximum segments.
number of compounds that are zero for specimens a and b.
3.2.1. Square cosine function 3.2.5. Canberra distance

The Squared cosine function (SCF) is a widely utilised and well- The Canberra distance (CAN) is a metric often used for
established method for differentiating between linked and compounds with values close to zero [93]. It first appeared in
unlinked specimen drug populations [9,10,12,31–33,36,68,69]. the literature over 50 years ago and the metric was modified by the
To estimate the similarity of two vectors, the angle between them same authors to present the more commonly used Adkins form of
is calculated. The correlation (SCF) value between the two the Canberra distance [94], its formula is given by:
chromatograms is given by: 1 Xn ðjai bi jÞ
2 3 CAN ¼
2 nz i¼1 ða þ b Þ
i i
ða b þ a b þ þ a b Þ
SCF ¼ 100 4 5
1 1 2 2 n n
2 2 2 The distance is not affected if one of the values in the equation is
a21 þ a22 þ þ a2n b1 þ b2 þ þ bn
zero, i.e. if ai or bi are zero the fraction becomes 1. However, this
In many cases, the modified cosine function is used so that a distance is very sensitive to a small change when both values are
score of 0 indicates a high correlation between specimens and a close to zero. While the MAN and Canberra distances are similar,
the benefit of the latter is that it is not as influenced by compounds dendrogram. The y-axis of the dendrogram represents the
with large values [95]. similarity/distance between objects, which are plotted on the x-
axis. Different CMs are used to create a similarity or distance
3.3. Unsupervised pattern recognition matrix which is used to form the dendrogram. In an ILP context,
HCA is often used to create classes which cluster specimens based
The main goal of these mathematical methods is to correctly on similarity [5]. For example, if chemical characteristics are used,
group objects of similar architecture together and different objects specimens are grouped into chemical classes (CCs) if their
in discrete classes. The concept of class knowledge is what differs similarity exceeds the chosen THV.
supervised pattern analysis from unsupervised techniques. In HCA dendrograms can be elucidated through agglomerative or
unsupervised pattern recognition (UPR), natural clusters can be divisive clustering. Agglomerative clustering fuses individual objects
identified in the dataset. The classes are determined through the into small groups which are subsequently fused into larger groups
application of unsupervised techniques and are not required until all objects belong to one cluster. Divisive clustering begins with
initially. In contrast, supervised techniques require the knowledge a single cluster (containing all specimens) which is divided into
of specimen origin to develop models. smaller and smaller groups until each object is its own group. There
Conventional UPR techniques in the field of drug intelligence are many linkage methods which are used to calculate the distance
include principal component analysis (PCA) and CA. Although between clusters and thus influence the structure of the dendro-
clustering research has its origins in the late 1800s, no widely gram, (see Fig. 3). These linkage methods include:
accepted theories for clustering exist [96,97]. For this reason,
hundreds of clustering algorithms have been developed over time a Nearest neighbour, which uses the distance between the two
(see Fig. 2 for an outline of types of clustering methods). It is not nearest objects belonging to separate clusters.
feasible to outline and explain each algorithm in the scope of this b Furthest neighbour, which uses the distance between the two
article. For that reason, clustering algorithms used extensively in the furthest objects belonging to separate clusters.
drug profiling field will be mentioned, i.e. hierarchical clustering c Centroid method, which uses the distance between the centres
and k-means clustering (KMC). A more exhaustive collection and of two separate clusters.
explanation of clustering algorithms can be found in [98,99] d Averaged linkage or group average clustering method, which
calculates the mean distance between all pairs of objects
3.3.1. Principal component analysis belonging to separate clusters. The method is intermediate
One limitation of multivariate data is that visualisation of data between the single and complete linkage strategies, thus
with many variables is not feasible. To overcome this issue, some attempting to compensate deficiencies of one strategy by the
variables can be excluded through dimensionality reduction. PCA advantages of the other [102].
can achieve this and has been widely utilised in the field of drug e Ward’s method, which computes the distance between clusters
intelligence/profiling [10,11,41,52,53,60,100]. Data dimensionality as a minimum of within-cluster variance. The two clusters that
is reduced as a result of PCA rotating the coordinate system of the are characterised by the smallest variance between their objects
dataset [101]. The rotation creates uncorrelated variables called are merged to build a new cluster.
principal components (PC), which are orthogonal and uncorrelated
to each other. This process aims to eliminate high variable HCA does not indicate the variables, which contribute most to
correlation, which in turn would reduce the amount of irrelevant the classification of objects in the dataset, unlike PCA, where the
data from the visualisation or further analysis. loadings indicate the variation contained within each variable.
Eigenvalues are a measure of the overall variance of the dataset There may be a loss of information in the dendrogram generated by
for a particular PC. PC scores represent the relationship between HCA, especially if clusters are not well resolved. However, HCA
specimens, while PC loadings represent the relationship between does present all the variation in the dataset, in contrast to PCA,
variables and their associated PC. where only a percentage of the variation is typically presented.
3.3.2. Hierarchical clustering analysis 3.3.3. K-means clustering

One of the most popular ways of grouping objects is hierarchical Another popular method used for finding natural clusters in a
cluster analysis (HCA), which presents the clusters in the form of a dataset is KMC. The goal of this algorithm is to partition all objects
Fig. 2. Different types of clustering algorithms.

PLS-DA can be applied in many cases when classical DA cannot

be applied. For example, when the number of observations is low,
and when the number of explanatory variables is high. When there
are missing values, PLS-DA can be applied to the available data.
Additionally, PLS-DA is a type of parametric technique and is
applied to data which is known to have a normal distribution [7].
3.4.2. Linear discriminant analysis

Dimensionality reduction and classification are two common
reasons LDA is applied to a dataset. It is an SPR technique as it
requires knowledge about group membership. The distance
between a specimen and the class centre is calculated and a
new set of axes are calculated. The goal of LDA is to minimise the
distance between specimens of the same class and to maximise the
distance between individual classes.
As LDA requires that the number of specimens exceeds the
number of variables, it is often preceded by PCA [110]. It should
be mentioned that LDA assumes normally distributed data and
Fig. 3. Visualisation of different HCA linkage methods; (a) nearest neighbour, common class covariance matrices [111]. However, this is more so
(b) furthest neighbour, (c) centroid, (d) average and (e) Ward. relevant for LDA used as a classifier and not as a dimensionality
reduction technique. In saying that, the technique is still capable of
into k clusters so that the within-cluster sum of squares (WCSS) is providing good results even if the assumptions are violated [112].
minimised. In contrast to HCA, the number of clusters, k, needs
to be specified before analysis. Additionally, it is possible that 3.4.3. Artificial neural networks
the lowest WCSS may not be achieved for each cluster as the There have been few articles documenting the successful
technique is optimisation based. This optimisation is achieved classification of illicit drugs using ANNs [60,61,100]. The benefit
through varying the centroids of the k clusters until no movement of this technique is that ANNs can learn and adapt as opposed to
of an object from one cluster to another will minimise the WCSS other chemometric techniques which strive to find the correct
[103]. answer [113]. In contrast to PLS-DA, ANNs, which are a type of non-
In comparison to HCA, KMC is better suited to larger datasets as parametric method are applied to datasets where the distribution
visualisation via dendrogram can get challenging to interpret with is unknown or known to be not-normal [7]. ANNs are quite useful
many specimens. One drawback of KMC is that results are for solving non-linear relationships, even in the most complex
influenced by choice of k. KMC results may differ if the algorithm datasets [88].
is re-run, whereas with HCA, the results are reproducible. In The most common structure consists of neurons arranged into
contrast to HCA, KMC does not produce a graphical representation three different layer types, i.e. input (I), hidden (H), and output (O)
of the clustering result. Instead, KMC produces a list of objects that layers, (see Fig. 4). Various types of ANN structures exist with the
belong to each cluster. KMC is often paired with PCA to visualise multi-layer perceptron (MLP) method featuring most regularly in
results or reduce dimensionality before analysis [104]. the literature [114]. Other types of ANN structures include the
radial basis function (RBF) [115]. MLP usually has one or more
3.4. Supervised pattern recognition hidden layers, while RBF usually has one. This single hidden layer
allows for faster computation when compared to other methods
In supervised pattern recognition (SPR), the number of clusters [116]. Other ANN structures include probabilistic neural networks
is known in advance. Rather than locating natural clusters, SPR (PNN), which have an additional layer (summation layer) between
aims to assign unclassified objects to an existing cluster. SPR is the input and hidden layers [117]. Additionally, some articles have
often referred to as classification or discriminant analysis (DA). looked at clustering techniques coupled with ANN, for various
Conventional SPR techniques in the field of drug intelligence applications [118,119].
include partial least squares – discriminant analysis (PLS-DA) ANNs tend to learn through the application of training data
[12,31,52,53,105], linear discriminant analysis (LDA) [10,71,106– which contains defined inputs (e.g. target variables) and outputs
109] and artificial neural networks (ANN) [60,61,100]. (e.g. chemical classes). Each neuron in the hidden layer receives
its inputs from the training data and summates the product of
3.4.1. Partial least squares – discriminant analysis inputs and their associated weights. An activation function is
Partial least squares regression (PLSR) is a regression method applied to the weighted sum, generating an appropriate output
based on covariance. PLS-DA is the result of fitting PLSR to DA, value [103]. This output serves as the input for the final layer, i.e.
which in turn is a supervised classification method. The PLS the output layer. The most common activation function is the
algorithm is used to explain and predict the membership of sigmoid function, which is used to solve non-linear patterns [103].
specimens to several classes using quantitative or qualitative As well as the ability to differ the activation function, different
explanatory variables. The class membership vector is populated training algorithms can be applied to the ANN. The most common
with “dummy” variables. These variables usually have a value of training algorithm is back-propagation [103]. In this algorithm,
either 0 or 1 if they do or do not belong to a class, respectively [52]. data is fed-forward through the ANN to optimise the weights of
During the prediction stage, the closer a specimen is to 1, the more each neuron in the hidden and output layer. This is done to
likely it is that it belongs to that particular class. minimise the error between the actual output and the training
In contrast to PCA, which focuses on dimensionality reduction, output values [60]. The error in the output layer is then ‘back-
PLS-DA is a technique for identifying and measuring variables that propagated’ to the other neuron layers in the model and weight
cannot be measured directly (latent variables). Thus, PLS-DA scores values are continuously altered to arrive at the minimum error
describe the position of each specimen in each determined latent [120]. An alternative training algorithm is the conjugated gradient
variable. descent, which is more sophisticated [60].
Fig. 5. Sections of a confusion matrix; (a) two-class, (b) three-class.
4.1.2. Confusion matrices

Confusion matrices are a popular evaluation tool for several
chemometric techniques [52]. In the case of combination
evaluation, confusion matrices evaluate the extent of TPs, FPs,
FNs and TNs of the chosen model, (see Fig. 5). The layout of the
matrix is quite intuitive as the instances in the predicted and true
classes are represented in the rows and columns of the matrix.
Using the formulas provided in Table 6, the sensitivity (true
positive rate) and specificity (true negative rate) of a model can be
evaluated. Sensitivity is defined as the probability of getting a
positive result if the specimens were in fact linked. Similarly,
specificity is defined as the probability of getting a negative result if
the specimens were, in fact, unlinked [121]. The closer to one each
of those parameters is the better the model is at classification [53].
This model works just as well for models with more than two
classes (e.g. ANN model), although the structure of the matrix is
slightly altered, (see Fig. 5 for a three-class example).
4.1.3. Receiver operating characteristic curves

Receiver operating characteristic (ROC) curves are another
common way for visualising and assessing the overall efficiency of
statistical combinations [28,122]. This type of curve elucidates the
ideal statistical combination, i.e. the combination with the smallest
Fig. 4. Visualisation of different ANNs; (a) radial basis function, (b) multi-layer intra-variability and greatest inter-variability. As mentioned in
perceptron, (c) probability ANN.
Section 3.2, there will almost always be some overlap between
linked and unlinked populations (see Fig. 1).
For every THV there will be some level of FPs and FNs. ROC
curves are a plot of the sensitivity (true positive rate) as a function
of 1 – specificity (FP rate), see Table 6 for formulas. A test with
4. Evaluation of chemometric techniques perfect separation between the linked and unlinked populations
would have a 100% true positive rate and a 0% FP rate. Therefore,
4.1. Evaluation of CMs the closer to 1 the area under the ROC curve (AUC) is the more
efficient the statistical combination [9]. Any model that produces a
4.1.1. Discrimination power curve close to or under the reference line (i.e. 0.5) is a poor
It is vital that statistical methodologies can discriminate combination of PT and CM. That is, the combination produces a
between specimens of differing origins. Thus, calculating the large overlap between the linked and unlinked populations. A
discriminating power (DP) of the CM is essential. It is through the typical ROC curve would resemble Fig. 6.
calculation of mean and standard deviations of the comparison Either confusion matrices alone or in combination with ROC
scores that the discrimination is calculated [9]. Hence, discrimi- curves can be used to evaluate the ideal statistical combination of
nation can be estimated by: PT and CM.
mðunlinkedÞ s ðunlinkedÞ
¼
mðlinkedÞ s ðlinkedÞ
where m is the average and s is the standard deviation. This Table 6
essentially measures the ability of the CM to differentiate between Formulas of certain evaluation technique measures.
linked and unlinked specimens. The higher the discrimination, the Measure Formula
better the method is at differentiating unlinked specimens [31]. It TPþTN
Accuracy
should be noted that the ratio is only an estimation as it assumes Sensitivity
TPþFPþFNþTN
TP
TPþFN
the populations follow a normal distribution, which is often not the Specificity TN
FPþTN
case with casework specimens.
within-cluster sum of squares (WCSS) as a function of the number

of clusters to provide an accurate estimate. More specifically, the
method works by inserting a range of k values in clustering
algorithms, computing a WCSS for each k value. These values are
used to plot the WCSS curve. The bend in the curve (elbow point)
indicates where the rate of decrease sharply shifts and as such
correlates with an optimal number of clusters [126].
One thing to note with k means clustering is that decreasing the
WCSS increases the distance between clusters. The essence of
partitioning methods, like k means clustering, is to minimise the
WCSS as much as possible as it measures the compactness of
clustering.
4.2.2.2. Average silhouette method. The average silhouette method

is an alternative to the somewhat ambiguous elbow method. It
essentially measures how well objects fit in a cluster and can be
used with any clustering technique [126].
After clustering has been performed, each object is scored a
value from 1 to 1. The closer to 1 the more similar the object is to
its own cluster compared to other clusters. This is repeated for a
Fig. 6. Ideal ROC curve example and reference line.
range of k values. The estimate of the optimal number of clusters is
found by locating which configuration contains the highest
average silhouette score [127].
4.2. Evaluation of UPR techniques
4.2.2.3. Gap statistic method. The gap statistic is an approach that
4.2.1. PCA can be applied to any clustering method [128]. It is a formalised
It is possible to determine an ideal number of PCs using several version of the elbow method and employs the standardised
approaches. These approaches can indicate a different number of logarithm of the WCSS to calculate the optimum number of
PCs for the same data. clusters [126].
The more variables that load onto a particular PC, the more The idea behind this method was to compare the logarithm of
critical the PC is in summarising the data. Eigenvalues with a value the WCSS against a distribution with no apparent clustering [129].
of 1.0 indicate that the PC contains the same amount of information Once again, this process is repeated for a range of k values. Ideally,
as a single variable. Therefore, the Kaiser method suggests only to the clustered specimens will be distant from the randomly
keep PC with eigenvalues above 1.0 [123]. However, there has been distributed points. The clustering configuration which maximises
some criticism that this method may allow for too many the gap statistic is a good estimation of the optimal number of
components [124]. clusters.
The Cattell scree test [125] is another method of determining
an ideal amount of PCs to keep. Scree plots are a graphical 4.3. Evaluation of SPR techniques
representation of eigenvalue magnitude as a function of their
associated PC. They often show a steep curve, followed by a bend Although DA and ANNs differ slightly in functionality, their
and then a shallow gradient (scree). The ideal number of resultant models can be evaluated in similar ways. In saying that,
eigenvalues will be before the scree, where the line starts to the methods mentioned in this section can evaluate most SPR
decrease smoothly. techniques. Once an SPR technique has classified new specimens,
Each PC accounts for a certain percentage of the variance in a cross-validation is often performed to test the classification
dataset. The first PC will account for the most variability, followed accuracy. There are several methods available for cross-validation,
by the second and so on. It has been suggested to only retain the which include resubstitution, hold-out method and the leave-one-
PCs that have a cumulative explanation over a certain percentage. out method [110]. As mentioned in Sections 4.1.2 and 4.1.3,
The percentage chosen is pre-determined and case specific. The confusion matrices and ROC curves are not just suited to CMs. They
cumulative percent explained method is more subjective than have been used in many cases to evaluate SPR techniques including
the Kaiser or scree plot methods. PLS-DA, LDA and ANN models.
Additionally, “predicted vs measured” plots can be used to
4.2.2. Cluster validation evaluate the quality of the regression model fitted to the data [31].
Determining the optimal number of clusters in a dataset is As the name suggests, these plots assess the correlation between
a fundamental issue in partitioning clustering, such as KMC, the predicted values and actual values of a specimen. The closer the
requiring the user to specify the number of clusters (k) to be correlation between the two axes the better the PLS-DA or ANN
generated. The number of clusters is somewhat subjective and model is at predicting classes. This type of plot visualises the
depends on the technique and parameters used for clustering. prediction error for calibration and validation datasets. Other
There are three main methods (elbow, silhouette and gap) methods of evaluating prediction error have been outlined in
which can be used to determine the optimum number of clusters [130,131].
[126]. These methods can be used for most clustering techniques. Apart from the methods mentioned above, PLS-DA and LDA
They will be explored in the following subsections. models can be evaluated via several graphical methods. These
include score plots or loading plots [12].
4.2.2.1. Elbow method. The elbow method is employed when DA score plots vary to that of PCA due to the supervised nature
attempting to estimate an optimal number of clusters for k means of the former technique. While PCA transforms a dataset based on
clustering. Existing methods cannot currently determine the exact the greatest orthogonal variation in the data. DA, on the other
value of k; the elbow method remedies this by looking at the total hand, produces a similar transformation but is informed by
between-group variability to better reveal group structure 5.1.1. ATS (AM & MA)
[132,133]. DA loading plots can be interpreted similarly to PCA For amphetamine (AM) and MA, the EUC is the most widely
loading plots. Highly correlated variables have similar weights in utilised metric for determining the linkage between specimens of
the loading vectors and appear close together in the loading plots AM and MA. This may reflect the ease of computation and familiar
of all dimensions. nature of this method. However, in the cases where authors have
experimented with metrics other than the EUC, it generally does
5. Application and added value of chemometric techniques not produce the greatest separation between populations. In terms
of statistical combinations, N + 4R/PCC and N + 4R/SCF consistently
5.1. Applications of CMs to illicit drugs produced lower FP rates for a variety of studies.
Four articles focused their efforts on using only EUC as a
The combination of PT and CM will influence the rate of FPs and distance measure [25,38,40,41]. This CM showed satisfactory
FNs obtained for a particular dataset. Subsequently, the selection of discrimination between the linked and unlinked populations.
the THV can be modified to reflect the appetite for risk, resource Dujourdy et al. [12] evaluated EUC, SCF and PCC and determined
availability or other operational factors. that the two latter metrics provided exceptional discrimination
Table 7 outlines various statistical combinations found in the between the two populations, with SCF producing a slightly lower
literature, referring to a variety of illicit drug types. As mentioned FP rate. This result was also supported by further studies [32,36]
in previous sections, the optimum combination will ensure the with SCF the most effective statistical combination when
greatest separation between linked and unlinked populations, compared against other metrics.
which in turn minimises FP and FN specimen linkages. In some When multiple PTs were compared, N + 4R or N + 2R produced
cases, authors have chosen to focus on one statistical combination the best discrimination between populations [12,31,32]. Ander-
rather than comparing modifications. For simplicity, it should be sson et al. [31] theorised that these PTs are more discriminatory
noted that the optimum or only statistical methodology in the than N as they minimise closure effects. Closure, or the constant
respective reference is in bold. The remainder of the references sum, introduces dependence between normalised variables so that
present in Table 7 is a combination of lower ranked options, which if one large variable goes up, the others automatically go down
the reader should not be discouraged to try as the effectiveness of because their sum is fixed [134]. Using a treatment such as N + 4R
the lower ranked options is not significantly different. The reduces the influence of large peaks and classify data with more
combinations have been ranked based on the DP or the AUC for accuracy, thus minimise closure effects.
the ROC curves stated in the article, either-or is present in the
referenced articles. 5.1.2. ATS (MDMA)
The following sections break down the preferred and optimum In many cases involving MDMA, PCC with data PT using
statistical combinations for each illicit drug type, as per the normalisation and its modifications, namely N + 4R, has been found
available literature. to produce the greatest separation between linked and unlinked
Table 7
Statistical combinations of CMs and PTs used for drug profiling, references shown in cells. Where applicable, the optimal CM for the article is shown in bold. The second and
third ranked combinations in the relevant articles are included as well. Any article that has evaluated only one combination is italicised.
CM PT ATS (AM & MA) ATS (MDMA) Cocaine Heroin

SCF L [36]
N [37] [9] h , [47], [136] [69], [68] c
N+L [56]
N + 2R [12] [10]
N + 4R [31], [12], [32]a , [32]b [33]a , [33]b
N+S [9]g , [9] h
N + S+L [56]
PCC 4R [56]
L [36]
N [10], [35] [54] [69]
N+L [35]
N + 2R [12] [39], [10], [34] e [17] [17]
N + 4R [31], [32]a,b [33] , [33]a , [35], [34] f
b
h g
N+S [9] , [9] [13]
EUC 2R [36]
4R [11]
N [45] [68] d
N + 4R [31], [32] b, [32]a [33]a,b
S [41], [25], [40], [38] [11]
CAN N [69]
N+L [9]g
N + 4R [31]
MAN 4R [11]
S [11]
a
Organic impurities.
b
Inorganic impurities.
c
Major impurities.
d
Minor impurities.
e
LLE method.
f
HS-SPME method.
g
Within-laboratory study.
h
Between-laboratory study.
populations [10,33–35,37,39]. It is always good, if time permits, to profiling method to specimens which have not properly crystal-
explore all combinations of CMs (including their modifications) lised and thus half base half HCl.
and PTs. As [17,45,47,54,136] only looked at one PT it is difficult to report
Morelato et al. [33] applied PCC, SCF and EUC metrics to which is commonly the optimum. Lociciro et al. [9] determined
inorganic impurity data, with PCC found to produce the greatest that out of 66 statistical combinations, N + S paired with SCF and
separation of linked and unlinked populations and the lowest FP PCC provided optimum results. Liu et al. [56] evaluated 50
rates. This outcome was also observed for additional studies, combinations and reported that N + S + L proved to have the
although for organic impurities, where multiple metrics were greatest DP. Interestingly, S + N tended to have a very low DP for all
compared [34,35]. In another study by van Deursen et al. [39], PCC CMs.
was evaluated as the only method. Although a good separation of
populations was achieved, the authors concluded that a larger 5.1.4. Heroin
specimen set was needed to increase the accuracy of their analyses. Critically evaluating an optimum statistical combination is
Additionally, in their final remarks, it was suggested that difficult at this stage due to the lack of diversity in combinations
alternative statistical combinations should also be trialed. This is applied in the few articles that exist for heroin profiling.
unsurprising as different metrics can have varying effects on the In general, few studies apply all five metrics mentioned in this
structure of the linked and unlinked score distributions. review article to a single dataset. One such study by Esseiva et al.
In contrast, two studies [10,33] utilising MDMA organic [69] found that there was an overlap between linked and unlinked
impurities concluded that the use of SCF, rather than other populations, no matter which statistical combination was used.
metrics, produced slightly lower FP values for both datasets. An Their findings showed that the EUC and MAN, in comparison to
additional study by Gimeno et al. [37] presented the SCF as a means CAN, SCF and PCC, showed a larger distribution of linked scores and
of possible identification of batch-level relationships based on were therefore excluded from further analysis. Although only one
MDMA tablet impurities. However, the profiling technique needs PT was explored, N, its combination with SCF proved to produce the
to be applied to more MDMA specimens to set an accurate THV for lowest rate of FPs [69]. Dufey et al. [68] investigated the use of both
distinguishing populations. In the sequential study to Weyermann minor and major impurities for determining links between heroin
et al. [10], Marquis et al. [11] evaluated EUC and MAN distances for profiles. In another interesting article, Broséus et al. [13] compared
the diameter, thickness, weight and score of seized MDMA tablets. the performance of PCC on its ability to separate linked from
In this case, it was found that EUC resulted in a separation of linked unlinked specimens within each analytical method and between
and unlinked populations. One reason for this might be that the analytical methods. Although the methodology was applicable for
EUC tends to perform better than MAN when the dataset contains a their aims, the authors suggest that further exploration of
low number of outliers [135]. statistical combinations should be conducted. This type of research
The greatest separation between populations in most cases is at the forefront of developing solutions to the issue of method
was achieved when using the N + 2R and N + 4R PT method [10,33– harmonisation.
35,39]. It should be noted that studies [11] and [37] did not In contrast to other drug types, a search of the literature shows a
evaluate the performance of N + 2R or N + 4R. small selection of PTs utilised for heroin data. It is highly
recommended that other PTs are explored for a thorough
5.1.3. Cocaine evaluation of the optimum statistical combination.
Although cocaine profiling and the subsequent generation of
intelligence products have been documented in the literature 5.2. Applications of UPR to illicit drugs
[9,17,45,47,54,56], the summary presented in Table 7 depicts the
lack of versatility of statistical methods used in the articles above. 5.2.1. ATS (AM, MA & MDMA)
Only articles [9] and [56] explored more than one CM, and in The visualisation and identification of clusters in ATS datasets
both cases, SCF and PCC outperformed other metrics. In the case of has been achieved in several studies which employed PCA and
[9] where a cross-border inter-laboratory specimen comparison HCA, see Table 8. Overall, PCA provided suboptimal results when
was attempted, it was found that the same metrics produced the utilised for pattern recognition of ATS specimens. Although this
greatest discrimination between populations. Two articles [17,54] was the case in several articles [10,11,41,100], the method should
evaluating PCC only, reported that a small overlap of the linked and not be overlooked as it was successful for other drug types, as
unlinked score distributions was observed. Broséus et al. [54] explained in the subsequent sections. Furthermore, except for one
noted that this degree of overlap is reliant on sampling and the study [100] by Waddell et al., HCA successfully recognised known
statistical combination should be evaluated using a larger and identified new patterns in several datasets [12,25,36,41].
specimen set. Nonetheless, PCC has proven to be a viable CM for Several articles utilised PCA to group specimens according to
the differentiation of cocaine specimens. In contrast [136], and [47] their original seizure [41,100] and country of origin [10,11].
reported minimal FP and FN rates when using SCF with N. It is Suboptimal results were reported for all four articles as the
interesting to note that Dujourdy and Besacier [47] tested whether structure of the groups did not coincide with additional specimen
the addition of cocaine base would influence the similarity information. It is not guaranteed that specimens coming from the
measurement. This was done to test the applicability of the same seizure are linked; this is only an assumption. For this reason,
Table 8
Common unsupervised and supervised pattern recognition techniques split by drug type.
Pattern recognition technique ATS Cocaine Heroin
UPR PCA [10,11,41,100] [52,53,107,139,141,142] [60,61,66,71–76,143]

HCA [12,25,36,41,100] [47,52,53,56,136,138,140,142] [66,67,71,73,75,144]
KMC [61,67,71,75,143]
SPR PLS-DA [12,31] [47,52,53,105,138,139] [74]
LDA [10] [106,107] [71,108,109]
ANN [100] [60,61,66]
specimen membership to pre-classified groups needs to be not as definitive but indicated a separation between cocaine HCl
carefully evaluated via visual inspection of score plots. That is, and base, with the purity and therefore potential adulterant
specimens logged as belonging to the same seizure should be presence to be inferred by the wide range of each group. The latter
carefully evaluated before attempting a pattern recognition article [139] elucidated that cocaine specimens of various purities
technique. could be differentiated via PCA. Additionally, using loading plots,
Furthermore, PCA was unable to detect patterns between authors were able to infer which impurity peaks were responsible
countries, likely due to the production of MDMA utilising for the discrimination between specimens.
predominantly the same route (reductive amination) [10,11]. Monfreda et al. [107] and Nielsen et al. [141] obtained alkaloid
The variation in MDMA tablets can be attributed to the high seizure and residual solvent profiles for cocaine seizures. In both cases, the
intra-variability in both studies. In both cases, the authors alkaloid profile was more descriptive of the seizure subgroups than
recommended that statistical analysis should be applied to the solvent profile. It was hypothesised that due to the lack of
specimens collected in similar periods to arrive at more reliable variation in latter profiles, multiple batches were manufactured
conclusions about patterns in the data. using the same organic solvent [141]. Combining the two profiles
In contrast, most articles involving HCA reported successful did not yield more information than the alkaloid PCA alone. Rather
results [12,25,36,41] except [100]. Despite indicating the potential than combining them, it was suggested that the two profiles could
grouping of tablets, not one original seizure was correctly clustered be complementary. Production batches, identified through alka-
by HCA in the latter article [100]. The ability of the technique to loid profiles, could be linked to the same manufacturer by the
correctly group seizures was limited by the high intra-variability of presence of common residual solvents [141].
the specimens. It is difficult to overcome this issue as MDMA The HCA analysis seen in [52,53,138] shows that specimens
tablets produced clandestinely will almost always exhibit such were clustered by chemical form and purity level. This confirms the
variability [100]. analysis conducted by PCA, where applicable. HCA performed by
However, in the case of studies [41] and [36], HCA was able to Rodrigues et al. [52] and Perez et al. [138] predominantly grouped
group specimens into their original seizures except for a few the specimens by high and low purity. Marcelo et al. [53] opted to
misclassifications. Krawczyk et al. [41] used EUC and PCC as group specimens into more definitive clusters. That is, HCA results
distance measures, and the single linkage method to determine indicated that specimens were split into two major groups (HCl
cluster structure. Upon viewing the resultant dendrograms, it was and base), and subsequent groups were formed based on the type
elucidated that HCA was an adequate technique to evaluate class of adulterant present. With some misclassification, HCA separated
membership. The visual nature of the dendrogram allows analysts all specimens based on assumed geographic location
to easily set a THV as to whether specimens belong to the same [47,56,140,142]. This analysis allowed for the identification of
group or not. The score on the y-axis of the dendrogram can be previously unknown links between specimens. Additionally, the
linked to certain FP and FN values, which can then be used to set an purity level could be attributed to certain locations, highlighting
appropriate THV [56]. Although the analysis performed by the extent of adulterants used in various regions [140]. Further-
Kuwayama et al. [36] also contained minimal error, it is important more [136], used HCA as a preliminary clustering technique to
to factor in the small specimen size. To ensure the accuracy of the group similar specimens prior to PT/CM analysis. This use of the
technique, the specimen size should be increased. technique does not feature greatly in the literature but is useful
Both Dujourdy et al. [12] and Kuwayama et al. [25] identified when the specimen to seizure relationship is not clear or known.
interesting trends in their respective MA datasets using HCA. The
former study performed HCA on a group of MA specimens 5.2.3. Heroin
previously determined as being produced via ephedrine (EPH), A range of UPR methods has been applied to heroin sets with
(achieved through PLS-DA, see Section 5.3.1). However, MA the purpose of batch comparisons [60,66,71,73,143,144] and
synthesised via EPH can be achieved in a variety of ways, e.g. geographical origin determination [61,67,72,74–76], using either
through Birch reduction, Nagai and Moscow routes [137]. Distance organic or inorganic components of the heroin production
calculation between specimens via EUC paired with the Ward process.
linkage method identified the presence of six clusters indicative of A clear trend across the range of heroin profiling projects was
different synthesis routes. Furthermore, it was possible to indicate the exploration of different combinations and sequencing of
which target compounds were characteristic of each cluster, techniques. Overall, PCA was demonstrated to be effective as a
yielding information about the applicability of the chosen target preliminary screening technique [60,61,66,73,75,76,143]. Howev-
compounds. In the latter study by Kuwayama et al. [25], authors er, the technique does not perform as well for low purity specimens
clustered MA crystals from different countries based on charac- [71]. Some studies identified that combinations of variables are
teristic impurity peaks. These clusters provided information about present in specific concentrations in heroin coming from different
tendencies of specimens in different countries, in particular, the origins [74]. This notion allowed for unsupervised classification of
lack of naphthalene derivatives present in Thai MA specimens. heroin specimens using PCA score and loading plots [72,74].
However, the specimens were split into two broad origins.
5.2.2. Cocaine Additionally, it is not guaranteed that the variables were present
As cocaine mainly exists in two forms, i.e. as the salt and base, it at the point of plant cultivation, they could have been added at any
is essential to be able to differentiate between these two chemical point of the manufacturing process. It would, however, be
forms as they indicate the extent of the cutting process interesting to assess the applicability of PCA to classify specimens
[52,53,138,139]. An alternate use of UPR can be seen in into sub-groups and therefore more specific origins.
[47,56,107,140] and [141,142]. The aim of these articles was the As mentioned, many authors reported a sequential application of
elucidation of common geographical origin and batch trends, an UPR clustering technique such as HCA or KMC following PCA. HCA
respectively. was used to evaluate the clustering of specimens [66,67,71,73,75,144],
In [52,53,139], score plots of the first two PCs graphically with some authors also conducting a thorough comparison of different
display the separation between different cocaine forms. In the case combinations of linkage methods and CMs [67,71]. These studies
of Rodrigues et al. [52] three main groups were observed; high identified that the Ward linkage method consistently outperformed all
purity cocaine HCl, high purity cocaine base and low purity cocaine other options including average, centroid and median; when used in
(both HCl and base). The PCA conducted by Marcelo et al. [53] was combination with either squared EUC or squared PCC CMs.
The final UPR method to be featured in research involving cocaine specimens of low and high purity and specimens of HCl
heroin profiling data is KMC [61,67,75,143], which can also be and base form. Although score plots were not presented in the
applied as a semi-supervised pattern recognition method, as article, authors presented confusion matrices, which summarised
demonstrated by [71]. It is interesting to note that contrary to the the results. The models assessing purity and chemical form
articles mentioned above, one article did not use KMC as a produced sensitivity and specificity in the range of 83–97%. The
confirmatory technique [67]. Instead, it was used as a dimension- models developed by Marcelo et al. [53] and Grobério et al. [105]
ality reduction technique, with HCA confirming the results. each correctly identified 100% of specimens as being either cocaine
HCl or base. In the case of [105] by Groberio et al., the lack of error
5.3. Applications of SPR to illicit drugs was achieved after outliers (4.4% of the dataset) were removed.
Two articles utilised LDA for the classification of cocaine
5.3.1. ATS (AM, MA & MDMA) specimens [106,107]. In the former study [106], specimens were
As a plethora of possible manufacturing routes exists for synthetic subjected to different storage conditions and their alkaloid and
ATS [137], two studies [12,31] sought the use of PLS-DA to extract residual solvent profiles were generated over time. LDA predicted
information regarding synthesis routes for their respective datasets. the level of association between specimens. The authors suggested
Andersson et al. [31] used PLS-DA to evaluate the optimum PT for AM improving the statistical comparison of cocaine profiling by
route discrimination, in particular, the Leuckart, nitrostyrene and combining the two profiles. The stability of residual solvent
reductive amination routes. While Dujourdy et al. [12] used the same profiles over time, as well as the specificity of alkaloid profiles,
technique to discriminate between MA manufactured via benzyl proved to lower prediction error rates. In the latter article [107],
methyl ketone (BMK) and EPH. LDA was modelled on pure cocaine specimens. Test “cut” speci-
Evaluation of the PLS-DA models was carried out using score plots mens (containing cutting agents) were classified according to
and “predicted vs measured” plots. Score plots, in both cases, gave an clusters observed through PCA. In both cases, prediction error only
indication of the model with optimum discrimination. In the case of started occurring when specimens contained more than one
one article by Dujourdy et al. [12], score plots graphically illustrated cutting agent and a cocaine percentage of less than 60%.
that the model successfully differentiated specimens synthesised
using BMK and EPH routes. It also gave an indication of the large 5.3.3. Heroin
extent of EPH manufacturing, shown by the sizable difference A limited range of SPR methods has been applied to heroin
between the two groups. In the case of [31] by Andersson et al., visual profiling data, with the typical objective of achieving geographical
inspection of the score plots indicated that the N + 4R PT provided the origin determination [61,71,74,109]. For heroin profiling, ANN
greatest spread between synthetic route groups. architectures such as RBF and MLP have been featured several
As visual inspection can be quite subjective, the results of the times in the literature [60,61], with Ratle et al. [61] evaluating
score plots were confirmed through “predicted vs. measured” plots. alternative ANN such as PNN and an additional non-parametric
Dujourdy et al. [12] elucidated that their model produced a very high method k-ANN. PNN and k-ANN outperformed the MLP and RBF
correlation coefficient for both calibration and validation, demon- methods, but this outcome may be attributable to the application
strating the applicability of their model in discriminating between of a dimensionality reduction technique such as PCA. However,
MA synthesised via BMK and EPH. The authors also used their when discriminating between heroin batches, Klemenc [66] found
validated model to make predictions with new unclassified speci- that PCA and HCA were less erroneous than k-ANN. It should be
mens, which resulted in a 5% misclassification rate. Andersson et al. noted that this misclassification by the SPR technique could be
[31], on the other hand, reported that N + 4R produced the highest attributed to the small specimen size.
correlation coefficient for the “predicted vs. measured” plots. Another SPR technique applied to heroin profiling data is DA.
An alternative DA applied to ATS data was explored by The technique can be used to discriminate specimens based on
Weyermann et al. [10]. In this study, LDA was applied to MDMA geographic origin [71,74,109] or batch trends [108]. Additionally,
data to ascertain whether a reduction of target variables offered DA was evaluated in comparison to other chemometrics [71,74]
the same level of discrimination. It was found that the model where it was identified as the superior method. In all four cases,
employing the reduced set of variables presented minor classifi- DA performed well with some misclassification. Amongst other
cation. Furthermore, the model was also able to identify the things, the error could be due to the small specimen size,
variables mainly responsible for specimen discrimination. uncertainty of the country of origin or period of the specimens.
With the limited literature on ANN development for ATS, As DA learns from available specimens, a small dataset could
Waddell et al. [100] provided an interesting insight into the topic. potentially overtrain the model and misclassify specimens
The authors classified ecstasy tablets into their original seizures [71,108]. Furthermore, if there is a high inaccuracy in the original
using various MLP models and retained the six optimum ANN assumption of the country of origin, this could increase the
configurations. These configurations produced an accurate classi- misclassification rate during the training of a model [109].
fication rate of 96–99%. Misclassification can become a major issue With regards to spatiotemporal trend analysis, it would be
if seizure intra-variability is high. However, this type of variation beneficial to build SPR models that can discriminate between
will always be present in clandestinely produced illicit substances different batches with the same geographic origin. For this to be
[100]. It is a true test of the ANN to be able to discriminate between achieved the outputs of models should not be restricted to the
linked and unlinked specimens. assumed country but should include subgroups to account for
the change in the manufacture of heroin specimens [109]. Overall,
5.3.2. Cocaine there would appear to be a significant opportunity to explore SPR
As mentioned in Section 5.2.2, cocaine can exist in several as the bulk of research in this space is occurred over a decade ago.
forms. For this reason, it is useful to be able to differentiate
between these forms. Several articles [47,52,53,105,138,139] have 5.4. Added value of chemometric techniques applied to illicit drugs
evaluated the use of PLS-DA for classifying cocaine specimens. for ILP
Dujourdy et al. [47] attributed certain solvents to three
presumed specimen origins (Peru, Bolivia and Columbia) with The sections above provide a consolidated review on the
the aid of a PLS-DA loading plot. Rodrigues et al. [52] developed application of the most common chemometric techniques con-
two PLS-DA models which successfully discriminated between cerning drug profiling. The subsequent sections go through the
added value of chemometric techniques applied to illicit drugs for The creation of CCs is quite a useful feature in an ILP context.
ILP, in particular, their added value towards active investigations Their creation allows us to infer that specimens part of the same
and understanding drug markets. In an ILP perspective, forensic class were once part of the same physical unit, when major
case data is usually stored in an organised memory [5]. This impurities were profiled [16]. Taking the example of Broséus et al.
includes profile(s), scores, link values relating to specimens [77]. [16], a drug profiling method coupled with chemometric
Previously unknown connections between specimens can be un- techniques was used to study the evolution of cocaine and heroin
earthed through applying chemometric techniques [36,41]. This is drug markets as well as devise measures for promptly disrupting
often achieved through a combination of CMs and HCA to create developing segments of the network. The findings of such analysis
classes of specimens based on similarity, which are subsequently included the degree of drug homogeneity, spatiotemporal market
integrated into the memory [60]. Classes that are produced trends and expected outcomes of preventative measures.
through clustering can be chemical or physical, depending on the In more detail, the degree of drug homogeneity can be inferred
selected characteristics. The memory needs to be updated conti- from mapping the connections between specimens. A network
nuously to provide accurate information in a forensic intelligence with a low number of CCs and a high number of connections
application, this includes CCs as well as any investigative indicates consistency amongst specimens [16]. Additionally, hubs
information. in the network can be identified by observing seizures that link
previously unconnected cases. These instances can be confirmed
5.4.1. Added value for active investigations through quantitative measures of social network analysis, such as
If the subset of specimens to be explored is restricted to a centrality scores [149]. Based on the centrality scores and number
chosen event, chemometric techniques can be used to support a of CCs per seizure, Broseus et al. [16] showed that cocaine multi-
hypothesised link based on other investigative information. This profile seizures are commonly observed at an upper level of the
approach is rather reactive as specimens to be compared are distribution chain while this is not the case for heroin. This may
restricted to certain cases where links are inferred by other highlight how physical units with different chemical profiles are
information. Granted, the hypothesised links may be confirmed or managed and possibly blended along the distribution chain, an
refuted by the chosen statistical measure, but the potential to information difficult to obtain through traditional market analysis.
uncover links that are not indicated by investigative information is Furthermore, spatiotemporal analysis can be performed on
severely limited. This process is somewhat restrictive as it severely the network to identify points of interest or a need for trans-
confines the potential of drug profiling to support ILP. jurisdictional collaboration. Using static visualisations may limit the
Another approach would be to systematically compare new intelligence that can be generated from network evaluation.
specimens against existing cases with the aid of chemometric Therefore, it is recommended that the network be studied over a
techniques. The physical result of systematically applying CMs to chosen time (e.g. trimester) to understand the evolution of the drug
drug profiling data could take the form of a record of linked cases market fully. Temporal analysis can elucidate information about CCs
(including previously unknown links). Further applying CA could life and infer the emergence or extinction of criminal networks.
index specimens based on their similarity, in turn creating classes. Classes deemed significant in size and observed to be emerging
This would produce a form of intelligence that generates a clearer deserve more focus from investigators and should be neutralised
insight into the connectivity of the specimens in question to the promptly. As all information regarding a specimen is not always
drug market. This insight could allow investigators to focus their available, SPR can be used to classify entities to gain a better
resources on exploring instances where specimens are seemingly understanding of markets. There have been many instances of
linked to unrelated cases. Ideally, these instances should be successful classification, covered in Section 5.3. These articles mainly
supported by other types of information. Furthermore, the process classified specimens and highlighted patterns based on geographic
used to arrive at the link needs to be continuously evaluated to location, route of synthesis or batch trends. As well as temporal
ensure an appropriate level of FPs and FNs. analysis and enrichment of the memory, intelligence can be gathered
An example of both approach was demonstrated by Esseiva from evaluating spatial data. Trans-jurisdictional connections can
et al. [60]. In this article, several heroin seizures were chemically identify problematic areas and identify instances where collabora-
profiled for the purpose of confirming and potentially elucidating tion between jurisdictions should be increased [16]. Additionally, at
further links. Four clusters of specimens, seemingly linked through an international level, network analysis could identify countries that
investigative information, were further examined through chemi- are pivotal to drug markets. This could aid the suggestion of strategic
cal profiling and chemometrics. The hypothesised links were all targets for disrupting international drug trade [145].
chemically confirmed. If the analysis had stopped there (as it is It is possible to receive an indicator of how well implemented
traditionally the case), investigators would have missed links with strategies are disrupting drug markets by analysing the extent
other seizures not initially connected to the particular investiga- of certain CCs at a given time. To expand on this idea, law
tion. However, the chemical profiles were compared to all other enforcement could focus their resources on CCs deemed to be
specimens in the memory and new connections were revealed that important. This would initially result in an increase in connections
were previously hidden when consulting investigative information within the network as more seizures are made, then ideally a
only [60]. This approach made visible unsuspected links and decrease in connections as the criminal activity has been quenched
provided a more complete picture of the extent of the market. due to the focused efforts of law enforcement. However, this isn’t
always the case, criminal organisations adapt to police efforts and
5.4.2. Added value for understanding drug markets not all attempts at market disruption are successful. That is not to
Chemometric techniques are a valuable tool for supporting ILP; say that something cannot be gained from this. These unsuccessful
the approach outlined above could not be achieved by presenting efforts can guide law enforcement in understanding flexibility of
raw data to investigators. So, as well as systematically comparing criminal organisations and help further develop proactive strate-
new specimens against the memory, it can be explored retrospec- gies for ILP [16].
tively to understand drug markets and ultimately predict trends
that could prevent future events from happening [145,146]. To 6. Conclusions
have further confidence in the links and CCs identified through
chemometric techniques, they should be validated with other The illicit drug trade is a widely recognised social and judicial
sources of information [147,148]. problem. However, there are opportunities to aid decision makers
in disrupting the problem through the collection and appropriate through studying the evolution of drug markets, devising
exploration of forensic case data. Chemical and physical profiles, measures for prompt disruption and thorough evaluation of such
coupled with chemometric techniques, can be used to highlight measures to ensure success of police efforts.
various patterns in the forensic case data. In the case of illicit drug
data, these patterns are related to batch trends, links between CRediT authorship contribution statement
specimens, geographic location or route of synthesis.
The most common chemometric techniques outlined in the Ana Popovic: Conceptualization, Data curation, Writing -
literature include CMs, unsupervised and supervised pattern original draft, Writing - review & editing. Marie Morelato: Writing
recognition. - original draft, Writing - review & editing. Claude Roux: Writing -
CMs can be very useful in determining the level of association review & editing. Alison Beavis: Conceptualization, Writing -
between profiles. Multiple metrics should be applied to any dataset original draft, Writing - review & editing.
to identify the optimum. The EUC distance has been widely used
for determining linkages as it produces excellent results. However, Acknowledgements
when other metrics are explored alongside EUC, it proves to be
suboptimal. These cases show that either SCF or PCC provide the This work is supported by the Australian Research Council grant
lowest rate of FPs and FNs. In all the articles included in this review (LP160100352). The authors would like to acknowledge the project
either SCF, PCC or EUC were superior to other metrics. The optimal partners: NSW Forensic & Analytical Science Services, Australian
combination of CM and PT needs to be periodically re-evaluated to Federal Police, United Nations Office on Drugs and Crime,
accommodate the addition of new specimens to the database. This Univertity of Lausanne.
will ensure that maximum separation is achieved between linked
and unlinked populations.
References
The UPR techniques featured in this article include PCA, HCA
and KMC. The occasional error encountered with PCA was most [1] H.A. Visser, M. Visser-van Leeuwen, H. Huizer, Residual solvents in
likely due to incorrect assumptions relating to specimen member- methylenedioxymethamphetamine tablets as a source of strategic informa-
ship. Otherwise, the technique was successful in preliminary tion and as a tool for comparative analysis: the development and application
of a static headspace gas chromatography/mass spectrometry method, Bull.
screening and dimensionality reduction. The two clustering Narc. 57 (1–2) (2005) 167–182.
techniques (HCA and KMC) explored in this article highlighted [2] M.M. Houck, Professional Issues in Forensic Science, Academic Press, 2015.
interesting patterns between specimens. The UPR techniques are [3] T. Raymond, R. Julian, Forensic intelligence in policing: organisational and
cultural change, Aust. J. Forensic Sci. 47 (4) (2015) 371–385.
commonly used sequentially and often to confirm PCA results. The [4] O. Ribaux, Forensic intelligence and crime analysis, Law Probab. Risk 2 (1)
visual nature of the HCA dendrogram could explain the popularity (2003) 47–60.
of the technique. Significantly fewer articles feature KMC. The two [5] P. Esseiva, S. Ioset, F. Anglada, L. Gaste, O. Ribaux, P. Margot, A. Gallusser, A.
Biedermann, Y. Specht, E. Ottinger, Forensic drug intelligence: an important
clustering techniques are not without their drawbacks and the goal tool in law enforcement, Forensic Sci. Int. 167 (2–3) (2007) 247–254.
of the data analysis should be considered before choosing one or [6] M. Morelato, A. Beavis, M. Tahtouh, O. Ribaux, P. Kirkbride, C. Roux, The use of
the other. Furthermore, one limitation of the clustering techniques forensic case data in intelligence-led policing: the example of drug profiling,
Forensic Sci. Int. 226 (1–3) (2013) 1–9.
is the requirement of choosing a THV, whether that be a score
[7] N. Nic Daeid, R.J. Waddell, The analytical and chemometric procedures used
relating to a CM or the number of desired clusters. This THV might to profile illicit drug seizures, Talanta 67 (2) (2005) 280–285.
not always be an obvious choice and if chosen incorrectly, could [8] S. Ioset, P. Esseiva, O. Ribaux, C. Weyermann, F. Anglada, S. Lociciro, P. Hayoz, I.
severely limit the information acquired from the clustering Baer, L. Gasté, A.L. Terrettaz-Zufferey, C. Delaporte, P. Margot, Establishment
of an operational system for drug profiling: a Swiss experience, Bull. Narc. 57
techniques. Additionally, unlike PCA, HCA does not indicate which (1–2) (2007) 121–147.
variables contribute most to the classification of objects in the [9] S. Lociciro, P. Esseiva, P. Hayoz, L. Dujourdy, F. Besacier, P. Margot, Cocaine
dataset. profiling for strategic intelligence, a cross-border project between France and
Switzerland: part II. Validation of the statistical methodology for the
When complementary specimen information is available, SPR profiling of cocaine, Forensic Sci. Int. 177 (2–3) (2008) 199–206.
techniques can be used to generate prediction models. When [10] C. Weyermann, R. Marquis, C. Delaporte, P. Esseiva, E. Lock, L. Aalberg, J.S.
compared to UPR, superior results were often observed for DA. Bozenko Jr., S. Dieckmann, L. Dujourdy, F. Zrcek, Drug intelligence based on
MDMA tablets data I. Organic impurities profiling, Forensic Sci. Int. 177 (1)
However, misclassification was encountered in some cases. This (2008) 11–16.
error was usually due to small specimen size, the uncertainty of the [11] R. Marquis, C. Weyermann, C. Delaporte, P. Esseiva, L. Aalberg, F. Besacier, J.S.
country of origin or lengthy time span of specimens. Several ANNs Bozenko Jr., R. Dahlenburg, C. Kopper, F. Zrcek, Drug intelligence based on
MDMA tablets data: 2. Physical characteristics profiling, Forensic Sci. Int. 178
have been applied to illicit drug data in order to classify specimens.
(1) (2008) 34–39.
In most cases, the ANN produce sufficient results, but the efficacy [12] L. Dujourdy, V. Dufey, F. Besacier, N. Miano, R. Marquis, E. Lock, L. Aalberg, S.
may vary if unsupervised techniques are applied first. Dieckmann, F. Zrcek, J.S. Bozenko Jr., Drug intelligence based on organic
impurities in illicit MA samples, Forensic Sci. Int. 177 (2–3) (2008) 153–161.
The optimum ordering of chemometric methods will depend on
[13] J. Broseus, B. Debrus, O. Delemont, S. Rudaz, P. Esseiva, Study of common
the desired goal. In many cases, PCA is a good starting point. It database feeding with results coming from different analytical methods in
can be followed by CA or SPR depending on the availability of the framework of illicit drugs chemical profiling, Forensic Sci. Int. 230 (1–3)
additional information. A prediction model can be generated (2013) 16–28.
[14] C.S. Jonson, L. Stromberg, Computer aided retrieval of common-batch
without additional class information if CA is performed ahead of members in Leuckart amphetamine profiling, J. Forensic Sci. 38 (6) (1993)
SPR. However, CA can also follow SPR if a known class is thought to 1472–1477.
contain further clusters. [15] J.F. Casale, J.W. Watterson, A computerized neural network method for pattern
recognition of cocaine signatures, J. Forensic Sci. 38 (2) (1993) 292–301.
Various configurations of chemometric techniques can aid the [16] J. Broseus, S. Baechler, N. Gentile, P. Esseiva, Chemical profiling: a tool to
interpretation of forensic case data, in particular, their ability to decipher the structure and organisation of illicit drug markets: an 8-year
add value towards active investigations and understanding drug study in Western Switzerland, Forensic Sci. Int. 266 (2016) 18–28.
[17] P. Esseiva, L. Gaste, D. Alvarez, F. Anglada, Illicit drug profiling, reflection on
markets. Drug profiling has proven useful in confirming links statistical comparisons, Forensic Sci. Int. 207 (1–3) (2011) 27–34.
originally hypothesised by investigators and has also demonstrat- [18] M. Collins, Illicit drug profiling: the Australian experience – revisited, Aust. J.
ed the potential of going a step further and identifying previously Forensic Sci. 49 (6) (2017) 591–604.
[19] N. NicDaéid, S. Jayaram, W.J. Kerr, Elemental profiling using ICPMS of
unconnected entities. Furthermore, several articles have demon-
methylamphetamine hydrochloride prepared from proprietary medication
strated the ability of forensic case data, coupled with chemometric using the Moscow and hypophosphorous synthesis, Sci. Justice 53 (3) (2013)
techniques, to support ILP in a strategic context. This was achieved 278–285.
[20] R.J. Waddell-Smith, A review of recent advances in impurity profiling of illicit [45] K.E. Janzen, A.R. Fernando, L. Walter, A database for comparison analysis of
MDMA samples, J. Forensic Sci. 52 (6) (2007) 1297–1304. illicit cocaine samples, Forensic Sci. Int. 69 (1) (1994) 23–29.
[21] Y. Marumo, T. Inoue, S. Seta, Analysis of inorganic impurities in seized [46] M. Collins, J. Huttunen, I. Evans, J. Robertson, Illicit drug profiling: the
methamphetamine samples, Forensic Sci. Int. 69 (1) (1994) 89–95. Australian experience, Aust. J. Forensic Sci. 39 (1) (2007) 25–32.
[22] A. Kruegel, T. Kram, The identification of impurities in illicit methamphet- [47] L. Dujourdy, F. Besacier, Headspace profiling of cocaine samples for
amine exhibits by gas chromatography/mass spectrometry and nuclear intelligence purposes, Forensic Sci. Int. 179 (2-3) (2008) 111–122.
magnetic resonance spectroscopy, J. Forensic Sci. 22 (1) (1977) 40–52. [48] M. Cole, Occluded solvent analysis as a basis for heroin and cocaine sample
[23] A.M. Verweij, Impurities in illicit drug preparations: amphetamine and differentiation, Forensic Sci. Rev. 10 (2) (1998) 113–120.
methamphetamine, Forensic Sci. Rev. 1 (1) (1989) 1–11. [49] D.R. Morello, R.P. Meyers, Qualitative and quantitative determination of
[24] V. Kunalan, N. Nic Dae’id, W.J. Kerr, H.A. Buchanan, A.R. McPherson, residual solvents in illicit cocaine HCl and heroin HCl, J. Forensic Sci. 40 (6)
Characterization of route specific impurities found in methamphetamine (1995) 957–963.
synthesized by the Leuckart and reductive amination methods, Anal. Chem. [50] J.R. Ehleringer, D.A. Cooper, M.J. Lott, C.S. Cook, Geo-location of heroin and
81 (17) (2009) 7342–7348. cocaine by stable isotope ratios, Forensic Sci. Int. 106 (1) (1999) 27–35.
[25] K. Kuwayama, H. Inoue, J. Phorachata, K. Kongpatnitiroj, V. Puthaviriyakorn, [51] J.R. Ehleringer, J.F. Casale, M.J. Lott, V.L. Ford, Tracing the geographical origin
K. Tsujikawa, H. Miyaguchi, T. Kanamori, Y.T. Iwata, N. Kamo, Comparison and of cocaine, Nature 408 (6810) (2000) 311–312.
classification of methamphetamine seized in Japan and Thailand using gas [52] N.V.S. Rodrigues, E.M. Cardoso, M.V.O. Andrade, C.L. Donnici, M.M. Sena,
chromatography with liquid–liquid extraction and solid-phase microex- Analysis of seized cocaine samples by using chemometric methods and FTIR
traction, Forensic Sci. Int. 175 (2) (2008) 85–92. spectroscopy, J. Braz. Chem. Soc. 24 (3) (2013) 507–517.
[26] L. Aalberg, K. Andersson, C. Bertler, H. Boren, M.D. Cole, J. Dahlen, Y. Finnon, H. [53] M.C. Marcelo, K.C. Mariotti, M.F. Ferrao, R.S. Ortiz, Profiling cocaine by ATR-
Huizer, K. Jalava, E. Kaa, E. Lock, A. Lopes, A. Poortman-van der Meer, E. FTIR, Forensic Sci. Int. 246 (2015) 65–71.
Sippola, Development of a harmonized method for the profiling of [54] J. Broseus, S. Huhtala, P. Esseiva, First systematic chemical profiling of cocaine
amphetamines. I. Synthesis of standards and compilation of analytical data, police seizures in Finland in the framework of an intelligence-led approach,
Forensic Sci. Int. 149 (2–3) (2005) 219–229. Forensic Sci. Int. 251 (2015) 87–94.
[27] L. Aalberg, K. Andersson, C. Bertler, M.D. Cole, Y. Finnon, H. Huizer, K. Jalava, E. [55] S. Lociciro, P. Hayoz, P. Esseiva, L. Dujourdy, F. Besacier, P. Margot, Cocaine
Kaa, E. Lock, A. Lopes, A. Poortman-van der Meer, E. Sippola, J. Dahlen, profiling for strategic intelligence purposes, a cross-border project between
Development of a harmonized method for the profiling of amphetamines. II. France and Switzerland. Part I. Optimisation and harmonisation of the
Stability of impurities in organic solvents, Forensic Sci. Int. 149 (2-3) (2005) profiling method, Forensic Sci. Int. 167 (2-3) (2007) 220–228.
231–241. [56] C. Liu, Z. Hua, X. Meng, Profiling of illicit cocaine seized in China by ICP-MS
[28] K. Andersson, K. Jalava, E. Lock, Y. Finnon, H. Huizer, E. Kaa, A. Lopes, A. analysis of inorganic elements, Forensic Sci. Int. 276 (2017) 77–84.
Poortman-van der Meer, M.D. Cole, J. Dahlen, E. Sippola, Development of a [57] I.S. Lurie, P.A. Hays, A.E. Garcia, S. Panicker, Use of dynamically coated
harmonised method for the profiling of amphetamines: III. Development of capillaries for the determination of heroin, basic impurities and adulterants
the gas chromatographic method, Forensic Sci. Int. 169 (1) (2007) 50–63. with capillary electrophoresis, J. Chromatogr. A 1034 (1) (2004) 227–235.
[29] K. Andersson, K. Jalava, E. Lock, H. Huizer, E. Kaa, A. Lopes, A. Poortman-van [58] M. Gloger, H. Neumann, Analysis of heroin samples with capillary gas
der Meer, M.D. Cole, J. Dahlen, E. Sippola, Development of a harmonised chromatography. Comparison of glass capillary column and packed column,
method for the profiling of amphetamines: IV. Optimisation of sample Forensic Sci. Int. 22 (1) (1983) 63–74.
preparation, Forensic Sci. Int. 169 (1) (2007) 64–76. [59] C. Barnfield, S. Burns, D.L. Byrom, A.V. Kemmenoe, The routine profiling of
[30] E. Lock, L. Aalberg, K. Andersson, J. Dahlen, M.D. Cole, Y. Finnon, H. Huizer, K. forensic heroin samples, Forensic Sci. Int. 39 (2) (1988) 107–117.
Jalava, E. Kaa, A. Lopes, A. Poortman-van der Meer, E. Sippola, Development of a [60] P. Esseiva, F. Anglada, L. Dujourdy, F. Taroni, P. Margot, E.D. Pasquier, M.
harmonised method for the profiling of amphetamines V: determination of the Dawson, C. Roux, P. Doble, Chemical profiling and classification of illicit
variability of the optimised method, Forensic Sci. Int. 169 (1) (2007) 77–85. heroin by principal component analysis, calculation of inter sample
[31] K. Andersson, E. Lock, K. Jalava, H. Huizer, S. Jonson, E. Kaa, A. Lopes, A. correlation and artificial neural networks, Talanta 67 (2) (2005) 360–367.
Poortman-van der Meer, E. Sippola, L. Dujourdy, J. Dahlen, Development of a [61] F. Ratle, A.-L. Terrettaz, M. Kanevski, P. Esseiva, O. Ribaux, Pattern analysis in
harmonised method for the profiling of amphetamines VI: evaluation of illicit heroin seizures: a novel application of machine learning algorithms,
methods for comparison of amphetamine, Forensic Sci. Int. 169 (1) (2007) Bruges (2006) 665–670.
86–99. [62] A.C. Allen, D.A. Cooper, J.M. Moore, M. Gloger, H. Neumann, Illicit heroin
[32] M. Morelato, A. Beavis, M. Tahtouh, O. Ribaux, K.P. Kirkbride, C. Roux, The use manufacturing by-products: capillary gas chromatographic determination
of methylamphetamine chemical profiling in an intelligence-led perspective and structural elucidation of narcotine-and norlaudanosine-related com-
and the observation of inhomogeneity within seizures, Forensic Sci. Int. 246 pounds, Anal. Chem. 56 (14) (1984) 2940–2947.
(2015) 55–64. [63] M. Collins, E. Casale, D.B. Hibbert, S. Panicker, J. Robertson, S. Vujic, Chemical
[33] M. Morelato, A. Beavis, M. Tahtouh, O. Ribaux, P. Kirkbride, C. Roux, The use of profiling of heroin recovered from the North Korean merchant vessel Pong
organic and inorganic impurities found in MDMA police seizures in a drug Su, J. Forensic Sci. 51 (3) (2006) 597–602.
intelligence perspective, Sci. Justice 54 (1) (2014) 32–41. [64] H. Neumann, M. Gloger, Profiling of illicit heroin samples by high-resolution
[34] F. Bonadio, P. Margot, O. Delémont, P. Esseiva, Headspace solid-phase capillary gas chromatography for forensic application, Chromatographia 16
microextraction (HS-SPME) and liquid–liquid extraction (LLE): comparison (1) (1982) 261–264.
of the performance in classification of ecstasy tablets (Part 2), Forensic Sci. [65] Ø. Næss, K.E. Rasmussen, Micellar electrokinetic chromatography of
Int. 182 (1) (2008) 52–56. charged and neutral drugs in acidic running buffers containing a
[35] F. Bonadio, P. Margot, O. Delemont, P. Esseiva, Optimization of HS-SPME/GC- zwitterionic surfactant, sulfonic acids or sodium dodecyl sulphate separation
MS analysis and its use in the profiling of illicit ecstasy tablets (Part 1), of heroin, basic by-products and adulterants, J. Chromatogr. A 760 (2) (1997)
Forensic Sci. Int. 187 (1-3) (2009) 73–80. 245–251.
[36] K. Kuwayama, K. Tsujikawa, H. Miyaguchi, T. Kanamori, Y. Iwata, H. Inoue, S. [66] S. Klemenc, In common batch searching of illicit heroin samples—evaluation
Saitoh, T. Kishi, Identification of impurities and the statistical classification of of data by chemometrics methods, Forensic Sci. Int. 115 (1) (2001) 43–52.
methamphetamine using headspace solid phase microextraction and gas [67] R.B. Myors, S.V. Skopec, R.J. Wells, P.T. Crisp, Investigation of heroin profiling
chromatography-mass spectrometry, Forensic Sci. Int. 160 (1) (2006) 44–52. using trace organic impurities, Analyst 126 (5) (2001) 679–689.
[37] P. Gimeno, F. Besacier, H. Chaudron-Thozet, J. Girard, A. Lamotte, A [68] V. Dufey, L. Dujourdy, F. Besacier, H. Chaudron, A quick and automated
contribution to the chemical profiling of 3,4-methylenedioxymethamphet- method for profiling heroin samples for tactical intelligence purposes,
amine (MDMA) tablets, Forensic Sci. Int. 127 (1–2) (2002) 1–44. Forensic Sci. Int. 169 (2-3) (2007) 108–117.
[38] K. Tanaka, S. Seta, T. Inoue, T. Ohmori, Impurity profiling analysis of illicit [69] P. Esseiva, L. Dujourdy, F. Anglada, F. Taroni, P. Margot, A methodology for
methamphetamine by capillary gas chromatography, Impurity Profiling illicit heroin seizures comparison in a drug intelligence perspective using
Analysis of Illicit Methamphetamine by Capillary Gas Chromatography, large databases, Forensic Sci. Int. 132 (2) (2003) 139–152.
(1994) . [70] L. Strömberg, L. Lundberg, H. Neumann, B. Bobon, H. Huizer, N.W. van der
[39] M.M. van Deursen, E.R. Lock, A.J. Poortman-van der Meere, Organic impurity Stelt, Heroin impurity profiling, Forensic Sci. Int. 114 (2) (2000) 67–88.
profiling of 3,4-methylenedioxymethamphetamine (MDMA) tablets seized [71] K.W. Chan, G.H. Tan, R.C.S. Wong, Chemometric procedures for analyzing
in the Netherlands, Sci. Justice J. Forensic Sci. Soc. 46 (3) (2006) 135–152. trace organic impurities present in street doses of heroin via a constant
[40] H. Inoue, T. Kanamori, Y.T. Iwata, Y. Ohmae, K. Tsujikawa, S. Saitoh, T. Kishi, weight approach, Aust. J. Forensic Sci. 44 (3) (2012) 299–309.
Methamphetamine impurity profiling using a 0.32 mm i.d. Nonpolar [72] K.-W. Chan, G.-H. Tan, R.C.S. Wong, Investigation of illicit heroin seized in
capillary column, Forensic Sci. Int. 135 (1) (2003) 42–47. Malaysia: physical characteristics and chemical profiling, Aust. J. Forensic Sci.
[41] W. Krawczyk, A. Parczewski, Application of chemometric methods in 44 (4) (2012) 353–369.
searching for illicit Leuckart amphetamine sources, Anal. Chim. Acta 446 [73] K.W. Chan, G.H. Tan, R.C.S. Wong, Harmonizing a chemometric procedure for
(1-2) (2001) 107–114. the classification of illicit heroin using manufacturing impurities: prelimi-
[42] J.F. Casale, R.W. Waggoner, A chromatographic impurity signature profile nary investigation of heroin simulated links, Aust. J. Forensic Sci. 45 (2) (2013)
analysis for cocaine using capillary gas chromatography, J. Forensic Sci. 36 (5) 153–163.
(1991) 1312–1330. [74] C. Liu, Z. Hua, Y. Bai, Y. Liu, Profiling and classification of illicit heroin by ICP-
[43] A.R. Fernando, K.E. Janzen, L. Walter, Comparison analysis of illicit cocaine MS analysis of inorganic elements, Forensic Sci. Int. 239 (2014) 37–43.
samples, Journal of Forensic Science 37 (2) (1992) 436–445. [75] R. Myors, R.J. Wells, S.V. Skopec, R. Iavetz, Z. Skopec, P. Crisp, A. Ekangaki, J.
[44] K.E. Janzen, Cross-matching of cocaine samples. A case study, Can. Soc. Robertson, Preliminary investigation of heroin fingerprinting using trace
Forensic Sci. J. 20 (2) (1987) 77–81. element concentrations, Anal. Commun. 35 (12) (1998) 403–410.
[76] M.Z. Yusoff, K.H. Chang, A.F.L. Abdullah, Trace elemental profile investigation [110] A. Mendlein, C. Szkudlarek, J.V. Goodpaster, Chemometrics (2012) 646–651.
of illicit heroin for forensic intelligence, Aust. J. Forensic Sci. 0618 (May) [111] G. Govaert, Data Analysis, Wiley, 2013.
(2018) 1–11. [112] T. Li, S. Zhu, M. Ogihara, Using discriminant analysis for multi-class
[77] M. Morelato, Forensic Drug Profiling: a Tool for Intelligence-led Policing, (2015) . classification: an experimental investigation, Knowl. Inf. Syst. 10 (4)
[78] Drug Characterization/Impurity Profiling-Background and Concepts, United (2006) 453–472.
Nations Office on Drugs and Crime, 2001. [113] R. Webb, P. Doble, M. Dawson, Optimisation of HPLC gradient separations
[79] A. Bolck, C. Weyermann, L. Dujourdy, P. Esseiva, J. van den Berg, Different using artificial neural networks (ANNs): application to benzodiazepines in
likelihood ratio approaches to evaluate the strength of evidence of MDMA post-mortem samples, J. Chromatogr. B Analyt. Technol. Biomed. Life Sci. 877
tablet comparisons, Forensic Sci. Int. 191 (1-3) (2009) 42–51. (7) (2009) 615–620.
[80] Q. Milliet, C. Weyermann, P. Esseiva, The profiling of MDMA tablets: a study of [114] C. Kingston, Neural networks in forensic science, J. Forensic Sci. 37 (1) (1992)
the combination of physical characteristics and organic impurities as sources 252–264.
of information, Forensic Sci. Int. 187 (1) (2009) 58–65. [115] R.S. Govindaraju, B. Zhang, R.S. Govindaraju, Radial-basis function networks,
[81] J. Huttunen, C. Austin, M. Dawson, C. Roux, J. Robertson, Physical evidence in in: A.R. Rao (Ed.), Artificial Neural Networks in Hydrology, Springer,
drug intelligence Part 1: rationale based on hierarchic distribution of drugs Dordrecht, Netherlands, 2000, pp. 93–109.
using pyrolysis gas chromatography –mass spectrometry as an example, [116] H. de Leon-Delgado, R.J. Praga-Alejo, D.S. Gonzalez-Gonzalez, M. Cantú-
Aust. J. Forensic Sci. 39 (2) (2007) 93–106. Sifuentes, Multivariate statistical inference in a radial basis function neural
[82] J. Huttunen, P. Doble, M. Dawson, C. Roux, J. Robertson, Physical evidence in network, Expert Syst. Appl. 93 (2018) 313–321.
drug intelligence Part 2: discrimination of packaging tapes by colour, Aust. J. [117] D.F. Specht, Probabilistic neural networks and the polynomial Adaline as
Forensic Sci. 40 (1) (2008) 73–83. complementary techniques for classification, IEEE Trans. Neural Netw. 1 (1)
[83] J. Huttunen, M. Dawson, C. Roux, J. Robertson, Physical evidence in (1990) 111–121.
drug intelligence Part 3: supercritical fluid extraction–high performance [118] Y. Lee, Handwritten digit recognition using K nearest-neighbor, radial-basis
liquid chromatography of packaging tapes, Aust. J. Forensic Sci. 41 (1) (2009) function, and backpropagation neural networks, Neural Comput. 3 (3) (1991)
63–72. 440–449.
[84] P. Maynard, K. Gates, C. Roux, C. Lennard, Adhesive tape analysis: establishing [119] Z. Wang, W. Hamza, L. Song, k-Nearest Neighbor Augmented Neural
the evidential value of specific techniques, J. Forensic Sci. 46 (2) (2001) 280– Networks for Text Classification, (2017) arXiv preprint arXiv:1708.07863.
287. [120] D.E. Rumelhart, J.L. McClelland, Parallel Distributed Processing: Explorations
[85] D. Rhumorbarbe, L. Staehli, J. Broséus, Q. Rossy, P. Esseiva, Buying drugs on a in the Microstructure of Cognition, Vol. 1: Foundations, MIT Press, 1986 p.
Darknet market: a better deal? Studying the online illicit drug market 547.
through the analysis of digital, physical and chemical data, Forensic Sci. Int. [121] M.H. Zweig, G. Campbell, Receiver-operating characteristic (ROC) plots: a
267 (2016) 173–182. fundamental evaluation tool in clinical medicine, Clin. Chem. 39 (4) (1993)
[86] K.R. Beebe, R.J. Pell, M.B. Seasholtz, Chemometrics: A Practical Guide, Wiley- 561–577.
Interscience, 1998. [122] J.R. Beck, E.K. Shultz, The use of relative operating characteristic (ROC)
[87] D.L. Massart, B.G. Vandeginste, L. Buydens, P. Lewi, J. Smeyers-Verbeke, curves in test performance evaluation, Arch. Pathol. Lab. Med. 110 (1) (1986)
Handbook of Chemometrics and Qualimetrics: Part A, Elsevier Science Inc., 13–20.
1997. [123] H.F. Kaiser, The application of electronic computers to factor analysis, Educ.
[88] M. Perkal, Y.L. Ng, J.R. Pearson, Impurity profiling of methylamphetamine in Psychol. Meas. 20 (1) (2016) 141–151.
Australia and the development of a national drugs database, Forensic Sci. Int. [124] J.C. Nunnally, I.H. Bernstein, Psychometric Theory, McGraw-Hill, 1994.
69 (1) (1994) 77–87. [125] R.B. Cattell, The scree test for the number of factors, Multivariate Behav. Res. 1
[89] C.S.L. Jonson, L. Strömberg, Two-level classification of Leuckart amphet- (2) (1966) 245–276.
amine, Forensic Sci. Int. 69 (1994) 31–44. [126] A. Kassambara, Practical Guide to Cluster Analysis in R: Unsupervised
[90] G. Zadora, A. Martyna, D. Ramos, C. Aitken, Statistical Analysis in Forensic Machine Learning, STHDA, 2017.
Science: Evidential Values of Multivariate Physicochemical Data, John Wiley [127] L. Kaufman, P.J. Rousseeuw, Finding Groups in Data: an Introduction to
& Sons, 2013. Cluster Analysis, John Wiley & Sons, 2009.
[91] P.E. Black, Manhattan Distance, (2006) https://xlinux.nist.gov/dads/HTML/ [128] R. Tibshirani, G. Walther, T. Hastie, Estimating the number of clusters in a data
manhattanDistance.html (accessed 22/09/17). set via the gap statistic, J. R. Stat. Soc. 63 (2) (2001) 411–423.
[92] B.S. Everitt, City-Block Distance, in: B.S. Everitt (Ed.), Cambridge Dictionary of [129] A.E. Zambelli, A data-driven approach to estimating the number of clusters in
Statistics, Cambridge University Press, Cambridge, UK, 2002, pp. 73. hierarchical clustering, F1000Res 5 (2016) ISCB Comm J-2809.
[93] G.N. Lance, W.T. Williams, Computer programs for hierarchical polythetic [130] J.A. Westerhuis, H.C.J. Hoefsloot, S. Smit, D.J. Vis, A.K. Smilde, E.J.J. van Velzen,
classification (“Similarity analyses”), Comput. J. 9 (1) (1966) 60–64. J.P.M. van Duijnhoven, F.A. van Dorsten, Assessment of PLSDA cross
[94] G.N. Lance, W.T. Williams, Mixed-data classificatory programs, I.) agglom- validation, Metabolomics 4 (1) (2008) 81–89.
erative systems, Aust. Comput. J. 1 (1967) 15–20. [131] C.W. Dawson, R.L. Wilby, Hydrological modelling using artificial neural
[95] C.J. Krebs, Ecological Methodology, Benjamin/Cummings, 1999. networks, Prog. Phys. Geogr. 25 (1) (2016) 80–108.
[96] G.W. Milligan, Clustering Validation: Results and Implications for Applied [132] S. Wold, M. Sjöström, L. Eriksson, PLS-regression: a basic tool of chemo-
Analyses, Max M. Fisher College of Business, Ohio State University, 1996. metrics, Chemom. Intell. Lab. Syst. 58 (2) (2001) 109–130.
[97] K. Pearson III, Contributions to the mathematical theory of evolution, Philos. [133] M. Barker, W. Rayens, Partial least squares for discrimination, J. Chemom. 17
Trans. R. Soc. Lond. A 185 (1894) 71. (3) (2003) 166–173.
[98] M. Halkidi, Y. Batistakis, M. Vazirgiannis, On clustering validation techniques, [134] K. Sjoedin, Minimizing effects of closure on analytical data, Anal. Chem. 56 (9)
J. Intell. Inf. Syst. 17 (2-3) (2001) 107–145. (2002) 1685–1688.
[99] C.C. Aggarwal, C.K. Reddy, Data Clustering: Algorithms and Applications, CRC [135] A. Géron, Hands-on Machine Learning With Scikit-learn and TensorFlow:
Press, 2016. Concepts, Tools, and Techniques to Build Intelligent Systems, O’Reilly Media,
[100] R.J. Waddell, N. NicDaeid, D. Littlejohn, Classification of ecstasy tablets using Inc., 2017.
trace metal analysis with the application of chemometric procedures and [136] J.J. Zacca, T.S. Groberio, A.O. Maldaner, M.L. Vieira, J.W. Braga, Correlation of
artificial neural network algorithms, Analyst 129 (3) (2004) 235–240. cocaine hydrochloride samples seized in Brazil based on determination
[101] G. Zadora, A. Martyna, D. Ramos, C.G. Aitken, Statistical Analysis in Forensic of residual solvents: an innovative chemometric method for determination of
Science: Evidential Values of Multivariate Physicochemical Data, John Wiley linkage thresholds, Anal. Chem. 85 (4) (2013) 2457–2464.
& Sons, Incorporated, New York, UNITED KINGDOM, 2013. [137] N. Stojanovska, S. Fu, M. Tahtouh, T. Kelly, A. Beavis, K.P. Kirkbride, A review
[102] J. Podani, Hierarchical Clustering, Introduction to the Exploration of of impurity profiling and synthetic route of manufacture of methylamphet-
Multivariate Biological Data, Backhuys Publishers, 2000. amine, 3,4-methylenedioxymethylamphetamine, amphetamine, dimethy-
[103] M.J. Adams, Chemometrics in Analytical Spectroscopy, Royal Society of lamphetamine and p-methoxyamphetamine, Forensic Sci. Int. 224 (1-3)
Chemistry, 2004. (2013) 8–26.
[104] C. Ding, X. He, K-means clustering via principal component analysis, [138] C. Perez-Alfonso, N. Galipienso, S. Garrigues, M. de la Guardia, A green
Proceedings of the Twenty-first International Conference on Machine method for the determination of cocaine in illicit samples, Forensic Sci. Int.
Learning, ACM, 2004 29. 237 (2014) 70–77.
[105] T.S. Groberio, J.J. Zacca, E.D. Botelho, M. Talhavini, J.W.B. Braga, Discrimina- [139] S. Materazzi, A. Gregori, L. Ripani, A. Apriceno, R. Risoluti, Cocaine profiling:
tion and quantification of cocaine and adulterants in seized drug samples by implementation of a predictive model by ATR-FTIR coupled with chemo-
infrared spectroscopy and PLSR, Forensic Sci. Int. 257 (2015) 297–306. metrics in forensic chemistry, Talanta 166 (2017) 328–335.
[106] L. Stride Nielsen, C. Lindholst, P. Villesen, Cocaine classification using alkaloid [140] E.J. Magalhaes, C.C. Nascentes, L.S. Pereira, M.L. Guedes, R.A. Lordeiro, L.M.
and residual solvent profiling, Forensic Sci. Int. 269 (2016) 42–49. Auler, R. Augusti, M.E. de Queiroz, Evaluation of the composition of street
[107] M. Monfreda, F. Varani, F. Cattaruzza, S. Ciambrone, A. Proposito, Fast cocaine seized in two regions of Brazil, Sci. Justice J. Forensic Sci. Soc. 53 (4)
profiling of cocaine seizures by FTIR spectroscopy and GC-MS analysis of (2013) 425–432.
minor alkaloids and residual solvents, Sci. Justice J. Forensic Sci. Soc. 55 (6) [141] L. Stride Nielsen, P. Villesen, C. Lindholst, Variation in chemical profiles
(2015) 456–466. within large seizures of cocaine bricks, Forensic Sci. Int. 280 (2017) 194–
[108] K. Janhunen, M.D. Cole, Development of a predictive model for batch 199.
membership of street samples of heroin, Forensic Sci. Int. 102 (1) (1999) 1–11. [142] B. Pagano, I. Lauri, S. De Tito, G. Persico, M.G. Chini, A. Malmendal, E.
[109] A. Johnston, L.A. King, Heroin profiling: predicting the country of origin of Novellino, A. Randazzo, Use of NMR in profiling of cocaine seizures, Forensic
seized heroin, Forensic Sci. Int. 95 (1) (1998) 47–55. Sci. Int. 231 (1-3) (2013) 120–124.
[143] F. Besacier, H. Chaudron-Thozet, M. Rousseau-Tsangaris, J. Girard, A. Lamotte, [147] F. Been, L. Benaglia, S. Lucia, J.-P. Gervasoni, P. Esseiva, O. Delémont, Data
Comparative chemical analyses of drug samples: general approach and triangulation in the context of opioids monitoring via wastewater analyses,
application to heroin, Forensic Sci. Int. 85 (2) (1997) 113–125. Drug Alcohol Depend. 151 (2015) 203–210.
[144] H. Neumann, Comparison of heroin by capillary gas chromatography in [148] M.J. Reid, K.H. Langford, M. Grung, H. Gjerde, E.J. Amundsen, J. Morland, K.V.
Germany, Forensic Sci. Int. 69 (1) (1994) 7–16. Thomas, Estimation of cocaine consumption in the community: a critical
[145] S. Chandra, J. Joba, Transnational cocaine and heroin flow networks in comparison of the results from three complimentary techniques, BMJ Open 2
Western Europe: a comparison, Int. J. Drug Policy 26 (8) (2015) 772–780. (6) (2012)e001637.
[146] P. Margot, Traceology: the trace as the fundamental vector of police science/ [149] D.A. Bright, C.E. Hughes, J. Chalmers, Illuminating dark networks: a social
forensic science, Revue Internationale de Criminologie et de Police Technique network analysis of an Australian drug trafficking syndicate, Crime, Rev. Law
et Scientifique 67 (1) (2014) 72–97. Soc. Change 57 (2) (2012) 151–176.

Chemometric Methods in FS 1

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Chemometric Methods in FS 1

Uploaded by

Copyright:

Available Formats

Forensic Science International 302 (2019) 109911

Contents lists available at ScienceDirect

Forensic Science International

Review of the most common chemometric techniques in

4.1.1. Discrimination power . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

1. Introduction performed on illicit drugs to produce chemical and physical

each enantiomer. This can be achieved using capillary electropho- Table 2

3.2.2. Pearson correlation coefﬁcient

3.2.1. Square cosine function 3.2.5. Canberra distance

3.3.2. Hierarchical clustering analysis 3.3.3. K-means clustering

Fig. 2. Different types of clustering algorithms.

PLS-DA can be applied in many cases when classical DA cannot

3.4.2. Linear discriminant analysis

Fig. 5. Sections of a confusion matrix; (a) two-class, (b) three-class.

4.1.2. Confusion matrices

4.1.3. Receiver operating characteristic curves

within-cluster sum of squares (WCSS) as a function of the number

4.2.2.2. Average silhouette method. The average silhouette method

CM PT ATS (AM & MA) ATS (MDMA) Cocaine Heroin

Pattern recognition technique ATS Cocaine Heroin

UPR PCA [10,11,41,100] [52,53,107,139,141,142] [60,61,66,71–76,143]

You might also like