Professional Documents
Culture Documents
(Download PDF) Chemometrics Statistics and Computer Application in Analytical Chemistry Matthias Otto Online Ebook All Chapter PDF
(Download PDF) Chemometrics Statistics and Computer Application in Analytical Chemistry Matthias Otto Online Ebook All Chapter PDF
https://textbookfull.com/product/handbook-on-miniaturization-in-
analytical-chemistry-application-of-nanotechnology-1st-edition-
chaudhery-mustansar-hussain/
https://textbookfull.com/product/challenges-in-green-analytical-
chemistry-2nd-edition-salvador-garrigues/
https://textbookfull.com/product/analytical-methods-in-
statistics-amistat-prague-november-2015-1st-edition-jaromir-
antoch/
https://textbookfull.com/product/chemometrics-in-
chromatography-1st-edition-lukasz-komsta/
Analytical Chemistry for Technicians Fourth Edition
Kenkel
https://textbookfull.com/product/analytical-chemistry-for-
technicians-fourth-edition-kenkel/
https://textbookfull.com/product/luminescence-in-
electrochemistry-applications-in-analytical-chemistry-physics-
and-biology-1st-edition-fabien-miomandre/
https://textbookfull.com/product/computational-network-analysis-
with-r-applications-in-biology-medicine-and-chemistry-1st-
edition-matthias-dehmer/
https://textbookfull.com/product/ionic-equilibria-in-analytical-
chemistry-1st-edition-jean-louis-burgot-auth/
https://textbookfull.com/product/contemporary-topics-in-
analytical-and-clinical-chemistry-volume-3-1st-edition-david-
hercules/
Matthias Otto
Chemometrics
Matthias Otto
Chemometrics
Third Edition
Author All books published by Wiley-VCH are
carefully produced. Nevertheless, authors,
Matthias Otto editors, and publisher do not warrant the
TU Bergakademie Freiberg information contained in these books,
Inst. für Analytische Chemie including this book, to be free of errors.
Leipziger Str. 29 Readers are advised to keep in mind that
09599 Freiberg statements, data, illustrations, procedural
Germany details or other items may inadvertently be
inaccurate.
Contents
1 What is Chemometrics? 1
1.1 The Computer-Based Laboratory 2
1.2 Statistics and Data Interpretation 10
1.3 Computer-Based Information Systems/Artificial
Intelligence 11
General Reading 12
Questions and Problems 13
2 Basic Statistics 15
2.1 Descriptive Statistics 16
2.2 Statistical Tests 28
2.3 Analysis of Variance 44
General Reading 50
Questions and Problems 52
6 Modeling 213
6.1 Univariate Linear Regression 214
6.2 Multiple Linear Regression 231
6.3 Nonlinear Methods 258
General Reading 269
Questions and Problems 271
Appendix 355
Index 371
VII
List of Abbreviations
Symbols
I Identity matrix
I 50 Interquartile range
J Jacobian matrix
k Kurtosis
k Capacity factor
KA Protolysis (acid) constant
𝜆 Eigenvalue, Poisson parameter
L Loading matrix
𝜇 Population mean
m Membership function
mr Moment of distribution
nf Neighborhood function
N Analytical Resolution, plate number
N(𝜈) Noise
p portion
P Probability
Q Quartile, Dixon statistics
r Correlation coefficient, radius
R Correlation matrix
RS Chromatographic resolution
R2 Coefficient of determination
𝜎 Standard deviation
s Estimation of standard deviation, skewness
sr Estimation of relative standard deviation
S Similarity measure
𝜏 Time lag
t Quantile of Student distribution
T Test quantity of Grubbs’ test
T Matrix of principal components scores, transformation
matrix
U Matrix of left eigenvectors
R Range, region in CART
V Matrix of right eigenvectors
w Singular value, weight
W Warping path
x (Independent) variable
x Vector of independent variables
X Matrix of dependent variables
x Arithmetic mean
𝜉 Slack variable
y (Dependent) variable
y* Transformed (filtered) value
z Standard normal deviate, signal position, objective
function
1
1
What is Chemometrics?
Learning objectives
• To define chemometrics
• To learn how to count with bits and how to perform arith-
metic or logical operations in a computer
• To understand the principal terminology for computer sys-
tems and the meaning of robotics and automation.
Chemometrics: Statistics and Computer Application in Analytical Chemistry, Third Edition. Matthias Otto.
© 2017 Wiley-VCH Verlag GmbH & Co. KGaA. Published 2017 by Wiley-VCH Verlag GmbH & Co. KGaA.
2 1 What is Chemometrics?
1.1
The Computer-Based Laboratory
are considered here, that is, coding and processing of digital infor-
mation, the main components of the computer, programming
languages, computer networking, and automation processes.
The use of digital data provides several advantages over the use of
analog data. Digital data are less noise sensitive. The only noise
arises from round-off errors due to finite representation of the
digits of a number. They are less prone to, for instance, electrical
interferences, and they are compatible with digital computers.
As a rule, primary data are generated as analog signals either
in a discrete or a continuous mode (Figure 1.1). For example,
monitoring the intensity of optical radiation by means of a
photocell provides a continuous signal. Weak radiation, however,
could be monitored by detecting individual photons by a
photomultiplier.
Usually, the analog signals generated are converted into digi-
tal data by an analog-to-digital converter (ADC) as explained as
follows.
Signal
(b) Time
4 1 What is Chemometrics?
1 0 0 1 1 0 1
0 0
1 1
1 × 26 0 × 25 0 × 24 1 × 23 1 × 22 0 × 21 1×20 =
10 2 64 +0 +0 +8 +4 +0 +1 = 77
11 3
100 4
101 5 Table 1.1 summarizes further relationships between binary and
110 6 decimal numbers. Every binary number is composed of individual
111 7 bits (binary digits). The digit lying farthest to the right is termed
1000 8
the least significant digit and the one on the left is the most signif-
1001 9
1010 10
icant digit.
1101 13 How are calculations done using binary numbers? Arithmetic
10000 16 operations are similar but simpler than those for decimal num-
100000 32 bers. In addition, for example, four combinations are feasible:
1000000 64
0 0 1 1
+0 +1 +0 +1
0 1 1 10
a. 21 b. 10101
+5 101
26 11010
1 1 1 1 1 0
1 0 0 1 0 —
0 1 0 1 1 1
0 0 0 0 1 —
3
2
1
0
Time
6 1 What is Chemometrics?
Computer Terminology
Components of Computers
Memory
The microcomputer or microprocessor contains typically two
kinds of memory: random access memory (RAM) and read-
only memory (ROM). The term RAM is somewhat misleading
and historically reasoned, since random access is feasible for
RAM and ROM alike. The RAM can be used to read and write
1.1 The Computer-Based Laboratory 7
Input/Output Systems
Communication with the computer is carried out by
input–output (I/O) operations. Typical input devices are
keyboard, magnetic tapes and disks, and the signals of an analyti-
cal instrument. Output devices are screens, printers, and plotters,
as well as tapes and disks. To convert analog information into
digital or vice versa, the aforementioned ADCs or DACs are used.
Programs
Networking
Other Distributions
Apart from the distributions used for hypothesis testing, that is,
F-, t-, and 𝜒 2 -distributions, to be considered in Section 2.2, there
are further models for the distribution of random numbers. Those
are the lognormal, uniform, binomial, and Poisson distributions.
The Poisson distribution is quite important for the analyst to
characterize countable but rare events. Such events are typical for
procedures based on counting rates, for example, if a photomulti-
plier is applied in optical spectrometry or a proportional counter
in X-ray analysis.
The Poisson distribution is based on the probability density
A variate specifies a function for discrete values of a variate. This is termed a proba-
random variable. bility function. For each value of this function, f (x), a probability
for the realization of the event, x, can be defined. It is calculated
according to a Poisson distribution by
𝜆x e−𝜆
f (x) = (2.10)
x!
The parameter 𝜆 represents the mean and the variance of the
distribution similarly, that is, 𝜆 = 𝜇 = 𝜎 2 .
Figure 2.3 demonstrates the Poisson distribution for 𝜆 values
of 1 and 5, that is, the distributions for the cases that on aver-
age 1 event or 5 events are observed, respectively. The breadth
of curves, also determined by 𝜆, is only dependent on the total
number of events or the counting rate. The higher the counting
rate, the larger will be the variance or standard deviation of the
counting process. For x = n events, the variance 𝜎 2 = n, or for the
standard deviation, the Poisson distribution becomes
√
𝜎= n (2.11)
0.4 0.18
f(x) f(x) 0.15
0.3 λ=1 λ=5
0.12
0.2 0.09
0.06
0.1
0.03
0 0
0 1 2 3 4 5 6 0 3 6 9 12 15
Variable, x Variable, x
Vaaka-puulle laitto
Afroditen lapsen;
Palmikosta hapsen,
Vasta-painoks', taitto;
Haapsi maahan sousi,
Amor ylös nousi.
»Lauloin v. 1815.
Painatti Sjögren
Mnemos v, 1820.»
(Kasku on katolilainen
Muinon Munkkien tekemä).
[Ennen painamaton.]