Professional Documents
Culture Documents
An Effective and Rapid Approach To Predict Molecular Composition of Naphtha Based On Raw NIR Spectra
An Effective and Rapid Approach To Predict Molecular Composition of Naphtha Based On Raw NIR Spectra
Vibrational Spectroscopy
journal homepage: www.elsevier.com/locate/vibspec
A R T I C LE I N FO A B S T R A C T
Keywords: Molecular management has become an important trend in petroleum refining, which relies on the information of
Near infrared (NIR) petroleum composition. In this contribution, a simple and effective analytical approach is proposed for the rapid
Tchebichef curve moment (TCM) prediction of the more detailed molecular composition of naphtha samples based on raw near infrared (NIR)
Molecular composition spectroscopy for the first time. The 101 samples of reformed naphtha were collected and determined, and
Naphtha
Tchebichef curve moments (TCMs) were calculated directly from the raw NIR spectra and employed to establish
PIONA
Pretreatment
linear models for the quantitative analysis of 26 hydrocarbons (PIONA) with different carbon numbers and
components. For the obtained models, the average of RMSE of prediction is 0.10. According to the ratio of
performance to deviation (SD/RMSEp), the 23 obtained TCM models achieved “excellent” predictive quality. By
means of the conventional PLS method with spectral pretreatment, there were only 15 models with “excellent”
predictive quality, which indicated that TCM method without any spectral preprocessing could provide more
simple, accurate and reliable analytical results, and meet the requirements of fast assessment. This work suggests
the feasibility of the proposed method for the rapid and non-destructive analysis of molecular composition in
naphtha, which is significant in the determination of refinery operating conditions.
⁎
Corresponding author.
E-mail address: zhaihl@163.com (H.L. Zhai).
https://doi.org/10.1016/j.vibspec.2020.103071
Received 5 March 2020; Received in revised form 30 March 2020; Accepted 13 April 2020
Available online 25 April 2020
0924-2031/ © 2020 Elsevier B.V. All rights reserved.
L. Zhu, et al. Vibrational Spectroscopy 109 (2020) 103071
PIONA is employed in the study of the composition of light oils, comparisons, classical PLS methods with spectral pretreatment were
which generate distillates with lower boiling points [11]. As detailed also carried out.
hydrocarbon analysis of reformed naphtha samples, PIONA has great
importance in the petrochemical industry. It allows a correct char- 2. Experimental
acterization of samples to predict their behavior in further processing
and applications [8]. Now there is well-developed ASTM method 2.1. Samples and their NIR spectra
(ASTM D-5134-r03) for the parameter. The obtained gas chromato-
graphic data was processed by the PONA software. If the chromato- A total number of 101 naphtha samples were collected over a period
graphic peak has shift caused by other chemical bonds, make appro- of one year from the refining unit.
priate correction by comparing with standard chromatogram. NIR spectra were recorded on a Fourier-transform near infrared
Quantitative analysis can be carried out when the chromatographic spectrometer (SupNIR-4510, Focused Photonics Inc, China) from 1600
peak matching degree with the standard spectrum reaches more than to 1000 nm (6400 to 4000 cm−1). For each sample, the measurements
90 %. Then the percentages content of PIONA were obtained. But the were made every 30 s, then choosing 10 spectra with a deviation of less
procedures need a long analysis time (it may cost 2 h for one sample) than 5 parts per 10,000. The 10 scans were made with a spectral re-
[12] and the rich experience of technicians, require expensive and solution of 6 cm−1 and then averaged to minimize interferences.
maintenance–intensive equipment, and are not well suitable for on-line Samples were temperature equilibrated at 35 °C in the constant tem-
determination [13]. perature drying oven before scanning.
The combination of the near infrared spectroscopy (NIR) and mul- The samples were divided into two groups randomly, training set
tivariate calibration methods has many advantages in petroleum ana- (80 % of the entire data set) and test set (20 % of the entire data set).
lysis [14], such as simple, high speed, cheapness, efficiency, eco- The quantitative models were established by chemometrics methods
friendly, non-destruction and so on [15]. In recent years, the NIR based based on the training set, and their predictive ability were evaluated by
on chemometric algorithms has been widely applied to the in-line or the test set as external independent samples that were excluded from
on-line monitoring and controlling of refinery processes, such as crude the modeling procedure.
distillation, catalytic cracking and gasoline blending [16]. Many im-
portant parameters of petroleum and petroleum products have been 2.2. The percentages content of PIONA
predicted accurately and precisely [17,18]. Balabin et al. [19] built an
effective calibration model to predict four properties (fractional com- The PIONA analysis including the estimation of hydrocarbons was
position, iodine value, and cold filter plugging point) of biodiesel fuel, carried out on the GC-FID (ASTM D5134-98 r03). The GC system con-
based on near-infrared (NIR) spectroscopy data and an artificial neural sisted of a 6890 GC system (Agilent Technologies, USA). GC analysis
network (ANN). By comparison, the ANN approach was superior to the was performed on a 19091S-001 HP-PONA capillary chromatographic
other calibration methods, such as PLS. Li et al. [13] based on NIR column (50 m × 20 μm × 0.5 μm; Agilent Technologies, USA). The in-
spectroscopy and Monte Carlo virtual spectrum to determine the hy- jection volume was 1 μL. The programmed temperature was adopted.
drocarbon groups of gasoline samples. The results meet the require- The initial temperature was 35 °C–200 °C in 1 °C/min.
ments of fast analysis. But PLS, ANN, Monte Carlo and other methods Gas chromatographic data was processed by the software PONA
need spectra preprocessing, which may loss some useful information. At (E.05.02). The data are analyzed qualitatively and quantitatively by
the same time, the hydrocarbon information is not detailed enough to comparing the correction table. If the chromatographic peak has shift
guide refinery operating conditions in petroleum processing. caused by other chemical bonds, make appropriate correction. The re-
Partial least square (PLS) is the most popular method to build the producibility of the method was evaluated by analyzing standard
quantitative model. However, due to the weak absorption and the sample of naphtha. The method showed satisfactory results for quan-
broad peaks of NIR spectra, the quantitative or qualitative results ob- titative purposes with the relative standard deviations (RSDs) ranging
tained by PLS method generally yields low sensitivity and low se- from 0.02 % to 0.03 %.
lectivity. At the same time, especially in complex samples, the inter- The 26 PIONA percentages contents in 101 naphtha samples were
ferences of irrelevant information, overlapping and shifts of peaks listed in Table 1.
appear mostly in NIR spectra, which seriously affected the accuracy of
the PLS models. Therefore, in order to cover these shortages before 2.3. Modeling and evaluation
modeling several, spectral pretreatment techniques were developed,
such as the orthogonal signal correction [20], standard normal variate 2.3.1. Tchebichef curve moments
[21], multiplicative scattering correction [22], continuous wavelet Tchebichef curve moment (TCM) had been defined in the reference
transform [23] and iPLS [24], which have been removed the noise and [26]. Here, a brief description is provided as follows:
background in the raw spectra so as to enhance the reliability of NIR
N −1
spectral analysis. Nevertheless, these pretreatment methods are still the 1
risk that might result in loss of useful information. Thus, it is necessary
Tn =
ρ˜ (n, N )
∑ t͠n (x ) f (x )
x=0
to develop some simple and effective chemometric methods to realize n = 0, 1, 2, … N − 1 (1)
on-line monitoring of the NIR spectroscopy.
As one of the discrete orthogonal moments, the Tchebichef image When the values of n are increased to a certain point (nN), the
moment (TIM, also called Chebyshev image moment) has an excellent significant information in the curve can be extracted adequately by the
description ability with multiresolution and invariance properties and set of calculated TCMs according to the reconstruction error.
has been widely applied in image processing. Although TIM has been The calculated TCMs were regard as independent variables and the
employed successfully in analytical chemistry to determine multi- concentrations of target compounds were denoted by response vari-
compounds in complex samples based on the several chemical 3D ables, stepwise regression was employed to establish linear quantitative
spectra [25], for the analysis of chemical curve spectra, image moments models.
are not convenient. Then, the Tchebichef curve moment (TCM) method
was extended [26]. 2.3.2. PLS algorithm
In this work, TCM was applied to the simultaneous quantitative With regression by partial least squares, the arrays X and y are de-
analysis of the 26 components (PIONA) with different carbon numbers composed into smaller matrices that contain relevant information about
and components in naphtha based on the raw NIR spectra. As the the samples (called scores) and the original variables (loadings).
2
L. Zhu, et al. Vibrational Spectroscopy 109 (2020) 103071
Table 1
Mean, range, and standard deviation (SD) of percentages concentrations of the
PIONA.
No. Chemical Mean Range Standard deviation
3
L. Zhu, et al. Vibrational Spectroscopy 109 (2020) 103071
Table 2
The parameters of TCM and PLS (with spectral pretreatment) models.
Chemical TCM PLS with spectral pretreatment
R2p RMSEp RPD Predictive Quality Method R2p RMSEp RPD Predictive Quality
Note: (1) Values listed are related only to the optimal results for each pretreatment method.
(2) Abbreviation: Normalization (NOR), Standardization (SD), Detrending (DT), Meancenter (MC), Savitzky-Golay smoothing (SG), Moving window smoothing
(MW), Multiplicative scatter correcction (MSC), Standard normal variate transformation (SNV), First derivative (1D), Second derivative (2D).
The spectral preprocessing methods were also used to TCM mod- suitable preprocessing methods can effectively improve the perfor-
eling. The results of preprocessing don't make too much difference ex- mance of PLS models, because there are various interference factors on
cept for i8 and i9 (Details are listed in supplementary materials). For raw NIR spectra.
TCM models, the RPD of i8 and i9 are less than 2.0 (RPDi8 = 1.83,
RPDi9 = 0.69), so the results present a little worse prediction ability
with respect to the other studied targets. Compared with other hydro- 3.4. Comparison of TCM method and traditional PLS method
carbon compounds, the NIR spectra of isoparaffins has more inter-
ference such as overlapping peaks, unknown interfering substances, From Table 2, the most R2p values of the obtained TCM models are
noise and drifts. So for excellent model (RPD>2.5) of TCM, there is no higher than or near to the PLS models. All the values of RMSEp calcu-
need to do the spectral preprocessing. When RPD<2.5, proper pre- lated by TCM are lower than PLS. At the same time, all the values of
treatment methods can be done to improve the performance and pre- RPD calculated by TCM models are higher than that of PLS models. For
dictive quality of the established models. TCM, 23 models of the predictive quality meet “excellent”. But for PLS
modeling with spectral pretreatment, only 15 models of the predictive
quality meet “excellent”. In supplementary data, the value of pre-
3.3. Performance of PLS models
processing methods doesn't make too much difference for TCM models,
but spectral pretreatment before PLS modeling can greatly improve the
PLS regression was carried out based on the same dataset, and the
accuracy of the models. Therefore, all the results mean that the per-
26 PLS models for the target components were established, respec-
formance of the TCM models are superior to that of PLS models with
tively. PLS models without preprocessing data are listed in supple-
spectral pretreatment.
mentary data, most of the PLS models without spectral pretreatment are
The main reasons are that the different unknown interferences in
not ideal. Only four models of the predictive quality meet “excellent”.
real samples severely affect the distributions in data, which leads to the
Four models are “very poor”, and seven models are “poor”.
poorer performance of the PLS models established based on standard
The optimal results of the obtained PLS models by pretreatment
samples. However, the TCM with different orders can separate the
methods are listed in Tables 2. Models with good performance should
target and interference components. The important features of target
have higher R2p and lower RMSEP values, then choose respectively the
components can be extracted effectively to establish models, mean-
more optimal model. In Table 2, if the subtractive value before and
while, the interference components can be eliminated.
after processing is greater than 0.1, the parameters are bold.
The TCM can be used entire raw NIR spectra without spectral pre-
For most of PLS models, spectral preprocessing is commonly used to
treatment to predict molecular composition of naphtha. It can be con-
correct measurements and noisy spectra. We use multiple processing
sidered an excellent model and an effective, rapid approach to build the
methods to enhance the more chemically relevant peaks in the spectra.
model to predict molecular composition. Therefore, the proposed TCM
There has been a lot of literature which reported that each processing
method is expected to be applied for the quantitative analysis of mul-
method has its own advantages and disadvantages, so selection of the
tiple target components based on conventional NIR spectra.
most optimal model process by relevant parameters is needed. After
preprocessing, the results have marked improvement that 15 models of
the predictive quality meet “excellent”. It can be concluded that
4
L. Zhu, et al. Vibrational Spectroscopy 109 (2020) 103071