Professional Documents
Culture Documents
ABSTRACT Increasingly energy and environmental crises put forward higher request on diesel engine.
It promotes the development of diesel engine, while the complexity of structure is much higher, which leads
to higher probability of faults. In order to recognize the states of engine in harsh environments effectively,
variational mode decomposition (VMD) and expectation maximization (EM) are introduced into this paper
to analyze multi-channel vibration signals. To select the decomposition level of VMD adaptively, a novel
power spectrum segmentation based on scale-space representation is proposed for the optimization of VMD
and results show this approach can discriminate different frequency components in high noise circumstance
accurately and efficiently. To improve the adaptability and accuracy of EM, a feature selection approach
based on genetic algorithm (GA) is introduced to preprocess original data and a cross validation method
is used for selecting cluster number adaptively. Combined with these approaches, a diesel engine state
recognition scheme based on multi-channel vibration signals using optimized VMD and EM is proposed.
Compared with existing method, this scheme shows great advantages in accuracy and efficiency, and could
be applied in actual engineering.
INDEX TERMS Diesel engine, vibration, pattern recognition, variational mode decomposition (VMD),
expectation maximization (EM).
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see http://creativecommons.org/licenses/by/4.0/
VOLUME 8, 2020 33545
X. Bi et al.: Engine Working State Recognition Based on Optimized VMD and EM Algorithm
and vibration, is being a promising method, in which the combinations may not identify the working state effectively in
vibration signal is a research focus because of easy measure- certain case [17], [18]. For these reasons, multi-sensor infor-
ment, low cost and strong robustness. For example, Jing et al mation fusion technology is gradually attached the weight of
analyzed vibration signals from cylinder head and detected researchers. Multi-sensors information resources can be fully
valve clearance fault by it [5], Ftoutou et al used vibration and effectively utilized through data fusion method to gain
signals to make research on fuel injection faults [6]. the greatest resources of system under test in different view
Owing to complex structure, so many vibration sources angles [19]. However, the increase of features has a negative
exist in engine that there is much noise in vibration sig- impact, such as information redundancy and complex data
nals. A number of signal processing algorithms have been processing on classification, and even reduce the recognition
researched to overcome this problem such as wavelet trans- rate [20]. Accordingly, it is necessary to reduce the feature
form and empirical mode decomposition (EMD). Moosa- dimension by removing redundant and irrelevant features in
vian et al analyzed the performances of different mother order to improve classification efficiency and accuracy. Fea-
wavelets and used wavelet transform to denoise vibration ture selection is a challenging task, especially for hundreds
signals [7], Ma et al mixed wavelet transform and EMD or thousands of features sub sets, in which genetic algo-
to analyze vibration signals in order to diagnose abnormal rithm (GA) is a classic method to improve the searching effi-
combustion in engine [8], Li et al detected abnormal clear- ciency. Stefano et al proposed an improved filter-based GA
ance between contacting components using EMD [9]. How- to search feature subsets with high discriminative power [21].
ever, the wavelet transform used for signal decomposition is Mohammadi et al presented a methodology which can search
generally binary discrete wavelet transform (DWT), which and select a set of closure relationships for given experimen-
just decomposes signals in Fourier spectrum mechanically. tal based on GA to minimize the error between measured
Moreover, wavelet basis has a great influence on decom- and predicted pressure gradient [22]. Gangavarapu et al,
position while there is no definite conclusion on how to used GA to optimize the subspace ensembling process of
choose it. As an adaptable method, the EMD is built based high-dimensional biomedical datasets feature subspaces [23].
on recursive model, which results in low robustness and mode The performance of GA highly depends on the selection of
mixing [10], [11]. In 2013, Gilles proposed empirical wavelet fitness function. However, various subsets of dataset may give
transform (EWT), which can build adaptable wavelet filter best results with a different function [24].
to extract components [12]. For the problem that meaningful Feature selection is a preprocessing technique for classi-
modes should be pre-detected to provide necessary param- fication algorithm [25], [26]. X-means clustering algorithm
eters for the wavelet filter, Gilles et al proposed a param- is an extending K-means with efficient estimation of the
eterless scale-space approach to overcome it [13], which number of clusters. It goes into action after each run of
improved the EWT and promoted its application in faults K-means, making local decision about which subset of cur-
detection such as Wang et al used EWT analyzed the fault rent centroids should split themselves in order to better fit
features of industrial bearing through vibration signals [14]. the data. The splitting decision is done by the Bayesian
However, the problem of wavelet basis is still unresolved. information criterion (BIC) [27], and the numbers of clusters
Besides, the process of building wavelet filter is complex are computed dynamically within preset lower and upper
and the performance of filter is no guarantee of the best. bound [28]. X-means algorithm has a good effect on process-
In 2014, Dragomiretskiy et al proposed variational mode ing low-dimensional dataset, while its performance on high-
decomposition (VMD) based on variational model, which is dimensional dataset is unstable. Expectation maximization
an essentially adaptable Wiener filter bank and it is equating (EM) algorithm based on Gaussian mixture model has been
to the best filter under certain conditions [15]. The VMD being widely used due to its stability and simplicity [29].
has been used in faults detection for rotating machinery and Zhao et al proposed an improved EM algorithm for fault
showed great potential [16]. However, decomposition level detection of air conditioning system [30]. Chen et al put
in VMD should be determined manually, which increases the forward an adaptive Gaussian mixture model to deal with
complexity and uncertainty. Fortunately, the parameterless the dynamic working process of rotating turbine engine disk,
scale-space approach also has ability to provide parameters which improved the adaptability of fault detection model to
for VMD. So the method is optimized to combine with VMD turbine engine disk [31]. Tapana et al evaluated the perfor-
in order to build a better filter. mance of various clustering techniques and results showed
Although a lot of successes have been achieved in that the EM clustering algorithm is the most effective and
the diesel engine condition recognition and fault diagno- robust method for unlabeled data classification [32]. How-
sis based on time-frequency analysis, feature extraction ever, the conventional EM algorithm cannot choose the num-
and pattern recognition of single channel vibration signal, ber of clusters adaptively and need to specify the number of
there are still some problems need solving. Firstly, in some clusters as an input parameter.
harsh environments, unpredictable signal interference may The paper is organized as follows: section 1 introduces
lead to the misinterpretation of recognition rate. Secondly, the research background and significance, section 2 gives
every feature parameter has different sensitivity to different algorithms details, the experiment is described in section
working states, which results in some features or feature 3, the optimized VMD is proposed in section 4, analysis
and contrast of experiment result by a novel approach based Based on Parseval / Plancherel Fourier isometry, it can be
on EM are shown in section 5 and conclusion is given in solved as:
section 6. _ _
P _ λ (ω)
_ n+1 f (ω) − i6 =k u i (ω) + 2
u k (ω) = (5)
II. ALGORITHM THEORIES 1 + 2α (ω − ωk )2
A. VMD THEORIES R∞ _ 2
The goal of VMD is to decompose input signal f into several ω u i (ω) dω
0
modes, uk , i.e. intrinsic mode functions (IMFs), which are ωkn+1 (ω) = (6)
R∞ _ 2
supposed to have specific sparsity properties and limited u i (ω) dω
bandwidths. The IMFs are also assumed to be mostly compact 0
around a center frequency, ωk [15].
!
_n+1 _n _ X _n+1
Firstly, the uk is processed by Hilbert transform to get λ (ω) = λ (ω) + τ f (ω) − u k (ω) (7)
unilateral frequency spectrum: k
Z where n is the number of iterations, τ is time-step of the dual
1 uk (v) j
Huk = p.v. dv = δ(t) + ∗ uk (t) (1) ascent and it is set as 0, i.e. taking no account of λ in this
π t −v πt paper.
R
Based on that, IMFs can be obtained by alternate direction
where p.v is Cauchy principal value, δ is Dirac distribution, method of multipliers (ADMM). The process of VMD is:
∗ represents convolution and j2 = −1. k ∈ [1, 2, . . . , K ], n 1 o n 1 o _1
_ _
where K is the decomposition level in VMD, i.e. the number ¬ Initialize u k , ωk , λ and n.
of IMFs. Compute uk , ωk and λ based on equation(5), (6) and (7)
Next, an estimated center frequency, ejωk t , is mixed to shift through ADMM.
the unilateral frequency spectrum into baseband: ® During the iteration in step , results can be obtained
P _n+1 _n _n
when stopping condition k u k − u k / u k < ε is
j
B = δ(t) + ∗ uk (t) e−jωk t (2) met, where ε = 10− 7 in general.
πt
Bandwidth can be computed by squared L 2 -norm of the B. GENETIC ALGORITHM BASED FEATURES SELECTION
gradient. Meanwhile, all of the IMFs can restructure the input GA is a method to solve optimization inspired by biolog-
signal f . Based on that, a constrained variational model can ical processes of mutation, natural selection and genetic
be obtained: crossover, which is also a powerful feature selection tool,
( 2
) especially for feature set with large dimensions
X j
min ∂t δ(t) + ∗ uk (t) e−jωk t
, The basic genetic operators of GA are selection, crossover,
{uk },{ωk } πt 2 and mutation [33].
k
X
s.t. uk = f (3)
1) SELECTION
k
Selection is based on individual fitness and influences its abil-
where {uk } := {u1 , u2 , · · · , uK } and {ωk } := ity to reproduce into the next generation [34]. The probability
{ω1 , ω2 , · · · , ωK } are all IMFs and their center frequencies, of selecting individual hi is determined by (8):
P PK
respectively. := is the summation of all IMFs. F(hi )
k k=1 p(hi ) = Pp (8)
In order to solve the constrained variational model, i=1 F(hi )
Lagrangian multipliers, λ, and quadratic penalty α, are intro- where F(hi ) is the fitness value of hi . The probability that an
duced to shift the model into unconstrained. λ and α are individual will be selected is proportional to its own fitness
two classic ways to solve variational problem. The α is used and is inversely proportional to the fitness of the other com-
to enforce constraints strictly and it can be ignored at low peting hypothesis in the current population.
requirements for constraint. The α is used to balance the data-
fidelity constraint especially in high Gaussian noise condition 2) CROSSOVER
and it is inversely proportional to the noise level in data based Select two chromosomes than have high fitness values from
on Bayesian prior. The unconstrained variational model is: the current population, exchange some bits of them and copy
them into the new population. The location of these bits are
L ({uk } , {ωk } , λ) :
2 random.
X j
=α ∂t δ (t) + ∗ uk (t) e−jωk t
πt 2 3) MUTATION
k
X
2 *
X
+ Select one chromosome that has a high fitness value from the
+ f (t)− uk (t) + λ (t) , f (t) − uk (t) (4) current population, alter some bits of it and copy it into the
k 2 k new population [35].
TABLE 2. Main parameters of testing equipment. their length and obtain Lsort = {L1 , L2 , · · · , LM }, where M
is the number of local minima. Next, suppose 1 = LML−L M
1
C. OPTIMIZED VMD BASED ON POWER SPECTRUM where f n is the adjoint of fn , n is the length of input signal s.
SEGMENTATION Finally, divide the input signal in scale-space by the power
As shown in last section, the Fourier spectrum segmentation spectrum.
can provide accurate and stable parameters for building VMD As shown in the description, power spectrum can remain
in order to reduce subjectivity and complexity. However, with the similar waveform as well as make characteristics stand
in-depth research, author team finds this approach is strug- out. In order to verify this approach, the new simulated signal
gling in dividing approximating frequencies. The f2 simulated with f2 = 4000 is divided by it and shown in Fig. 10.
FIGURE 19. X-means with feature selection, number of clusters is 2. FIGURE 22. X-means without feature selection, number of clusters is 4.
Classification accuracy: 1000bar: 87.5% 1100bar: 82.5% 1200bar:
45% 1340bar: 80% Average accuracy: 73.75%.
[12] J. Gilles, ‘‘Empirical wavelet transform,’’ IEEE Trans. Signal Process., [34] V. Kozeny, ‘‘Genetic algorithms for credit scoring: Alternative fitness
vol. 60, no. 16, pp. 3999–4010, May 2013. function performance comparison,’’ Expert Syst. Appl., vol. 42, no. 6,
[13] J. Gilles and K. Heal, ‘‘A parameterless scale-space approach to find pp. 2998–3004, Apr. 2015.
meaningful modes in histograms—Application to image and spectrum [35] Q. Chen, K. Worden, P. Peng, and A. Y. T. Leung, ‘‘Genetic algorithm with
segmentation,’’ Int. J. Wavelets, Multiresolution Inf. Process., vol. 12, an improved fitness function for (N)ARX modelling,’’ Mech. Syst. Signal
no. 06, Jan. 2015, Art. no. 1450044. Process., vol. 21, no. 2, pp. 994–1007, Feb. 2007.
[14] D. Wang, K.-L. Tsui, and Y. Qin, ‘‘Optimization of segmentation frag- [36] M. Jordan, J. Kleinberg and B. Scholkopf, Pattern Recognition and
ments in empirical wavelet transform and its applications to extracting Machine Learning. New York, NY, USA: Springer, 2006, pp. 423–460.
industrial bearing fault features,’’ Measurement, vol. 133, pp. 328–340, [37] M. A. Hall, ‘‘Correlation-based feature selection for machine learn-
Feb. 2019. ing,’’ Ph.D. dissertation, Dept.Comput. Sci., Waikato Univ., Hamilton,
[15] K. Dragomiretskiy and D. Zosso, ‘‘Variational mode decomposi- New Zealand, 1999.
tion,’’ IEEE Trans. Signal Process., vol. 62, no. 3, pp. 531–544, [38] D. Ruppert, ‘‘The elements of statistical learning: Data mining, inference,
Feb. 2014. and prediction,’’ J. Amer. Stat. Assoc., vol. 99, no. 466, pp. 567–567,
Jun. 2004.
[16] M. F. Isham, M. S. Leong, M. H. Lim, and Z. A. Ahmad, ‘‘Variational
[39] W. Fu and P. O. Perry, ‘‘Estimating the number of clusters using
mode decomposition: Mode determination method for rotating machinery
cross-validation,’’ J. Comput. Graph. Statist., to be published,
diagnosis,’’ J. Vibroeng., vol. 20, no. 7, pp. 2604–2621, Nov. 2018.
doi: 10.1080/10618600.2019.1647846.
[17] B. L. Boada, M. J. L. Boada, and V. Diaz, ‘‘Vehicle sideslip angle
[40] WEKA-Documentation, Univ. Waikato, Hamilton, New Zealand, 2018.
measurement based on sensor data fusion using an integrated ANFIS
and an unscented Kalman filter algorithm,’’ Mech. Syst. Signal Process.,
vols. 72–73, pp. 832–845, May 2016.
[18] L. Yi-Bo, WangNing, and ZhouChang, ‘‘Based on D-S evidence theory of
information fusion improved method,’’ in Proc. Int. Conf. Comput. Appl. XIAOBO BI received the master’s degree in vehi-
Syst. Modeling (ICCASM), Taiyuan, China, Oct. 2010, pp. 416–419. cle engineering from the Hebei University of Tech-
[19] Y. Xu, T. Xie, Z. Zhou, Y. Lu, J. Jun, C. Shu, J. Li, Y. Ma, and C. Jiang, nology, Tianjin, China. He is currently pursuing
‘‘Research of practical expert system for transformer fault diagnosis based the Ph.D. degree in power machine and engineer-
on multidimensional data fusion,’’ in Proc. IEEE PES Asia–Pacific Power ing with the State Key Laboratory of Engines,
Energy Eng. Conf. (APPEEC), Oct. 2016, pp. 300–304. Tianjin University, Tianjin. He is also a Senior
[20] C. Melendez-Pastor, R. Ruiz-Gonzalez, and J. Gomez-Gil, ‘‘A data fusion Lecturer with the Hebei University of Technology.
system of GNSS data and on-vehicle sensors data for improving car His research interests include multiinformation
positioning precision in urban environments,’’ Expert Syst. Appl., vol. 80, fusion fault diagnosis of ICE and NVH control of
pp. 28–38, Sep. 2017. vehicle.
[21] C. D. Stefano, F. Fontanella, and A. S. Freca, ‘‘Feature selection in
high dimension data by a filter-based genetic algorithm,’’ in Applica-
tions of Evolutionary Computation (Lecture Notes in Computer Science),
vol. 10199. Cham, Switzerland: Springer, Mar. 2017, pp. 506–521.
[22] S. Mohammadi, M. Papa, E. Pereyra, and C. Sarica, ‘‘Genetic algo- JIANSHENG LIN is currently a Ph.D. Super-
rithm to select a set of closure relationships in multiphase flow models,’’ visor and a Professor with the Tianjin Univer-
J. Petroleum Sci. Eng., vol. 181, Oct. 2019, Art. no. 106224. sity of Technology. His current research interests
[23] T. Gangavarapu and N. Patil, ‘‘A novel filter–wrapper hybrid greedy include multibody dynamics simulation and anal-
ensemble approach optimized using the genetic algorithm to reduce the ysis and internal combustion engine working pro-
dimensionality of high-dimensional biomedical datasets,’’ Appl. Soft Com- cess research.
put., vol. 81, Aug. 2019, Art. no. 105538.
[24] R. Malhotra and M. Khanna, ‘‘Dynamic selection of fitness function for
software change prediction using particle swarm optimization,’’ Inf. Softw.
Technol., vol. 112, pp. 51–67, Aug. 2019.
[25] P. A. Devijver and J. Kittler, ‘‘Pattern recognition: A statistical approach,’’
Image Vis. Comput., vol. 3, no. 2, pp. 87–88, May 1985.
[26] A. K. Das, S. Sengupta, and S. Bhattacharyya, ‘‘A group incremental
FENGRONG BI received the Ph.D. degree in
feature selection for classification using rough set theory based genetic
power machinery and engineering from Tianjin
algorithm,’’ Appl. Soft Comput., vol. 65, pp. 400–411, Apr. 2018.
University, Tianjin, China. He is currently a Ph.D.
[27] D. Pelleg and A. Moore, ‘‘X-menas: Extending K-means with efficient
estimation of number of clusters,’’ in Intelligent Data Engineering and
Supervisor and a Professor with Tianjin Univer-
Automated Learning. Berlin, Germany: Springer, 2000. sity. His current research interests include the con-
[28] P. Kumar and S. K. Wasan, ‘‘Analysis of X-means and global k-means trol of vibration and noise and the fault diagnosis
using TUMOR classification,’’ in Proc. 2nd Int. Conf. Comput. Automat. of vehicles and power machineries, with specific
Eng. (ICCAE), Feb. 2010, pp. 832–835. investigation on excitation mechanism, transfer
[29] H. Jung and B. Seo, ‘‘A fast EM algorithm for Gassian mixtures,’’ Com- characteristics, assessment, and control methods
mun. Korean Stat. Soc., vol. 19, no. 1, pp. 157–168, 2012. of NVH.
[30] P. Zhao, D. Li, Z. Feng, L. Yao, and G. Jia, ‘‘Identification of unknown
modes for air-conditioning based on hybrid cluster algorithm,’’ in Proc.
12th World Congr. Intell. Control Automat. (WCICA), Guilin, China,
Jun. 2016, pp. 1147–1151.
[31] J. Chen, X. Zhang, N. Zhang, and K. Guo, ‘‘Fault detection for turbine XIN LI received the master’s degree in power
engine disk using adaptive Gaussian mixture model,’’ Proc. Inst. Mech. machinery and engineering from Tianjin Univer-
Eng., I, J. Syst. Control Eng., vol. 231, no. 10, pp. 827–835, Sep. 2017. sity, Tianjin, China, in 2017, where he is currently
[32] T. Mekaroonkamon and S. Wongsa, ‘‘A comparative investigation of the pursuing the Ph.D. degree. His main research
robustness of unsupervised clustering techniques for rotating machine fault interests include reliability and weak power fault
diagnosis with poorly-separated data,’’ in Proc. 8th Int. Conf. Adv. Comput. diagnosis.
Intell., Chiang Mai, Thailand, Feb. 2016, pp. 165–172.
[33] F. Tan, X. Fu, Y. Zhang, and A. G. Bourgeois, ‘‘A genetic algorithm-
based method for feature subset selection,’’ Soft Comput., vol. 12, no. 2,
pp. 111–120, May 2007.
DAIJIE TANG received the B.S. degree in energy XIAO YANG graduated from Tianjin University,
and power engineering and the M.S. degree in Tianjin, China. He is currently pursuing the Ph.D.
power engineering from Tianjin University, Tian- degree in power machine and engineering with the
jin, China, in 2016 and 2019, respectively, where State Key Laboratory of Engines, Tianjin Univer-
he is currently pursuing the Ph.D. degree in power sity. His current research interests include fault
machinery and engineering. His current research diagnosis and noise reduction of internal combus-
interests include fault diagnosis of power machin- tion engines.
ery, vibration, and noise control.
YOUXI WU (Member, IEEE) received the Ph.D. PENGFEI SHEN received the B.S. degree in
degree in theory and new technology of electrical energy and power engineering and the M.S. degree
engineering from the Hebei University of Technol- in power engineering from Tianjin University,
ogy, Tianjin, China. He is currently a Ph.D. Super- Tianjin, China, in 2016 and 2019, respectively. His
visor and a Professor with the Hebei University of main research interest includes the power machin-
Technology. His current research interests include ery’s vibration fault diagnosis.
data mining and machine learning. He is a Senior
Member of CCF.