Professional Documents
Culture Documents
Applied Acoustics
journal homepage: www.elsevier.com/locate/apacoust
a r t i c l e i n f o a b s t r a c t
Article history: When the rolling bearing fails, the fault features contained in bearing vibration signal are easily sub-
Received 10 July 2022 merged by fortissimo noise interference signals, and have obvious non-stationary and nonlinear proper-
Received in revised form 24 October 2022 ties. This means that it is extremely challenging to acquire useful bearing fault features and identify
Accepted 20 November 2022
bearing fault patterns effectively by traditional diagnosis methods. To more efficiently learn bearing fault
Available online 5 December 2022
information and improve bearing fault diagnosis accuracy, this research proposes a new intelligent fault
diagnosis method for rolling bearing based on variational mode extraction (VME) and an improved one-
Keywords:
dimensional convolutional neural network (I-1DCNN). Firstly, a new adaptive signal processing method
Variational mode extraction
Improved one-dimensional convolutional
named VME is employed to handle the collected bearing vibration signals with the aim of obtaining the
neural network desired mode component and removing the noise interference information. Meanwhile, the extracted
Rolling bearing mode components are randomly divided into the training set, validation set and test set. Then,
Fault diagnosis the training set and validation set are input into the proposed I-1DCNN model for training, where the
proposed I-1DCNN model may not only learn the discriminant features intelligently, but also boost the
computational efficiency and alleviate the problem of over-fitting by incorporating the early stopping
method and self-attention mechanism into the traditional one-dimensional convolutional neural net-
work (1DCNN). Finally, the test set is input into the well-trained I-1DCNN to realize the automatic iden-
tification of different fault types of rolling bearing. The effectiveness of the suggested method is
illustrated by analyzing two experimental data sets. In addition, by comparing with other representative
methods, the superiority of the proposed method is testified in bearing health condition identification.
Ó 2022 Elsevier Ltd. All rights reserved.
https://doi.org/10.1016/j.apacoust.2022.109143
0003-682X/Ó 2022 Elsevier Ltd. All rights reserved.
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
Nomenclature
bearing faults. Wang et al. [13] combined VMD and deep autoen- the above problems, the convolutional neural network (CNN) [25]
coder to achieve an unsupervised fault diagnosis. According to is exploited for fault diagnosis. As CNN is originally used for image
the VMD decomposition principle, Jiang et al. [14] proposed a cen- processing, some scholars convert the collected bearing signals
tral frequency mode decomposition method (CFMD), and verified into two-dimensional images and input them into CNN for
the effectiveness of CFMD method in fault diagnosis through sim- automatic fault identification. Zhang et al. [26] converted
ulation and experiment. These methods have made some out- one-dimensional time-domain signals into two-dimensional
standing achievements in mechanical fault diagnosis. time–frequency grayscale images through continuous wavelet
Nonetheless, LMD has some problems in signal processing, such transform (CWT) and put them into the multi-mode CNN
as end effect and mode mixing. Although WTD can achieve noise (MMCNN) model for fault diagnosis. Hasan et al. [27] fused
reduction, the noise reduction effect depends on the reasonable multi-domain information of the original vibration signals into
choice of threshold. Similar to LMD, EMD also suffers from the two-dimensional composite images, and then fed the composite
drawbacks of end effect and mode mixing. VMD has fortissimo sig- images into CNN model based on multitask learning (MTL) for fault
nal decomposition performance, which is extensively applied in diagnosis. Experiments testify that this method can acquire good
fault diagnosis. Nevertheless, VMD also has two problems, one is results in fault diagnosis. Although two-dimensional CNN can be
the long decomposition time, and the other is that the decomposi- applied to identify-one-dimensional vibration signals, in the pro-
tion capacity of VMD depends on the number of decomposition cess of signal dimension transformation, the correlation between
layers [15]. To overcome these problems, Nazari et al. [16] recently one-dimensional data is destroyed, which can beget some momen-
suggested VME on the basis of VMD, which is a new adaptivity sig- tous fault information to be mislaid. Therefore, one-dimensional
nal processing method. This method has been employed by some convolutional neural network (1DCNN) is proposed to settle these
academics for health condition monitoring of rotating machinery. issues. Zhang et al. [28] input original one-dimensional vibration
Guo et al. [17] used the improved VME method to process the gear- signals into 1DCNN model for fault identification. Wang et al.
box vibration signals collected in the noise environment, and ver- [29] fused acoustic signals and vibration signals as the input of
ified the strong noise reduction performance of the method 1DCNN for bearing fault status identification. Habbouche et al.
through experiments. In order to overbear the weaknesses of set- [30] firstly used VMD to handle the original bearing vibration sig-
ting parameters manually, Yan et al. [18] adopted whale optimiza- nals, and then input the processed one-dimensional signals into
tion algorithm (WOA) to select the optimal parameters for VME, 1DCNN for fault diagnosis, which can accurately recognize fault
which gave full play to the potential of VME. Compared with patterns. According to the above analysis, 1DCNN can be consid-
VMD, VME mainly focuses on extracting a desired mode compo- ered as a promising tool to identify bearing fault patterns.
nent, which has a better noise reduction performance and higher However, for complex noisy vibration signals, the traditional
computational efficiency than the acknowledged VMD [19]. There- 1DCNN model sometimes cannot achieve a high identification
fore, for the purpose of reducing the noise interference of the col- accuracy due to its over-fitting problem. Besides, the computa-
lected raw bearing vibration signal, VME method is employed to tional burden of traditional 1DCNN is large when the training
preprocess bearing vibration signals in this paper. times are set too large. Therefore, to boost the identification accu-
Intelligent fault diagnosis of rolling bearing is widely regarded racy and computational efficiency of the 1DCNN model, this
as a pattern recognition problem. Common pattern recognition research proposes the improved one-dimensional convolutional
algorithms include support vector machine (SVM) [20], K-nearest neural network (I-1DCNN). Specifically, batch normalization (BN)
neighbor algorithm (KNN) [21], decision tree (DT) [22], support is introduced between the convolution layer and pooling layer in
matrix machine (SMM) [23] and artificial neural network (ANN) traditional 1DCNN to escalate the convergence speed of the model.
[24]. These algorithms are able to accurately identify bearing Following the pooling layer, self-attention mechanism (SAM) is
health states, but they take a lot of time to manually extract useful introduced to highlight main features and suppress interference
features. In addition, when the above methods are faced with vast features, which is able to enhance the robustness of network
sample data and vibration signals containing noise, they can not model. Additionally, to minimize the parameters of entire model
achieve satisfactory identification accuracy and efficiency. To solve and increase the computational speed of 1DCNN, global average
2
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
2. Related works
3
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
xi lB
traditional 1DCNN model, which contains a convolution layer, ^xi ¼ p ffiffiffiffiffiffiffiffi
2
rB þe
pooling layer, full connected layer and classification layer [35], is ð11Þ
displayed in Fig. 2. yi ¼ c^xi þ b
In the convolution layer, the input signals are convolved by fil-
P
m
ters, followed by an activation function to generate an output lB ¼ m1 xi
eigenmatrix. The convolution operation process is: i¼1
ð12Þ
Pm 2
l l r2B ¼ m1 xi l B
i ðjÞ ¼ ReLUðK i x ðjÞ þ bi Þ
ylþ1 ð8Þ
l
i¼1
where ReLUðÞ and represent activation function and convolution where xi is the data in small batch B; m is the batch size; e is a small
operation respectively; xl ðjÞ is defined as the j-th local region in constant and can be ignored; c and b respectively represent the
l
the convolutional layer l; K li and bi are the weight and bias of the scale and shift which need to be learned.
i-th convolutional kernel in the l layer respectively.
The pooling layer comes after the convolution layer. Its main 3.2. Self-attention mechanism
function is to extract main features and reduce feature dimensions.
Maximum and average pooling are two typical pooling algorithms. In 2017, Vaswani et al. [38] proposed SAM that was initially
The following is the maximum pooling expression. applied to machine translation. Many scholars have combined
SAM and deep learning to handle the problem of natural language
Plþ1
i ðjÞ ¼ max fylþ1
i ðtÞg ð9Þ [39,40] and fault diagnosis. SAM performs adaptive weighted pro-
ðj1ÞWþ16t6jW
cessing of main information by mimicking human attention. For an
where W is the width of the pooling area, P lþ1
i ðjÞ is the output fea- input sequencex ¼ fx1 ; x2 ; . . . ; xn g; xi 2 Rk , the operation steps of the
ture after the maximum pooling operation. self-attention mechanism are as follows [41]:
The fundamental capacity of the full connected layer is to splice
First, calculate the vectors qi ; ki ; v i 2 Rdk of keys, queries, and
two-dimensional features obtained by convolution and pooling
values by a linear transformation.
operations into a one-dimensional vector. The following is the
equation of the full connected layer: qi ¼ W q xi ; ki ¼ W k xi ; v i ¼ W v xi ð13Þ
!
X
m X
n
l
q
here W ; W ; W 2 R k v dk k
are parameter matrices, which need to
zlþ1 ðjÞ ¼ ReLU W litj ali ðt Þ þ bj ð10Þ
i¼1 t¼1
be learned.
Secondly, the similarity between qi and kj is calculated to obtain
l
where W litj
and are the weight and bias; ali ðtÞ represents the value
bj an attention scoreai;j .
of the t-th neuron in the i-th feature of layer l.
q k
ai;j ¼ p
i
ffiffiffiffiffij ð14Þ
dk
3. The proposed method
where dk represents the dimension ofkj .
3.1. Batch normalization Then, the softmax function is used to normalizeai;j , and the cor-
responding weight coefficient of v i is obtained.
Szegedy et al. [36] proposed BN in 2015. The idea of BN is to
expðai;j Þ
standardize each batch of training data and reduce the difference a^ i;j ¼ softmaxðai;j Þ ¼ P
n
ð15Þ
from the input feature distribution of the previous layer. BN can expðai;t Þ
not only escalate the convergence rate, but also enhance the gen- t¼1
eralization capability of the model. For a small batch of Finally, multiply v i and the corresponding weight coefficient
dataB ¼ fx1 ; x2 ; ; xm g, the following is the computational proce- and sum to obtain the attention valuezi .
dure of BN [37]:
4
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
X
n
dimensions. Repeat the above operation three times with a narrow
zi ¼ a^ i;j v j ð16Þ
convolution kernel to improve the learned features.
j¼1
Secondly, the features obtained from the last pooling layer are
weighted adaptively to highlight important features and suppress
where zi is the attention ofxi . The attention value corresponding to
invalid features. Thus, the feature extraction capability and robust-
each value in x can be calculated simultaneously to form a new fea-
ness of the model are enhanced.
ture sequencez ¼ fz1 ; z2 ; . . . ; zn g. A schematic representation of self-
Thirdly, GAP is utilized to process the features obtained from
attention is presented in Fig. 3.
the self-attention (SA) layer, which can lessen the total parameters
and shorten the computation time compared to the full connected
3.3. Improved one-dimensional convolutional neural network layer. The GAP is expressed as follows:
1 Xn
The fault features contained in bearing vibration signals are yk ¼ xk ð17Þ
n i¼1 i
facilely inundated by intense noise interference signals, and pos-
sess strong non-stationary and nonlinear properties. When the tra-
where xki denotes the value corresponding to the i-th neuron of the
ditional 1DCNN is adopted to process these vibration signals, it is
k-th feature in the SA layer, n is the feature dimension.
unable to effectually extract useful feature information, thus weak-
Fourthly, the one-dimensional vector obtained from global
ening greatly feature learning competence of 1DCNN. To overcome
average pooling is input to softmax classifier. The following is
the above problems, this research proposes the I-1DCNN model.
the softmax expression:
Fig. 4 shows the structure of I-1DCNN, which consists of four con-
volutional layers, four BN layers, four pooling layers, one GAP layer, exp zo ðjÞ
one self-attention layer and one softmax layer. The following is an QðjÞ ¼ softmaxðzo ðjÞÞ ¼ ð18Þ
P
M
elaborate illustration of I-1DCNN. exp zo ðkÞ
k¼1
Firstly, a wide convolution kernel is used to convolve the input
features, which can effectively filter out the noise. The features here z ðjÞ represents logits of the j-th neuron, and M represents
o
after convolution are normalized in batches to reduce the distribu- the number of bearing fault categories.
tion differences between layers and speed up convergence. Maxi- Finally, the goal function is selected to be the cross-entropy loss
mum pooling is used to extract main features and reduce feature function, and its equation is provided in Eq. (19). The adam opti-
5
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
7
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
In order to more efficiently learn bearing fault information and 4. Experimental validation
improve bearing fault diagnosis accuracy, this research proposes a
new intelligent fault diagnosis method for rolling bearing based on In this section, two cases are governed to validate the availabil-
VME and I-1DCNN. Firstly, bearing vibration signals are acquired ity of the suggested method in bearing fault diagnosis. Besides,
by acceleration sensors. Secondly, VME is carried out on the comparisons among a variety of methods are executed to illustrate
acquired original vibration signals with the aim of extracting the the meliority of the suggested method in identifying bearing fault
main mode component and eliminating some unserviceable inter- patterns under noise environments. Of particular note is, the
ference information. Then, the extracted mode components are experiment is operated on a PC with Intel(R) Core(TM) I7-9700
divided into the training set, validation set and test set. Next, the CPU and 16G RAM. Software is based on MATLAB language and
training set and validation set are input into the I-1DCNN model Python language TensorFlow framework to achieve.
for training and the well-trained I-1DCNN model is saved. Finally,
the test set is fed into the well-trained I-1DCNN model for testing 4.1. Bearing benchmark data from CWRU
to obtain the fault diagnosis results. Fig. 5 is overall framework of
the proposed fault diagnosis method. Its particular steps are 4.1.1. Description of experimental equipment and bearing data
explained below: The normal and fault data of type 6205-2RS bearing offered by
the electrical engineering laboratory of Case Western Reserve
Step 1: Collect bearing vibration data. Acceleration sensors are University (CWRU) are selected as diagnostic objects for case 1
installed on the testing bearing of the experimental device to [42]. Fig. 6 presents the data acquisition system of case 1. The
acquire the original bearing vibration data. whole data set contains one normal state and three damaged
Step 2: Variational modal extraction. VME method is carried out states. Each damage state contains three different sizes: early fault
on the collected original bearing vibration data to extract the (damage diameter is 0.1778 mm), middle fault (damage diameter
8
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
Table 2
Center frequency of bearing vibration signals of different fault types.
Fault type N IRF1 IRF2 IRF3 ORF1 ORF2 ORF3 BF1 BF2 BF3
Center frequency xd /Hz 1000 3500 3500 2800 2800 3400 3400 3400 3400 3400
9
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
Fig. 10. Time-domain waveform and spectrum of IRF1 vibration signal before and after variational mode extraction.
Table 3
Specific parameters of the I-1DCNN model in case 1.
Layer type Filter number Kernel size Stride Activation function padding Has BN Output shape
C1 16 1024*1 16 ReLU Same Yes 128*16
P1 16 2*1 2 – Valid No 64*16
C2 32 8*1 1 ReLU Same Yes 64*32
P2 32 2*1 2 – Valid No 32*32
C3 32 4*1 1 ReLU Same Yes 32*32
P3 32 2*1 2 – Valid No 16*32
C4 64 3*1 1 ReLU Same Yes 16*64
P4 64 2*1 2 – Valid No 8*64
SA – – – – – No 8*64
GAP – – – – – No 1*64
Softmax – – – Softmax – No 10
where L represents the total length of the vibration signal, Lsample small batch size and Adam optimizer initial learning rate are set
represents the length of a sample, and S represents the sliding win- to 64 and 0.001, respectively. The maximum training iteration is
dow size. ½ indicates the downward-rounded function. set to 500. The training result of the proposed method is revealed
in Fig. 11. As can be clearly observed in Fig. 11, the convergence
speed of the I-1DCNN model is very fast. After about 10 times of
4.1.2. Diagnosis results of the proposed method training, the loss value and accuracy curve of the I-1DCNN model
In case 1, to verify the effectiveness of the suggested method, reaches a steady state. The loss value gradually tends to 0, the
the experimental data is analyzed according to the flowchart in accuracy gradually tends to 1. Finally, the test set is fed into the
Fig. 5. Firstly, VME is utilized to extract main mode component well-trained I-1DCNN for testing, and the result of fault diagnosis
from raw vibration signals. In VME, two key parameters (i.e., the is obtained. The diagnostic result for the test sample is given in
penalty parameter a and the initial center frequency xd ) are Fig. 12. From the confusion matrix in Fig. 12, it can be found that
required to determine in advance. If the penalty parameter a is the true labels and predicted labels of 300 test samples are com-
huge, the extracted mode component will lose some useful infor- pletely consistent, and the identification accuracy reaches 100 %.
mation. If the penalty parameter a is too small, the extracted mode The above analysis illustrates that the suggested method possesses
component will contain some interference information. Therefore, excellent viability in bearing health condition identification.
according to the literature [13], this paper choosesa ¼ 1000.
Besides, the initial center frequency xd is determined based on fre-
4.1.3. Performance analysis under different noise levels
quency value corresponding to the highest spectral peak of bearing
To simulate the collected vibration signal in the noisy environ-
vibration signal. Specifically, the IRF1 vibration signal is used as an
ment, Gaussian white noise is mixed into the acquired raw bearing
instance. Fig. 9 shows the amplitude spectrum of IRF1 vibration
vibration signals. Signal-to-Noise Ratio (SNR) is employed to
signal. By observing Fig. 9, it may be found that the frequency cor-
appraise the level of noise contained in bearing signals, and its
responding to the highest spectral peak is around 3500 Hz. There-
expression is presented below:
fore, the initial center frequency xd is selected 3500 Hz. The center
frequency of bearing vibration signals of different fault types is Psignal
exhibited in Table 2. Fig. 10 displays the extracted main mode
SNRdB ¼ 10 logð Þ ð21Þ
Pnoise
component. Secondly, bearing vibration data processed by VME
are divided into the training set, validation set and test set accord- Z T Z T
1 1
ing to 7:2:1. Then, the training set and validation set are input into P¼ pðtÞdt ¼ jxðtÞj2 dt ð22Þ
T 0 T 0
the I-1DCNN model for training. Table 3 displays the configuration
of parameters of the I-1DCNN model. Concretely, the 1024*1 wide where P is the average power of the signal xðtÞ in period T.
convolution kernel is used in the first convolution layer to effec- Take the IRF1 vibration signal as an example. Fig. 13(a)–(c) are
tively suppress noise. Other layers use narrow convolution kernels the original IRF1 vibration signal, the added noise signal and the
to improve the learned features. The convolution kernels of other mixed IRF1 vibration signal containing noise, respectively. To illus-
layers are 8*1, 4*1, 3*1 and 3*1, respectively. Set dropout to 0.2. trate anti-noise capacity, the proposed method is applied to bear-
The parameter of the softmax classification layer is set to 10. The ing vibration signals with different SNR values. In order to assure
10
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
Fig. 13. Figures for IRF1 original vibration signal, noise signal and the IRF1 vibration signal containing noise.
Table 4
Identification results with different SNR values in case 1.
Fig. 14. The identification results with different convolution kernel sizes in case 1.
about 10 percentage points higher than VMD-I-1DCNN and about IMFs k. Number of iterations m and number of IMFs k are the main
30 percentage points higher than WTD-I-1DCNN. The results factors that influence EMD performance. In order to avoid impro-
demonstrate that it is very necessary to employ VME to handle per parameter selection leading to insufficient performance of sig-
raw bearing vibration signals in advance. nal preprocessing method and affecting final diagnosis results,
The preprocessing effect of VME, VMD and EMD depends on different parameters are handpicked to analyze bearing vibration
several parameters. The main parameters that affect VME perfor- signal with SNR = 10 dB. Table 6 parades the preprocessing
mance are penalty factor a and center frequencyx. VMD perfor- results of VME, VME and EMD under different parameters. The
mance is primarily influenced by penalty factor a and number of recognition accuracy of employing VME may be increased to
12
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
Fig. 15. Identification accuracy and time with different mini-batch sizes in case 1.
Fig. 16. The identification results of the I-1DCNN model with different signal preprocessing methods.
13
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
14
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
15
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
Table 7
A specific description of the experimental bearing in case 2.
Bearing type Inside diameter Roll diameter Outside diameter Number of the roller Contact angle (°)
HRB 6205 25 mm 7.94 mm 52 mm 9 0
Table 8
The elaborate depiction of the sample set in case 2. suggested method is revealed in Fig. 24. The I-1DCNN model does
not converge in the early stage of training, but after 30 sessions of
Fault type Number Category
training, the loss value and accuracy curve of the model tended to
of samples label
be stable. The loss value gradually tends to 0, the accuracy gradu-
Normal(N) 300 1
ally tends to 1. Finally, the test sample is fed into the well-trained
Outer ring fault (ORF) 300 2
Inner ring fault (IRF) 300 3
I-1DCNN for testing, and the identification result of bearing data in
Ball fault (BF) 300 4 case 2 is obtained. The identification consequence of test sample
Inner and outer ring compound fault (OIF) 300 5 set is illustrated in Fig. 25. The real labels of the 210 test samples
Outer ring and ball compound fault (OBF) 300 6 exactly matched the predicted labels with 100 % identification
Inner and outer ring and ball compound fault (OIBF) 300 7
Fig. 21. Time-domain waveform and spectrum of 7 original vibration signals in case 2.
16
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
Table 9
Center frequency of bearing vibration signals of different fault types.
Fig. 23. Time-domain waveform and spectrum of IRF vibration signals before and after variational mode extraction.
Table 10
Specific parameters of the I-1DCNN model in case 2.
Layer type Filter number Kernel size Stride Activation function padding Has BN Output shape
C1 16 512*1 16 ReLU same Yes 128*16
P1 16 2*1 2 – valid No 64*16
C2 32 8*1 1 ReLU same Yes 64*32
P2 32 2*1 2 – valid No 32*32
C3 32 4*1 1 ReLU same Yes 32*32
P3 32 2*1 2 – valid No 16*32
C4 64 3*1 1 ReLU same Yes 16*64
P4 64 2*1 2 – valid No 8*64
SA – – – – – No 8*64
GAP – – – – – No 1*64
Softmax – – – Softmax – No 10
17
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
Table 11
Identification accuracy with different SNR values.
Fig. 27. Identification accuracy and identification time with different mini-batch sizes.
Table 12 5. Conclusions
The identification results with early stopping method and without early stopping
method.
This research proposes a new intelligent fault diagnosis method
Identification method Number of Accuracy Time for rolling bearing based on variational mode extraction and
iterations (%) (s) improved one-dimensional convolutional neural network, which
I-1DCNN without early stopping method 10 13.33 16.41 can effectually wipe noises in the original vibration signals and
30 89.52 25.26 boost the identification accuracy of rolling bearing under
50 83.80 35.50
70 94.76 42.11
noise environments. The effectiveness and superiority of the
100 97.14 55.03 suggested method are verified by experiments and comparative
200 96.66 69.08 analysis. The following main innovations and contributions are
I-1DCNN with early stopping method 61 97.61 38.24 emphasized:
19
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
Fig. 28. Identification results of the I-1DCNN model with different signal preprocessing methods.
Table 13
Preprocessing results of VME, VME and EMD under different parameters. (1) A new model named improved one-dimensional convolu-
Preprocessing Key parameters Parameter Accuracy
tional neural network is proposed by introducing batch nor-
method selection (%) malization, self-attention mechanism and an early stopping
VME Penalty factor and center (500, 500) 83.33
method in traditional one-dimensional convolutional neural
frequency (a; x) (2000, 3000) 62.01 network, which can more efficiently learn fault features and
(4000, 3000) 82.72 alleviate the over-fitting problem of traditional one-
(1500, 4000) 61.31 dimensional convolutional neural network.
(1000, xd ) 97.13
(2) A bearing intelligent fault diagnosis method is proposed by
VMD Penalty factor and number of (1000, 9) 90.18 fusing variational mode extraction with improved one-
IMFs (2000, 6) 91.24
dimensional convolutional neural network. The proposed
(a; k) (3200, 14) 87.42
(800, 4) 92.13 method may acquire a satisfactory consequence in two
(6000,20) 83.71 groups of experimental data, even in a strong noise
EMD Number of iterations and (80, 4) 82.94 environment.
number of IMFs (m, k) (200, 5) 83.14 (3) Compared with other representative methods, the suggested
(140, 8) 82.09 fault diagnosis method may identify the fault pattern of roll-
(450, 6) 84.66 ing bearing more accurately, which substantiates the superi-
(50, 7) 82.37
ority of the suggested method in bearing health monitoring.
* The specific value of xd is presented in Table 9.
20
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
Moreover, when the SNR value ranges from 10 dB to 0 dB, Data availability
the proposed method may acquire identification accuracy of
more than 95 %, indicating that the suggested method pos- The data that has been used is confidential.
sesses strong anti-noise performance.
CRediT authorship contribution statement This research was supported by the National Natural Science
Foundation of China (No.: 52005265), the Natural Science Fund
Maoyou Ye: Conceptualization, Methodology, Software, Writ- for Colleges and Universities in Jiangsu Province (No.:
ing – original draft, Writing – review & editing. Xiaoan Yan: Fund- 20KJB460002), Macau Young Scholars Program (No.:
ing acquisition, Software, Writing – review & editing. Ning Chen: AM2021002). Meanwhile, the authors would like to thank the edi-
Supervision. Minping Jia: Project administration, Resources. tors and the anonymous reviewers for their valuable comments.
21
M. Ye, X. Yan, N. Chen et al. Applied Acoustics 202 (2023) 109143
References [22] Menezes AGC, Araujo MM, Almeida OM, et al. Induction of decision trees to
diagnose incipient faults in power transformers. IEEE Trans Dielectr Electr
Insul 2022;29:279–86.
[1] Shao H, Xia M, Wan J, et al. Modified stacked auto-encoder using adaptive
[23] Li X, Yang Y, Shao H, et al. Symplectic weighted sparse support matrix machine
morlet wavelet for intelligent fault diagnosis of rotating machinery. IEEE-
for gear fault diagnosis. Measurement 2021;168:108392.
ASME Trans. Mechatron. 2021; PP: 1-1.
[24] Hameed SS, Muralidharan V, Ane BK. Comparative analysis of fuzzy classifier
[2] Hoang D, Kang H. A survey on deep learning based bearing fault diagnosis.
and ANN with histogram features for defect detection and classification in
Neurocomputing 2019;335:327–35.
planetary gearbox. Appl Soft Comput 2021;106:107306.
[3] Yan X, Xu Y, She D, et al. Reliable fault diagnosis of bearings using an optimized
[25] Cao H, Shao H, Zhong X, et al. Unsupervised domain-share CNN for machine
stacked variational denoising auto-encoder. Entropy 2022;24:36.
fault transfer diagnosis from steady speeds to time-varying speeds. J Manuf
[4] Li Y, Zuo MJ, Chen Z, et al. Railway bearing and cardan shaft fault diagnosis via
Syst 2022;62:186–98.
an improved morphological filter. Struct Health Monit 2020;19:1471–86.
[26] Zhang K, Wang J, Shi H, et al. A fault diagnosis method based on improved
[5] Dragomiretskiy K, Zosso D. Variational mode decomposition. IEEE Trans Signal
convolutional neural network for bearings under variable working conditions.
Process 2014;62:531–44.
Measurement 2021;182:109749.
[6] Jiang X, Wang J, Shen C, et al. An adaptive and efficient variational mode
[27] Hasan MJ, Islam MMM, Kim J. Bearing fault diagnosis using multidomain
decomposition and its application for bearing fault diagnosis. Struct. Health
fusion-based vibration imaging and multitask learning. Sensors 2022;22:56.
Monit. 2020: 1475921720970856.
[28] Zhang W, Li C, Peng G, et al. A deep convolutional neural network with new
[7] Liu Z, Jin Y, Zuo MJ, et al. Time-frequency representation based on robust local
training methods for bearing fault diagnosis under noisy environment and
mean decomposition for multicomponent AM-FM signal analysis. Mech Syst
different working load. Mech Syst Signal Proc 2018;100:439–53.
Signal Proc 2017;95:468–87.
[29] Wang X, Mao D, Li X. Bearing fault diagnosis based on vibro-acoustic data
[8] Su M, Zheng J, Yang Y, et al. A new multipath mitigation method based on
fusion and 1D-CNN network. Measurement 2021;173:108518.
adaptive thresholding wavelet denoising and double reference shift strategy.
[30] Habbouche H, Amirat Y, Benkedjouh T, et al. Bearing fault event-triggered
GPS Solut 2018;22:40.
diagnosis using a variational mode decomposition-based machine learning
[9] Oh B, Guo X, Wan F, et al. Micro-doppler mini-UAV classification using
approach. IEEE Trans Energy Convers 2022;37:466–74.
empirical-mode decomposition features. IEEE Geosci Remote Sens Lett
[31] Pang B, Nazari M, Tang G. Recursive variational mode extraction and its
2018;15:227–31.
application in rolling bearing fault diagnosis. Mech Syst Signal Proc
[10] Liu H, Han M. A fault diagnosis method based on local mean decomposition
2022;165:108321.
and multi-scale entropy for roller bearings. Mech Mach Theory
[32] Jun TJ, Eom Y, Kim D, et al. TRk-CNN: transferable ranking-CNN for image
2014;75:67–78.
classification of glaucoma, glaucoma suspect, and normal eyes. Expert Syst
[11] Ying W, Zheng J, Pan H, et al. Permutation entropy-based improved uniform
Appl 2021;182:115211.
phase empirical mode decomposition for mechanical fault diagnosis. Digit
[33] Kumar A, Gandhi CP, Zhou Y, et al. Improved deep convolution neural network
Signal Prog 2021;117:103167.
(CNN) for the identification of defects in the centrifugal pump using acoustic
[12] Chen W, Li J, Wang Q, et al. Fault feature extraction and diagnosis of rolling
images. Appl Acoust 2020;167.
bearings based on wavelet thresholding denoising with CEEMDAN energy
[34] Yang J, Liu J, Xie J, et al. Conditional GAN and 2-D CNN for bearing fault
entropy and PSO-LSSVM. Measurement 2021;172:108901.
diagnosis with small samples. IEEE Trans Instrum Meas 2021;70:3525712.
[13] Wang H, Liu X, Ma L, et al. Anomaly detection for hydropower turbine unit
[35] Wang Y, Yang F. A fractional-order CNN hyperchaotic system for image
based on variational modal decomposition and deep autoencoder. Energy Rep
encryption algorithm. Phys Scr 2021;96:035209.
2021;7:938–46.
[36] Ioffe S and Szegedy C. Batch normalization: accelerating deep network training
[14] Jiang X, Song Q, Wang H, et al. Central frequency mode decomposition and its
by reducing internal covariate shift. In: 32nd International Conference on
applications to the fault diagnosis of rotating machines. Mech Mach Theory
Machine Learning, Lille, France, 07-09 Jul 2015.
2022;174:104919.
[37] Wang C, Li H, Zhang K, et al. Intelligent fault diagnosis of planetary gearbox
[15] Wang Z, Yang N, Li N, et al. A new fault diagnosis method based on adaptive
based on adaptive normalized CNN under complex variable working
spectrum mode extraction. Struct Health Monit 2021;20:3354–70.
conditions and data imbalance. Measurement 2021;180:109565.
[16] Nazari M, Sakhaei SM. Variational mode extraction: a new efficient method to
[38] Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. In: 31st
derive respiratory signals from ECG. IEEE J Biomed Health Inform
Annual Conference on Neural Information Processing Systems (NIPS), Long
2018;22:1059–67.
Beach, CA, 04-09 Dec 2017.
[17] Guo Y, Jiang S, Yang Y, et al. Gearbox fault diagnosis based on improved
[39] Zhang X, Qiu X, Pang J, et al. Dual-axial self-attention network for text
variational mode extraction. Sensors 2022;22:1779.
classification. Sci China-Inf Sci 2021;64:222102.
[18] Yan X, Xu Y, She D, et al. A bearing fault diagnosis method based on PAVME
[40] Xiao X, Xiao W, Zhang D, et al. Phishing websites detection via CNN and multi-
and MEDE. Entropy 2021;23:1402.
head self-attention on imbalanced datasets. Comput Secur 2021;108:102372.
[19] Pang B, Nazari M, Sun Z, et al. An optimized variational mode extraction
[41] Bi X, Zhao J. A novel orthogonal self-attentive variational autoencoder method
method for rolling bearing fault diagnosis. Struct Health Monit
for interpretable chemical process fault detection and identification. Process
2022;21:558–70.
Saf Environ Prot 2021;156:581–97.
[20] Yang C, Hou J. Fed-batch fermentation penicillin process fault diagnosis and
[42] Su H, Xiang L, Hu A, et al. A novel hybrid method based on KELM with SAPSO
detection based on support vector machine. Neurocomputing
for fault diagnosis of rolling bearing under variable operating conditions.
2016;190:117–23.
Measurement 2021;177:109276.
[21] Song B, Tan S, Shi H, et al. Fault detection and diagnosis via standardized k
[43] Ye M, Yan X, Jia M. Rolling bearing fault diagnosis based on VMD-MPE and
nearest neighbor for multimode process. J Taiwan Inst Chem Eng
PSO-SVM. Entropy 2021;23:762.
2020;106:1–8.
22