You are on page 1of 5

Remaining useful life prediction of lithium battery

using convolutional neural network with optimized


parameters
1st Dongdong Li 2nd Lin Yang
School of Mechanical Engineering School of Mechanical Engineering
Shanghai Jiao Tong University Shanghai Jiao Tong University
Shanghai, China Shanghai, China
lee_auto@163.com yanglinsjtu@126.com

Abstract—Accurately predicting remaining useful life (RUL) Among the data-driven methods for RUL prediction, the
of lithium battery with nonlinear characters is essential for neural network (NN) methods rely on the system complexity to
ensuring safety of applications. However, the diverse aging discover the relations among neural nodes. Due to the characters
mechanism gives the challenge for present technologies. In this of accurate generalization and learning of nonlinear relation
paper, a convolutional neural network (CNN) model is between data and output, the NN model is widely used in the
constructed for RUL prediction of lithium battery. For reducing RUL prediction of lithium-ion battery. However, for feature
the training time of CNN model, the orthogonal method is applied learning, not only enough raw data is needed, but also many
for optimizing model parameters. Then, the proposed is validated parameters are optimized for an accurate NN model. Therefore,
by a large dataset. And the accuracy of RUL prediction exceeds
proper method is essential for selecting the main parameters and
90.9 percent while root mean square error and mean absolute
error are limited to 35.1 and 13.7, respectively. The proposed
modifying parameters to reduce the training time of NN model.
method is suitable for RUL prediction of lithium battery applied The contributions of this paper are twofold: a convolutional
in electric vehicles and energy storage devices. neural network (CNN) model is constructed for RUL prediction
of lithium battery. Then the orthogonal method is applied for
Keywords—remaining useful life, lithium battery, convolutional optimizing CNN model parameters. And the proposed method
neural network is validated with high accuracy.
I. INTRODUCTION II. RUL PREDICTION BY CONVOLUTIONAL NEURAL NETWORKS
Lithium-ion batteries are widely used in electric vehicles
(EVs) for high energy density, long lifetimes and low costs [1, A. Structure of RUL Prediction
2]. However, the system faults and safety accidents of vehicles CNN can maximize the map relations between the input and
caused by aging failure of battery pose a great threat to the output by extracting feature matrix. When the CNN model
occupants and properties. Thus, the real-time monitoring and is training, it can fully learn the feature matrix of input without
management of battery states is needed to avoid the occurrence exacting mathematical expression between the input and the
of safety accidents. And the remaining useful life (RUL) of output. After model trained, CNN has high performance of
battery is the key parameter to accurately predict and evaluate features recognition for target prediction. Therefore, the CNN is
the battery states [3, 4]. Because of dynamic loading and selected for RUL prediction. Comparing to other neural
variable working conditions, the non-linear characters of networks, CNN benefits from three core ideas: local receptive
lithium-ion battery pose a great challenge to the accurate field, weight sharing mechanism, and pooling sample. The
prediction of RUL. framework of the RUL prediction by CNN is shown in Fig. 1.
At present, there are mainly two methods of RUL prediction Firstly, CNN uses sparse connection of local areas (local
for lithium-ion batteries: model-based prediction [5-8] and data- area of filter) to reduce the number of parameters in the model
driven prediction [9-13]. The model-based approach requires the and the training time, and improve the performance of CNN.
construction of a mechanism model that describes the Then, the convolution kernel of convolutional layer adopts
degradation of the electrochemical reaction inside the battery kernel parameter (weight of parameter) to finish convolution
[14, 15]. However, the battery mechanism model has a operations. The weight matrix and the offset vector are shared,
complicate structure which is difficult to completely consider which not only greatly reduces the number of network
the influences of environment and loading conditions on battery parameters, but also improves the training efficiency of CNN.
life. Thus, accurate mechanism modeling of battery is difficult. Finally, pooling sample is formed as the pooling layer that
The data-driven prediction method avoids the research on the further reduces the output dimensions of feature matrix
complex mechanism and electrochemical reaction process extracted from convolutional layer. The local feature scaling of
inside the battery, and directly mines degradation information in the input feature matrix from previous layer is performed by
monitoring data of battery performances [16, 17]. Therefore, mean function of the pooling layer (maximum pooling, mean
data-driven method is currently the primary method for accurate pooling, etc.). So, the dimension of the feature matrix and
RUL prediction of lithium-ion battery. computational burden can be further reduced. On the other hand,
This work is financially supported by the National Natural Science
Foundation of China (NSFC, Grant No. 51875339 and No. 51741707).

978-1-7281-5281-3/20/$31.00 ©2020 IEEE 840

Authorized licensed use limited to: Universidad de Antioquia. Downloaded on May 25,2021 at 16:41:10 UTC from IEEE Xplore. Restrictions apply.
pooling method can maximize the original features of input. The operation of convolution is shown as (2).
These three core ideas have greatly enhanced the performance C −1
of CNN. =  CKil ( j ) X l ( j + j )
l ( rj ) ∗ ∗
yl (i , j ) = CKil ⊕ X (2)
J ∗ =0
*

where, CKil(j ) the j*th weight of ith kernel at lth convolution layer;
Xl(r ) the jth convolution area of lth convolution layer; Xl(j+j*) the
j

character value of jth convolution area of lth convolution layer; C


the kernel width.
The pooling layer connects adjacent convolutional layers
and consists of multiple feature maps. Thus, the convolutional
layer can be considered as the input layer of the pooling layer.
Similarly, the convolution kernel is used to perform sliding
calculation on the convolution layer feature map to obtain the
feature map of the layer. The pooling layer can scale the feature
data of the convolution layer, thereby reducing the feature data
dimension and reducing the computational complexity. At the
same time, the pooling layer can also perform secondary feature
extraction on its feature data. Commonly used pooling
calculation methods include max pooling and average pooling.
In this paper, the method of max pooling is adopted because
Fig. 1. The framework of the RUL prediction by CNN it can obtain the eigenvalues that have no positional relationship
B. Convolutional Neural Networks with the input feature matrix, and enhance the robustness, which
is defined as follows:
The CNN used in this paper contains a multi-layer neural
network, which consists of input layer, hidden layer, full- pl ( i , j ) = max ( f l (i ,t ) ) (3)
connected layer, and output layer. The hidden layer is composed [( j −i )C +1]≤t ≤ jC

of many convolution layers and pooling layers. Each layer of the where, f l(i,t) the tth activation value of ith area at lth convolution
network consists of a two-dimensional plane containing a set of layer (the character value in the local area connecting the input
neurons. The implementation process of the CNN is shown in matrix and pooling layer); C the width of pooling layer; pl(i, j) the
Fig. 2. output of pooling.
The convolutional layer is composed of various feature The fully connected layer is connected after multiple
matrices, which can be regarded as a feature extraction layer. convolutional and pooling layers in the CNN. Each neuron in
Assume that the original data matrix of the input layer is X, and the fully connected layer and the neurons in the upper layer are
the ith layer is the feature map (character matrix) corresponding fully connected, and the characteristic matrix of the
to the convolutional layer FMi. The convolution process can be convolutional layer or the pooled layer is spread to obtain the
defined as: input feature vector of the fully connected layer, after several
FM i = f ( FM i −1 ⊕ w i + bi ) (1) fully connected layers and the entire volume. The output layers
of the neural network are connected to obtain the final CNN
where, FMi the feature matrix corresponding to the ith output value. Each neuron between the fully connected layers
convolutional layer; FMi-1 the feature matrix corresponding to also uses an activation function (ReLU, tanh function, etc.) to
the (i-1)th convolutional layer; wi the weight matrix improve the performance of CNN.
corresponding to the ith convolutional layer; the operator of
convolution; f the activate function.

Fig. 2. The training and prediction process of CNN

841

Authorized licensed use limited to: Universidad de Antioquia. Downloaded on May 25,2021 at 16:41:10 UTC from IEEE Xplore. Restrictions apply.
Mean absolute error (MAE) and root mean square error the limitation of computational burden and training time, model
(RMSE) are adopted as error indexes of RUL prediction. parameters should be quickly and properly adjusted to get the
optimized model. Orthogonal experiment is one of the effective
ˆ
1 n l (i ) − l (i ) (4) methods in searching for main factors without conducting full-
MAE =  l (i )
n i =1 parameter experiments. As expected, the main parameters of
CNN model can be settled.
1 n
RMSE = (l (i) − lˆ(i))2
n i=1
(5) When the structure of CNN is defined, the most influenced
parameters include the segment length, time distance, filter size,
filter number, pooling size, dropout rate and activate function.
C. Optimization of Model Parameters For the data sheet for RUL prediction, the orthogonal table for
The CNN model contains various parameters which have three levels of seven factors is shown in TABLE 1.
influence on the model precision in the training process. Due to
TABLE 1. The plan and range of orthogonal experiments
Activat
Experiment Segmen Time Filter Filter Pooling Dropout Prediction
e
No. t length distance size number size rate accuracy
function
1 200 20 2 50 2 0 ReLU 83.52
2 200 60 6 100 3 0.25 ELU 86.43
3 200 100 10 150 4 0.5 tanh 76.91
4 400 20 2 100 3 0.5 tanh 55.55
5 400 60 6 150 4 0 ReLU 81.51
6 400 100 10 50 2 0.25 ELU 90.20
7 600 20 6 50 4 0.25 tanh 79.17
8 600 60 10 100 2 0.5 ReLU 89.20
9 600 100 2 150 3 0 ELU 87.42
10 200 20 10 150 3 0.25 ReLU 81.62
11 200 60 2 50 4 0.5 ELU 54.18
12 200 100 6 100 2 0 tanh 82.83
13 400 20 6 150 2 0.5 ELU 69.16
14 400 60 10 50 3 0 tanh 88.74
15 400 100 2 100 4 0.25 ReLU 73.16
16 600 20 10 100 4 0 ELU 86.53
17 600 60 2 150 2 0.25 tanh 60.38
18 600 100 6 50 3 0.5 ReLU 90.76
A1 77.58 75.93 69.04 81.10 79.22 85.09 83.30
A2 76.39 76.74 81.64 78.95 81.75 78.49 78.99
A3 82.24 83.55 85.53 76.17 75.24 72.63 72.93
Range 5.86 7.62 16.50 4.93 6.51 12.47 10.37
Influence dropout rate > activate function > filter size (main factors)
ratio > filter number > pooling size > segment length > time distance (secondary factors)

1 n factors should be adjusted firstly, then the secondary factors can


Ak =  Si
k i (6)
be adapted. Form the results in Table 1, the segment length, time
distance, filter size, filter number, pooling size, dropout rate and
activate function should be selected as 600, 100, 6, 50, 3, 0 and
R = max { Ak } − min { Ak } ReLU, respectively.
where Sn (n = 1~6) and Ak (k = 1, 2, 3) are the sum value and III. VALIDATION OF CNN MODEL
mean value of prediction accuracy corresponding to the k level
of factor, respectively; R is the range of Ak. Due to the internal relation between discharge curve and
remaining useful life, six characters (Qske, Qdif,tot, Vdis, Qdis,
The range represents the influence ratio of each factor among dQ/dV and dT/dV ) of discharge curve are selected for the data
all factors. So, the main factors of CNN model are dropout rate, mining. All the characters are illustrated as: Qske the skewness of
activate function and filter size. In the training process, the main dQ/dV, Qdif,tot the capacity difference between present battery

842

Authorized licensed use limited to: Universidad de Antioquia. Downloaded on May 25,2021 at 16:41:10 UTC from IEEE Xplore. Restrictions apply.
and new battery, Vdis the voltage of discharge, Qdis the
discharged capacity, dQ/dV the differential of discharged
capacity to voltage, and dT/dV the differential of battery
temperature to voltage.
The raw data is derived from the reference [1] and reshaped
for the model training of CNN. Details of raw data and
characters are listed in TABLE 2. Battery degrades smoothly
with the increased using cycles, sometimes sharply as shown in
Fig. 3.

Fig. 4. RUL prediction of CNN model

IV. CONCLUSION
In this paper, a convolutional neural network with optimized
Fig. 3. Capacity fading of raw batteries model parameters is proposed for RUL prediction of lithium
TABLE 2. Data frame of CNN model batteries. The main parameters of model are evaluated as
dropout rate, activate function and filter size. The accuracy of
Parameters RUL prediction exceeds 90.9 percent while RMSE and MAE
Battery ID 41~83 are limited to 35.1 and 13.7, respectively. The proposed method
is suitable for predicting RUL of lithium battery used in EVs and
Voltage range 3.5~2.0V
energy storage device.
Life cycles 400~598
Segment Discharge process Future research interests exist in finding more characters in
battery degradation trajectory, e.g. the character among
Characters Qske, Qdif,tot, Vdis, Qdis, dQ/dV, dT/dV
degradation cycles, and constructing high efficiency and easy-
Linear range 1000 embed model.
Training data 41-69
Training samples 388970 ACKNOWLEDGMENT
Testing data 70-83 This work is financially supported by the National Natural
Testing samples 38570 Science Foundation of China (NSFC, Grant No. 51875339 and
No. 51741707).
The prediction results of RUL are presented in Fig. 4. The
remaining capacity of each battery (battery ID: 41~83) varies REFERENCES
from 0 to 550 which gives big challenge for accurately [1] Severson K, Attia Pm, Jin N, et. al. Data-driven prediction of battery cycle
predicting the RUL. However, with the feature mining and life before capacity degradation, Nature Energy. 2019;4:383-391.
relation fitting of nonlinear character, the CNN model can [2] Xia G, Cao L, Bi G. A review on battery thermal management in electric
predict the RUL only by characters data of 10 discharge cycles. vehicle application, Journal of Power Sources. 2017;367:90-105.
The prediction accuracy of total segments is higher than 90 [3] Xiong R, Sun F, Chen Z, et. al. A data-driven multi-scale extended
percent, and the highest one is 97.7 percent when RUL is less Kalman filtering based parameter and state estimation approach of
than 100 cycles. Furthermore, all predicted RUL derivates slight lithium-ion polymer battery in electric vehicles, Applied Energy.
2014;113:463-476.
from the true RUL with no more than 10 percent. The RMSE
[4] He H, Xiong R, Peng J. Real-time estimation of battery state-of-charge
and MAE between prediction and actual are 35.1 and 13.7, with unscented Kalman filter and RTOS mu COS-II platform, Applied
respectively. All the above results show that the proposed CNN Energy. 2016;162:1410-1418.
model can fully predict the RUL of lithium battery. [5] Li Y, Liu K, Foley AM, et. al. Data-driven health estimation and lifetime
prediction of lithium-ion batteries: A review, Renewable & Sustainable
Energy Reviews. 2019;113.
[6] Wu B, Yufit V, Merla Y, et. al. Differential thermal voltammetry for
tracking of degradation in lithium-ion batteries, Journal of Power Sources.
2015;273:495-501.
[7] Shibagaki T, Merla Y, Offer GJ. Tracking degradation in lithium iron
phosphate batteries using differential thermal voltammetry, Journal of
Power Sources. 2018;374:188-195.
[8] Miao Q, Xie L, Cui H, et. al. Remaining useful life prediction of lithium-
ion battery with unscented particle filter technique, Microelectronics
Reliability. 2013;53:805-810.
[9] Zhang Y, Xiong R, He H, et. al. Long Short-Term Memory Recurrent
Neural Network for Remaining Useful Life Prediction of Lithium-Ion

843

Authorized licensed use limited to: Universidad de Antioquia. Downloaded on May 25,2021 at 16:41:10 UTC from IEEE Xplore. Restrictions apply.
Batteries, Ieee Transactions on Vehicular Technology. 2018;67:5695- [14] Han X, Ouyang M, Lu L, et. al. A comparative study of commercial
5705. lithium ion battery cycle life in electric vehicle: Capacity loss estimation,
[10] Berecibar M. Accurate predictions of lithium-ion battery life, Nature. Journal of Power Sources. 2014;268:658-669.
2019;568:325-326. [15] Ecker M, Gerschler JB, Vogel J, et. al. Development of a lifetime
[11] Huang C, Huang H, Li Y. A Bidirectional LSTM Prognostics Method prediction model for lithium-ion batteries based on extended accelerated
Under Multiple Operational Conditions, Ieee Transactions on Industrial aging test data, Journal of Power Sources. 2012;215:248-257.
Electronics. 2019;66:8792-8802. [16] Wu Y, Yuan M, Dong S, et. al. Remaining useful life estimation of
[12] Wu J, Zhang C, Chen Z. An online method for lithium-ion battery engineered systems using vanilla LSTM neural networks,
remaining useful life estimation using importance sampling and neural Neurocomputing. 2018;275:167-179.
networks, Applied Energy. 2016;173:134-140. [17] Li W, Jiao Z, Du L, et. al. An indirect RUL prognosis for lithium-ion
[13] Si X, Wang W, Hu C, et. al. Remaining useful life estimation - A review battery under vibration stress using Elman neural network, International
on the statistical data driven approaches, European Journal of Operational Journal of Hydrogen Energy. 2019;44:12270-12276.
Research. 2011;213:1-14.

844

Authorized licensed use limited to: Universidad de Antioquia. Downloaded on May 25,2021 at 16:41:10 UTC from IEEE Xplore. Restrictions apply.

You might also like