You are on page 1of 9

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/363350797

Turbofan exhaust gas temperature forecasting and performance monitoring


with a neural network model

Conference Paper · August 2022


DOI: 10.3850/978-981-18-5183-4_S30-03-504-cd

CITATIONS READS

2 138

2 authors:

Raphaël Langhendries Jérôme Lacaille


Université de Paris 1 Panthéon-Sorbonne Safran Aircraft Engines
7 PUBLICATIONS   6 CITATIONS    131 PUBLICATIONS   643 CITATIONS   

SEE PROFILE SEE PROFILE

Some of the authors of this publication are also working on these related projects:

EHM (Engine Health Monitoring) View project

Defectivity Analysis View project

All content following this page was uploaded by Raphaël Langhendries on 07 September 2022.

The user has requested enhancement of the downloaded file.


Proceedings of the 32nd European Safety and Reliability Conference (ESREL 2022)
Edited by Maria Chiara Leva, Edoardo Patelli, Luca Podofillini, and Simon Wilson
©2022 ESREL2022 Organizers. Published by Research Publishing, Singapore.
doi: 10.3850/978-981-18-5183-4_S30-03-504-cd

Turbofan exhaust gas temperature forecasting and performance monitoring with a


neural network model

LANGHENDRIES Raphaël1,2 and LACAILLE Jérôme2


1
Université Paris I - SAMM, 90 rue de tolbiac, Paris, France.
2
Safran Aircraft Engines, 77550 Moissy-Cramayel, France.
E-mail: r.langhendries@gmail.com and jerome.lacaille@safrangroup.com

Exhaust Gas Temperature (EGT) denotes the temperature of the exhaust gas when it leaves the turbine. EGT is an
important parameter for measuring the energy efficiency of a turbofan engine. Indeed, the heat energy produced by
an aircraft engine corresponds to a loss of power. Therefore, forecasting the exhaust gas temperature is a key task to
monitor the engine performance and schedule maintenance operations. In this paper, we propose a new method for
forecasting EGT throughout engine life. The EGT is regarded as a time series, for each flight we aim to predict the
EGT during the cruise. Our model is a neural network that leans on recurrent networks and attention mechanisms to
compute a state vector that represents the wear of the engine. Moreover, we show that this state vector can be used
to monitor the engine’s energy efficiency over time.

Keywords: Artificial Neural Network (ANN), Attention Mechanisms, Prognostic Health Management, Exhaust Gas
Temperature, Turbofan Engine.

1. Introduction a result, the temperature of the exhaust gas is an


Energy efficiency of aircraft engines is a ma- indicator of the amount of energy lost and thus of
jor economic and environmental issue closely the overall efficiency of the engine.
watched by airlines and the aeronautics industry, A normal aircraft flight can be divided into sev-
indeed the fuel consumption of the aircraft di- eral phases: TakeOff, Climb, Cruise, Descent, Ap-
rectly depends on the engine’s efficiency. Con- proach etc. The cruise is usually the longest phase
sequently, much research has been undertaken to during which the plane reaches its maximum dis-
evaluate the engine’s efficiency, monitor it across tance. During this phase altitude and speed remain
engine operations and forecast its evolution. stable. For those reasons, the efficiency during
Moreover, it has been established that engine the cruise phase is capital to ensure the optimal
wear negatively impacts energy efficiency (La- functioning of the engine and that’s why we focus
caille et al. (2013)). The link between wear and on the EGT measured during the cruise.
energy efficiency is therefore an important field of 2.2. Real flight data
research.
We describe below our dataset. We use two kinds
of data.
2. Background
2.1. EGT during cruise and turbofan Snapshot. For each flight phase, the engine com-
efficiency puter records the measurements of sensors at-
The efficiency of a turbofan can be expressed tached to the engine or to the aircraft. Such records
as the mechanical energy provided by the turbo- are called snapshots. We use snapshots from the
fan (the thrust) divided by the energy supplied takeoff, climb and cruise phases.
to the turbofan (the heating value of the fuel).
Apart from mechanical energy, heat energy is also Environmental data. We also use data describ-
released by the engines. But heat energy is not ing the environment where aircraft are operated
used and is mostly dissipated in exhaust gas. As (i.e. departure and arrival airports). We have data

3095
3096 Proceedings of the 32nd European Safety and Reliability Conference (ESREL 2022)

 
related to the weather (temperature, pressure, etc.) Zsi s<t and Xti ). Consequently, abrupt changes
and data describing the air pollution (dust, salt can sometimes occur in the EGT.
concentration etc.). We provide below an example of a cruise EGT.
An abrupt dynamic change can be observed near
2.3. Notations and main goal flight 800. This change is caused by a maintenance
For each engine and for each flight we retrieve operation that has decreased the EGT.
data from the takeoff, climb and cruise snapshots
and environmental data (weather and air pollution
at the departure and arrival airport). For engine i
and flight t, we denote by Zti this vector. The EGT
recorded during the cruise phase for the engine i

Temperature (°C)
and flight t is denoted Eti .
We emphasize that the EGT Eti is included in
the vector Zti . Moreover, other features strongly
correlated with the EGT are also included in Zti
(for example internal temperature in the engine).
Therefore, we also define the vector Xti equal to
Zti into which we remove the EGT Eti and any
features directly related to the engine operation
Cycles Since New
except the rotational speed of the turbine engine
(denoted N1 which represents the thrust delivered
by the engine). Fig. 1. EGT during cruise phase, the abscissa cor-
Indeed, our problem is to predict the EGT using responds to the number of operating cycles and the
past data Zsi s<t and data describing the flight ordinate to the measured EGT (degree Celsius). For
number t (the vector Xti ). confidentiality matters, the temperature scale on the y-
axis is not provided.

3. Model
3.1. Towards a probabilistic model Due to such operations, instead of investigating
deterministic models to predict EGT, it appears
The cruise EGT during flight depends on condi-
more promising to learn a probabilistic model that
tions at flight t (thrust delivered by the engine,
fits the EGT distribution for each flight.
altitude, external temperature, etc.) and the state
In this way, we aim to take account of the
of the engine at time t (which depends on past
uncertainty linked to maintenance policy.
data). However, other events can impact the per-
formance of the aircraft. In particular, water-wash 3.2. Gaussian Mixture Model (GMM)
are common maintenance operations that consist
in washing the engine in order to remove airborne A gaussian mixture is a probability distribution
particles that can build up in the engine. Such consisting of a superposition of m gaussian densi-
maintenance operation are regularly carried out ties. An introduction to gaussian mixtures can be
by airline companies and aim to restore engine found in Bishop (2006) Chapter 2.3.9).
performance (Chen and Sun (2018)). Probability density for a m components gaus-
Besides water-wash, many other events can im- sian mixture is defined by
pact aircraft performance: it includes every main- m

tenance operations and component replacements. pi,j
t (x) = πti,j N (μi,j i,j
t , σt ). (1)
j=1
Such events mainly depend on the mainte-
nance policy of airline companies and are not To simplify we will now imply indices i (the
always possible to predict with available data (i.e. engine) and t (the flight cycle) when necessary.
Proceedings of the 32nd European Safety and Reliability Conference (ESREL 2022) 3097

(π j )j∈[1,m] are called mixture coefficients and The use of gated recurrent networks to com-
m
their sum must be equal to 1 (i.e. j=1 π j = pute the physical state of a device has become
1). Coefficients (μj , σ j )j∈[1,m] are respectively a common idea in the PHM field. It has already
expectations and standard deviations of gaussian been proposed for example in Wu et al. (2018)
random variables. for RUL (Remaining Useful Life) estimation and
The gaussian mixture model seems pertinent to in Langhendries and Lacaille (2021) for survival
use in our context. Indeed, we expect to model analysis in the aerospace field. Moreover, recent
the uncertainty due to the several possible main- works (for example Zhong et al. (2018)) use an-
tenance operations using several gaussian com- other type of gated recurrent network (i.e. GRU)
ponents. In this way, our problem is to compute for EGT forecasting and show that such meth-
 each engine i and each flight t parameters
for ods achieve state of the art performance. We also
πti,j , μi,j i,j
t , σt with j ∈ [1, m]. The number of emphasize that recurrent networks can be stacked
gaussian components (m) is a hyper-parameter. into layers (hidden states computed by first re-
The gaussian mixture model is commonly used current network are used as inputs by a second
in the field of prognostic health management. For recurrent network). It allows the capture of more
example Wei et al. (2022) uses GMM to predict complex information in the final layer of hidden
the RUL (Remaining Useful Life) of batteries. In states. In this work, we used two stacked LSTM
the aerospace field, gaussian mixture models are but to simplify notations we will continue to de-
also used in the context of unsupervised learning note and draw the recurrent layer as there is only
for fault detection (for example Lacaille et al. one LSTM.
(2014)).
3.5. Attention mechanisms

3.3. State vector Attention mechanisms are a recent advance in


the deep learning field (Vaswani, Shazeer, Parmar,
For each engine and flight, we aim to compute
Uszkoreit, Jones, Gomez, Kaiser, and Polosukhin
the gaussian mixture parameters based on current
(Vaswani et al.)). The key idea is to allow a neural
flight condition data and past data. In particular,
network to learn which part of the input to fo-
we rely on data Zsi s<t to compute a state vector
cus on. Initially, attention based neural network
describing the health state of the engine i after
demonstrate their capabilities in the NLP field.
flight t − 1.
Recent works also demonstrate the pertinence
of such architectures for the PHM. For example
3.4. Recurrent network
Muneer et al. (2021) and Liu and Wang (2021)
To compute the state vector, we chose to use a use attention mechanisms to estimate RUL for
recurrent neural network. The most simple type of turbofan (based on a simulated dataset). A review
recurrent networks is vanilla recurrent networks, on different attention mechanisms can be found in
they have been previously used for RUL estima- Galassi et al. (2019).
tion (for example Liu et al. (2010)). However In this work, two different attention mecha-
those network are exposed to the vanishing and nisms are used. The first one is the short term
exploding gradient issue. attention it aims to find among recent past flights
Gated recurrent network is another type of of the engine those more pertinent to focus on in
recurrent network that has been introduced to order to predict EGT at time t. The second one is
deal with those issues. It includes Long Short the long term attention. It concerns state vector.
Term Memory (LSTM) networks (Hochreiter and
3.5.1. Short term attention
Schmidhuber (1997)).  
In our context, inputs are vectors Zsi s<t rep- This first attention mechanism focuses on flights.
resenting the flight and the hidden state hit is the We compute attention weights according to the
state vector for the engine i after flight t. following equations.
3098 Proceedings of the 32nd European Safety and Reliability Conference (ESREL 2022)

of past state vectors. Thus, tb can be much smaller


than ta .
as,t = (W2 ) σ (W1 [Xs , Xt ] + w0 ) . (2)
The state context is computed as follows
exp(as,t )
αs,t = t−1 . (3)
r=ta exp(ar,t )
t−1

Then, the flight context is computed as follows Cs (t) = βs,t hs . (7)
s=tb
t−1

Cf (t) = αs,t Zs . (4) 3.5.3. Gaussian mixture network
s=ta
Finally, parameters of the gaussian mixture can
The notations Xt and Zt are defined in Section be computed using the two contexts Cs (t) and
2.3, we omit the engine index i to simplify the Cf (t) and data Xt concerning flight t. Vectors
notations. σ is a sigmoid function, w0 , W1 and Cs (t), Cf (t) and Xt are concatenated and fed to
W2 are parameters. t − 1 − ta is the number of a dense neural network. The output is separated
flights considered, smaller is ta and more flights into means μ, standards deviations σ and mixture
are considered. coefficients π. Function softplus (resp softmax) is
The compatibility function used to compute the used to ensure that standard deviations are positive
attention weights for the flight context is the con- (resp the sum of mixture coefficients is equal to
catenation function (see Table 3 from Galassi et al. one).
(2019)). The purpose of this compatibility func- A figure representing the full model is provided
tion is to compute a degree of similarity between (Figure 2).
flight s and t (because more flights are similar,
more we would like to pay attention to flight s
3.6. Loss function
to compute EGT for flight s). We chose to make
this compatibility function learnable in order to Several parameters occurred in the model (in the
let the model learn by itself a suitable measure of recurrent networks, the short term attention model
similarity. However, this represents a cost in terms and the gaussian mixture network). These param-
of computing power during the training phase of eters need to be optimized and thus we need to
the model. Thus, the ta cannot be too small. define a loss function to minimize. The purpose of
this section is to define this function.
3.5.2. Long term attention A common choice to fit parameters of a proba-
The second attention mechanism concerns state bilistic model is to maximize the likelihood func-
vectors. Attention weights are computed accord- tion. Denoting i the engine index, ni the number
ing to the following equations. of flights available for engine i, m the number of
gaussian components, θ parameters of the model
and fμ the probability density function. The like-
bs,t = sim (hs , ht−1 ) . (5) lihood is defined by
exp(bs,t )
βs,t = t−1 . (6)
r=tb exp(br,t )
  i∈[1,n]   n   
Where sim denotes the cosine similarity func- L θ| Zti t∈[1,ni ] = fμ Zti t∈[1,n ] |θ ,
i
i=1
tion.
(8)
Cosine similarity plays the role of compatibility
function for long term attention. Cosine similarity assuming engines are pairwise independent.
does not involve learnable parameters. Therefore,
we are not limited by computing power issues We now focus on one engine and thus omit the
during training phase and are able to consider a lot index i of this engine. Then, it holds
Proceedings of the 32nd European Safety and Reliability Conference (ESREL 2022) 3099

Fig. 2. Neural network model for EGT forecasting

   
  L θ| Zti t∈[1,ni ]
fμ (Zt )t∈[1,ni ] |θ (9) ⎛ ⎛ ⎞⎞
ni m j j 2
ni
 ni
 π 1 Z t − μ
=− ln ⎝ t
exp ⎝− t ⎠⎠
= fμ (Xt ) fμ (Zt |Xt , Zt−1 . . . Z1 , θ) . j 2 σtj
t=1 j=1 σt
t=1 t=1
(10) (12)

ni 3.7. Implementation
The density t=1 fμ (Xt ) does not depend on
model parameters θ. Thus this term can be omitted We implemented the model using PyTorch. The
in the loss function. dense network for the short term attention mech-
We denote πtj , μjt , σtj parameters of the gaus- anism is composed of two layers. The dense net-
sian mixture. πtj (same for μjt and σtj ) denotes work that computes the gaussian mixture coeffi-
the mixture coefficient for the flight t and the cients is composed of four layers. Three gaussian
gaussian mixture j (therefore πtj depends on components are used. The model is trained using
Xt , Zt−1 . . . Z1 , θ). a dataset of 800 engines (all Leap1A) and validate
on another dataset of 22 engines. Throughout the
rest of the article, all given results are obtained on
fμ (Zt |Xt , Zt−1 . . . Z1 , θ) this validation dataset.
⎛ ⎞
m j 2
πtj 1 Z t − μ 4. Results and Applications
= j√
exp ⎝− t ⎠.
j=1 σt 2π
2 σtj 4.1. Comparison with a flight by flight
(11) (FbF) dense neural network
In this section, we compare results obtained using
Maximizing the likelihood is equivalent to min- our model to results obtained using a baseline
imizing the negative log likelihood. Therefore, we model. The chosen baseline model is a neural
can choose the following loss function network taking for each engine and each flight
3100 Proceedings of the 32nd European Safety and Reliability Conference (ESREL 2022)

the vector Xti and computing a ẽit . To train the

Temperature Difference (Δ °C)


baseline model, we minimized theMSE loss 2(i.e.
 ni
for one engine LM i
SE
= n1i t=1 ẽit − Eti ).
Concerning our model, for each engine and
each flight we use a Monte Carlo methods to
estimate the average MSE. It consists in drawing
several EGT from the given gaussian mixture (we
practice we draw 1000 values) and computing the
mean loss.
Table 1 gives the results and is provided in the
Appendix.
Engines 4 and 5 stand apart. Engine 5 is the
only one for which the baseline performs better Cycles Since New
and the two models perform poorly for engine 4

Temperature Difference (Δ °C)


and 5 compared to all other.
The case of engine number 4 is investigated
below. EGT and model error are provided in the
Appendix (Figure 5). We notice three flights with
an EGT in the cruise phase around 100 degrees
Celsius. such outliers cannot be real data (the EGT
cannot be so low).
The poor performance of the two models is
directly due to these three outliers (Figure 5).

4.2. Performance monitoring: brandnew


engine vs worn engine Cycles Since New
The health state of each engine is represented by
the state vector. This state vector is computed ac- Fig. 3. For engines number 16 and 21 of the validation
cording to past flights of the engine and initialized set. We draw the difference between EGT predicted
at zero for the first flight. We suggest to compare by the model throughout engine’s operation and EGT
the EGT predicted by the model for the engine predicted by the model throughout engine’s operation
to the EGT predicted by the model for the same for the same engine but without aging.
engine but with state vector corresponding to an
unused condition. This technique allows to moni- This approach allows to spot brutal degradation
tor the engine’s performance deterioration across (such that for engine 16 around flight 1400). It also
time. That’s as we did in Figure 3. allows to evaluate the impact of maintenance op-
We emphasize that initially all engine are as- erations (for engine 21 around flight 1500 heavy
sumed equivalent (same state vector). It is not operations are achieved).
the case in practice. Indeed we know from tests
4.3. Engines compared with each other
before delivery that engines can show different
performance. Thus it takes some flights to get the It is also possible to register the state vector during
state vector representative to the real engine state. the engine’s life and to compute an average EGT
That is why, to represent the unused condition, we computed on a sample of flights. If this sample is
fixed the state vector after thirty flights and not sufficiently representative, the average EGT will
zero. It also explains the negative value and high be representative of the overall performance of the
volatility of the indicator at the beginning of the engine. This technique allow to compare engines
engine life. with each other over their lifetime.
Proceedings of the 32nd European Safety and Reliability Conference (ESREL 2022) 3101

state vectors calculated from past flights to de-


scribe the health status of the engine at any time.
Moreover, we presented possible applications to
performance monitoring. It includes estimation of
Temperature (°C)

the EGT overheating due to engine wearing and


performance comparisons between engines.
The model is implemented and tested on real
flight data from the Leap1A engine.

Acknowledgement

Cycles Since New This work has been support by a CIFRE contract be-
tween Safran Aircraft Engines and University Paris 1
Pantheon Sorbonne.
Temperature (°C)

Appendix A. Results: comparison with the


baseline.

Table 1. MSE (Mean Squared Error) comparison between


flight by flight neural model and the proposed attention based
model. MSE for attention model is obtained with a Monte
Cycles Since New Carlo method.
Engine number FbF model Attention based model
0 410.30 171.66
Fig. 4. For engine number 8 and 15, we displayed 1 348.65 60.60
the average EGT predicted by the model on the same 2 248.66 159.10
sample of flights (10000 flights drawn randomly). 3 194.72 27.81
4 1274.73 783.82
5 542.94 757.67
We present results on average EGT in Figures 6 166.31 75.89
4. We can observe that engines number 8 and 15 7 316.89 160.93
display very different performances. At the begin- 8 373.70 138.89
ning of their operation, engine 8 seems more ef- 9 175.69 113.50
10 180.19 74.88
ficient than engine 15 (around 30 degrees Celsius
11 156.19 42.35
colder). However, a heavy maintenance operation 12 378.34 50.73
has been performed on engine 15 around flight 13 119.87 111.22
800 thus engine 15 is now colder. 14 216.32 22.56
15 338.66 117.11
5. Conclusion 16 135.21 85.04
17 203.22 142.65
In this article, we introduced a new model based 18 100.02 52.89
on neural network (it includes recurrent networks 19 126.15 109.23
and attention mechanisms) and gaussian mixture 20 44.24 28.19
model in order to estimates the EGT (Exhaust 21 188.26 44.14
Gas Temperature) of engines across time. We used 22 217.84 15.97
3102 Proceedings of the 32nd European Safety and Reliability Conference (ESREL 2022)

Appendix B. Results: EGT for engine Neural Attention Models in Natural Language
number 4. Processing. arXiv:1902.02181 [cs, stat]. arXiv:
1902.02181.
Hochreiter, S. and J. Schmidhuber (1997, 11).
Long Short-Term Memory. Neural Computa-
tion 9(8), 1735–1780.
Temperature (°C)

Lacaille, J., A. Bellas, C. Bouveyron, and M. Cot-


trell (2014). Online Normalization Algorithm
for Engine Turbofan Monitoring.
Lacaille, J., A. Gouby, and O. Piol (2013). Wear
Prognostic on Turbofan Engines. pp. 8.
Langhendries, R. and J. Lacaille (2021). Deep
Learning Model for Context-Dependent Sur-
vival Analysis. Computational Intelligence, 6.
Cycles Since New Liu, J., A. Saxena, K. Goebel, B. Saha, and
W. Wang (2010). An Adaptive Recurrent Neu-
Temperature Difference (Δ °C)

ral Network for Remaining Useful Life Predic-


tion of Lithium-ion Batteries. pp. 10.
Liu, Y. and X. Wang (2021). Deep amp; attention:
A self-attention based neural network for re-
maining useful lifetime predictions. In 2021 7th
International Conference on Mechatronics and
Robotics Engineering (ICMRE), pp. 98–105.
Muneer, A., S. M. Taib, S. Naseer, R. F. Ali, and
I. A. Aziz (2021). Data-driven deep learning-
based attention mechanism for remaining use-
Cycles Since New ful life prediction: Case study application to
turbofan engine analysis. Electronics 10(20).
Fig. 5. On the first figure, we displayed the EGT for Vaswani, A., N. Shazeer, N. Parmar, J. Uszkoreit,
engine number 4 during cruise phase. On the second L. Jones, A. N. Gomez, L. Kaiser, and I. Polo-
figure, we displayed the difference between real EGT sukhin. Attention is All you Need. pp. 11.
and EGT predicted with our model (estimated with
Wei, M., M. Ye, Q. Wang, Xinxin-Xu, and J. P.
Monte Carlo method) for engine 4 during cruise phase.
Twajamahoro (2022, March). Remaining useful
life prediction of lithium-ion batteries based
on stacked autoencoder and gaussian mixture
regression. Journal of Energy Storage 47,
References 103558.
Bishop, C. M. (2006). Pattern recognition and Wu, Y., M. Yuan, S. Dong, L. Lin, and Y. Liu
machine learning. Information science and (2018). Remaining useful life estimation of
statistics. New York: Springer. engineered systems using vanilla lstm neural
Chen, D. and J. Sun (2018, December). Fuel and networks. Neurocomputing 275, 167–179.
emission reduction assessment for civil aircraft Zhong, S., Z. Li, L. Lin, and Y. Zhang (2018,
engine fleet on-wing washing. Transporta- June). Aero-Engine Exhaust Gas Temperature
tion Research Part D: Transport and Environ- Prognostic Model Based on Gated Recurrent
ment 65, 324–331. Unit Network. In 2018 IEEE International
Galassi, A., M. Lippi, and P. Torroni (2019, Febru- Conference on Prognostics and Health Man-
ary). Attention, please! A Critical Review of agement (ICPHM), Seattle, WA, pp. 1–5. IEEE.

View publication stats

You might also like