You are on page 1of 16

Reliability Engineering and System Safety 211 (2021) 107530

Contents lists available at ScienceDirect

Reliability Engineering and System Safety


journal homepage: www.elsevier.com/locate/ress

Review

Machine learning for reliability engineering and safety applications:


Review of current status and future opportunities
Zhaoyi Xu, Joseph Homer Saleh *
School of Aerospace Engineering, Georgia Institute of Technology, USA

A R T I C L E I N F O A B S T R A C T

Keywords: Machine learning (ML) pervades an increasing number of academic disciplines and industries. Its impact is
Machine learning profound, and several fields have been fundamentally altered by it, autonomy and computer vision for example;
Reliability reliability engineering and safety will undoubtedly follow suit. There is already a large but fragmented literature
Safety
on ML for reliability and safety applications, and it can be overwhelming to navigate and integrate into a
Prognostic and health management
coherent whole. In this work, we facilitate this task by providing a synthesis of, and a roadmap to this ever-
Deep learning
expanding analytical landscape and highlighting its major landmarks and pathways. We first provide an over­
view of the different ML categories and sub-categories or tasks, and we note several of the corresponding models
and algorithms. We then look back and review the use of ML in reliability and safety applications. We examine
several publications in each category/sub-category, and we include a short discussion on the use of Deep
Learning to highlight its growing popularity and distinctive advantages. Finally, we look ahead and outline
several promising future opportunities for leveraging ML in service of advancing reliability and safety consid­
erations. Overall, we argue that ML is capable of providing novel insights and opportunities to solve important
challenges in reliability and safety applications. It is also capable of teasing out more accurate insights from
accident datasets than with traditional analysis tools, and this in turn can lead to better informed decision-
making and more effective accident prevention.

1. Introduction emphasis on the use of computers to statistically estimate complicated


functions and a decreased emphasis on proving confidence intervals
This work provides a synthesis of and a roadmap to the growing and around these functions” [17]. Several academic fields have been
diverse literature on Machine Learning (ML) for reliability engineering fundamentally altered by machine learning, controls and autonomy or
and safety applications. It also outlines future opportunities and chal­ computer vision for example, reliability engineering and safety analysis
lenges for ML in these areas of applications. will undoubtedly follow suit. There is already a large but fragmented
Machine learning pervades an increasing number of fields, from literature on ML for reliability and safety applications, and it can be
banking and finance [1–3] to healthcare [4–6], robotics [7–9], trans­ overwhelming to navigate. We propose to facilitate this task in this work
portation [10–12], e-commerce [13–15], and social networks to by describing this ever-expanding analytical landscape and highlighting
mention a few. Few academic disciplines are likely to remain immune its major landmarks and pathways.
from its influence. Its impact is profound, and it will continue to upend In a previous review work of reliability engineering [18], the author
traditional academic disciplines and industries. This quiet but relentless summarized the progress to date in the field and highlighted some of its
wave of creative disruption was enabled in part by the advent of big data, challenges. He noted, for example, that new reliability analysis tools are
the collection and storage of massive amount of data, and the develop­ required to model the ever-increasing complexity of systems being
ment of powerful models and algorithms to probe it. Roughly speaking, designed. He also highlighted the need for better predictions and un­
“machine learning [is] a set of methods that can detect patterns in data, certainty propagation in reliability and safety analysis. In support of this
and then use the uncovered patterns to predict future data, or to perform observation, Li et al. [19] pointed out that traditional tools for lifetime
other kinds of decision making under uncertainty” [16]. Machine predictions of engineering components are becoming less capable of
learning is “essentially a form of applied statistics with increased meeting the industry’s increasing demand for precision and accuracy.

* Corresponding author.
E-mail address: jsaleh@gatech.edu (J.H. Saleh).

https://doi.org/10.1016/j.ress.2021.107530
Received 9 August 2020; Received in revised form 15 December 2020; Accepted 3 February 2021
Available online 5 February 2021
0951-8320/© 2021 Elsevier Ltd. All rights reserved.
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

Fig. 2. Ingredients of machine learning.

and engineering, and it is becoming increasing more so to all other


fields. The different categories of machine learning offer ways for un­
derstanding different aspects of what data is collected. Consider, for
example, a dataset with thousands of observations about the main rotor
diameters of helicopters, their Maximum Takeoff Weight (MTOW), the
number of engines (single or twin), and other features. Assume we want
Fig. 1. Machine learning categories and sub-categories or tasks, the latter are to determine, for instance, if there is a relationship between the main
not meant to be exhaustive. Anomaly detection algorithms exist in supervised
rotor diameter and the other helicopter features. And if so, how can we
and semi-supervised learning modes; the more widely used are in unsupervised
quantify it, and how accurately can we predict the former from the
mode. Semi-supervised regression is a smaller area of research than its classi­
fication counterpart. Not shown here is Active learning for example, in which latter? In this case, we have paired observations with the output of in­
the learning algorithm can query the user to provide labels to carefully selected terest Yi, here the main rotor diameter, and a vector of features Xi, here
previously unlabeled data. MTOW and number of engines (Xi, Yi). In machine learning, the output
of interest Yi is referred to as the label, and it is also known as the
Although these topics will be discussed in the next sections, we simply response or dependent variable. The inputs available Xi are known as
note here that ML is well suited to address these challenges. ML is features, covariates, or predictors. One way of understanding the
capable of providing new insights and opportunities to solve important different categories of ML is in relation to Fig. 2. The unknown function
challenges in reliability engineering and safety analysis. It is also f(X) represents the information that the feature vector X provides about
capable of teasing out more accurate insights from accident datasets, or the label Y. Roughly speaking, machine learning is about estimating this
degradation and survival data for example that were beyond the capa­ function with ̂f .
bility of traditional analysis tools. This, in turn, can lead to better What is “learned” from the data is the model Y ̂ = ̂f (X). In this
decision-making for the design and operation of engineering systems expression, Y is the predicted response variable given the predictors X,
̂
and more effective accident prevention. along with in some cases the uncertainty quantification associated with
Our first objective in this work is to provide a synthesis of, and a
the prediction. Supervised learning consists in estimating ̂f when the
roadmap to current ML use in reliability engineering and safety appli­
dataset includes paired features and labels, that is Xi and Yi. Unsuper­
cations. Our second objective is to outline some future opportunities for
ML in reliability and safety applications. Overall, we hope this work vised learning consists in estimating ̂f when the dataset is unlabeled,
encourages the reliability and safety communities to upgrade their that is, there is no Yi. The dataset in this case consists of only input
traditional analytical toolkits to include (different aspects of) ML and variables. This is akin to searching blindly for patterns in a dataset
leverage its significant potential. without a guide to supervise the learning, hence the qualifier unsuper­
The remainder of the article is organized as following. In Section 2, vised. For this reason, unsupervised learning is also known as knowledge
we provide a brief overview of ML and its main categories and sub- discovery, and it is more challenging than supervised learning. Semi-
categories or tasks. In Section 3, we review the current status of ML supervised learning, as can be surmised, consists in estimating ̂f when
use in reliability and safety, and we provide a synthesis of and a road­ only partial labels are available, that is, some paired observations are
map to this diverse literature. In Section 4, we discuss future opportu­ available, but not all the features have labels associated with them (some
nities for ML in reliability and safety applications. Finally, we conclude Xi have no Yi). Reinforcement learning does not directly fit into this
this work in Section 5. framework, but it roughly consists in exploring for X and searching for
the optimal X∗ given a desired output Y. More details about each cate­
2. Machine learning: A brief overview gory are discussed in the next subsections.

The reader familiar with ML may skip this section and jump ahead to
2.1. Supervised learning (SL)
the applications in Section 3. ML as noted previously, is a set of methods
for learning from data and uncovering patterns in it. This learning, in
Supervised learning, as noted previously, consists in estimating a
turn, can be put to use for inferential and prediction purposes, or to
function that maps an input vector to an output given a dataset of paired
perform some decision-making under uncertainty. Given the type of
observations:
data available and the kind of questions being asked for understanding
it, different categories of ML are available for the task at hand. ML can be ̂f : X→Y
classified into three major categories, with a fourth one straddling the
first two: (i) supervised learning; (ii) unsupervised learning; (iii) semi- Supervised learning is done for inferential or prediction purposes. In
supervised learning; and (iv) reinforcement learning. A tree diagram the inferential case, we seek to understand for example, the strength of
of this categorization along with the common sub-categories is shown in the association between the predictors and the response variable, or how
Fig. 1. A brief introduction to these categories is provided next. the latter varies with the former. For prediction purposes, we seek to
The process of generating and collecting data is essential to science predict the responses for new observed features not in the dataset. Given
the nature of the response variable Y, a quantitative or qualitative

2
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

Fig. 3. Illustrations of two types of anomalies: point anomalies (left panel; A1 and A2 are point anomalies), and contextual anomalies (right panel; y1 is a contextual
anomaly, its value is within the range of normal values except it is anomalous in context, here the context is derived from the time-series data). Adapted from Ref. [44].

variable, two major sub-categories of supervised learning are available: predefined, and the observations are assigned to different classes before
regression and classification. the analysis starts (i.e., labeled data). Whereas in clustering, neither the
Regression problems consist in estimating ̂f for which the response groups nor their numbers are known beforehand, and the assignment of
variable is quantitative, such as blood pressure, temperature, weight, or the unlabeled data to specific meaningful clusters is the main outcome of
cost for example. To carry out this estimation, several machine learning the analysis. Choosing the number of clusters, deciding what constitute
models and algorithms are available, from the widely used basic linear similarity in the Xi space (similarity measures), and selecting the eval­
regression (LR) [20] and polynomial response surface (PRS) [21], to uation criteria to assess the quality of the output (e.g., intra-cluster
more advanced techniques, such as support vector regression (SVR) homogeneity and inter-cluster separability) are typical challenges of
[22], decision tree regression (DTR) [23], and random forest regression clustering. Clustering methods are widely used in a host of applications,
(RFR) [24]. Some ML models, such as Gaussian Process Regression from image segmentation and object recognition, to document retrieval/
(GPR) and Bayesian network, offer a distinctive advantage by providing data mining, genomics, and countless e-commerce applications. There is
accurate uncertainty quantification in regression problems [25,26]. a significant number of clustering algorithms to choose from, and it can
Furthermore, with the recent development of deep learning techniques, be overwhelming for the user to navigate the thicket of methods avail­
deep neural network (DNN) models are becoming increasingly popular able for this task. Two excellent reviews of clustering methods can be
with regression for high-dimensional, nonlinear problems [27,28]. found in Jain et al. [37] and Saxena et al. [38]. Clustering methods
The second sub-category of supervised learning shown in Fig. 1, include the widely used K-means algorithm [39], a variety of hierar­
classification, involves a response variable Y that is qualitative, for chical clustering methods [40], density-based clustering (DBC) [41],
example 0/1 or eye colors. Classification problems consists again in and the Gaussian mixing model (GMM) [16]. With the recent develop­
ment of DNN, there is a growing focus on combining deep learning
estimating ̂f given a labeled dataset, then using this learned knowledge,
feature extraction models such as deep auto-encoder, with clustering
for example, to predict the classification probability of new observed
algorithms to handle problems with high-dimensional noisy data [42,
features. The problem may involve an output with only two classes, such
43].
as spam email identification (spam/not spam) [29], or multi-class
The second sub-category of unsupervised learning shown in Fig. 1,
output, such as speech and handwriting recognition [30]. Commonly
anomaly detection, consists in identifying unexpected observations in a
used ML methods for classification problems include logistic regression
dataset. The term anomaly in this context is used in a broader sense than
[31], k-nearest neighbor (KNN) [32], support vector machine (SVM)
how it is understood in the reliability and safety communities. For
[16], decision tree (DT) [16], boosted tree [33], and random forest (RF)
example, a person who steals your credit card will likely make purchases
[34]. An excellent reference for a wide range of classification models can
that derive from a different probability distribution of purchases than
be found in Ref. [35]. As with regression tasks, DNNs models are also
your own. Anomaly detection, as understood in machine learning, will
used for classification [16,36]. We will provide more details and
flag these purchases as likely fraudulent. More generally, anomaly
examine the use of supervised learning in reliability engineering and
detection in ML refers to “the problem of finding patterns in data that do
safety application in Section 3.
not conform to expected normal behavior” [44]. Anomaly detection
algorithms have found a vast range of applications in many domains
2.2. Unsupervised learning (USL) because they often produce critical information that can be acted upon
and prompt meaningful intervention. For example, anomaly detection is
Unsupervised learning consists in examining datasets with only input used in cyber-security and intrusion detection [45], in banking and in­
variables Xi and no corresponding label or response Yi. This problem surance fraud detection, in medical applications [46], and increasingly
setup can be disconcerting at first, and it is fair to inquire about its in reliability and safety applications as will be discussed in Section 3.
objective and what can be “learned” in this situation. We first note that Two generically different types of anomalies are worth distinguishing,
unlike supervised learning, unsupervised learning is not concerned with and they lead to different types of algorithms to detect them: (i) point
predicting an output variable since none is available. Its objective anomalies, and (ii) contextual anomalies. The simplest are point anoma­
instead is to explore the feature space Xi and find patterns in the dataset. lies, and they are defined as individual observations that can be assessed
Two major sub-categories or tasks of unsupervised learning are available as anomalous with respect to other data points. The left panel in Fig. 3
based on the nature of the patterns sought: clustering and anomaly illustrates point anomalies within a 2D feature space. Non-anomalous
detection. data occur in dense neighborhoods, whereas point anomalies occur in
Clustering, also known as unsupervised classification, consists in low density regions far from their closest neighbors or established
dividing the observations into clusters that share some similarities in the clusters, as seen in Fig. 3. Contextual anomalies are more interesting and
feature space. Clustering and (supervised) classification share on the more difficult to handle than point anomalies. If a data point is anom­
surface some similarities in that they both deal with observations in alous in a given context, but not so when evaluated in isolation, it is
groups; except in supervised classification the groups or classes are termed a contextual anomaly. The context can be broadly defined, and it

3
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

Fig. 4. Illustration of classification under supervised (left panel) and semi-supervised learning. The ‘x’ and ‘o’ belong to two separate classes, but only the bold ones
are labeled. If the learning algorithm only makes use of the labeled data (supervised mode), the decision boundary will pass through high-density regions in the
feature space and misclassify many data points (poor accuracy). A semi-supervised learning algorithm will leverage the low-density, cluster assumption to produce
the decision boundary in the right panel, which improves the classification accuracy. Adapted from Ref. [53].

is often specified in terms of spatial data or time-series data. For used in speech recognition, internet content classification, bioinformatic
example, the right panel in Fig. 3 illustrates a contextual anomaly in a [50], and a host of other applications. These methods leverage, for
time-series. example, generative models [51] and graph-based algorithms [52] to
Data for anomaly detection can come in streaming fashion, in which learn from the mixed labeled–unlabeled data (they differ by how the
case it requires online analysis, or the entire dataset can be available at unlabeled data is used, and by the choice of the loss function and reg­
the onset, and the analysis carried out offline. These lead to different ularizer). Two excellent surveys of semi-supervised learning can be
types of algorithms for which computational efficiency is paramount (in found in Ref. [53] and the more recent Ref. [54]. We will provide more
the streaming case) or not. Anomaly detection algorithms have lever­ details and examine the use of semi-supervised learning in reliability
aged different tools, for example, nearest-neighborhood and clustering- engineering and safety application in Section 3.
based or spectral-based anomaly detection algorithms. Some of the
commonly used machine learning anomaly detection algorithms include
2.4. Reinforcement learning (RL)
self-organizing maps (SOM) [47], K-mean [39], adaptive resonance
theory (ART) [48], and one-class support vector machine (OC-SVM)
Reinforcement learning is a distinctive set of techniques, which,
[49]. An excellent survey of anomaly detection can be found in
unlike the previous ML categories, does not experience or operate with a
Ref. [44]. We will provide more details and examine the use of unsu­
fixed dataset. Instead, reinforcement learning algorithms interact with
pervised learning in reliability engineering and safety application in
an environment, and as a result, their experience becomes part of the
Section 3.
dataset from which they learn. They explore by trial-and-error in­
teractions with the environment during the training process. A pre-
2.3. Semi-supervised learning (SSL)
requisite for this feedback loop between experience and learning is the
presence of a reward function. Reinforcement learning is therefore used
In some problems, such as image recognition, photo categorization,
in different contexts, by software and machines, to make decisions
and autonomous driving, there can be a limited labeled dataset, i.e.,
(sequentially) and find a course of action (policy) that maximizes the
paired observations (Xi,Yi), along with a much larger unlabeled dataset
agent’s cumulative reward. The reward can be defined for a specific
of features (Xj with no Yj). Obtaining unlabeled data is often easy or
outcome or with a specific function of the state of the agent. A funda­
cheap, whereas expanding the labeled dataset can be time-consuming or
mental issue in reinforcement learning algorithms is setting parameters
expensive. In these situations, neither supervised nor unsupervised are
that achieve a proper balance between agent exploration and exploita­
suitable for handling these problems. Semi-supervised learning is a set of
tion to balance the stability and efficiency of the learning process [55].
methods for these types of problems, and which bridge the gap between
An over-exploration agent may have difficulties converging to an
supervised and unsupervised approaches. They use the labeled dataset
optimal course of actions, whereas an over-exploitation agent is likely to
to estimate ̂f and leverage the much larger unlabeled dataset, which get trapped in a local optimum and fail to find the global optimal so­
under certain assumptions1 can help improve the model’s performance lution. Reinforcement learning has very broad applications, including
and provide more accurate predictions, as shown in Fig. 4 in the context robotics and autonomy [56], gaming [57], and online advertisement
of a classification problem. Semi-supervised learning methods have been [58]. The major reinforcement learning algorithm includes policy
gradient [59], Q-learning [60], deep Q network (DQN) [61], and
actor-critic algorithm [62]. We will examine the use of reinforcement
1
Generally described as the smoothness assumption, the cluster assumption, learning in reliability engineering and safety application in the next
and the manifold assumption for classification problems. These assumptions are section.
related, for example the smoothness assumption states that if two points are in a
high-density region, then so should their corresponding labels. The cluster
assumption states that if points are in the same cluster (unsupervised learning), 3. Machine learning applications in reliability engineering and
then they are likely to be in the same class as well (supervised learning). This is safety applications: Overview and current status
also known as the low-density separation assumption, which states that the
decision boundary should lie in a low-density region [53,54]. See Fig. 4 for an In this section, we review the current use of ML in reliability engi­
illustration. neering and safety applications. We also introduce some of the

4
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

Table 1
Selection from the literature on ML regression models in reliability and safety applications.
ML model Reported advantage Model disadvantage Application Ref.

Vector machine:

Support vector regression (SVR) 1. Superior prediction accuracy than the 1. Not suitable for sparse and high 1. Failure and reliability prediction of [65]
traditional method dimensional data time series data [67]
2. Robust to data noise 2. Requires prior knowledge for kernel 2. RUL estimation [71]
3. Superior efficiency than traditional simulation selection 3. RUL estimation for aircraft engines [72]
models. 4. Approximations of fire hazard model
at nuclear power plants
relevance vector machine 1. Superior prognostic accuracy than traditional 1. Not suitable for sparse and high 1. System degradation prognostic [63]
(RVM) methods dimensional data

Neural network:

Deep neural network (DNN) 1. Excellent prediction accuracy and training 1. Computationally expensive 1. RUL of aircraft engine prediction [64]
efficiency 2. Hard to interpret the ‘black box’ 2. Human errors prediction [68]
2. Excellent long- and mid-term prediction model 3. Component reliability and [73]
accuracy degradation level prediction
Convolutional neural network 1. Excellent prediction accuracy to highly 1. Computationally expensive 1. RUL estimation [74]
(CNN) nonlinear, complex, multidimensional system 2. Requires large training dataset
Recurrent neural network 1. No need for prior knowledge and analysis of the 1. Suffers from the vanishing gradient 1. RUL estimation [75]
(RNN) dataset problem
Deep bidirectional long short- 1. No need for prior knowledge and assumptions 1. Computationally expensive 1. RUL estimation [76]
term memory (LSTM) 2. More accurate estimation than traditional
methods
CNN based LSTM 1. Suitable for more complex modern engineering 1. Computational expensive 1. Multi-scale feature selection and RUL [66]
system, and high dimensional input 2. Prone to overfitting estimation
LSTM and gated recurrent unit 1. Excellent efficiency and accuracy for nonlinear 1. Computational expensive 1. RUL prediction and PHM [77]
complex system

Gaussian process:

Gaussian process regression 1. Superior accuracy and efficiency than traditional 1. Requres prior knowledge for kernel 1. Time-dependent failure probability [69]
(GPR) methods selection prediction [78]
2. Suitable for nonlinear high dimensional system 2. Inferior computational scalability 2. Structural reliability analysis and [79]
analysis with training dataset size failure probability estimation [80]
3. Dynamic updating of the model parameter 3. System reliability analysis [81]
4. Active learning and excellent performance in 4. Failure-pursuing sampling [82]
complicated real-world applications framework
5. Reliability-based importance analysis
of structural system

algorithms and provide a short guide to the growing literature in these corresponding ML models.
areas. While no claim of exhaustiveness can be made when undertaking
such a task, we have attempted to sample major publications in each 3.1.1. Regression
category and subcategories of ML discussed previously. In addition, we Supervised regression is widely used for remaining useful life (RUL)
devote the last subsection to the use of Deep Learning (DL) in reliability estimations and degradation predictions. The literature in this area in­
and safety applications, as separate from the rest despite its overlap with cludes applications of ML to different engineering items, for example, Li-
the previous categories to highlight its growing popularity and empha­ ion batteries [63], railway tracks [64], turbine cutting tools [65], rolling
size its distinctive advantages. bearings [66], and aircraft engine [67,68]. In addition to these different
The basic ingredients for a machine learning algorithm are the domains of application, researchers have explored different ML models,
following: (i) datasets for training and testing purposes, (ii) an objective for example, support vector machine/regression [67], Gaussian Process
function or loss function to optimize, for example, a sum of squared Regression (GPR) or kriging [69], and Deep Learning for structural
errors or a likelihood function, and (iii) an optimization algorithm and a reliability problems, RUL prediction, fire hazard simulation, and more
model for the data (e.g., linear, nonlinear, nonparametric). By varying generally for engineering surrogate modeling applications [70]. A se­
any one of these ingredients, it is easy to conceive of the very wide range lection from this ML regression literature is provided in Table 1.
of applications of ML for reliability and safety applications, for example An accurate prediction of an equipment RUL and degradation level is
by examining different datasets in various industries, by applying essential for a safety-critical system, and it is important for condition-
different ML models to various systems or components, and by modi­ based maintenance to minimize system downtime and other adverse
fying some models and algorithms to better suit the task at hand. The consequences of a run-to-failure approach. RUL prediction is an
objective is generally to tease out novel, more accurate results from important element of prognostics and health management (PHM), and
datasets for better reliability and safety-informed decision-making, and the demand for better PHM requires, among other things, a more ac­
more effective accident prevention. Some of the recurrent themes in the curate estimation of the RUL. Approaches to RUL and degradation level
literature we examine next include ML use for the estimation of an asset predictions can be grouped into two broad categories, with a third
remaining useful life, anomaly and fault detection, health monitoring, hybrid one straddling the first two: (i) model-based approaches, which
maintenance planning, and degradation assessment. build failure models based on the detailed analysis of the physical nature
of the failure mechanism under consideration [83,84]. These models
require extensive prior knowledge and subject-matter expertise; (ii)
3.1. Supervised learning applications data-driven approaches, which build degradation models from historical
sensor data, and therefore require no prior knowledge of the system.
In this subsection, we review some of the regression and classifica­ These models can be built with various ML approaches, and they can
tion applications, and we provide brief introductions to some of the

5
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

Table 2
Selection from the literature on ML classification models in reliability and safety applications.
Technique Reported advantage Model disadvantage Application Ref.

Conditional models:

Decision tree (DT) 1. Excellent accuracy with great training efficiency 1. Not robust to data noise 1. Assessing stakeholders corporate [94]
governance [96]
2. Dirt and mud detection on a wind [99]
turbine blade
3. Forest fire risk assessment
Random Forest (RF) 1. Suitable for discrete classification 1. More complex than DT model 1. Rank the importance of each [100]
2. Excellent estimation accuracy 2. Hard to interpret the ‘black box’ component of an engineering system
model
Boosted Tree 1. Excellent accuracy and overfitting free 1. Over-sensitive to outliers in the 1. Risk-based inspection screening [98]
training dataset assessment

Nearest neighborhood

K-nearest neighborhood 1. Excellent accuracy and efficiency 1. Not suitable for high dimensional 1. Risk-based inspection screening [94]
(KNN) data assessment [98]
2. Not robust to data noise 2. Dirt and mud detection on a wind
turbine blade

Support vector:

Support vector 1. Highly efficient with up to two orders of magnitude 1. Not suitable for sparse and high 1. Risk-based inspection screening [92]
classification (SVC) time saving compared with traditional methods dimensional data assessment [94]
2. Robust to appraiser-to-appraiser output variation, 2. Requires prior knowledge for kernel 2. Dirt and mud detection on a wind [97]
and better prediction accuracy selection turbine blade [98]
3. Robust to dataset noise 3. Bearings fault diagnosis [101]
4. Early detection of gradual concept [102]
drifts
4. Reliability analysis of network
connectivity
5. Structure health monitoring
Relevance vector 1. High classification accuracy 1. Not robust to data noise 1. Structural damage detection [103]
machine (RVM)

Bayesian method:

Linear discriminant 1. Excellent prediction accuracy 1. Not suitable for nonlinear 1. Dirt and mud detection on a wind [94]
analysis (LDA) applications turbine blade
Bayesian net 1. Available for a complex system with excellent 1. Requires prior knowledge of the 1. Water distribution system pipe [91]
accuracy and efficiency application failures analysis [100]
2. Suitable for discrete classification and satisfying 2. Rank the importance of components
estimation of the engineering system
3. Seismic loss analysis of spatially
distributed infrastructure system
Gaussian process 1. Suitable for complex system 1. Requres prior knowledge for kernel 1. Reliability evaluation of complex [104]
classification (GPC) 2. Excellent computational efficiency selection system
2. Inferior computational scalability
with training dataset size

Neural network:

Deep neural network 1. Accurate classification accuracy in high dimensional 1. Computationally expensive for 1. Structural reliability analysis and [105]
(DNN) problem model training failure probability estimation
Stacked autoencoder 1. Robust to the input noise 1. Computationally expensive 1. Bearing fault diagnosis [93]
2. Excellent accuracy
Recurrent neural 1. Superior prediction accuracy than traditional 1. Suffers from the vanishing gradient 1. Software reliability prediction [95]
network (RNN) parametric models problem
Deep belief network 1. Efficient inference 1. Computationally expensive 1. Health state classification [90]
2. Able to encode richer and higher-order complex
systems
Long short-term memory 1. Capable of the safety analysis of time-varying systems 1. Computational expensive 1. Dynamic predictive maintenance [106]
(LSTM) 2. No need for prior assumption and knowledge 2. Prone to overfitting framework for failure prognostics

vary in accuracy and computational intensity given the quality and maximum distance from the nearest training data points in order to
quantity of data available. We briefly introduce next two powerful and reduce the classification or regression error. The original SVM model
commonly used ML regression models in reliability and safety applica­ was designed to handle problems that are linearly separable in the
tion, namely support vector regression (SVR) and Gaussian Process design space (features). However, real-world applications often exhibit
Regression (GPR). Deep Learning regression models are discussed in nonlinearities in their features (Xi) and their relationship to the output
Section 3.5. variable (Yi). To overcome this drawback, the kernel method is used
Support vector machine (SVM), also known as Support Vector with SVM, which maps the original low dimensional nonlinear problem
Regression (SVR) when applied to regression problems, is a supervised into a higher dimensional space where the data is linearly separable. For
ML approach that can be used for both regression and classification example, the Gaussian radial basis function, which maps the original
tasks. The principle of SVM is to construct a set of hyperplanes in the data to infinite-dimensional space [22], is a popular kernel for SVM
feature space (Xi) to conduct regression or classification with the input applications. The reader is referred to Ref. [22] for more details on the
data. The parameters are calculated such that the hyperplanes are at a SVM method. Variants of SVM, such as the relevance vector machine

6
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

(RVM) [85] have also been developed for equipment degradation and case, the DT classifier identifies the feature region in which the new case
RUL prediction applications. resides. For this reason, DTs are described as conditional control-based
GPR, also known as Kriging, is a Bayesian statistical learning algorithms in which each node of the tree structure represents a “test”
approach to regression problems. In GPR, the response function is on a feature, and each branch a possible outcome of the test [107]. In
defined as a stochastic posterior distribution in the inference equation, order to improve the accuracy and avoid potential overfitting with DT,
with the Gaussian process prior conditioned on the input dataset. The ensemble DTs such as random forest (RF), have been developed. The DT
GPR infers a probability distribution with mean and standard deviation and its variants are suitable for classification problems where the deci­
estimations of the normally distributed response function. With a proper sion boundaries between different classes are sharp. Other classifiers
selection of Gaussian process kernel and corresponding hyper­ may be better suited for handling smooth boundaries, for example,
parameters, GPR and its variants such as active learning Kriging (ALK), support vector classification (SVC) and Bayesian classification models
can provide accurate response function estimation and uncertainty which are also popular in fault detection and identification and operate
quantification [86], and they are increasingly used in engineering sur­ with somewhat similar mechanisms to SVR and GPR regression models
rogate modeling and reliability analysis, [87,88]. The reader is referred discussed previously. More details can be found in Ref. [35].
to Ref. [89] for more details on the GPR. Which models are better? Given the extent of supervised machine
learning models available, it is fair to ask which ones are better or best.
3.1.2. Classification The answer derives from what are known as the “No Free Lunch” The­
Supervised classification is widely used in fault detection and iden­ orems [108], which roughly state that no single classification model is
tification. This includes both online monitoring of the degradation states universally better than any other algorithm over all applications. The
of equipment and diagnosing different types of faults (binary and multi- theorems further demonstrate that all classification models have the
class). Classification in this context is at the nexus of two broader con­ same error rates when averaged over all possible data-generating dis­
siderations, PHM and predictive maintenance, and it provides critical tributions. In other words, all classification (and regression) models are
information that helps inform the conduct of both. The literature in this created equal, and their performance is similar when averaged across all
area includes the use of classification ML tools for fault detection and possible applications. While this absence of universally better/best
identification in a wide range of reliability and safety applications, for model might be disappointing at first, it is in fact what makes ML
example, aircraft engine and electric power transformers [90], water research and use in different applications rich in possibilities: it prompts
distribution and pipe failures [91], bearings or rotary machines [92,93], analytical creativity to explore different models and tailor the investi­
wind turbine blades [94], software reliability [95], and forest fires [96]. gation to the specific application at hand. Which model is better/best is
In addition to these different domains of application, researchers have highly dependent on the specific application under consideration. For
also developed or adapted ML classifiers for specific PHM purposes. For example, the linear discriminator analysis (LDA) classifier can achieve
example, Islam and Kim [92] developed an improved one-against-all excellent accuracy and outperforms other models, such as DL methods,
multiclass support vector machine classifier and used it with acoustic for computational efficiency when the application is fundamentally
signals to diagnose eight types of fault classes in bearings (seven linear. However, this algorithm is totally off for complicated applica­
different fault types and one nominal operational conditions). Stern tions when the underlying data-generation process is nonlinear. This
et al. [97] used ML classifiers (SVM and logistic regression) to develop makes the model selection. The practical implication is for researchers
surrogate models for the reliability of a complex two-terminal network. and data scientists to search for the best model(s) given their particular
They demonstrated high prediction accuracy and order of magnitude domain of applications, and to pay careful attention to the model se­
smaller computational effort than traditional Monte Carlo Simulation. lection phase discussed previously.
Rachman and Ratnayake [98] benchmarked the performance of
different ML classifiers for equipment screening and risk-based inspec­ 3.2. Unsupervised learning applications
tion assessment (with application to oil and gas pressure vessels and
piping lines). They demonstrated better accuracy and precision with ML In this subsection, we review some of the unsupervised clustering
classifiers than conventional methods of screening for inspection and a and anomaly detection applications, and we provide brief introductions
reduction of output variability between human appraisers. A selection to some of the corresponding ML models.
from this ML classification literature is provided in Table 2.
ML classification is enabled by a set of models, from the simple k- 3.2.1. Clustering
nearest neighbor (KNN) and logistic regression to more advanced De­ Clustering or unsupervised classification is not as widely used in
cision Trees (DT), linear discriminant analysis (LDA), and support vector reliability and safety applications as its supervised counterpart. It has
classification (SVC). In addition to these single-classifier methods, significant untapped potential, which other fields mentioned previously,
ensemble classifiers have been developed that incorporate multiple such as image recognition, genomics, and e-commerce leverage to a
single classifiers for better performance. These include, for example, fuller extent. Some applications of clustering include degradation
Random Forest (RF), and AdaBoost (AB). As with regression models, the analysis in railway point machines [109], wind turbine failures [110],
availability of a wide range of ML classifiers makes it important for fault detection in the nuclear industry [111], damage classification in
research in reliability and safety applications to benchmark and assess structural components [112], bearing faults identification in rotating
the performance of different classifiers, and then select the most machines [113], and blasting operations in mining activities [114].
appropriate one given the datasets used. This model selection phase Beyond these specific areas of applications, some authors have also
makes a work more convincing and lends its findings more weight. leveraged clustering tools for broader, methodological problems in
Unfortunately, it is missing in some publications, and we recommend reliability and safety. For example, Fang and Zio [115] adapted a form
that authors, reviewers, and editors be more mindful of this expectation clustering for network reliability and criticality analysis. They leveraged
of transparency for comparative model analysis and selection when clustering to demonstrate a novel hierarchical modeling framework and
writing or reviewing manuscripts in this area of applications. to extract structural properties of the network (in a reliability sense).
We briefly introduce next one popular ML classifier in reliability and Soualhi et al. [116] developed a novel unsupervised classification
safety application, namely decision tree (DT) to illustrate some aspects technique called Artificial Ant Clustering (AAC). They successfully
of ML classification. DT uses a tree-like structure in which each node applied it to identify operational modes and diagnose two types of faults
leads to different branches based on a threshold value for a covariate or in an induction motor (broken rotor bars and bearing failures). The
feature variable [107]. In essence, DT slices the feature space into authors demonstrated significant improvements in clustering error rates
multiple regions. Then, for predicting the response variable of a new test compared with other methods for their particular application, even

7
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

Table 3 Table 4
Selection from the literature on ML clustering (unsupervised classification) Selection from the literature on ML anomaly detection models in reliability and
models in reliability and safety applications. safety applications.
Technique Reported Model Application Ref. Technique Reported Model Application Ref.
advantage disadvantage advantage disadvantage

Self-organizing map: Support vector:

Self-organizing 1. Capable of 1. Hard to 1. Fault [109] Support vector 1. Excellent 1. Not suitable for 1. Real-time [126]
map (SOM) detecting interpret the detection and [111] machine accuracy and sparse and high Motor machine [128]
various levels ‘black box’ model isolation [117] (SVM) efficiency for dimensional data failure
of fault with scheme for feature 2. Requires prior identification
high accuracy pneumatic extraction knowledge for and early fault
2. Promising actuator 2. Accurate kernel selection diagnosis
result in early 2. Early Fault in detecting 2. Spacecraft
fault detection detection of the early health
of real the nuclear signs of monitoring
application industry system
with excellent anomalies
visualization
Neural net:
analysis
Principle 1. High 1. Computatinally 1. Damage [112] Convolutional 1. Excellent 1. 1. Real-time [128]
component efficiency to expensive classification neural accuracy and Computationally Motor machine
analysis-self- high 2. Hard to in structural network efficiency for expensive failure
organizing dimension interpret the health (CNN) feature 2. Requires large identification
map (PCA- system with ‘black box’ model monitoring extraction training dataset and early fault
SOM) excellent diagnosis
accuracy Deep neural 1. General for 1. 1. Anomaly [125]
network different Computationally detection for gas [130]
Distance metric learning:
(DNN) applications expensive turbine [131]
K-mean 1. High 1. Requires prior 1. Associating [110] and prior 2. Hard to combustors [132]
detection knowledge for k weather assumption interpret the 2. Wind turbine [133]
accuracy and selection condition and free. ‘black box’ model fault detection [134]
efficiency wind turbine 2. Superior and monitoring
failures anomaly
Neighborhood 1. Efficient for 1. Requires 1. Fault [113] detection
component high manual selection detection to accuracy
analysis dimensional of the class bearings 3. Suitable
input data number for detecting
with multiple
outstanding anomalies
accuracy Deep auto- 1. Accurate 1. 1. Anomaly [124]
Spectral 1. Capable of 1. 1. Network [115] encoder and efficient Computationally detection and
clustering identifying the Computationally component- fault expensive fault
most relevant expensive with a level safety detection in disambiguation
clusters large dataset and support aircraft flight in large flight
criticality data data
analysis Long short- 1. Capable of 1. Computational 1. Detecting [127]
term supervising a expensive spacecraft
Bayesian method:
memory large amount 2. Prone to anomalies
Bayesian net 1. Accurate 1. Requires prior 1. Analyze the [114] (LSTM) of high overfitting
prediction knowledge of the risk of dimensional
with limited application accident in spacecraft
information complex telemetry
blasting data with
operations good
accuracy
Deep neural network:
2. Capable of
Long short- 1. Excellent 1. Computational 1. Estimate [118] performing
term memory efficiency and expensive the health anomaly
(LSTM) accuracy 2. Prone to index of a detection
overfitting system without
costly expert
knowledge
when information about the different operating modes of the motors was
limited. A selection from this ML clustering literature is provided in
space based on some similarity measures such as Euclidean distance
Table 3.
[119,120]. Details about SOM can be found in Ref. [121]. Examples of
ML clustering is enabled by a set of models such as the popular K-
the use of SOM in reliability and safety applications are provided in
means algorithm and principal component analysis (PCA), as well as a
Table 3.
wide range of hierarchical clustering methods. We briefly introduce next
Distance metric learning is a category of machine learning algo­
two popular clustering models in reliability and other applications,
rithms that extracts similarity information from the input features
namely the self-organizing map (SOM) and the distance metric learning.
themselves. It can be applied with both supervised classification tools
SOM is an unsupervised learning method based on neural networks. It is
such as KNN, and unsupervised clustering tools such as K-mean. Several
a popular dimension reduction tool that maps the high-dimensional
ML algorithms rely on being provided a good similarity metric over their
feature space to a smaller, typically two-dimensional space, and it pro­
input data to perform effectively. The quality of their output is highly
duces a low dimensional representation of the input data. Following this
dependent on the similarity metric used, and distance metric learning
dimensionality reduction, SOM identifies clusters in the new feature

8
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

can considerably improve their classification or clustering performance. compare them with the observed ground truth [130]. The NBM has been
Details about distance metric learning can be found in Ref. [122,123]. applied to wind turbine monitoring [130-134] and showed good per­
Examples of its use in reliability and safety applications are provided in formance with excellent fault detection accuracy in this application.
Table 3. One challenging domain of application in this area which will likely
witness a significant growth in the coming years is anomaly detection for
3.2.2. Anomaly detection spacecraft operation and health monitoring. Spacecraft can be particu­
Anomaly detection is particularly well suited for and used in early larly complex and expensive systems with thousands of telemetry data
fault/damage detection of engineering equipment and structures. It is points for monitoring the health and performance of all its subsystems
intrinsically related to sensor data, and for industrial machinery and and payload. Since physical access to the craft is limited or impossible,
equipment, the data typically comes in a streaming fashion. Early detecting anomalous behavior as early as possible is particularly
detection of anomalies is essential in some contexts to prevent further important in order to provide satellite operators with sufficient lead-
damages and preempt catastrophic failures. For structures such as time to prevent the incipient fault from further developing, or to
beams, airframes, or bridges, the data for anomaly detection has both develop a workaround that limits its damage and consequences. Some
spatial and temporal dimensions. Although we have listed anomaly spacecraft missions can downlink gigabytes to terabytes of data per day,
detection under unsupervised learning, algorithms for anomaly detec­ and as a result, there is a critical need for: (i) automated anomaly
tion also exist in supervised and semi-supervised mode. That being said, detection methods that can support/augment limited engineering flight
the more widely used though are in the unsupervised mode, often resources for manual monitoring of spacecraft health; (ii) highly accu­
because unlabeled data is widely available and labeled data is expensive rate and robust detection algorithms, since on the one hand a missed
(and often rare) to obtain. In reliability and safety applications, data is detection can lead to dramatic consequences such as complete loss of
often available for (labeled) nominal operational conditions only. When mission, and on the other hand given the high volume of telemetry data
that is the case, anomaly detection can leverage tools of semi-supervised provided, even a small false alarm rate will lead to a voluminous number
learning since partial labels are available. of false alarms and overwhelm the operators, thus defeating the purpose
The literature includes applications of anomaly detection in support for which the anomaly detection system was designed in the first place,
of PHM for different engineering systems, for example, aircraft flight namely to alleviate the operators’ workload; and (iii) highly scalable
data recorders [124], industrial gas turbines [125], spacecraft operation anomaly detection algorithms that can operate efficiently with large
and health monitoring [126,127], and induction motors with a focus on multivariate streaming data. These challenges are far from being
ball-bearing faults [128]. Applications of anomaly detection algorithms resolved. They are likely to become more critical in the near future, and
for structural damage detection also abound, a discussion of which can there are significant opportunities for progress and meaningful contri­
be found in [44]. A selection from this ML anomaly detection literature butions in this area.
is provided in Table 4. To date, spacecraft anomaly detection is mostly based on fixed-
Many of the same tools of clustering are also used or adapted for threshold alarms or out-of-limit (OOL) method, thus treating the prob­
anomaly detection. Some of the recent works on anomaly detection in lem as a detection of point anomalies. This is appropriate in some cases,
reliability and safety applications also made important methodological but it misses the richness of the context and the signatures of many
contributions to the field. For example, Ince et al. [128] leveraged contextual anomalies in times series data for early detection. A more
Convolutional Neural Networks (CNN) to develop a highly accurate and insidious limitation of this approach is that it assumes that all anomalous
robust detection method of faults in bearings of induction motors. behaviors are known beforehand, and the corresponding thresholds are
Feature extraction is typically computationally expensive, and the ac­ set and programmed to be detected. This of course is rarely the case, and
curacy of anomaly detection algorithms often hinges on the careful se­ as a result, current spacecraft anomaly detection systems only target a
lection of features. The authors addressed both limitations by small subset of possible anomalies, and those that are likely to be
developing a novel CNN model that merges feature extraction and detected are not flagged early enough (or at their earliest possible
(unsupervised) classification into a single learner. They validated their detection). Two recent contributions that tackled these problems and
method by directly using raw sensor data to detect anomalies (no pre­ worth noting. Fuertes et al. [126] proposed a One-Class Support Vector
processing or input transformation). This is a meaningful contribution Machine (OC-SVM) anomaly detection algorithm with some
that can enable real-time anomaly detection capability with streaming pre-processing of the input telemetry data. Their algorithm showed
data. Yan and Yu [125] used Deep Learning tools for detecting abnormal promising results and detected anomalies that had not been detected by
behavior and incipient faults in industrial turbines where failures can be current monitoring systems. But it came at the price of a high false alarm
catastrophic and extremely costly. The authors recognized that rate, and the authors recognized that this challenge should be addressed
“advanced technologies that can improve detection performance are in “before this type of surveillance can be accepted and trusted by
great need” in their domain of application. Their model architecture [spacecraft] operational teams.” Hundman et al. [127] examined
integrated a stacked denoising autoencoder [129] with a neural network (expert-labeled) anomalies from the Soil Moisture Active Passive
known as Extreme Learning Machine (ELM) to significantly improve on (SMAP) satellite and the Mars Science Laboratory (MSL) Curiosity rover,
the sensitivity and specificity of other anomaly detection methods for and they reported that 41% were context anomalies. Thus, a significant
gas turbines. The SDAE+ELM model uses exhaust gas temperature proportion of spacecraft anomalies are not suitable for or cannot be
measurement to infer the operational condition of the combustor in the handled effectively with the prevalent OOL approaches and detection
gas turbine and detect anomalies. Just like Ref. [128], Yan and Yu [125] algorithms for point anomalies. The authors then leveraged advances in
demonstrated that their ML model can automatically extract the right Deep Learning [17] to develop a type of recurrent neural network (RNN)
features—a critical and challenging task—from the raw time-series known as Long Short-Term Memory (LSTM) for spacecraft anomaly
temperature measurements to improve the anomaly detection without detection that is capable of flagging context anomalies. LSTM is
the time-consuming “handcrafted” feature extraction phase. well-suited for modeling temporal data and prioritizing historical in­
A relatively recent anomaly detection approach is known as the formation for future predictions [135,136]. The authors demonstrated
Normal Behavior Model (NBM) [130-134] (NBM) and is reported to the viability of LSTM for detecting anomalies in near real-time space­
have several advantages, including being (i) prior assumptions free, and craft telemetry with over 700 channels. However, one major obstacle for
(ii) capable of ‘learning’ multiple normal conditions and detect different deployment they reported was the same as the one faced by Fuertes et al.
types of anomalies. The NBM is defined as a ML model, normally a DNN, [126], namely high rates of false positives. “High demands are placed on
trained based on the sensor signals during the normal operating condi­ operations engineers, and they are hesitant to alter [existing] proced­
tions of the system. The trained model is then used to predict signals and ures. Adopting new technologies [for spacecraft anomaly detection]

9
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

Table 5 algorithm for the RUL prediction of bearing and cooling fans. They
Selection from the literature on semi-supervised learning in reliability and safety demonstrated better accuracy in RUL prediction, along with a smaller
applications. scatter of the results, compared with a SL learning algorithm. They used
Technique Reported Model Application Ref. nonetheless a high ratio of labeled to total data (50%), which is much
advantage disadvantage larger than what is typically found in SSL applications. He et al. [138]
Graph based model: used ladder network (LN), a form of deep denoising auto-encoder to
estimate the RUL of centrifugal pumps, with an unsupervised feature
Graph based 1. Excellent 1. Require prior 1. Fault [142]
model self-learning knowledge for detection [143] extraction operating offline, and the prediction performed online with
(GBM) ability GBM model 2. Health streaming data. Yoon et al. [140] used a different strategy to estimate
2. Excellent building prognostic and the same quantity, the RUL of turbofan engines, but using a variational
data maintenance auto-encoder (VAE) trained on both the unlabeled and labeled data
visualization activity
3. Excellent identification
simultaneously, and with a fraction of the labels down to 1% of the
accuracy and entire dataset. Both works demonstrated good prediction accuracy even
great when the available label information was highly limited. A selection
efficiency from this SSL literature is provided in Table 5.
Support vector machine: Co-training is an example of “wrapper methods” in SSL, which are
One class 1. Excenlent 1. Not suitable for 1. Near-miss [144]
simple approaches to extending supervised algorithms to the semi-
support near miss high dimensional fall detection supervised mode by performing the following steps: first, a regressor
vector prediction data or classifier is trained on the labeled data, and predictions are performed
machine on the unlabeled data. Second, the most confident predictions are
(OCSVM)
pseudo-labeled and added to the training dataset. The regressor or
Semi- 1. Accurate for 1. Not robust to 1. Fault [141]
supervised complex high- data noise diagnostic classifier is then retrained on the expanded dataset in a supervised
support dimensional mode, “unaware of the distinction between originally labeled and
vector system pseudo-labeled data” [54]. The process is repeated until a termination
machine criterion is reached and the regressor or classifier cannot be further
(S3VM)
improved.
Neural network: Although we mentioned previously examples of semi-supervised
Deep 1. Superior 1. Unstable 1. Asset failure [140] regression problems, the majority of applications of semi-supervised
generative accuracy than generative model prediction learning are in classification, and in reliability and safety applications,
models traditional training
these revolve around fault detection and identification. We briefly
methods
Ladder 1. Superior 1. 1. RUL [138]
discuss next one interesting work in this area. Zhao et al. [142] devel­
network efficiency and Computationally prediction for oped a semi-supervised method for fault detection and classification in
accuracy than expensive centrifugal solar photovoltaic (PV) arrays. The authors noted that “without proper
traditional pumps fault detection, unnoticed faults in PV arrays might lead to safety issues
methods
and fire hazards. Conventional fault detection and protection usually
Long short- 1. Reliable 1. Prone to 1. RUL [139]
term RUL overfitting prediction for add overcurrent protection devices (OCPD)”, but certain faults have
memory- prediction problem turbofan been shown to be unnoticeable by these devices. Furthermore, fault
restricted with 2. Not robust to the engine classification (identification of the types of fault) can be particularly
Boltzmann insufficient initialization
helpful in helping to identify the maintenance needs to expedite the
machine labeled data setting
(LSTM-
recovery of the system. The authors developed a graph-based method
RBM) (GMB) that uses existing measurements such as PV voltage, current, and
Radial basis 1. Excellent 1. Hard to 1. Data-driven [137] operating temperature, with a low training cost leveraging a large pool
network accuracy and interpret the ‘black accident of unlabeled data (less than 2% of the data is labeled) and self-learning
(RBN) robustness. box’ model prognostics,
to achieve a remarkable detection and classification accuracy (99%)
RUL prediction
under real working conditions. GBMs are the focus of one of the most
active areas of research in semi-supervised learning. Their common
means an increased risk of wasting valuable time and attention. Inves­ denominator is that the data are represented by the nodes (V) of the
tigation of even a couple of false alarms can deter users, and therefore graph, and the edges (E) are labeled by a measure of similarity between
achieving high precision […] is essential for adoption.” two nodes, the Euclidean distance for example or some other (geodesic)
In short, many challenges remain but also significant opportunities distance. GBMs are based on the manifold assumption, which roughly
exist for leveraging anomaly detection algorithms in a host of reliability states that high dimensional data lie on a low dimensional manifold, and
and safety applications. We will discuss some of them in Section 4. nearby data in the feature space have similar label prediction. In most
instances, the predictions of GBM consist in labeling the unlabeled
3.3. Semi-supervised learning applications dataset based on local similarities between nodes or a weighted distance
of the edges. “A parallel can be drawn between [graph-based] methods
Semi-supervised learning algorithms are used in fault detection and and supervised nearest-neighbor methods. The latter predicts the label
identification, in prognostics and RUL prediction, which are important of an unlabeled data point by looking at the labels of similar (i.e.,
for maintenance planning. These methods have significant, and in our nearby) labeled data-points; graph-based methods also consider the
opinion, barely tapped potential for reliability and safety applications similarity between pair of unlabeled data points. Using that information,
where unlabeled data is abundant but labeled (e.g., failure) data is labels can be propagated transitively form a labeled data point to an
scarce and expensive to come by, or it takes a long time to collect. The unlabeled data point” [54]. The reader is referred to Ref. [145] for more
literature in this area includes applications of SSL in support of fault details on GBM. Other approaches to SSL include SVM-based methods
detection and PHM or RUL prediction for different engineering systems, such as semi-supervised support vector machine (S3VM) and one-class
for example cooling fans [137], centrifugal pumps [138], turbofan en­ support vector machine (OC-SVM) [146,147].
gines [139,140], bearing defects in induction motors [141], and solar One last note of caution regarding SSL methods the reader should be
arrays [142]. Hu, Youn, and Kim [137] used a co-training regression SSL aware of is the publication bias and the importance of safe semi-

10
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

supervised learning. While the vast majority of articles on SSL report at identifying near-optimal policies for traditionally intractable systems
improved accuracy and better prediction than their SL counterparts, this and other computationally intractable problems in reliability and safety
needs not be the case in real-world applications. The performance of a applications.
regressor or classifier can degrade when unlabeled data is exploited One last important subfield of research within RL is worth
alongside labeled data if the underlying assumptions of SSL are violated. mentioning as the literature and applications in this area are likely to
Such findings would rarely be reported given the publication bias to­ experience significant growth in the coming years, that is, Safe Rein­
ward positive results, thus skewing the perception that SSL methods are forcement Learning (SRL). In all the previous discussions, we noted how
unconditionally successful. The theme of safe semi-supervised learning is a ML algorithms have been used to address a reliability or safety appli­
growing research area with meaningful recent progress, and it examines cation; in SRL the reverse relationship holds, namely safety consider­
the circumstances under which unlabeled data can confidently be inte­ ations are brought to bear on the learning algorithm to avoid exploring
grated into the learner without degrading its performance.2 The reader hazardous or unsafe states (known more generally as error states) during
is referred to Li and Liang [148] and the references therein for an the learning process. SRL is defined as “the process of learning policies
introduction to these issues. that maximize the expectation of return in problems in which it is
important to respect safety constraints” during the learning and
3.4. Reinforcement learning applications deployment phases3 [157]. This is crucial in some situations, for
example when the learning phase occurs online with hardware in the
Reinforcement learning is not yet as broadly used in reliability and loop, not through simulators. Reinforcement Learning, as noted in Sec­
safety applications as supervised or unsupervised learning. Yet, it offers tion 2.4, involves an agent perceiving the state of the environment and
significant opportunities for making important contributions to these learning a course of action (policy) that maximizes its cumulative
areas. We briefly some recent publications from 2019 to 2020 in this reward. In many robotics applications, whether the algorithm is learning
area. to fly a drone or perform surgery for example, safety is particularly
Some recent applications include reliable handover in cellular important, and avoiding damage to the agent or the environment is
network operations for mobility robustness optimization (MRO). This is essential (e.g., it is not an efficient learning algorithm if it crashes a
a challenging problem for traditional rule-based methods, the objectives drone repeatedly before it is capable of skillfully flying it). Safety con­
of which are to minimize the number of dropped calls/unsatisfied cus­ straints or preferences have been incorporated into RL algorithms either
tomers, increase each cell throughput, and ensure a more balanced by modifying the optimization criterion, or by modifying the exploration
network using cell load-sharing [149]. The authors developed a Deep process through the use of risk metrics, or by incorporating external
Reinforcement Learning (DRL) solution that outperformed (on user QoS) knowledge about undesirable states [157]. With the proliferation of
and required fewer parameters to tune than traditional methods for social robots, Safe Reinforcement Learning will remain a vital research
reliably handling wireless user handover across cells. Reinforcement area, and its applications will extend to incorporate different forms of
learning has also been used for condition-based maintenance planning risks. For more details on Safe Reinforcement Learning, the reader is
with multi-component systems subject to competing risks [150]. referred to Garcia and Fernandez [157] and Munos et al. [158] and the
Condition-based maintenance (CBM) has traditionally been examined references therein.
with Markov Decision Processes (MDP) [151] and its variant the
partially observable Markov decision processes (POMDP) [152,153,
3.5. A note on deep learning
154]. MDP and POMDP can efficiently handle small systems with a
limited number of states and possible actions for decision-making. They
This last subsection overlaps with the previous ones since in dis­
strain, however, at handling large (real) systems in which the number of
cussing earlier applications leveraging for example CNN or LSTM, we
states and actions grows exponentially with the number of components.
were in fact presenting some of the use of deep learning (DL) in reli­
For example, a system with “20 components, 5 states and 5 actions per
ability and safety applications. We include this subsection nonetheless to
component is described by nearly 1014 states and actions! This renders
highlight DL’s growing popularity and emphasize its distinctive ad­
the problem practically intractable by any conventional scheme or
vantages over other types of ML. DL is a subset of machine learning
advanced MDP or POMDP algorithm” [155]. With the increasing
(DL⊂ML), and it is capable of handling all categories of ML shown in
complexity of engineering systems, optimization of CBM becomes
Fig. 1: supervised, unsupervised, semi-supervised, and reinforcement
challenging if not computationally intractable with MDP and other
learning. DL consists of a collection of connected computational neu­
conventional decision-making tools. Andriotis and Papakonstantinou
rons, organized in multiple layers,4 between inputs and outputs, the
[155] leveraged Deep Reinforcement Learning to identify efficient in­
whole capable of learning more complex functions than a single neuron5
spection and maintenance policies for large-scale infrastructure systems.
or layer. Each layer extracts some features from its inputs, and each
Xiang et al. [156] developed a Deep Reinforcement Learning-based
subsequent layer extracts features out of the previous, lower-level fea­
sampling method for structural reliability assessment. The sampling
tures. In this sense, DL extracts high-level latent features (e.g., a human
space is treated as the state of the DRL method, and the sample selection
face, a cat, a sentiment) from lower-level features and data (e.g., corners,
as its action (coupled with a reward function to guide the deep neural
contours, pixels, letters). The idea of a nested hierarchy of features, each
network in selecting the sampling points). The proposed method ach­
one defined in terms of simpler ones is the essential pillar from which DL
ieved a higher prediction accuracy than competing sampling methods
derives its great power and flexibility. The depth in DL refers to the
such as Monte Carlo Simulation and Latin hypercube sampling. Rein­
presence of hidden layers in the network between the inputs and the
forcement Learning techniques are well suited to alleviate the curse of
dimensionality in large state and action space systems, and as such they
are ideal for handling the scheduling of inspection and maintenance of 3
There are other definitions of safe reinforcement learning, see for example
large engineering systems across their life cycle. The recent advances in Munos et al. [158].
Deep Reinforcement Learning [61] make these tools even more effective 4
With different network architectures, feedforward (DNN) or with feedback
looks (recurrent neural networks or RNN).
5
Typically executing the sigmoid function, the tanh, the softmax function, or
2
Safe in this context is not used in its traditional sense of free from danger or more recently the rectified linear unit (ReLU) and the leaky ReLU.
harm, but to “indicate that the performance [of the SSL learner] is never worse
than methods using only labeled data” [148]. The corresponding noun in this
context is safeness, not safety.

11
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

outputs, the results of their computations are not seen by the user or at 4. Future opportunities: Promising reliability and safety
the output. It was the challenges faced by shallow machine learning al­ applications of ML
gorithms6 and their failure to address central tasks in AI, such as object
recognition in computer vision or speech recognition (high-dimensional The previous section provided an overview of the large but frag­
data), that led to the developments and breakthroughs in DL. To date, DL mented literature on ML in reliability and safety applications. This ever-
technology is ubiquitous in smart phones for example, in natural lan­ expanding analytical landscape can be overwhelming to navigate and
guage processing, machine translation, image processing, and recom­ integrate into a coherent whole. We sought to facilitate this task by
mender systems used by Netflix, Amazon, and YouTube for example. “looking back” and providing a scaffolding for, and a wide panoramic
More generally, problems with high-dimensional data in which the curse view of this field of applications and its major themes. In this section, we
of dimensionality is a major impediment—and there are scores of them “look ahead” and outline some future opportunities for ML in reliability
in reliability and safety applications, for example in prognostics and and safety applications. We have already touched on this in the previous
health management—are candidates for being effectively handled by DL section, for example when discussing Safe Reinforcement Learning and
algorithms: variational autoencoders. Some themes or domain of applications dis­
cussed next are likely to touch on ongoing efforts by researchers, others
“Machine learning problems become exceedingly difficult when the
are more speculative in nature and are included here because of their
number of dimensions in the data is high. This phenomenon is known as
potential to help advance our common reliability and safety agenda:
the curse of dimensionality. Of particular concern is that the number of
to engineer more reliable and safer systems, cost effectively, to
possible distinct configurations of a set of variables increases exponen­
advance accident and injury prevention, and to make the world a
tially as the number of variables increases. [Making the mild assumption
safer place.7 The tone hereafter is conversational, and the discussion
that the data was generated by] a composition of factors, possibly at
high-level. We restrict the discussion to our (subjective) Top 5 list of
multiple levels in a hierarchy allows an exponential gain […]. The
most promising opportunities and themes for leveraging ML in service of
exponential advantages conferred by the use of [deep learning] counter
this reliability and safety agenda. This is by design a short, non-
the exponential challenges posed by the curse of dimensionality.” [17]
exhaustive list intended not to further prolong this work and to invite
A high-level overview of DL can be found in Ref. [159]. The appli­ other researchers to reflect on our selection and complement it with
cations of DL in reliability and safety are steadily growing, in anomaly theirs:
detection for example, fault classification, RUL estimation, maintenance
planning, and more broadly in PHM. Some of these applications were 1. Uncertainty quantification and reduction, and deployment
discussed in the previous subsections and will not be repeated here. A considerations: in all the previous discussions of ML applications,
few others are included here for illustrative purposes. For example, whether RUL prediction, anomaly detection, fault classification, or
Tamilselvan and Wang [90] developed a deep belief network (DBN) to more generally any type of PHM activity, improving the accuracy of
perform system health diagnostic and classify different states of the the prediction was a main driver for the use of ML. For anomaly
system. The authors benchmarked the DBN classification performance detection or classification, reducing false alarm rates (or misclassi­
against other shallow ML methods such as SVM, and they reported su­ fication) and missed detection is paramount for a broader adoption
perior classification accuracy. For unsupervised learning, using ap­ of ML. These considerations will continue to be central to ML ap­
proaches such as SOM and autoencoder, DL is highly capable of handling plications. For meaningful impact, the literature on ML for reliability
high-dimensional and time sequential clustering and anomaly detection. and safety applications would be well served to address, even if
For example, Reddy, et al. [124] developed a multi-modal deep briefly, deployment considerations, which will inevitably lead to
auto-encoder approach to realize unsupervised anomaly detection of issues of accuracy and uncertainty quantification for more informed
flight raw time series data. The authors validated their deep autoencoder decision-making. Predicting a single value for an asset RUL, for
on the NASA DASHlink open database and reported excellent anomaly example, is limited; providing confidence intervals around a RUL
detection accuracy. expectation is more useful; identifying the sources of uncertainty in
As noted previously, DL algorithms are capable of handling high- the calculation and adopting models that both quantify and reduce
dimensional input data. Furthermore, when equipped with convolu­ the variability in the calculations will be more convincing for the ML
tional layers, they become highly effective in handling extreme high- technology to be adopted and have a meaningful impact in the field.
dimensional data sources such as images and videos. Li et al. [74] 2. Machine learning for fleet-level PHM, and more broadly, ML for
developed CNN model to perform RUL prediction. The authors validated system-of-system PHM: this theme was identified and discussed by
their model performance using NASA’s turbofan engine degradation Fink et al. [162]. We repeat it here because we agree with its
dataset [160,161] with multiple sensor signals as input. They reported importance and give it a slightly different twist or complementary
higher accuracy and lower root mean square error (RMSE) compared perspective. Many of the applications we examined in Section 3
with other shallow ML methods. Some applications of DRL were noted in addressed PHM issues at the component or subsystem level, ball
the previous subsection in connection with inspection and maintenance bearings for example, compression stage in a turbine, induction
planning. An excellent survey of DL applications in PHM is provided by motors, or batteries. The future challenges of PHM and promising
Fink et al. [162]. applications are likely to be at a higher level of aggregation, at the
In all applications to date, DL has significantly outperformed shallow fleet level, for example at the level of a mega-constellation of satel­
ML algorithms. While this need not reflect a universal truth, we believe lites, a fleet of trucks, or a wind turbine farm. More generally, we
the return on investment in DL methods for reliability and safety propose that one particularly fruitful direction for ML is not just in
application is far more likely to yield more dividends than any other support of PHM at the system level, but more broadly ML for
shallow methods. system-of-systems (SoS) reliability and safety applications. This will
raise several interesting multidisciplinary challenges, including the
need to better integrate ML with systems engineering models and
human factors, and to address the variability in configurations and
operational conditions of different units in the fleet or systems within

6
Learning algorithms that have no hidden layers (no depth), which in
7
essence means learning directly from the features in the data without a hier­ In the products we design and operate, and more broadly at work, at home,
archical composition of new high-level features from lower-level ones. during commutes, and while engaging in leisurely activities.

12
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

the SoS. The advantages and limitations of transfer learning will


likely be central in this context. More details on fleet level PHM can
be found Fink et al. [162].
3. Integration of ML with accident databases and Safety Manage­
ment Systems: We strongly believe there are many opportunities for
teasing out more, better insights from accident databases and safety
management systems (SMS) by using ML tools than with traditional
statistical tools. These databases can be institutional with govern­
ment entities or professional societies for example, and SMS are
pervasive across all hazardous industries (SMS also includes near
miss management systems (NMS), and these are goldmines for ML
tools to probe). The next generation of smart SMS will likely include
ML tools for seamlessly integrating safety-related heterogeneous
Fig. 5. Schematic of the GAN training process.
data collected across different time scales and through different
means (e.g., real-time online monitoring, inspections, manually
manuscripts in this area of applications. To help researchers focus on
coded NMS data) to improve the efficacy of accident precursor
more promising ML models for reliability and safety applications, we
identification and safety interventions. We believe a most promising
briefly note in this paragraph two powerful ML models that are likely
application of ML is in unleashing its power to harvest more value
to outperform others. They make excellent candidates for addressing
from near miss data and other accident databases for ultimately
the uncertainty quantification and reduction issue raised in (1) and
improving accident and occupational injury prevention. Data link­
all the other applications we listed subsequently. They are the Deep
ages with other non-accident databases can further help provide
Gaussian process (DGP) and the generative adversarial network
measures of exposures for determining accident and injury rates,
(GAN). DGP is a multilayer hierarchical generalization of the
comparing across similar contexts or industries, and identifying
Gaussian process, and it is formally equivalent to neural networks
predictive features and the strength of their associations with the
with infinitely wide hidden layers. The powerful Gaussian Process
outcomes for better interventions and accident prevention.
Regression model is a special case of DGP with a single layer [164].
4. ML integration with wearable computing/sensor for predictive
DGP is a nonparametric probabilistic model with excellent accuracy
safety analytics: one particularly useful category of applications for
and better uncertainty estimation and propagation than alternative
advancing the safety agenda noted previously is the integration of
ML models. Details about this state-of-the-art ML method can be
ML with wearable computing/sensors for predictive safety analytics.
found in Damianou and Lawrence [164]. GAN is another powerful
This can take many forms, and it can address a wide range of safety
ML generative method developed by Goodfellow et al. [165]. The
and injury prevention problems. For example: (i) falls among the
GAN architecture consists of two neural networks, a generator G and
elderly is a leading cause of death and injuries among the older
a discriminator D, competing with each other in a classification task
adults, 65+, and in the U.S. it leads to more than 2.7 million hos­
for example, the latter (D) trying to accurately classify an object, the
pitalizations and 27,000 deaths annually [163]. The associated
former (G) generating (increasingly more) fake objects to trick the
mental health toll and financial burden are significant. ML and
discriminator and degrade its performance. It is this antagonism
wearable computing are likely to be very helpful in advancing our
between the discriminator and the generator, or their cat-and-mouse
understanding of this major public health issue. This is a most worthy
game that confers superior performance to GAN models. The training
topic for ML and safety researchers to contribute to and make a dent
of GAN, shown in Fig. 5, involves both finding the optimal param­
in addressing it; (ii) in the U.S. workers sustain over 5000 fatal
eters for the discriminator to maximize its classification of accuracy
occupational injuries and 2.8 million nonfatal injuries every year.
and those of a generator to maximize the confusion of the discrimi­
Personal protective equipment (PPE) is the last line of defense
nator. The training process is iterative until a Nash equilibrium is
against the adverse consequences of an accident. A smart PPE that
reached [166].
integrates wearable sensors can be helpful in reducing this burden of
occupational injury. For example, workers at a construction site can
Taking the advantages of their excellent accuracy and efficiency in
wear smart PPE, and their collective streaming data handled by a
handling high-dimensional data, we anticipate that many promising
centralized ML system that performs a safety supervisory function
opportunities will adapt DGP and GAN models for fault detection and
and some safety analytics; (iii) body-cameras coupled with ML image
classification in a wide range of reliability and safety applications, and
and activity recognition can provide support for technicians during
more generally that future advanced PHM approaches will leverage
maintenance tasks and inspection. This in turn can be integrated
these models for more accurate diagnostic, prediction, and uncertainty
with the permit-to-work system at a company and provide significant
analysis in complex engineering systems.
safety benefits if properly designed, tested, and rolled out. Beyond
We thus conclude our Top 5 list of most promising opportunities and
these specific examples in (i), (ii), and (iii), we strongly believe there
themes for leveraging ML in service of this reliability and safety agenda.
are many fruitful opportunities for exploring applications that inte­
There are many other themes and applications we did not include for the
grate ML with wearable computing for predictive safety analytics
sake of brevity, for example the challenges of “small data” in reliability
and injury prevention.
and safety applications (beyond the familiar “big data” context of ML),
5. Deep Gaussian Process (DGP) and Generative Adversarial
the inescapable use of ML for safe autonomous transportation (ground
Network (GAN) for reliability and safety applications: Unlike the
and air), and the need to better integrate ML with risk analysis tools and
previous four suggestions which were application-centric, our last
concepts.
suggestion is tool-centric and proposes model candidates that might
effectively address our previous suggestions. The previous sections
5. Conclusion
made it clear that there are scores of machine learning models and
algorithms, and this wide availability makes it even more important
Machine learning pervades an increasing number of fields. Its impact
to benchmark the performance of different approaches for model
is profound, and it will continue to upend traditional academic disci­
selection. We recommended previously that authors, reviewers, and
plines and industries. Several academic fields have been fundamentally
editors be more mindful of this expectation of transparency for
altered by ML, controls and autonomy or computer vision for example,
comparative model analysis and selection when writing or reviewing

13
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

[6] Panch T, Szolovits P, Atun R. Artificial intelligence, machine learning and health
systems. J Glob Health 2018;8(2).
[7] Stone P, Veloso M. Multiagent systems: a survey from a machine learning
perspective. Auton Robots 2000;8(3):345–83.
[8] Mosavi A, Varkonyi A. Learning in robotics. Int J Comput Appl 2017;157(1):
8–11.
[9] Polydoros AS, Nalpantidis L. Survey of model-based reinforcement learning:
applications on robotics. J Intell Robot Syst 2017;86(2):153–73.
[10] Bhavsar P, Safro I, Bouaynaya N, Polikar R, Dera D. Machine learning in
transportation data analytics. Data analytics for intelligent transportation
systems. Elsevier; 2017. p. 283–307.
[11] Zantalis F, Koulouras G, Karabetsos S, Kandris D. A review of machine learning
and IoT in smart transportation. Future Internet 2019;11(4):94.
[12] Ermagun A, Levinson D. Spatiotemporal traffic forecasting: review and proposed
Fig. 6. ML for reliability and safety application. Not addressed in this work is directions. Transport Rev 2018;38(6):786–814.
[13] Ballestar MT, Grau-Carles P, Sainz J. Predicting customer quality in e-commerce
the reverse of this relationship, namely the reliability and safety of ML systems
social networks: a machine learning approach. Rev Managerial Sci 2019;13(3):
(and more broadly of AI systems). 589–603.
[14] Rath M. Machine Learning and Its Use in E-Commerce and E-Business. Handbook
of research on applications and implementations of machine learning techniques.
reliability engineering and safety will undoubtedly follow suit. There is IGI Global; 2020. p. 111–27.
already a large but fragmented literature on ML for reliability and safety [15] Elmurngi E, Gherbi A. An empirical study on detecting fake reviews using
applications, and it can be overwhelming to navigate and integrate into machine learning techniques. In: 2017 seventh international conference on
innovative computing technology (INTECH). IEEE; August 2017. p. 107–14.
a coherent whole. In this work, we sought to facilitate this task providing
[16] Murphy KP. Machine learning: a probabilistic perspective. MIT press; 2012.
a scaffolding for, and a wide panoramic view of this ever-expanding [17] Goodfellow I, Bengio Y, Courville A. Deep learning. MIT press; 2016.
analytical landscape and highlighting its major landmarks and [18] Zio E. Reliability engineering: old problems and new challenges. Reliab Eng Syst
pathways. Saf 2009;94(2):125–41.
[19] Li X, Zhang W, Ding Q. Deep learning-based remaining useful life estimation of
We first provided an overview of the different ML categories and sub- bearings using multi-scale feature extraction. Reliab Eng Syst Saf 2019;182:
categories or tasks. We then “looked back” and reviewed the use of ML in 208–18.
reliability engineering and safety applications. We selected and dis­ [20] Freedman DA. Statistical models: theory and practice. cambridge university press;
2009.
cussed a few publications in each category/sub-category, and we [21] Edwards JR. Polynomial regression and response surface methodology. Perspect
devoted a short subsection to the use of Deep Learning in this area to Org Fit 2007:361–72.
highlight its growing popularity and emphasize its distinctive advan­ [22] Smola AJ, Schölkopf B. A tutorial on support vector regression. Stat Comput
2004;14(3):199–222.
tages over other types of ML models. Finally, we “looked ahead” and [23] Lewis RJ. An introduction to classification and regression tree (CART) analysis.
outlined some promising future opportunities for leveraging ML in ser­ In: Annual meeting of the society for academic emergency medicine in14; May
vice of advancing reliability and safety considerations. 2000.
[24] Liaw A, Wiener M. Classification and regression by randomForest. R News 2002;2
One topic not addressed here (except for the brief discussion of safe (3):18–22.
reinforcement learning) is illustrated in Fig. 6, and it concerns the [25] Minh LQ, Duong PLT, Lee M. Global sensitivity analysis and uncertainty
reverse relationship of the scope of this work, namely the reliability and quantification of crude distillation unit using surrogate model based on Gaussian
process regression. Ind Eng Chem Res 2018;57(14):5035–44.
safety of ML systems, and more broadly of Artificial Intelligence (AI)
[26] Kabir HD, Khosravi A, Hosen MA, Nahavandi S. Neural network-based
systems. This is an important topic, and it deserves careful attention and uncertainty quantification: a survey of methodologies and applications. IEEE
dedicated treatment in a separate publication. Accidents in ML are Access 2018;6:36218–34.
defined as “unintended and harmful behavior that may emerge from [27] Qiu X, Zhang L, Ren Y, Suganthan PN, Amaratunga G. Ensemble deep learning for
regression and time series forecasting. In: 2014 IEEE symposium on
poor design” of the system [167]. There is a need for new risk analysis computational intelligence in ensemble learning (CIEL). IEEE; December 2014.
methodologies for ML/AI systems, for ways of reducing their accident p. 1–6.
risks and addressing their fundamental failure mechanisms. An intro­ [28] Zhang D, Lin J, Peng Q, Wang D, Yang T, Sorooshian S, Liu X, Zhuang J. Modeling
and simulating of reservoir operation using the artificial neural network, support
duction to this topic can be found in Amodei et al. [167], and we propose vector regression, deep learning algorithm. J Hydrol (Amst) 2018;565:720–36.
to examine it more carefully in future work. [29] Crawford M, Khoshgoftaar TM, Prusa JD, Richter AN, Al Najada H. Survey of
review spam detection using machine learning techniques. J Big Data 2015;2(1):
23.
[30] Hirschberg J, Manning CD. Advances in natural language processing. Science
Declaration of Competing Interest 2015;349(6245):261–6.
[31] Hosmer Jr DW, Lemeshow S, Sturdivant RX. Applied logistic regression, 398.
The authors have declared that no competing interests exist. John Wiley & Sons; 2013.
[32] Dudani SA. The distance-weighted k-nearest-neighbor rule. IEEE Trans Syst Man
Cybern 1976;(4):325–7.
Acknowledgments [33] Chen T. Introduction to boosted trees, 22. University of Washington Computer
Science; 2014. p. 115.
[34] Breiman L. Random forests. Mach Learn 2001;45(1):5–32.
This work was supported in part by a Space Technology Research [35] Duda RO, Hart PE, Stork DG. Pattern classification. John Wiley & Sons; 2012.
Institute grant from NASA’s Space Technology Research Grants [36] Sibi P, Jones SA, Siddarth P. Analysis of different activation functions using back
Program. propagation neural networks. J Theor Appl Inf Technol 2013;47(3):1264–8.
[37] Jain AK, Murty MN, Flynn PJ. Data clustering: a review. ACM Comput Surv
(CSUR) 1999;31(3):264–323.
References [38] Saxena A, Prasad M, Gupta A, Bharill N, Patel OP, Tiwari A, Er MJ, Ding W,
Lin CT. A review of clustering techniques and developments. Neurocomputing
2017;267:664–81.
[1] Leo M, Sharma S, Maddulety K. Machine learning in banking risk management: a
[39] Teknomo K. K-means clustering tutorial. Medicine (Baltimore) 2006;100(4):3.
literature review. Risks 2019;7(1):29.
[40] Johnson SC. Hierarchical clustering schemes. Psychometrika 1967;32(3):241–54.
[2] Boughaci D, Alkhawaldeh AA. Appropriate machine learning techniques for
[41] Kriegel HP, Kröger P, Sander J, Zimek A. Density-based clustering. Wiley
credit scoring and bankruptcy prediction in banking and finance: a comparative
Interdiscip Rev: Data Min Knowl Discov 2011;1(3):231–40.
study. Risk Decis Anal 2020;(Preprint):1–10.
[42] Shah SA, Koltun V. 2018. Deep continuous clustering. arXiv preprint arXiv:1
[3] Kou G, Chao X, Peng Y, Alsaadi FE, Herrera-Viedma E. Machine learning methods
803.01449.
for systemic risk analysis in financial sectors. Technol Econ Dev Economy 2019;
[43] Ghasedi Dizaji K, Herandi A, Deng C, Cai W, Huang H. Deep clustering via joint
25(5):716–42.
convolutional autoencoder embedding and relative entropy minimization. In:
[4] Beam AL, Kohane IS. Big data and machine learning in health care. JAMA 2018;
Proceedings of the IEEE international conference on computer vision; 2017.
319(13):1317–8.
p. 5736–45.
[5] Char DS, Shah NH, Magnus D. Implementing machine learning in health
care—Addressing ethical challenges. N Engl J Med 2018;378(11):981.

14
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

[44] Chandola V, Banerjee A, Kumar V. Anomaly detection: a survey. ACM Comput [78] Sun Z, Wang J, Li R, Tong C. LIF: a new Kriging based learning function and its
Surv (CSUR) 2009;41(3):1–58. application to structural reliability analysis. Reliab Eng Syst Saf 2017;157:
[45] Tsai CF, Hsu YF, Lin CY, Lin WY. Intrusion detection by machine learning: a 152–65.
review. Expert systems with applications 2009;36(10):11994–2000. [79] Zhang J, Xiao M, Gao L. An active learning reliability method combining kriging
[46] Beam AL, Kohane IS. Big data and machine learning in health care. JAMA 2018; constructed with exploration and exploitation of failure region and subset
319(13):1317–8. simulation. Reliab Eng Syst Saf 2019;188:90–102.
[47] Cottrell M, Olteanu M, Rossi F, Villa-Vialaneix N. Theoretical and applied aspects [80] Yang X, Liu Y, Mi C, Tang C. System reliability analysis through active learning
of the self-organizing maps. Advances in self-organizing maps and learning vector Kriging model with truncated candidate region. Reliab Eng Syst Saf 2018;169:
quantization. Cham: Springer; 2016. p. 3–26. 235–41.
[48] Carpenter GA, Grossberg S. Adaptive resonance theory. Springer US; 2010. [81] Jiang C, Qiu H, Yang Z, Chen L, Gao L, Li P. A general failure-pursuing sampling
p. 22–35. framework for surrogate-based reliability analysis. Reliab Eng Syst Saf 2019;183:
[49] Manevitz LM, Yousef M. One-class SVMs for document classification. J Mach 47–59.
Learn Res 2001;2(Dec):139–54. [82] Wei P, Liu F, Tang C. Reliability and reliability-based importance analysis of
[50] Tu E, Yang J. 2019. A Review of Semi Supervised Learning Theories and Recent structural systems using multiple response Gaussian process model. Reliab Eng
Advances. arXiv preprint arXiv:1905.11590. Syst Saf 2018;175:183–95.
[51] Kingma DP, Mohamed S, Rezende DJ, Welling M. Semi-supervised learning with [83] Pecht M, Gu J. Physics-of-failure-based prognostics for electronic products. Trans
deep generative models. In: Advances in neural information processing systems; Inst Meas Control 2009;31(3–4):309–22.
2014. p. 3581–9. [84] Jouin M, Gouriveau R, Hissel D, Péra MC, Zerhouni N. Particle filter-based
[52] Subramanya A, Talukdar PP. Graph-based semi-supervised learning. Synthesis prognostics: review, discussion and perspectives. Mech Syst Signal Process 2016;
Lect Artif Intell Mach Learn 2014;8(4):1–125. 72:2–31.
[53] Zhu XJ. Semi-supervised learning literature survey. University of Wisconsin- [85] Tipping ME. The relevance vector machine. In: Advances in neural information
Madison Department of Computer Sciences; 2005. processing systems; 2000. p. 652–8.
[54] Van Engelen JE, Hoos HH. A survey on semi-supervised learning. Mach Learn [86] Girard A. Approximate methods for propagation of uncertainty with Gaussian
2020;109(2):373–440. process models (Doctoral dissertation. University of Glasgow; 2004.
[55] Yang HK, Chiang PH, Ho KW, Hong MF, Lee CY, 2019. Never forget: balancing [87] Parussini L, Venturi D, Perdikaris P, Karniadakis GE. Multi-fidelity Gaussian
exploration and exploitation via learning optical flow. arXiv preprint arXiv:1901 process regression for prediction of random fields. J Comput Phys 2017;336:
.08486. 36–50.
[56] Da Silva FL, Costa AHR. A survey on transfer learning for multiagent [88] Busby D. Hierarchical adaptive experimental design for Gaussian process
reinforcement learning systems. J Artif Intell Res 2019;64:645–703. emulators. Reliab Eng Syst Saf 2009;94(7):1183–93.
[57] Leal LE, Chapman A, Westerlund M. Reinforcement learning for extended reality: [89] Schulz E, Speekenbrink M, Krause A. A tutorial on Gaussian process regression:
designing self-play scenarios. In: HICSS; 2019. p. 1–8. modelling, exploring, and exploiting functions. J Math Psychol 2018;85:1–16.
[58] Zhao X, Xia L, Tang J, Yin D. Deep reinforcement learning for search, [90] Tamilselvan P, Wang P. Failure diagnosis using deep belief learning based health
recommendation, and online advertising: a survey by Xiangyu Zhao, Long Xia, state classification. Reliab Eng Syst Saf 2013;115:124–35.
Jiliang Tang, and Dawei Yin with Martin Vesely as coordinator. ACM sigweb [91] Tang K, Parsons DJ, Jude S. Comparison of automatic and guided learning for
newsletter. Spring; 2019. p. 4. Bayesian networks to analyse pipe failures in the water distribution system.
[59] Sutton RS, McAllester DA, Singh SP, Mansour Y. Policy gradient methods for Reliab Eng Syst Saf 2019;186:24–36.
reinforcement learning with function approximation. In: Advances in neural [92] Islam MM, Kim JM. Reliable multiple combined fault diagnosis of bearings using
information processing systems; 2000. p. 1057–63. heterogeneous feature models and multiclass support vector Machines. Reliab
[60] Watkins CJ, Dayan P. Q-learning. Mach Learn 1992;8(3–4):279–92. Eng Syst Saf 2019;184:55–66.
[61] Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, [93] Tao S, Zhang T, Yang J, Wang X, Lu W. Bearing fault diagnosis method based on
Riedmiller M, Fidjeland AK, Ostrovski G, Petersen S. Human-level control stacked autoencoder and softmax regression. In: 2015 34th Chinese Control
through deep reinforcement learning. Nature 2015;518(7540):529. Conference (CCC). IEEE; July 2015. p. 6331–5.
[62] Grondman I, Busoniu L, Lopes GA, Babuska R. A survey of actor-critic [94] Jiménez AA, Muñoz CQG, Márquez FPG. Dirt and mud detection and diagnosis on
reinforcement learning: standard and natural policy gradients. IEEE Trans Syst a wind turbine blade employing guided waves and supervised learning classifiers.
Man Cybern Part C (Appl Rev) 2012;42(6):1291–307. Reliab Eng Syst Saf 2019;184:2–12.
[63] Chang Y, Fang H. A hybrid prognostic method for system degradation based on [95] Wang J, Zhang C. Software reliability prediction using a deep learning model
particle filter and relevance vector machine. Reliab Eng Syst Saf 2019;186:51–63. based on the RNN encoder–decoder. Reliab Eng Syst Saf 2018;170:73–82.
[64] Fink O, Zio E, Weidmann U. Predicting component reliability and level of [96] Naderpour M, Rizeei HM, Khakzad N, Pradhan B. Forest fire induced Natech risk
degradation with complex-valued neural networks. Reliab Eng Syst Saf 2014;121: assessment: a survey of geospatial technologies. Reliab Eng Syst Saf 2019;191:
198–206. 106558.
[65] Liu Y, Hu X, Zhang W. Remaining useful life prediction based on health index [97] Stern RE, Song J, Work DB. Accelerated Monte Carlo system reliability analysis
similarity. Reliab Eng Syst Saf 2019;185:502–10. through machine-learning-based surrogate models of network connectivity.
[66] Li X, Zhang W, Ding Q. Deep learning-based remaining useful life estimation of Reliab Eng Syst Saf 2017;164:1–9.
bearings using multi-scale feature extraction. Reliab Eng Syst Saf 2019;182: [98] Rachman A, Ratnayake RC. Machine learning approach for risk-based inspection
208–18. screening assessment. Reliab Eng Syst Saf 2019;185:518–32.
[67] Nieto PG, Garcia-Gonzalo E, Lasheras FS, de Cos Juez FJ. Hybrid PSO–SVM-based [99] Hernandez-Perdomo E, Guney Y, Rocco CM. A reliability model for assessing
method for forecasting of the remaining useful life for aircraft engines and corporate governance using machine learning techniques. Reliab Eng Syst Saf
evaluation of its reliability. Reliab Eng Syst Saf 2015;138:219–31. 2019;185:220–31.
[68] Zhao Z, Liang B, Wang X, Lu W. Remaining useful life prediction of aircraft engine [100] Gehl P, Cavalieri F, Franchin P. Approximate Bayesian network formulation for
based on degradation pattern learning. Reliab Eng Syst Saf 2017;164:74–83. the rapid loss assessment of real-world infrastructure systems. Reliab Eng Syst Saf
[69] Ling C, Lu Z, Zhu X. Efficient methods by active learning Kriging coupled with 2018;177:80–93.
variance reduction based sampling methods for time-dependent failure [101] Marseguerra M. Early detection of gradual concept drifts by text categorization
probability. Reliab Eng Syst Saf 2019;188:23–35. and Support Vector Machine techniques: the TRIO algorithm. Reliab Eng Syst Saf
[70] Khan S, Yairi T. A review on the application of deep learning in system health 2014;129:1–9.
management. Mech Syst Signal Process 2018;107:241–65. [102] Gui G, Pan H, Lin Z, Li Y, Yuan Z. Data-driven support vector machine with
[71] das Chagas Moura M, Zio E, Lins ID, Droguett E. Failure and reliability prediction optimization techniques for structural health monitoring and damage detection.
by support vector machines regression of time series data. Reliab Eng Syst Saf KSCE J Civ Eng 2017;21(2):523–34.
2011;96(11):1527–34. [103] Bao Y, Li H, Huang Y, Ou J. Structural damage detection based on non-negative
[72] Worrell C, Luangkesorn L, Haight J, Congedo T. Machine learning of fire hazard matrix factorization and relevance vector machine. In: Health Monitoring of
model simulations for use in probabilistic safety assessments at nuclear power Structural and Biological Systems 2010. 7650. International Society for Optics
plants. Reliab Eng Syst Saf 2019;183:128–42. and Photonics; April 2010. 76503H.
[73] Vanderhaegen F, Zieba S, Enjalbert S, Polet P. A Benefit/Cost/Deficit (BCD) [104] Perrin G. Active learning surrogate models for the conception of systems with
model for learning from human errors. Reliab Eng Syst Saf 2011;96(7):757–66. multiple failure modes. Reliab Eng Syst Saf 2016;149:130–6.
[74] Li X, Ding Q, Sun JQ. Remaining useful life estimation in prognostics using deep [105] Xiang Z, Chen J, Bao Y, Li H. An active learning method combining deep neural
convolution neural networks. Reliab Eng Syst Saf 2018;172:1–11. network and weighted sampling for structural reliability analysis. Mech Syst
[75] Heimes FO. Recurrent neural networks for remaining useful life estimation. In: Signal Process 2020;140:106684.
2008 international conference on prognostics and health management. IEEE; [106] Nguyen KT, Medjaher K. A new dynamic predictive maintenance framework
October 2008. p. 1–6. using deep learning for failure prognostics. Reliab Eng Syst Saf 2019;188:251–62.
[76] Wang J, Wen G, Yang S, Liu Y. Remaining Useful Life Estimation in Prognostics [107] Myles AJ, Feudale RN, Liu Y, Woody NA, Brown SD. An introduction to decision
Using Deep Bidirectional LSTM Neural Network. In: 2018 Prognostics and System tree modeling. J Chemom: J Chemom Soc 2004;18(6):275–85.
Health Management Conference (PHM-Chongqing). IEEE; October 2018. [108] Wolpert DH, Macready WG. No free lunch theorems for optimization. IEEE Trans
p. 1037–42. Evol Comput 1997;1(1):67–82.
[77] Chen J, Jing H, Chang Y, Liu Q. Gated recurrent unit based recurrent neural [109] Bian C, Yang S, Huang T, Xu Q, Liu J, Zio E. Degradation state mining and
network for remaining useful life prediction of nonlinear deterioration process. identification for railway point machines. Reliab Eng Syst Saf 2019;188:432–43.
Reliab Eng Syst Saf 2019;185:372–82.

15
Z. Xu and J.H. Saleh Reliability Engineering and System Safety 211 (2021) 107530

[110] Reder M, Yürüşen NY, Melero JJ. Data-driven learning framework for associating [138] He R, Dai Y, Lu J, Mou C. Developing ladder network for intelligent evaluation
weather conditions and wind turbine failures. Reliab Eng Syst Saf 2018;169: system: case of remaining useful life prediction for centrifugal pumps. Reliab Eng
554–69. Syst Saf 2018;180:385–93.
[111] Sirola M, Talonen J, Lampi G. SOM based methods in early fault detection of [139] Ellefsen AL, Bjørlykhaug E, Æsøy V, Ushakov S, Zhang H. Remaining useful life
nuclear industry. In: ESANN; April 2009. predictions for turbofan engine degradation using semi-supervised deep
[112] Tibaduiza DA, Mujica LE, Rodellar J. Damage classification in structural health architecture. Reliab Eng Syst Saf 2019;183:240–51.
monitoring using principal component analysis and self-organizing maps. Struct [140] Yoon AS, Lee T, Lim Y, Jung D, Kang P, Kim D, Park K, Choi Y. 2017. Semi-
Control Health Monitor 2013;20(10):1303–16. supervised learning with deep generative models for asset failure prediction.
[113] Zhou H, Chen J, Dong G, Wang H, Yuan H. Bearing fault recognition method arXiv preprint arXiv:1709.00845.
based on neighbourhood component analysis and coupled hidden Markov model. [141] Razavi-Far R, Hallaji E, Farajzadeh-Zanjani M, Saif M. A semi-supervised
Mech Syst Signal Process 2016;66:568–81. diagnostic framework based on the surface estimation of faulty distributions. IEEE
[114] Gerassis S, Albuquerque MTD, García JF, Boente C, Giráldez E, Taboada J, Trans Ind Inf 2018;15(3):1277–86.
Martín JE. Understanding complex blasting operations: a structural equation [142] Zhao Y, Ball R, Mosesian J, de Palma JF, Lehman B. Graph-based semi-supervised
model combining Bayesian networks and latent class clustering. Reliab Eng Syst learning for fault detection and classification in solar photovoltaic arrays. IEEE
Saf 2019;188:195–204. Trans Power Electron 2014;30(5):2848–58.
[115] Fang YP, Zio E. Unsupervised spectral clustering for hierarchical modelling and [143] Feng X, Jiao Y, Lv C, Zhou D. Label consistent semi-supervised non-negative
criticality analysis of complex networks. Reliab Eng Syst Saf 2013;116:64–74. matrix factorization for maintenance activities identification. Eng Appl Artif Intell
[116] Soualhi A, Clerc G, Razik H. Detection and diagnosis of faults in induction motor 2016;52:161–7.
using an improved artificial ant clustering technique. IEEE Trans Indust Electron [144] Aria SS, Yang K, Ahn CR, Vuran MC. Near-Miss Accident Detection for
2012;60(9):4053–62. Ironworkers Using Inertial Measurement Unit Sensors. In: ISARC. Proceedings of
[117] Prabakaran K, Kaushik S, Mouleeshuwarapprabu R, Singh AB. Self-Organizing the International Symposium on Automation and Robotics in Construction. 31.
Map Based Fault Detection and Isolation Scheme for Pneumatic Actuator. Int J IAARC Publications; 2014. p. 1.
Innov Appl Stud 2014;8(3):1361. [145] Subramanya A, Talukdar PP. Graph-based semi-supervised learning. Synth Lect
[118] Malhotra P, TV V, Ramakrishnan A, Anand G, Vig L, Agarwal P, Shroff G. 2016. Artif Intell Mach Learn 2014;8(4):1–125.
Multi-sensor prognostics using an unsupervised health index based on lstm [146] Bennett KP, Demiriz A. Semi-supervised support vector machines. In: Advances in
encoder-decoder. arXiv preprint arXiv:1608.06154. Neural Information processing systems; 1999. p. 368–74.
[119] Kohonen T, Somervuo P. Self-organizing maps of symbol strings with application [147] Mũnoz-Marí J, Bovolo F, Gómez-Chova L, Bruzzone L, Camp-Valls G.
to speech recognition. In: Proceedings of WSOM. 97; June 1997. p. 4–6. Semisupervised one-class support vector machines for classification of remote
[120] Wu Z, Lee J, Meneveau C, Zaki T. Application of a self-organizing map to identify sensing data. IEEE Trans Geosci Remote Sens 2010;48(8):3188–97.
the turbulent-boundary-layer interface in a transitional flow. Phys Rev Fluids [148] Li YF, Liang DM. Safe semi-supervised learning: a brief introduction. Front
2019;4(2):023902. Comput Sci 2019;13(4):669–76.
[121] Kohonen T, Honkela T. Kohonen network. Scholarpedia 2007;2(1):1568. [149] Mohajer A, Bavaghar M, Farrokhi H. Mobility-aware load Balancing for Reliable
[122] Xing EP, Jordan MI, Russell SJ, Ng AY. Distance metric learning with application Self-Organization Networks: multi-agent Deep Reinforcement Learning. Reliab
to clustering with side-information. In: Advances in neural information processing Eng Syst Saf 2020:107056.
systems; 2003. p. 521–8. [150] Zhang N, Si W. Deep reinforcement learning for condition-based maintenance
[123] Suárez JL, García S, Herrera F. 2018. A tutorial on distance metric learning: planning of multi-component systems under dependent competing risks. Reliab
mathematical foundations, algorithms and software. arXiv preprint arXiv:181 Eng Syst Saf 2020:107094.
2.05944. [151] White CC. Markov decision processes. Springer US; 2001.
[124] Reddy KK, Sarkar S, Venugopalan V, Giering M. Anomaly detection and fault [152] Papakonstantinou KG, Shinozuka M. Optimum inspection and maintenance
disambiguation in large flight data: a multi-modal deep auto-encoder approach. policies for corroded structures using partially observable Markov decision
In: Annual Conference of the Prognostics and Health Management Society; processes and stochastic, physically based models. Probab Eng Mech 2014;37:
October 2016. 93–108.
[125] Yan W, Yu L. 2019. On accurate and reliable anomaly detection for gas turbine [153] Papakonstantinou KG, Shinozuka M. Planning structural inspection and
combustors: a deep learning approach. arXiv preprint arXiv:1908.09238. maintenance policies via dynamic programming and Markov processes. Part I:
[126] Fuertes S, Picart G, Tourneret JY, Chaari L, Ferrari A, Richard C. Improving theory. Reliab Eng Syst Saf 2014;130:202–13.
spacecraft health monitoring with automatic anomaly detection techniques. In: [154] Papakonstantinou KG, Shinozuka M. Planning structural inspection and
14th International Conference on Space Operations; 2016. p. 2430. maintenance policies via dynamic programming and Markov processes. Part II:
[127] Hundman K, Constantinou V, Laporte C, Colwell I, Soderstrom T. Detecting POMDP implementation. Reliab Eng Syst Saf 2014;130:214–24.
spacecraft anomalies using lstms and nonparametric dynamic thresholding. In: [155] Andriotis CP, Papakonstantinou KG. Managing engineering systems with large
Proceedings of the 24th ACM SIGKDD international conference on knowledge state and action spaces through deep reinforcement learning. Reliab Eng Syst Saf
discovery & data mining; July 2018. p. 387–95. 2019;191:106483.
[128] Ince T, Kiranyaz S, Eren L, Askar M, Gabbouj M. Real-time motor fault detection [156] Xiang Z, Bao Y, Tang Z, Li H. Deep reinforcement learning-based sampling
by 1-D convolutional neural networks. IEEE Trans Indust Electron 2016;63(11): method for structural reliability assessment. Reliab Eng Syst Saf 2020:106901.
7067–75. [157] Garcıa J, Fernández F. A comprehensive survey on safe reinforcement learning.
[129] Vincent P, Larochelle H, Lajoie I, Bengio Y, Manzagol PA, Bottou L. Stacked J Mach Learn Res 2015;16(1):1437–80.
denoising autoencoders: learning useful representations in a deep network with a [158] Munos R, Stepleton T, Harutyunyan A, Bellemare M. Safe and efficient off-policy
local denoising criterion. J Mach Learn Res 2010;11(12). reinforcement learning. In: Advances in Neural Information Processing Systems;
[130] Schlechtingen M, Santos IF, Achiche S. Wind turbine condition monitoring based 2016. p. 1054–62.
on SCADA data using normal behavior models. Part 1: system description. Appl [159] LeCun Y, Bengio Y, Hinton G. Deep learning. Nature 2015;521(7553):436–44.
Soft Comput 2013;13(1):259–70. [160] Saxena A, Goebel K, Simon D, Eklund N. Damage propagation modeling for
[131] Schlechtingen M, Santos IF. Wind turbine condition monitoring based on SCADA aircraft engine run-to-failure simulation. In: 2008 international conference on
data using normal behavior models. Part 2: application examples. Appl Soft prognostics and health management. IEEE; October 2008. p. 1–9.
Comput 2014;14:447–60. [161] Saxena A, Goebel K. Turbofan engine degradation simulation data set. nasa ames
[132] Souza D, Vasconcelos T, Mattos C, Gomes J. Evaluation of data based normal prognostics data repository. Moffett Field, CA: NASA AMES Research Center;
behavior models for fault detection in wind turbines. In: 2019 8th Brazilian 2008. http://ti.arc.nasa.gov/project/prognostic-data-repository.
Conference on Intelligent Systems (BRACIS). IEEE; October 2019. p. 878–83. [162] Fink O, Wang Q, Svensén M, Dersin P, Lee WJ, Ducoffe M. Potential, challenges
[133] Bangalore P, Letzgus S, Karlsson D, Patriksson M. An artificial neural network- and future directions for deep learning in prognostics and health management
based condition monitoring method for wind turbines, with application to the applications. Eng Appl Artif Intell 2020;92:103678.
monitoring of the gearbox. Wind Energy 2017;20(8):1421–38. [163] CDC. 2020. CDC Press Releases. [online] Available at: https://www.cdc.gov/med
[134] Garcia MC, Sanz-Bobi MA, Del Pico J. SIMAP: intelligent System for Predictive ia/releases/2016/p0922-older-adult-falls.html [Accessed 7 August 2020].
Maintenance: application to the health condition monitoring of a windturbine [164] Damianou A, Lawrence N. Deep gaussian processes. In: Artificial Intelligence and
gearbox. Comput Ind 2006;57(6):552–68. Statistics; April 2013. p. 207–15.
[135] Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput 1997;9 [165] Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S,
(8):1735–80. Courville A, Bengio Y. Generative adversarial nets. In: Advances in neural
[136] Greff K, Srivastava RK, Koutník J, Steunebrink BR, Schmidhuber J. LSTM: a information processing systems; 2014. p. 2672–80.
search space odyssey. IEEE Trans Neural Netw Learn Syst 2016;28(10):2222–32. [166] Goodfellow I. 2016. NIPS 2016 tutorial: generative adversarial networks. arXiv
[137] Hu C, Youn BD, Kim T. Semi-supervised learning with co-training for data-driven preprint arXiv:1701.00160.
prognostics. In: 2012 IEEE Conference on Prognostics and Health Management. [167] Amodei D, Olah C, Steinhardt J, Christiano P, Schulman J, and Mané D. 2016.
IEEE; June 2012. p. 1–10. Concrete problems in AI safety. arXiv preprint arXiv:1606.06565.

16

You might also like