Fast Unsupervised Learning Methods For Structural Health Monitoring With Large Vibration Data From Dense Sensor Networks

Original Article
Structural Health Monitoring

1–26
Fast unsupervised learning methods Ó The Author(s) 2019

Article reuse guidelines:
sagepub.com/journals-permissions
for structural health monitoring with DOI: 10.1177/1475921719894186
journals.sagepub.com/home/shm
large vibration data from dense sensor
networks
Alireza Entezami1,2 , Hashem Shariatmadar1 and Stefano Mariani2
Abstract
Data-driven damage localization is an important step of vibration-based structural health monitoring. Statistical pattern
recognition based on the prominent steps of feature extraction and statistical decision-making provides an effective and
efficient framework for structural health monitoring. However, these steps may become time-consuming or complex
when there are large volumes of vibration measurements acquired by dense sensor networks. To deal with this issue,
this study proposes fast unsupervised learning methods for feature extraction through autoregressive modeling and
damage localization through a new distance measure called Kullback–Leibler divergence with empirical probability mea-
sure. The feature extraction approach consists of an iterative algorithm for order selection and parameter estimation
aiming to extract residuals in the training phase and another iterative process aiming to extract residuals only in the
monitoring phase. The key feature of the proposed approach is the use of correlated residual samples of the autoregres-
sive model as a new time series at each iteration, rather than handling the measured vibration response of the structure.
This is shown to highly reduce the computational burden of order selection and feature extraction; moreover, it effec-
tively provides low-order autoregressive models with uncorrelated residuals. The Kullback–Leibler divergence with
empirical probability measure method exploits a segmentation technique to subdivide random data into independent
sets and provides a distance metric based on the theory of empirical probability measure with no need to explicitly com-
pute the actual probability distributions at the training and monitoring stages. Numerical and experimental benchmarks
are then used to assess accuracy and performance of the proposed methods and compare them with some state-of-the-
art approaches. Results show that the proposed approaches are successful in feature extraction and damage localization,
with a reduced computational burden.
Keywords
Structural health monitoring, damage diagnosis, unsupervised learning, autoregressive model, feature extraction,
Kullback–Leibler divergence
Introduction non-recoverable variations of the structural stiffness

and, therefore, unfavorable vibrations, local failures,
Structural health monitoring (SHM) has recently and even collapse. To prevent such events and decrease
become a very active research area in the civil, mechan- the high costs of maintenance and rehabilitation, if
ical, and aerospace engineering fields. It essentially
aims at assessing the health of structures or systems in 1
Department of Civil Engineering, Faculty of Engineering, Ferdowsi
a broader sense and at detecting any possible damage
University of Mashhad, Mashhad, Iran
through vibration data.1 Damage in a structure can 2
Department of Civil and Environmental Engineering, Politecnico di
occur because of changes in the geometrical configura- Milano, Milano, Italy
tion or boundary conditions, or because of material
degradation due to, for example, cracks in concrete, Corresponding author:
Alireza Entezami, Department of Civil and Environmental Engineering,
loose bolts, and broken welds in steel connections, Politecnico di Milano, Piazza Leonardo da Vinci 32, 20133 Milano, Italy.
corrosion, and fatigue. All these effects may cause Email: alireza.entezami@polimi.it
2 Structural Health Monitoring 00(0)
necessary, SHM is becoming a paradigm in our smart- Dealing with vibration data in the time domain, time
cities age, regardless of the site-specific geographical series modeling is adopted for feature extraction. In this
location and economic development. regard, Fassois and Sakellariou17 discussed parametric
Any SHM strategy can be generally classified and non-parametric time series representations for
according to whether a damage diagnosis or prognosis vibration-based SHM problems. They concluded that
is provided.2,3 The former class can be subdivided on although the use of non-parametric representations is
its own into three main levels: early damage detection simple, parametric time series models are more reliable.
(Level 1), damage localization (Level 2), and damage Application of parametric time series modeling to the
quantification (Level 3). The latter class is instead feature extraction is based on fitting an appropriate
intended to predict the remaining lifetime of the struc- mathematical model to the measured data and extract-
ture.4 Early damage detection is conceived as a global ing DSFs such as the model residuals. Depending upon
process that attempts to perceive if the damage is pres- the nature of the time series (e.g. stationarity vs non-
ent in the whole structure; however, damage localiza- stationary, linear vs nonlinear, seasonal vs non-seaso-
tion and quantification are local procedures, aiming to nal) and the type of data acquisition (e.g. input–output
provide a clear snapshot of the damage pattern along vs output-only), a broad range of representations can
with an assessment of its severity. Since damage is an be adopted to extract significant DSFs. Considering lin-
inherently local phenomenon, rigorous and robust ear and stationary vibration data, widely used time-
methods look necessary for SHM procedures targeting invariant linear models are the autoregressive (AR),18–20
damage localization. autoregressive with exogenous input (ARX),21 autore-
Damage diagnosis is generally carried out with gressive moving average (ARMA),22,23 autoregressive
model-driven or data-driven methods.5 The model- moving average with exogenous input (ARMAX),24,25
driven approach is based on an analytical or finite ele- and autoregressive and autoregressive with exogenous
ment model of the structure,6 due to the discrepancies input (ARARX)26 ones. However, vector-dependent
between the model itself and the real-life structure, in functionally pooled (VFP) models,27 Gaussian process
which case model updating becomes a mandatory (GP) time series models,28 time-varying autoregressive
task.7–13 Although such an approach is often successful moving average (TV-ARMA),29 and trigonometric Box–
in relation to damage diagnosis,14–16 some limitations Cox ARMA trend seasonal (TBATS)30 are suitable for
are represented by the necessity of a detailed model, of conditions that time series data are exposed to uncertain-
a model updating procedure, and of data reduction ties, operational and environmental variability, ambient
from raw vibration measurements. It then results to be vibration, and seasonal variations. Among the time-
excessively time-consuming and makes data-driven invariant linear representations, the AR model offers
techniques appealing as a more feasible option. remarkable advantages for feature extraction: the model
Within the data-driven methods, statistical pattern depends only on the response of the structure, so that it
recognition stands as an efficient and powerful frame- is appropriate for output-only SHM strategies; the
work since it relies upon the direct use of raw vibration model coefficients (parameters) are linked to the inherent
(time domain) measurements, in terms of excitations structural properties, regardless of the excitation sources
and relevant structural responses. The relevant frame- and their variations;31,32 the coefficients and residuals of
work for vibration-based SHM can be then decom- the AR model are known to be sensitive to damage;31 its
posed into four main steps: operational evaluation, implementation looks also simple.
data acquisition, feature extraction, and statistical deci- Feature extraction through time-invariant linear
sion-making, or statistical modeling for feature discrim- representations is generally obtained with coefficient-
ination.2 The last two steps are in need of more based and residual-based algorithms.17,33 Having set
research activities, due to their high importance and model orders and coefficients for the undamaged state,
required effectiveness. Feature extraction refers to the especially with the latter ones, the time series will no
process of learning meaningful information from the longer provide a good fit to the structural response in
measurements, supposed to be pertinent to damage; the damaged state, and residuals will increase.
such information is represented by the so-called dam- Therefore, the great benefit of the residual-based
age-sensitive features (DSFs). Statistical decision- approach is that it does not require any order determi-
making is concerned with the implementation of algo- nation and parameter estimation for the damaged or
rithms that handle the DSFs relevant to undamaged current structural states.17
and damaged conditions, to provide the sought damage The performance of AR models in extracting DSFs,
diagnosis. As the functional relationship between the with either coefficient-based or residual-based algo-
DSFs and the damaged state is difficult to define by rithms, is affected by the choice of the model order to
means of model-driven procedures, statistical methods enable the generation of uncorrelated residuals. From a
are usually adopted in the process of decision-making.2 statistical viewpoint, the so-called accuracy and
Entezami et al. 3
sufficiency of the time series orders depend strongly on for feature discrimination. A reliable way to attain this
the uncorrelatedness of the residual sequences:34,35 an latter goal is to adopt a well-established statistical
inadequate model order would not allow the AR model distance metric to measure the discrepancy between dif-
to track the underlying dynamics of the structure, ulti- ferent sets of samples. Successful approaches used for
mately leading to the extraction of damage-insensitive damage diagnosis relied upon the Mahalanobis-squared
features.33 The Akaike information criterion (AIC) and distance,40–43 the Itakura and cepstral distances,44 the
the Bayesian information criterion (BIC) are well- Kullback–Leibler divergence (KLD),45,46 the
known techniques for order selection;34 several research Kolmogorov–Smirnov statistical test (KSTS),47,48
studies have been recently conducted to propose some dynamic time warping,49 the multivariate distance cor-
alternative, even more efficient and effective methods. relation,32 and damage indices such as the Fisher criter-
Broersen36 proposed the use of combined information ion,50 Q-statistic and T2-statistic,51 and the deflection
criterion (CIC) and finite sample information criterion coefficient (DC).52
(FSIC), to enhance the capability of determining the All the aforementioned methods can be exploited
model order for large sampling data. In connection through machine learning tools, in either supervised or
with operational modal analysis through time series unsupervised learning manners.38,53,54 Damage diagno-
modeling, a new order selection method for vector sis through machine learning is so decomposed into the
autoregressive (VAR) representations was suggested by training (baseline) and monitoring (inspection) phases.2
Vu et al.,37 regardless of the measurement noise. For During training, a statistical model or classifier is
SHM problems, Figueiredo et al.33 assessed the influence learned by accounting for the training data. In the
of the AR orders on damage detection, using four differ- monitoring stage, one then attempts to make a decision
ent information criterion techniques and showing that a based on the learned model and the currently acquired
low order may provide insensitive features and weak data. The main difference between supervised and
damage detectability. To robustly select the order, with unsupervised learning algorithms is that the former
an emphasis on extracting uncorrelated residuals, needs the DSFs relevant to the undamaged and dam-
Entezami and Shariatmadar38 proposed an iterative pro- aged conditions to train a classifier, whereas the latter
cedure through the Ljung–Box Q (LBQ) test, assuring is based on DSFs of the undamaged state only to train
model accuracy and adequacy. Rezaiee-Pajand et al.39 the classifier. The clear benefit of the unsupervised
also presented a two-stage iterative algorithm, to select learning algorithms is that no information (i.e. vibra-
the model order on the basis of a residual analysis tion data or DSFs) is required beforehand for any pos-
through statistical hypothesis tests and signal filtering. sible damage condition.
Although all the above-mentioned order selection Going more into detail, the capability to locate sin-
techniques have their own advantages, it may be ques- gle and multiple damage states characterized by differ-
tionable to assess whether they are indeed time-saving ent severity levels, particularly the small ones, plays a
or time-consuming. This is especially true by consider- prominent role in establishing the effectiveness and
ing the recent advances and improvements in sensor robustness of the SHM approach. The decision-making
technology and data acquisition, overall providing a process based on an unsupervised learning strategy
great opportunity to exploit dense sensor networks for must then be fast and overcome the main obstacle of
most of the SHM applications. Handling vibration accurate damage localization, whenever a large volume
datasets with high dimensionality, an approach looks of DSFs can be extracted from data provided by dense
efficient if it requires a short computational time for sensor networks.
order selection, guarantying in the end model suffi- Accounting for all the above-mentioned issues and
ciency and accuracy by generating uncorrelated resi- limitations, a fast unsupervised learning method is here
duals. Since order selection directly affects parameter proposed to locate structural damage within the statisti-
estimation and residual extraction, a positive by- cal pattern recognition paradigm, by means of an itera-
product is the computational efficiency of the tive feature extraction approach. The approach is based
coefficient-based and residual-based feature extraction on AR modeling and a novel statistical distance
(RBFE) algorithms. The other way around, high-order method, named Kullback–Leibler divergence with
models leading to a good fit of the structural output empirical probability measure (KLDEPM). The pro-
and providing uncorrelated residuals may increase the posed feature extraction approach consists of an
probability of overfitting, which finally causes erro- improved iterative AR model order selection with para-
neous forecasts.34 An effective order selection approach meter estimation and residual extraction during the
must be therefore time-saving, or efficient and generate training phase, iterative AR residual extraction only
uncorrelated residuals without any overfitting issue. during the monitoring stage. The major novelty of this
Besides feature extraction, the methodology has to approach consists of using correlated residual samples
be robust also in terms of statistical decision-making of the AR model as a new time series dataset in each
iteration, in place of the measured vibration response where x(t) is the measured vibration response at time t;
of the structure. It is shown that this innovation can u = [u1, u2, ., up] denotes the vector of AR coeffi-
considerably reduce the computational time of AR cients, or parameters to be estimated; p represents the
order selection and residual extraction; further to that, model order; and e(t) is an uncorrelated residual
it leads to low-order AR models that guarantee to sequence (an unobservable random error) that quanti-
finally obtain uncorrelated residuals. The proposed fies the difference between the measured vibration
KLDEPM method is an enhancement of the classical response and the one predicted by the model. In this
KLD technique, in the sense that random feature sam- work, the least-squares technique is used to estimate
ples (i.e. the AR model residuals related to the normal the coefficients u of the AR model.
and damaged conditions) are subdivided into indepen-
dent segments, and information on the segmentation
(in terms of numbers of samples and segments) are Iterative RBFE technique
accounted for in the distance calculation with the the-
The feature extraction technique here proposed is sub-
ory of empirical probability measure. All these novelties
divided into the two stages of training and monitoring;
make it an effective distance approach for damage iden-
on its own, each stage consists of an initial phase and
tification, and a fast tool for decision-making. Results
of a subsequent iterative phase. In the mentioned initial
relevant to a concrete beam55 and the International
stages, an optimal AR model order is set by an ad hoc
Association for Structural Control (IASC)-American
designed order selection method, so that model
Society of Civil Engineers (ASCE) experimental bench-
residuals are provided as DSFs to the next iterative
mark structure56 have been used to verify accuracy and
phases. The monitoring phase exploits the information
performance of the proposed method. Both the iterative
provided by training and, without dealing with order
feature extraction and the KLDEPM method turn out
to be superior to other available strategies, providing determination and parameter estimation, extracts the
reliable DSFs and locating single and multiple damages residuals of the AR model as the main DSFs for the
of different severity. current structural state. Since order selection and para-
The remainder of this article is organized as follows. meter estimation tasks are not tackled in such a moni-
Section ‘‘AR modeling’’ briefly discusses the vibration toring stage, the proposed RBFE method conforms to
response modeling by an AR representation. Section unsupervised learning. Details of the whole procedure
‘‘Iterative RBFE technique’’ presents the proposed are provided in the following.
iterative RBFE approach. The novel KLDEPM
method is detailed in section ‘‘A novel statistical dis-
tance measure for damage localization.’’ The results of
Training phase: AR order selection and residual
feature extraction and damage localization for the men- extraction
tioned examples are presented in section The essential step of feature extraction by time series
‘‘Applications.’’ Finally, section ‘‘Conclusion’’ draws modeling is the determination of accurate and sufficient
the main conclusions of this work. series orders, aiming to lead to uncorrelated resi-
duals.34,35 The iterative order selection algorithm here
proposed is an improvement of a former authors’
AR modeling
approach;38 the fundamental principle of the algorithm
Within the SHM realm, a time series is a sequence of relies on the choice of the AR order, based on the resi-
data that actually consists of time spaced, successive dual analysis through the LBQ hypothesis test. This
measurements. Time series analysis is, therefore, a sta- test assesses the correlation among residual sequences
tistical tool that is meant for model identification, para- as follows
meter estimation, and model validation of forecasting
purposes.34 It is also known to be a powerful approach XL r2j
for feature extraction from vibration measurements, QLB = nðn + 2Þ ð2Þ
j=1
nj
actually consisting of excitations and/or structural
responses.
Assuming that a linear time-invariant representation where n is the number of samples in the residual vector;
can fit the structural response,19 the relevant AR model rj is the sample autocorrelation function at the jth lag
for a single-output system reads (j = 1, 2,., L); L represents the number of lags. Test
decision is based on either the null hypothesis or alter-
p
X native ones linked to a certain significance level a:
x ðt Þ = ui xðt iÞ + eðtÞ ð1Þ model residuals are thus considered uncorrelated if QLB
i=1 is less than a critical value (c-value), or if the test
Entezami et al. 5
probability value (p-value) is greater than the signifi- the coefficients uxk of the new AR model. Alike in the
cance level. initial step, the first k samples of the residual vector of
Although a purely iterative order selection technique AR(k) are null and are therefore dropped from Exk at
is able to robustly set the AR model order providing the current iteration. If the sequences of Exk satisfy the
uncorrelated residuals, it may be time-consuming or null hypothesis by the LBQ test (QLB < c-value), the
may yield excessively high-order models. With the pres- iterative process is terminated; otherwise (QLB . c-
ent approach, correlated residual sequences are used as value), the model order k is increased and the corre-
a new time series within the iterative process, in place lated residual vector is exploited as a new dataset
of the measured vibration response; in this way, the x = Exk ). When the termination condition is attained,
(^
order selection turns out to be computationally more the number of iterations provides an optimal and
efficient than with the original (purely iterative) tech- robust AR order, and the relevant uncorrelated resi-
nique. This is because the range of correlation decreases dual vector Exk can be used as the main DSF for the
at each iteration of the improved method. In other training phase. Model coefficients at each iteration are
words, the new correlated residual samples have less stored, to be later used in the residual extraction during
correlation than the residuals of the previous iterations the monitoring phase. For the sake of convenience,
and the original vibration signal. This situation allows Algorithm 1 presents the pseudo-code for the initial
us to examine fewer sample orders and selecting a small and iterative steps of the proposed feature extraction
order than the purely iterative technique due to reduc- method in the training phase.
ing the correlation of the new time series data (the resi-
dual samples) and the sufficiency of using small sample
orders. Unlike the purely iterative technique, further-
Monitoring phase: residual extraction
more, one does not need to use the original vibration In order to obtain the DSFs relevant to the current
signal at each iteration. As a result, these reasons help structural state, the initial and iterative steps previously
to expedite the order determination and enable the detailed need to be repeated again to extract the resi-
improved method to be more efficient than the purely dual sequences, as follows:
iterative technique. In the following two steps are
described in detail: Initial step. Let y = (y(1),., y(n)) 2 <n be now the
vector of measured vibration responses associated with
Initial step. Let x = (x(1),., x(n)) 2 <n be the vector the current structural state. By fitting AR(1) to these
of measured vibration responses of the structure associ- data, the residual vector Ey = (e(2),., e(n)) 2 <n – 1 is
ated with the undamaged or normal condition. AR(1) obtained. If the residual vector Ex of the initial step of
(i.e. an AR model of order one) is fitted to x, in order the training phase is used as the main DSF for the nor-
to extract the model residuals Ex = (e(2).e(n)) 2 <n – 1. mal condition, one can do the same with Ey for the
Before extracting such residuals, model coefficients ux1 current state; otherwise, Ey is used as a new time series
are estimated by the least-squares technique. Since the in the subsequent iterative step.
residual sequence e(1) for this first-order model is approx- Iterative step. The iterative procedure in this step is
imately null, it can be neglected.34 In the case of using governed by the variable order r = 2,3,., k. Using
large volume of vibration measurements, featuring n rela- the model coefficients uxr estimated in the training
tively large, the residuals of AR(1) turn out to be corre- phase, at the rth iteration, the residuals of AR(r) are
lated; hence, Ex is handled as a new series dataset for the extracted. If r = k, the residual vector
subsequent iterative step. This is done since if the resi- Eyr = (e(r(r + 1)=2 + 1) . . . e(n)) 2 <nr(r+1)=2 is used as
duals are correlated, some information present in the the DSF for the current structural condition; other-
original dataset is not captured by the model. It is worth wise, the iterative process continues by choosing Eyr as
mentioning that if the analysis of AR(1) through the the new time series data (^y). In other words, the itera-
LBQ test allows ascertaining that residuals are uncorre- tive step for the monitoring phase needs to implement
lated, which is indeed a rare event, one can adopt p = 1 the iterative loop until r = k. This means that the pro-
and Ex as model order and main DSF for the normal cess of residual extraction in the monitoring phase
condition, respectively. should be carried out k times; otherwise, any extraction
Iterative step. By increasing the model order with of residual samples without this algorithm causes unre-
k = 2,3,., AR(k) is fitted to the correlated residual liable and erroneous features. In addition, it should be
sequence furnished by the previous step if k = 2, or by mentioned that the conventional RBFE technique uti-
the previous iteration if k . 2, in order to extract the lizes one type of the model order and coefficients for
new residual vector Exk = (e(k(k + 1)=2 + 1) . . . the residual extraction in the monitoring stage; that is,
e(n)) 2 <nk(k +1)=2 . Once again, before residual extrac- it implements this process one time. For simplicity,
tion, the least-squares technique is adopted to estimate Algorithm 2 provides the pseudo-code for the initial
Algorithm 1. Initial and iterative steps of the proposed Algorithm 2. Initial and iterative steps of the proposed
feature extraction method for the training phase. feature extraction method for the monitoring phase.
Initialization Initialization
nm=Number of test measurements; nm=Number of test measurements;
ns=Number of sensors; ns=Number of sensors;
p0=Minimum AR order (default p0=1); Load
pm=Maximum AR order; Read the optimal AR orders, the model coefficients for all
Initial Step sensors and all test measurements from Algorithm 1
for i 1 to nm do if the optimal AR order=p0 or k=1 then
for j 1 to ns do Initial Step
Read the vector of vibration time-domain samples regarding for i 1 to nm do
the normal condition for the jth sensor and ith test for j 1 to ns do
measurement Read the vector of vibration time-domain samples
Estimate the coefficient of AR(p0), up0 regarding the damaged state for the jth sensor and ith
Extract the model residuals e(1).e(n) test measurement
Implement the LBQ test and obtain the test statistics QKL Fit AR(p0) to the vibration time-domain samples
and its c-value Extract the model residuals e(p0 + 1).e(n) as the
if QLB \ c-value then main features for the jth sensor and ith test
Store p0, up0, e(2).e(n) as the optimal AR order, the measurement
model coefficient, and the main features at the jth end
sensor and ith test measurement end
Terminate the iterative loop else
else Iterative Step
Iterative Step for i 1 to nm do
for k p0+ 1 to pmdo for j 1 to ns do
Set e kðk2+ 1Þ + 1 :::eðnÞ as the new time series data Set k as the optimal AR order at the jth sensor and ith
test measurement
Estimate the coefficients of AR(k), u1.uk for r p0 + 1 to k do
Store the model coefficients u1.uk for the kth Fit AR(r) using the known model coefficients
iteration, jth sensor, and ith test
measurement
u1.ur regarding the rth iteration obtained from
Extract the model residuals e kðk2+ 1Þ + 1 :::eðnÞ Algorithm 1
Implement the LBQ test and obtain the test Extract the model residuals e rðr 2+ 1Þ + 1 :::eðnÞ
statistics QKL and its c-value if r=k then
if QLB.c-value then
Continue the iterative loop Store e rðr 2+ 1Þ + 1 :::eðnÞ as the main features
Else of the damaged state at the jth sensor and ith
Store k, u1.uk, and e kðk2+ 1Þ + 1 :::eðnÞ as the test measurement
Terminate the iterative loop at the iteration r
optimal AR order, the model coefficients, and Else
the final features for the kth iteration, jth sensor,
and ith test measurement Set e rðr 2+ 1Þ + 1 :::eðnÞ as the new time series
end data
end Continue the iterative loop by a new value for r
end end
end end
end end
end
end
and iterative steps of the proposed feature extraction

probability distributions of the random samples Exk
method in the monitoring stage.
and Eyr , the general formulation for the KLD reads
A novel statistical distance measure for X Px ð zÞ

DKL Px jjPy = Px ð zÞ log ð3Þ
damage localization z
P y ð zÞ
KLD Since Px and Py are both positive, equation (3)
The KLD technique proposed by Kullback and always provides a positive distance or divergence mea-
Leibler57 is a well-known statistical distance metric. It sure. In practical terms, DKL = 0 means that the two
is a non-symmetric measure of the difference between probability distributions have the same behavior, and
two probability distributions, or better between two any deviation from the zero value is representative of
probability density functions. If Px and Py are the dissimilarity between them.
Entezami et al. 7
KLDEPM the same number of samples; for the last segment, the
Unlike the conventional KLD technique, the main empirical probability measure is written as follows
advantage of the KLDEPM method is that one does
not need to compute the probability distributions of n k ðk2+ 1Þ mN y 2n k ðk + 1Þ 2mN y
^ y ðCm Þ =
P =
k ðk + 1Þ 2n k ðk + 1Þ
the data. The key features of this method are the seg- n 2
mentation of random samples and the adoption of an ð8Þ
empirical probability measure.58 The calculation of the
empirical probability measure directly depends on the Having obtained the empirical probability measures,
way data samples are segmented. the KLDEPM is formulated as
The strategy used in the proposed KLDEPM
method is based on the maximum entropy approach. m ^
^ x ðCi Þ log Px ðCi Þ
X
^ KL Exk jjEy =

The residual vector Eyr is arranged with entries in D P ð9Þ
r
^ y ðCi Þ
P
i=1
ascending order, in such a way that it begins with the
minimum residual value and ends with the maximum The great advantage of the proposed KLDEPM
one. Subsequently, the rearranged vector Eyr is divided method, as detailed here above, is that it computes
into m sub-vectors in the following way the distance between the two random datasets through
the information given by segmentation and empirical
emin <C1 <eN y probability measures, without directly handling the
eði1ÞN y\Ci <eiN y ð4Þ random samples as with the classical KLD technique.
Since all the values obtained with the segmentation are
eN y ðm1Þ\Cm <emax ^ KL is positive as well. Ideally, a zero distance
positive, D
where i = 2, 3, ., m – 1. In equation (4), Ny denotes would denote a similarity between the residual
vectors relevant to the normal and current structural
the number of samples within each segment of Eyr ,
states, which practically means that damage has not
empirically given as the square root of the total number
occurred.
of samples in each residual vector, that is
rffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
k ð k + 1Þ Unsupervised learning strategy
Ny = n ð5Þ
2 In order to locate damage by means of a univariate dis-
Accordingly, the number of segments turns out to be tance method such as the KLDEPM, the distance must
m = (n k(k + 1)=2)=Ny . Since both Ny and m have to be be computed between the residual vectors for the unda-
positive integers, one needs to round off the outcomes maged and damaged conditions at each sensor location.
of equation (5) to the nearest integer. Once vector Eyr Considering Ns sensors deployed over the structure,
has been segmented, the segmentation procedure is one can build a damage localization vector by assem-
repeated almost the same for the vector Exk , since it is ^ KL values associated with all the sensors as
bling the D
not necessary to arrange it in ascending order. Next, the follows
sequences of Exk that fall into the domains of C1, C2, h i
^ KL Exk jjEy ^ KL Exk jjEy ^ KL Exk jjEy 2 <N s

., Cm are determined, given that Nx represents the D= D r 1
D r 2
D r Ns
number of samples in each segment. ð10Þ

^ x and P
Let P ^ y be the empirical probability measures
of the residual vectors Exk and Eyr , respectively. Based An efficient and rigorous way for the localization of
on the information on the segmentation provided by damage is to define a threshold limit for the distance
Ny, Nx, and m, such measures for the ith segment are values obtained from all the sensors. Let the training
given by phase consist of Nc normal conditions, where Nc repre-
sents, for example, the number of measurements during
^ x ðC i Þ = N xi 2N xi the training period: the distance between the residual
P = ð6Þ
n k ðk + 1Þ 2n k ðk + 1Þ vectors related to each normal condition and all the
2
other ones is computed according to equation (10) at
^ y Ci, i6¼m =
Ny 2N y all sensor locations. The obtained distance matrices
P = ð7Þ
n k ðk + 1Þ 2n k ðk + 1Þ Tl 2 <Ns 3Nc for each undamaged state, where l = 1, 2,
2
., Nc, are then gathered in a global distance matrix
2
where N xi denotes the number of samples of Exk belong- Tl 2 RNs 3Nc . A threshold limit is next defined, based on
ing to the segment Ci. It is worth remarking that only the upper bound of the 95% confidence interval for
the first m – 1 segments of the residual vector Eyr have each column of T, according to
Fifteen sensors were supposed to be mounted on the

top surface of the beam, to measure the time histories
of the acceleration in the transverse (vertical) direction.
The measurement period was assumed to last 2 s, to
include 4001 samples. A noise featuring a 30 dB signal-
to-noise ratio was added to the digital acceleration
records, as obtained with the finite element analyses.
Figure 1. The numerical benchmark model of a concrete The histories were low-pass filtered at 1000 Hz, in order
beam: (a) sensor locations and damaged area and (b) element to keep five vibration modes of the structure possibly
mesh in the vicinity of the crack.55
excited in the test results.
sT The crack at the bottom of the beam mid-span was
rt = mT + Z a=2 pffiffiffiffiffiffi , t = 1, 2, . . . , N 2c ð11Þ used to simulate a single damage state: crack lengths of
Ns
10, 30, 50, and 100 mm are used in this work to varying
where mT and sT are the mean and the standard devia- the amount of damage. For each damage scenario, 12
tion of each column of T; Za/2 is the confidence coeffi- test measurements are considered, so that the first 10
cient, equal to 1.96 for the 95% confidence interval are relevant to the undamaged condition, while the
under an a = 0.05 significance level. The mean value information provided by the last two varies depending
of the entries of r = ½r1 . . . rNc2 finally represents the on the crack length. To have the same amount of data
threshold limit for damage localization. In this regard, in the undamaged and damaged conditions, the struc-
the sensor locations with the distance values more than tural states and, therefore, the corresponding measure-
the threshold limit are identified as the damaged areas ments are arranged, as shown in Table 1; accordingly,
of the structure. This means that the sensors should be the acceleration response at each sensor location always
arranged close to the possible damaged areas to ensure contains 8002 data samples. The healthy states are thus
the damage sensitivity. Under such circumstances, an assumed to be five, whereas the damage cases, at vary-
optimal and accurate sensor placement is of paramount ing damage levels, are the remaining four. As a sample,
importance to locate damage. However, it should be Figure 2 shows the acceleration responses at Sensor 8
mentioned that the strategy of finding the sensor loca- in Cases 1 and 5.
tion with distance value larger than the threshold limit Before moving to the results related to damage
is often valid for non-complex structural systems. This detection, the reasons for choosing the AR representa-
is because the damage localization procedure cannot tion are discussed. An effective way to identify the
simply be boiled down to tracking the location of a appropriateness of this model is through the
larger discrepancy in sensed signals or their damage Leybourne–McCabe (LMC) hypothesis test:60 in
indicators. Damage in structural systems may manifest essence, this test assesses the null hypothesis that a uni-
in complex forms depending on the system’s complex- variate time series conforms to an AR process, against
ity, boundary conditions, and localized nonlinear ele- the alternative hypothesis relevant to a non-stationary
ments. Therefore, this strategy needs possible autoregressive integrated moving average (ARIMA)
extensions and combinations with further SHM locali- process. In other words, if the test statistics QLM is
zation criteria. smaller than a c-value, which refers to as before to the
null hypothesis, one can argue that the time series is
stationary and the AR model is suitable to represent it.
Applications Figure 3 shows the LMC test statistics for the
healthy and damaged conditions. It can be seen that all
A numerical concrete beam the test statistics are smaller than the c-value, which
To verify the accuracy and performance of the pro- amounts to 0.1460 under a 5% significance limit; all
posed methods, a numerical benchmark model is first the acceleration time histories in all the cases can then
considered as shown in Figure 1. This benchmark was be considered stationary processes conforming to AR
constructed by Kullaa et al.55 as a realistic simulation ones. Accordingly, the AR model for feature extraction
of a simply supported concrete beam, cracked at the is assumed accurate for SHM purposes. The other rea-
mid-span cross-section and featuring a length of 5 m son that may help to verify the choice of AR modeling
and a height of 0.5 m. The beam was modeled in is to use the Box–Jenkins methodology,34 which pre-
ABAQUS,59 space discretized with four-node two- sents a graphical model identification approach. It uti-
dimensional finite elements. At the beam ends, the con- lizes the autocorrelation function (ACF) and partial
straints were provided at the neutral axis. For excita- autocorrelation function (PACF) of time series data
tion purposes, a uniform transverse random load was and identifies a suitable time series model among the
applied to the top surface. AR, MA, and ARMA representations. In this regard,
Entezami et al. 9
(a) (b)
Figure 2. The acceleration responses of Sensor 8: (a) Case 1 and (b) Case 5.
Table 1. Structural conditions relevant to the numerical Using the test measurements for Case 1, Figure 5(a)
concrete beam. shows the identified AR orders at all sensor locations,
as obtained with the proposed two-stage order selection
Case no. Label Measurement no. Damage level/crack method. To demonstrate the superiority of this method,
length (mm)
the orders obtained with the original iterative approach
1 Healthy 1-2 – and the BIC technique are shown in Figure 5(b) and
Healthy 3-4 – (c). It can be seen that the range of the AR orders
Healthy 5-6 – obtained with the improved method is upper bounded
Healthy 7-8 – by 26, whereas the original approach and the BIC tech-
Healthy 9-10 –
2 Damaged 11-12 10 nique provide results with orders up to 52 or 65,
3 Damaged 11-12 30 respectively.
4 Damaged 11-12 50 To further show how the improved order selection
5 Damaged 11-12 100 method yields low-order AR models, still preserving
their adequacy and accuracy by generating uncorre-
lated residuals, Figure 6 illustrates the variations of the
LBQ test statistics with the number of iterations for the
if the samples of ACF have an exponential decay form exemplary results related to Sensors 5 and 14 and
or behave as a sine wave without any trend in being Measurements 9-10 of Case 1, see Table 1. Using a 5%
zero and the samples of PACF gradually cut off after a significance level, in this case, the c-value turns out to
specific lag, these indicate that time series data conform be 31.4104: graphs show that the obtained orders,
to the AR process. This means that the most proper
namely k = 20 for Sensor 5 and k = 26 for Sensor 14,
time series model is AR. In contrast, if the samples of
enable the AR models to generate uncorrelated resi-
both the ACF and PACF have exponential decay
duals, with outcomes of the LBQ test smaller than the
forms or treat as sine waves, one can conclude that the
c-value. For more evaluations, Figure 7 indicates the
ARMA model is suitable for the time series data.
statistics for analyzing the uncorrelatedness and
With these descriptions, Figure 4 illustrates the ACF
Gaussianity (normality) of the residual samples in
and PACF plots of some vibration signals of the con-
Cases 1–5. In this regard, the LBQ test is applied to
crete beam for the first two tests (i.e. Measurements 1-
investigate whether the residual samples are uncorre-
2) of Case 1. As can be seen, all samples of the ACF
lated. For the problem of Gaussianity of the residuals,
behave as sine waves, while the samples of the PACF
the Anderson–Darling (AD) hypothesis test61 is appli-
gradually cut off approximately after 50 lags. These
cable. The outputs of this test are similar to the LBQ
observations confirm that the AR representation is suit-
test, in which case if the AD test statistics to be smaller
able for modeling the acceleration responses. It needs
than the c-value, one can infer that the residual samples
to mention that the same conclusion can be reached for
the other vibration signals in other cases. are Gaussian.
(a) (b)
Figure 3. Stationarity assessment and AR model identification by the LMC hypothesis test: (a) Case 1 and (b) Cases 2–5.
(a) (b)
(c) (d)
(e) (f)
Figure 4. Graphical model identification between the AR and ARMA representations by the Box–Jenkins methodology in Case 1:
(a) ACF at Sensor 4, (b) PACF at Sensor 4, (c) ACF at Sensor 8, (d) PACF at Sensor 8, (e) ACF at Sensor 11, and (f) PACF at
Sensor 11.
Entezami et al. 11
(a)
(b)
(c)
Figure 5. AR model orders for all test measurements relevant to Case 1, as obtained with (a) the proposed method, (b) the
original iterative method, and (c) the BIC method.
(a) (b)
Figure 6. LBQ test statistics for the test measurements (Measurements 9-10) of Case 1: (a) Sensor 5 and (b) Sensor 14.
As can be seen in Figure 7(a) and (b), the statistics observations prove the uncorrelatedness and
of both the LBQ and AD tests are smaller than the c- Gaussianity of the residual samples extracted from the
values under the 5% significance level. These AR models. Regarding the computational efficiency
(a) (b)
Figure 7. Analysis of the residual samples of the AR models in Cases 1–5 of the numerical beam: (a) assessment of the residual
uncorrelatedness by the LBQ test and (b) assessment of the residual Gaussianity by the AD test.
(a) (b)
Figure 8. Comparison among the method-dependent computational times relevant to (a) AR order selection and (b) complete
feature extraction process (residual-based feature extraction (RBFE)).
for the AR order selection, Figure 8(a) shows the com- training, and residual extraction during monitoring are
putational time of the improved and original iterative detailed. These results have been obtained by running
algorithms, and of the BIC approach, obtained using the analyses with MATLAB R2017a on a PC featuring
the acceleration time histories from all the sensors. an Intelä Core i7-3770, 3.40–3.90 GHz CPU and
Figure 8(b) provides instead a comparison among the 16 GB RAM.
times needed for the complete process of RBFE, by Figure 8 clearly points out that the proposed method
allowing for the datasets of Measurements 1-2 of Case is more efficient than the other techniques, as far as the
1 for the training period and of Measurements 11-12 of AR order selection is concerned; the other way around,
Case 2 for the monitoring phase. In the plot, the data it is less efficient than the others as for the residual
relevant to the main steps of AR order selection, para- extraction in the monitoring phase. Anyway, the pro-
meter estimation and residual extraction during posed method generally provides a faster feature
Entezami et al. 13
(a)
(b)
(c)
Figure 9. Case-dependent segmentation of the residual vectors at all sensor locations: (a) number of segments, (b) number of
samples of Eyr in the first m – 1 segments, and (c) number of samples of Eyr in the last segment.
extraction since it does not feature a further parameter This setting is then used in equations (7) to (11), to
estimation step in the training phase, which is shown to compute the empirical probability measures for all the
be relatively time-consuming for the conventional segments, and the distance quantities through
approach. As the process of order selection has the KLDEPM at all sensors for damage localization. In
highest influence on the overall computational time of this example, the single vertical crack shown in Figure
feature extraction, the novel iterative algorithm thus 1 is located near Sensor 8; hence, the attempt is to
represents a time-saving unsupervised learning strategy assess whether the proposed KLDEPM method is able
for feature extraction. to identify such location as the damaged area. For
By adopting the model order and coefficients comparison purposes, damage localization is per-
obtained in the training phase, the residual vectors Exk formed also by the classical KLD method and by the
and Eyr at each sensor are extracted and used as the state-of-the-art KSTS technique. Based on the metho-
main DSFs for the healthy and damaged conditions, dology for threshold estimation described in section
respectively. The setting of Ny and m on the basis of ‘‘Unsupervised learning strategy,’’ the threshold values
segmentation depends on thepnumber of the residual
ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi for the KLDEPM, KLD, and KSTS methods are set as
sequences, which is equal to n k(k + 1)=2; referring 0.1007, 0.2171, and 0.0361, respectively. The sensor
to the example in Figure 6(a), the order k of the AR location associated with a distance value greater than
model at Sensor 5 results to be 20, so that the number these threshold levels is accordingly identified as the
of samples within each segment and the number of seg- damaged area: Figure 11 illustrates the results of dam-
ments are rounded to Ny = m = 88. Figure 9 collects age localization obtained with the three methods,
the results in terms of Ny and m values at all sensor wherein the mean value of the distances between the
locations; here, the group numbering along the vertical residual vectors referring to the damaged and to all the
axes refers to the healthy cases reported in Table 1. normal conditions is used to identify it.
Finally, also the number Nx of samples related to the As reported in Figure 11, all the distance quantities
residual vectors in the normal conditions has to be set; furnished by KLDEPM and KLD exceed the threshold
as an example, Figure 10 shows exemplary values for limits at Sensor 8, so these methods prove successful in
the residual vectors relevant to Sensors 1–4 and to the locating the damage at varying severity levels.
first measurement of Case 1. Although the same outcome is reported by KSTS for
(a)
(b)
(c)
(d)
Figure 10. Number of samples in Exk based on the segmentation of Eyr in the first measurement of Case 1 relevant to (a) Sensor
1, (b) Sensor 2, (c) Sensor 3, and (d) Sensor 4.
Cases 3–5, it can be seen that this technique fails in the robustness of the proposed method in the presence
identifying the location in case of small damages (as of multiple damage cases. The benchmark consists of a
Case 2 corresponds to a crack length of 10 mm), giving scaled four-story steel structure, made of a 2-bay-by-2-
rise to a false-negative damage indication or Type II bay steel frame with a size of 2.5 m 3 2.5 m 3 3.6 m,
error. Figure 11 also conveys the important conclusion as shown in Figure 13(a). The members were hot-rolled
that one of the main reasons for achieving the accurate grade 300W steel, with a nominal yield strength of
damage localization with any distance method (except 300 MPa. The columns and floor beams featured
Case 2 with the KSTS technique) is due to the high sen- B100 3 9 and S75 3 11 cross sections, respectively.
sitivity to damage of the AR model residuals obtained For each bay, the bracing system consisted of two diag-
with the proposed feature extraction method. onal threaded steel rods with a diameter of 12.7 mm.
Besides the issue of damage identification, results To make the mass distribution realistic, slabs were
related to the computational times of damage localiza- placed in each bay with a mass of 1000 kg at the first,
tion are compared in Figure 12. The proposed second, and third floors and of 750 kg at the fourth
KLDEPM method is shown to be much faster than the floor.
others, and it thus results to provide the best perfor- The structure was excited in different ways; in this
mance also in this regard. The classical KLD technique work, account is taken of the experimental responses
requires instead the largest computing time, while the obtained with an electro-dynamic shaker mounted on
KSTS technique is a bit less computational demanding, the top level of the structure. Acceleration time histories
although it must be borne in mind that it also leads to caused by this excitation were measured by the 15 accel-
the erroneous damage localization in Case 2. Overall, erometers shown in Figure 13(b), at a frequency of
Figures 11 and 12 lead to the conclusion that the pro- 200 Hz; each file of acceleration response thus consists
posed KLDEPM method is able to locate damage by of 24,000 samples. The accelerometers located on the
means of a fast unsupervised learning strategy. east and west frames measured the accelerations along
the strong axis, or north-south (N-S) direction: Sensors
1, 4, 7, 10, and 13 were placed on the west side, while
A four-story steel structure Sensors 3, 6, 9, 12, and 15 were placed on the east side,
The experimental datasets of the second phase of the with numbering moving from the base and going up to
IASC-ASCE benchmark56 are now exploited, to verify the fourth floor. Accelerometers 2, 5, 8, 11, and 14 were
Entezami et al. 15
(a)
(b)
(c)
Figure 11. Damage localization in the numerical beam, as obtained with the (a) KLDEPM, (b) KLD, and (c) KSTS methods.
Acceleration time histories acquired with Sensors

1–3 mounted on the base are not incorporated in this
analysis since they do not provide relevant information
concerning the dynamic behavior of the structure. Nine
damage scenarios were simulated for this structure, by
removing some braces from the east side and southeast
corner (first damage pattern) or by loosening bolts at
the beam–column connections of the east side (second
damage pattern). The five damage scenarios belonging
to the aforementioned first damage pattern are consid-
ered next, as detailed in Table 2. For example, Figure
14 shows the excitation force and acceleration responses
at Sensor 9 in Cases 1 and 2.
First of all, the LMC hypothesis test is employed to
assure the accuracy of the adopted AR model for fea-
ture extraction. Figure 15 collects the LMC test statis-
tics at all sensors for Cases 1–5. Using a 5%
Figure 12. Comparison among KLDEPM, KLD, and KSTS
methods, in terms of the computational time for damage
significance limit, the c-value is now equal to 0.1460;
localization in the numerical beam. the plot shows that all the values of QLM are smaller
than this test c-value, so that once again all the mea-
sured acceleration responses can be assumed stationary
instead placed in the vicinity of the central columns, and compatible with an AR process. Furthermore,
again with numbering from the base to the fourth floor, Figure 16 shows the plots of the ACF and PACF for
to measure the accelerations along the weak axis or the graphical model identification through the Box–
east-west (E-W) direction. Jenkins methodology for some vibration signals of
Figure 13. (a) The four-story steel structure of the IASC-ASCE SHM benchmark problem56 and (b) details of the structure plan
for the base, first, second, third, and fourth floors.
(a)
Figure 15. Stationarity assessment of the AR model

identification by the LMC hypothesis test.
(b)
samples of the PACF (the right plots) gradually cut off
approximately after more than 50 lags. These observa-
tions demonstrate that the most suitable time series
model between AR and ARMA for the vibration sig-
nals is AR. It is worth remarking that the same conclu-
sion can be achieved for the other responses.
(c) To handle different normal conditions in the training
phase, which look essential for the threshold estima-
Figure 14. (a) The excitation force, (b) the acceleration tion, the acceleration responses associated with Case 1
response at Sensor 9 in Case 1, and (c) the acceleration are randomly contaminated by five different noise lev-
response at Sensor 9 in Case 2. els. In this way, five different acceleration datasets for
the healthy state of the IASC-ASCE structure are built,
and they can be considered equivalent to five normal
Case 1. It is seen that the samples of the ACF (the left conditions or five test measurements for Case 1 in the
plots) are similar to sine waves without any trend in training stage (Nc = 5). Figure 17 gathers the orders of
being zero. By contrast, one can discern that the all the AR models furnished by the improved and
Entezami et al. 17
Table 2. Healthy and damaged conditions of the IASC-ASCE structure (Phase II).
Case no. Label Description
1 Healthy Fully braced configuration—no damage

2 Damaged All braces removed from Floors 1–4 at the east side
3 Damaged All braces removed from Floors 1–4 at the southeast corner
4 Damaged Braces removed from Floors 1 and 4 at the southeast corner
5 Damaged Braces removed from Floor 1 at the southeast corner
IASC: International Association for Structural Control; ASCE: American Society of Civil Engineers.
(a) (b)
(c) (d)
(e) (f)
Figure 16. Graphical model identification between the AR and ARMA representations by the Box–Jenkins methodology in Case 1:
(a) ACF at Sensor 6, (b) PACF at Sensor 6, (c) ACF at Sensor 12, (d) PACF at Sensor 12, (e) ACF at Sensor 14, and (f) PACF at
Sensor 14.
original iterative methods as well as by the BIC tech- The proposed order selection method is able to guar-
nique, handling the mentioned five measurements for antee model accuracy with uncorrelated residuals:
Case 1. The results show that the improved method Figure 18 depicts the LBQ test statistics for the two
gives rise to orders smaller than the other techniques lowest and highest AR orders, respectively, obtained
also in this analysis. Overall, the BIC technique turns with the first measurement at Sensors 9 and 15. In the
out to be always the worst one in terms of two cases, the QLB value at Iterations 20 and 36
performance. becomes smaller than the c-value threshold, which
(a)
(b)
(c)
Figure 17. AR model orders for all the five measurements relevant to Case 1: outcomes of (a) the proposed method, (b) the
original method, and (c) the BIC method.
(a) (b)
Figure 18. LBQ test statistics for the first measurement of Case 1: (a) Sensor 9 and (b) Sensor 15.
amounts to 31.4104 for a 5% significance limit; similar of the LBQ and AD tests for analyzing the uncorrelat-
conclusions are arrived at for all the other locations and edness and Gaussianity of the residual samples regard-
measurements. Moreover, Figure 19 shows the statistics ing all sensors in Cases 1–5. Once again, it is seen that
Entezami et al. 19
(a) (b)
Figure 19. Analysis of the residual samples of the AR models in Cases 1–5 of the IASC-ASCE structure: (a) assessment of the
residual uncorrelatedness by the LBQ test and (b) assessment of the residual Gaussianity by the AD test.
Case 1 only are handled to set the computational time

of residual extraction in the training and monitoring
phases.
Results show that the improved order selection
method needs very little computational effort if com-
pared to the other approaches. The new RBFE method
provides also a better performance against the conven-
tional approach. As already shown for the cracked
beam, the step of residual extraction in the monitoring
stage requires more time than with the conventional
technique, due to the repeated iterative process for resi-
dual extraction. This may be a problem for SHM appli-
cations because the feature extraction in the training
phase can be practically performed only for one time in
advance, while during the monitoring phase, the fea-
ture extraction requires to be fast implemented for
more than one time as long as the damage detection is
required. Nevertheless, it should be mentioned that the
improved method possesses several advantages such as
Figure 20. Computational times for AR order selection in the determining a small and adequate order for the AR
IASC-ASCE structure, as obtained with the improved, original, model, simultaneously obtaining the model order, coef-
and BIC methods. ficients, and residuals in the training phase, and provid-
ing a total short time for a complete feature extraction,
all of which enable the improved method to be a time-
all statistics are smaller than the tests’ c-values, which saving unsupervised learning strategy for feature
approve that the extracted residual samples are uncor- extraction.
related and Gaussian. Once the residuals of the AR model at each sensor
Figures 20 and 21 illustrate the computational time have been extracted for the undamaged and damaged
relevant to the AR order selection, and to the complete conditions, their sensitivity to damage is assessed. To
process of RBFE. In Figure 20, the vibration responses aim this, the L2-norms of the residual vectors Exk and
from all the sensors in the five measurements of Case 1 Eyr at all sensors are computed, and the results are col-
are adopted to compute the time needed for the deter- lected in Figure 22; here, the samples in the range of 1–
mination of the AR orders. As for Figure 21, the accel- 60 belong to the measurements during training, while
eration datasets of Case 2 and the first measurement of the subsequent samples in the range of 61–108 are
Figure 21. Computational times for the complete process of feature extraction in the IASC-ASCE structure (residual-based
feature extraction (RBFE)): comparison among the performances of the improved, original, and BIC methods.
graphs, thresholds amount to 0.0234, 0.3577, and

0.0461 for the KLDEPM, KLD, and KSTS methods,
respectively. As before, the mean distance values
between the residual vectors relevant to each damaged
state and to the normal conditions are computed, to
provide the results in terms of damage identification.
Regarding Cases 2 and 3, Figures 25(a) and (b) to
27(a) and (b) show that the damage locations (DLs) are
correctly identified by the three methods at the east side
of Floors 1–4, since the distance values at Sensors 6, 9,
12, and 15 exceed the threshold. For Cases 4 and 5, in
Figure 22. L2-norm of the residuals in the training (1–60) and Figures 25(c) and (d) and 26(c) and (d), it is seen that
monitoring (61–108) phases. both the KLDEPM and KLD methods succeed in
locating the damaged areas at the southeast corner of
related to monitoring. As the L2-norm values relevant either Floor 1 or Floor 4. These outcomes are related
to the damaged conditions highly increase in compari- to distance values exceeding the threshold at Sensors 6
son with the values referring to the normal conditions, and 15 for Case 4, and at Sensor 6 for Case 5. Figure
a confirmation is provided of the sensitivity to damage 27(c) shows that for Case 4 that KSTS can also identify
of the residuals extracted with the proposed feature the DLs at Sensors 6 and 15, but a false-positive dam-
extraction method. age indication (Type I error) is observable at Sensor 5,
Figures 23 and 24 show the details of the segmenta- which is instead an undamaged location (UDL). This
tion of the residual sequences for Cases 2–5, in terms of technique is also incapable of identifying the damaged
the number of segments, and the number of samples area in Case 5, due to the reported false-negative indi-
within all the m segments. In the following, the number cation (Type II error) at Sensor 6 shown in Figure
of samples (Nx) of Exk is determined as shown in Figure 27(d). Furthermore, Figure 28 compares the time
24 for some exemplary sensor locations. It is to note required for damage identification through the three
that the black areas in Figure 23 are related to the infor- distance approaches: the best performance is provided
mation provided by Sensors 1–3 mounted on the base again by the KLDEPM method. Thus, also in this case,
of the structure, which as mentioned earlier has not it emerges as a fast decision-making process within an
been considered in the procedures for feature extraction unsupervised learning strategy for damage localization.
and damage localization. In most cases of SHM problems, the environmental
The KLDEPM method is adopted to locate damage, and operational variability conditions are inevitable.52
providing the results reported in Figure 25; for compar- To indicate the effects of these conditions on the proce-
ison, Figures 26 and 27 collect the same results, as dures of feature extraction and damage localization, it
obtained with the KLD and KSTS techniques. In these is assumed that Case 5 is an undamaged state with the
Entezami et al. 21
(a)
(b)
(c)
Figure 23. Measurement-dependent segmentation of the residual vectors at all sensor locations: (a) number of segments, (b)
number of samples of Eyr in the first m – 1 segments, and (c) number of samples of Eyr in the mth segment.
(a)
(b)
(c)
(d)
Figure 24. Number of samples in Exk based on the segmentation of Eyr , for the first measurement of Case 1 at (a) Sensor 4, (b)
Sensor 8, (c) Sensor 11, and (d) Sensor 14.
(a) (b)
(c) (d)
Figure 25. Damage localization in the IASC-ASCE structure by the proposed KLDEPM method: (a) Case 2, (b) Case 3, (c) Case 4,
and (d) Case 5.
(a) (b)
(c) (d)
Figure 26. Damage localization in the IASC-ASCE structure by the classical KLD technique: (a) Case 2, (b) Case 3, (c) Case 4, and
(d) Case 5.
Entezami et al. 23
(a) (b)
(c) (d)
Figure 27. Damage localization in the IASC-ASCE structure by the KSTS technique: (a) Case 2, (b) Case 3, (c) Case 4, and (d)
Case 5.
this assumption was already used by Entezami and

Shariatmadar.38 In fact, the basis of this assumption
comes from the simulation of the environmental varia-
tions by reducing the structural stiffness, which was
simulated on the laboratory frame of the Los Alamos
National Laboratory in the United States.32 Under
such circumstances, the normal conditions are Cases 1
and 5 (Nc = 2), which are applied to estimate a thresh-
old limit for damage localization. All the steps of fea-
ture extraction (i.e. AR order determination, parameter
estimation, and residual extraction in the training and
monitoring phases) are implemented to locate damage
in Cases 2–4 as shown in Figure 29, where the dashed
lines are the threshold limit equal to 0.0544. As can be
discerned, the same accurate results as Figure 25 are
observable in Figure 29. This means that the proposed
methods succeed in extracting reliable features and
Figure 28. Comparison among the KLDEPM, KLD, and KSTS
accurately locating damage in Cases 2–4 even under the
methods, in terms of the computational time for damage
localization in the IASC-ASCE structure. simulated environmental variability.
environmental variability attributable to the reduction Conclusion

in stiffness by removing the only braces on one bay at An innovative feature extraction approach, through
the first story of the IASC-ASCE structure. Note that AR modeling and a statistical distance method termed
(a) (b) (c)
Figure 29. Damage localization in the IASC-ASCE structure using the proposed KLDEPM method by considering the simulated
environmental variability: (a) Case 2, (b) Case 3, and (c) Case 4.
KLDEPM, have been proposed to locate structural KLDEPM is superior in locating small damages, with
damage based on an unsupervised learning strategy. Type I and Type II errors never observed.
The proposed method consists of an improved AR One can then conclude that the proposed feature
order selection, an iterative residual extraction algo- extraction and KLDEPM methods can be efficient and
rithm for the training stage, and another iterative resi- successful for SHM purposes, particularly regarding dam-
dual extraction process for the monitoring phase. The age localization, with the great advantage of working
KLDEPM method is based on the theory of empirical within the frame of an unsupervised learning strategy.
probability measure and rests on a segmentation of the
residual sequences into independent sets. A numerical Acknowledgements
concrete beam and the IASC-ASCE benchmark struc- The authors are indebted to Prof. Jyrki Kullaa at Aalto
ture have been considered to assess the accuracy and University and to the IASC-ASCE Structural Health
performance of the proposed method. Comparative Monitoring Task Group for providing the benchmark
analyses with alternative state-of-the-art procedures datasets.
have been conducted to testify the superior perfor-
mance of the proposed methods. Funding
The results of feature extraction have shown that the The author(s) disclosed receipt of the following financial sup-
improved order selection method guarantees the accu- port for the research, authorship, and/or publication of this
racy and sufficiency of the AR model by generating article: This research was supported by the Iran National
uncorrelated residuals and also provides model orders Science Foundation (INSF) under grant no. 96007230.
smaller than those obtained with the alternative tech-
niques. Accordingly, this method results to be less com- ORCID iDs
putationally demanding, also because the process of Alireza Entezami https://orcid.org/0000-0002-4864-2120
order selection has the highest influence on the total Hashem Shariatmadar https://orcid.org/0000-0001-5966-
time required for feature extraction. By means of the 3317
L2-norms in the undamaged and damaged conditions,
the residuals of the AR model extracted from the pro- References
posed feature extraction approach have been confirmed
1. Cawley P. Structural health monitoring: closing the gap
to be highly sensitive to damage. between research and industrial deployment. Struct
In the statistical decision-making process for damage Health Monit 2018; 17: 1225–1244.
localization, the results have proven that the KLDEPM 2. Farrar CR and Worden K. Structural health monitoring:
method is capable of locating damage of varying sever- a machine learning perspective. Hoboken, NJ: John Wiley
ity, particularly for scenarios featuring small damage & Sons Ltd, 2013.
levels. Since the same conclusion has been obtained 3. Kopsaftopoulos F and Fassois S. Vibration based health
with the (less computationally efficient) classical KLD monitoring for a lightweight truss structure: experimental
technique, the statistical distance measuring the discre- assessment of several statistical time series methods. Mech
pancy between two datasets by a logarithmic function Syst Sig Process 2010; 24: 1977–1997.
has shown to be a reliable tool for damage identifica- 4. Kopsaftopoulos FP and Fassois SD. A vibration model
residual-based sequential probability ratio test framework
tion. The comparative analysis has also confirmed that
Entezami et al. 25
for structural health monitoring. Struct Health Monit 20. Da Silva S, Gianini Gonsalez C and Lopes Jr V. Adaptive
2015; 14: 359–381. filter feature identification for structural health monitor-
5. Barthorpe RJ. On model-and data-based approaches to ing in an aeronautical panel. Struct Health Monit 2010;
structural health monitoring. Sheffield: The University of 10: 481–489.
Sheffield, 2010. 21. Adams DE and Farrar CR. Classifying linear and non-
6. Sakaris CS, Sakellariou JS and Fassois SD. Vibration- linear structural damage using frequency domain ARX
based damage precise localization in three-dimensional models. Struct Health Monit 2002; 1: 185–201.
structures: single versus multiple response measurements. 22. Entezami A and Shariatmadar H. Damage localization
Struct Health Monit 2015; 14: 300–314. under ambient excitations and non-stationary vibration
7. Mottershead JE, Link M and Friswell MI. The sensitivity signals by a new hybrid algorithm for feature extraction
method in finite element model updating: a tutorial. Mech and multivariate distance correlation methods. Struct
Syst Sig Process 2011; 25: 2275–2296. Health Monit 2019; 18: 347–375.
8. Capellari G, Eftekhar Azam S and Mariani S. Damage 23. Nair KK, Kiremidjian AS and Law KH. Time series-
detection in flexible plates through reduced-order model- based damage detection and localization algorithm with
ing and hybrid particle-Kalman filtering. Sensors 2016; application to the ASCE benchmark structure. J Sound
16: 2. Vib 2006; 291: 349–368.
9. Azam SE, Mariani S and Attari N. Online damage detec- 24. Xie L and Mita A. An innovative substructure damage
tion via a synergy of proper orthogonal decomposition identification approach for shear structures based on
and recursive Bayesian filters. Nonlinear Dyn 2017; 89: ARMAX models. Procedia Eng 2017; 188: 119–124.
1489–1511. 25. Ay AM and Wang Y. Structural damage identification
10. Eftekhar Azam S and Mariani S. Online damage detec- based on self-fitting ARMAX model and multi-sensor
tion in structural systems via dynamic inverse analysis: data fusion. Struct Health Monit 2014; 13: 445–460.
a recursive Bayesian approach. Eng Struct 2018; 159: 26. Sohn H, Allen DW, Worden K, et al. Statistical damage
28–45. classification using sequential probability ratio tests.
11. Capellari G, Chatzi E and Mariani S. Structural health Struct Health Monit 2003; 2: 57–74.
monitoring sensor network optimization through Baye- 27. Kopsaftopoulos F, Nardari R, Li Y-H, et al. A stochastic
sian experimental design. ASCE ASME J Risk Uncertain global identification framework for aerospace structures
Eng Syst A Civ Eng 2018; 4: 04018016. operating under varying flight states. Mech Syst Sig Pro-
12. Sarmadi H, Karamodin A and Entezami A. A new itera- cess 2018; 98: 425–447.
tive model updating technique based on least squares 28. Avendaño-Valencia LD, Chatzi EN, Koo KY, et al.
minimal residual method using measured modal data. Gaussian process time-series models for structures under
Appl Math Modell 2016; 40: 10323–10341. operational variability. Front Built Environ 2017; 3: 69.
13. Rezaiee-Pajand M, Entezami A and Sarmadi H. A sensi- 29. Bogoevska S, Spiridonakos M, Chatzi E, et al. A data-
tivity-based finite element model updating based on driven diagnostic framework for wind turbine structures:
unconstrained optimization problem and regularized a holistic approach. Sensors 2017; 17: 720.
solution methods. Structural Control and Health Monitor- 30. Shi H, Worden K and Cross EJ. A cointegration
ing 2019; e2481. approach for heteroscedastic data based on a time series
14. Fan W and Qiao P. Vibration-based damage identifica- decomposition: an application to structural health moni-
tion methods: a review and comparative study. Struct toring. Mech Syst Sig Process 2019; 120: 16–31.
Health Monit 2011; 10: 83–111. 31. Yao R and Pakzad SN. Autoregressive statistical pattern
15. Meruane V and Heylen W. Damage detection with paral- recognition algorithms for damage detection in civil struc-
lel genetic algorithms and operational modes. Struct tures. Mech Syst Sig Process 2012; 31: 355–368.
Health Monit 2010; 9: 481–496. 32. Entezami A and Shariatmadar H. Structural health mon-
16. Entezami A, Shariatmadar H and Sarmadi H. Structural itoring by a new hybrid feature extraction and dynamic
damage detection by a new iterative regularization time warping methods under ambient vibration and non-
method and an improved sensitivity function. J Sound stationary signals. Measurement 2019; 134: 548–568.
Vib 2017; 399: 285–307. 33. Figueiredo E, Figueiras J, Park G, et al. Influence of the
17. Fassois SD and Sakellariou JS. Statistical time series autoregressive model order on damage detection. Comput
methods for structural health monitoring. In: Boller C, Aid Civ Infrastruct Eng 2011; 26: 225–238.
Chang F-K and Fujino Y (eds) Encyclopedia of structural 34. Box GE, Jenkins GM, Reinsel GC, et al. Time series anal-
health monitoring. Hoboken, NJ: John Wiley & Sons, ysis: forecasting and control. 5th ed. Hoboken, NJ: John
Ltd, 2009, pp. 443–472. Wiley & Sons, 2015.
18. Hoell S and Omenzetter P. Optimal selection of autore- 35. Hyndman RJ and Athanasopoulos G. Forecasting: princi-
gressive model coefficients for early damage detectability ples and practice. Melbourne, VIC, Australia: OTexts, 2014.
with an application to wind turbine blades. Mech Syst Sig 36. Broersen PM. Finite sample criteria for autoregressive order
Process 2016; 70-71: 557–577. selection. IEEE Trans Sig Process 2000; 48: 3550–3558.
19. Sohn H, Czarnecki JA and Farrar CR. Structural health 37. Vu VH, Thomas M, Lakis A, et al. Operational modal
monitoring using statistical process control. Journal of analysis by updating autoregressive model. Mech Syst Sig
Structural Engineering 2000; 126: 1356–1363. Process 2011; 25: 1028–1044.
38. Entezami A and Shariatmadar H. An unsupervised learn- compromised autoregressive modeling with the aid of
ing approach by novel damage indices in structural health hypothesis tests and simulation analysis. Mech Syst Sig
monitoring for damage localization and quantification. Process 2007; 21: 1953–1982.
Struct Health Monit 2017; 17: 325–345. 49. Zhen D, Wang T, Gu F, et al. Fault diagnosis of motor
39. Rezaiee-Pajand M, Entezami A and Shariatmadar H. An drives using stator current signal analysis based on
iterative order determination method for time-series mod- dynamic time warping. Mech Syst Sig Process 2013; 34:
eling in structural health monitoring. Adv Struct Eng 191–202.
2017; 21: 300–314. 50. Mosavi AA, Dickey D, Seracino R, et al. Identifying
40. George RC, Mishra SK and Dwivedi M. Mahalanobis damage locations under ambient vibrations utilizing vec-
distance among the phase portraits as damage feature. tor autoregressive models and Mahalanobis distances.
Struct Health Monit 2018; 17: 869–887. Mech Syst Sig Process 2012; 26: 254–267.
41. Yeager M, Gregory B, Key C, et al. On using robust 51. Mujica L, Rodellar J, Fernandez A, et al. Q-statistic and
Mahalanobis distance estimations for feature discrimina- T2-statistic PCA-based measures for damage assessment
tion in a damage detection scenario. Struct Health Monit in structures. Struct Health Monit 2011; 10: 539–553.
2019; 18: 245–253. 52. Liu G, Mao Z, Todd M, et al. Damage assessment with
42. Sarmadi H and Karamodin A. A novel anomaly detec- state–space embedding strategy and singular value
tion method based on adaptive Mahalanobis-squared dis- decomposition under stochastic excitation. Struct Health
tance and one-class kNN rule for structural health Monit 2014; 13: 131–142.
monitoring under environmental effects. Mechanical Sys- 53. Entezami A, Shariatmadar H and Karamodin A. Data-
tems and Signal Processing. Epub ahead of print 9 driven damage diagnosis under environmental and opera-
December 2019. DOI: 10.1016/j.ymssp.2019.106495. tional variability by novel statistical pattern recognition
43. Sarmadi H, Entezami A and Daneshvar Khorram M. methods. Struct Health Monit 2018; 18: 1416–1443.
Energy-based damage localization under ambient vibra- 54. Figueiredo E, Park G, Farrar CR, et al. Machine learning
tion and non-stationary signals by ensemble empirical algorithms for damage detection under operational and
mode decomposition and Mahalanobis-squared distance. environmental variability. Struct Health Monit 2011; 10:
Journal of Vibration and Control. Epub ahead of print 2 559–572.
December 2019. DOI: 10.1177/1077546319891306. 55. Kullaa J, Santaoja K and Eymery A. Vibration-based
44. Zheng H and Mita A. Localized damage detection of structural health monitoring of a simulated beam with a
structures subject to multiple ambient excitations using breathing crack. Key Eng Mater 2013; 1093–1100. https://
two distance measures for autoregressive models. Struct www.scientific.net/KEM.569-570.1093
Health Monit 2009; 8: 207–222. 56. Dyke SJ, Bernal D, Beck J, et al. Experimental phase II
45. Eybpoosh M, Berges M and Noh HY. An energy-based of the structural health monitoring benchmark problem.
sparse representation of ultrasonic guided-waves for In: Proceedings of the 16th ASCE engineering mechanics
online damage detection of pipelines under varying envi- conference, 2003. https://authors.library.caltech.edu/
ronmental and operational conditions. Mech Syst Sig 34226/
Process 2017; 82: 260–278. 57. Kullback S and Leibler RA. On information and suffi-
46. Delpha C, Diallo D and Youssef A. Kullback-Leibler ciency. Ann Math Stat 1951; 22: 79–86.
divergence for fault estimation and isolation: application 58. Wang Q, Kulkarni SR and Verdú S. Divergence estima-
to Gamma distributed data. Mech Syst Sig Process 2017; tion of continuous distributions based on data-dependent
93: 118–135. partitions. IEEE Trans Inf Theory 2005; 51: 3064–3074.
47. Roy K, Bhattacharya B and Ray-Chaudhuri S. ARX 59. Dassault Systemes Simulia Corp. ABAQUS/CAE V6.12.
model-based damage sensitive features for structural Waltham, MA: Dassault Systemes Simulia Crop, 2012.
damage localization using output-only measurements. 60. Leybourne SJ and McCabe BP. A consistent test for a
J Sound Vib 2015; 349: 99–122. unit root. J Bus Econ Stat 1994; 12: 157–166.
48. Zhan Y and Mechefske CK. Robust detection of gearbox 61. Anderson TW and Darling DA. Asymptotic theory of
deterioration using compromised autoregressive modeling certain goodness of fit criteria based on stochastic pro-
and Kolmogorov–Smirnov test statistic—part I: cesses. Ann Math Stat 1952; 23: 193–212.

Fast Unsupervised Learning Methods For Structural Health Monitoring With Large Vibration Data From Dense Sensor Networks

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Fast Unsupervised Learning Methods For Structural Health Monitoring With Large Vibration Data From Dense Sensor Networks

Uploaded by

Copyright:

Available Formats

Original Article

Structural Health Monitoring

Fast unsupervised learning methods Ó The Author(s) 2019

Alireza Entezami1,2 , Hashem Shariatmadar1 and Stefano Mariani2

Introduction non-recoverable variations of the structural stiffness

and iterative steps of the proposed feature extraction

A novel statistical distance measure for X Px ð zÞ

number of samples in each segment. ð10Þ

Fifteen sensors were supposed to be mounted on the

Acceleration time histories acquired with Sensors

Figure 15. Stationarity assessment of the AR model

Case no. Label Description

1 Healthy Fully braced configuration—no damage

Case 1 only are handled to set the computational time

graphs, thresholds amount to 0.0234, 0.3577, and

this assumption was already used by Entezami and

environmental variability attributable to the reduction Conclusion

(a) (b) (c)

You might also like