Professional Documents
Culture Documents
Structures
journal homepage: www.elsevier.com/locate/structures
A R T I C L E I N F O A B S T R A C T
Keywords: Bridges are critical but vulnerable components in a transportation network as they are exposed to the threats
Seismic induced by long-term aging effects as well as natural hazards such as earthquakes. The traditional seismic
Fragility analysis fragility analysis is associated with high computational cost, making it infeasible for the cases requiring multiple
Structural Reliability
fragility analyses, such as evaluating time-dependent seismic fragility for deteriorating facilities, or a trans-
Probabilistic analysis
Deep Learning
portation network involving many bridges. In this study, a deep learning-aided seismic fragility analysis method
Neural Networks is proposed to improve the computational efficiency. Fragility analysis is transformed into a binary classification
problem. An improved deep neural network classification algorithm with a new activation function is proposed
and benchmarked with traditional deep neural networks and other machine learning counterparts. The accuracy
and the robustness of the new method are demonstrated by examples.
1. Introduction analysis for each ground motion-bridge pair is conducted. The peak
structural responses for key components are recorded for generating the
As a key component of transportation networks, bridges play crucial probabilistic seismic demand model. The fragility curves then can be
roles in maintaining the accessibility of a region. The damage or failure obtained comparing the seismic demand and structural capacity models.
of bridges may lead to potential catastrophic life loss and massive eco- The seismic demand analysis generally requires performing repetitive
nomic loss. The importance of seismic fragility assessment for bridges costly three-dimensional non-linear time history analyses. The high
has been highlighted for many decades. To quantitatively evaluate the computational cost leads to some limitations when dealing with com-
damage loss and seismic performance of bridges, fragility curves are plex engineering problems, such as generating time-dependent seismic
typically constructed, which are defined as a conditional probability fragility curves for deteriorating facilities considering aging effects
providing the likelihood of a component or a structure meeting or along the service life, and moreover, conducting seismic vulnerability
exceeding a certain level of damage for a given ground motion intensity assessment for a transportation network with many bridges. The high
(IM). Seismic fragility curves provide a reliable reference on the retrofit, demand of computation cost of such problems makes it time-consuming
loss evaluation and post-event prediction and inspection for bridge and impractical with the traditional NLTHA-based fragility
owners and decision makers [22,27,28]. methodologies.
Various approaches for seismic fragility analysis for civil facilities To address the issue of computational cost, statistical learning
were developed in the literatures, including expert-based approaches, techniques were introduced into the fragility analysis to replace the non-
empirical-based approaches based on observed damage from historic linear time-history analysis process, providing surrogate models for the
earthquakes, and the most widely used analytical approaches using “true” seismic demand of bridges [9]. Polynomial probabilistic seismic
numerical simulations accounting for the uncertainties from ground demand models were developed which are conditioned on ground mo-
motion and structural properties [2,3,7,26,35]. The analysis-based tion intensity IM, with considering the median of demand as a contin-
fragility method generates seismic fragility curves through a series of uous function of the IM based on the regression of the simulation
non-linear time history analysis (NLTHA) of the bridges under different responses [5,8,21,27]. Some researchers developed multi-parameter
ground motion intensities. A suite of ground motions is paired with the demand models and generated parameterized fragility curves for
same numbers of bridge simulations, then the non-linear time history bridges using the response surface models and logistic regression
* Corresponding author.
E-mail addresses: mwan6678@uni.sydney.edu.au (M. Wang), hao.zhang@sydney.edu.au (H. Zhang), hzdai@hit.edu.cn (H. Dai), luming.shen@sydney.edu.au
(L. Shen).
https://doi.org/10.1016/j.istruc.2022.04.058
Received 22 November 2021; Received in revised form 12 April 2022; Accepted 20 April 2022
Available online 5 May 2022
2352-0124/© 2022 Institution of Structural Engineers. Published by Elsevier Ltd. All rights reserved.
M. Wang et al. Structures 40 (2022) 1056–1064
techniques [6,30,33]. Seo and Linzell [33] used statistical screening data and start to learn, an output layer which outputs prediction, and
approach to identify the critical range for key components of bridges and multiple hidden layers in between which perform a series of filtering,
developed multi-parameter fragility curves. Kameshwar and Padgett delivering, and mapping operations. There are generally multiple neu-
[15] developed a multi-hazard parameterized fragility function for rons in each layer. These neurons are connected and convert input data
bridges. The stepwise logistic regression techniques and a sequential into output data by performing an activation operation. The connections
forward selection scheme were conducted to generate fragility curves. A between the neurons are realized by weights, which can determine the
multi-span simply supported reinforced concrete (RC) bridge was stud- learning ability of a neural network. All weights between two layers
ied to demonstrate the approach. Jeon, Mangalathu, Song, and Des- consists of the weight matrix. A Neural Network learn in both forward
roches [13] developed a multi-parameter fragility estimation using and reverse direction. For forward propagation, the input vector X goes
Bayesian framework including the selection of bridge class, bridge at- through neurons and being operated through activation functions.
tributes and the variation of geometric parameters. The multi-parameter Activation functions are key components of deep neural networks,
fragility curves were developed using Bayesian parameter estimation which can greatly influence the performance of a model. The activation
approach and logistic regression techniques. Pang, Dang, and Yuan [29] function can be regarded as a “gate” between the input of the current
developed an Artificial Neural Network (ANN) based approach to neuron and its output to the next layer, as it is connected to each neuron
generate fragility curves for bridges. Mangalathu, Heo, and Jeon [23] and determines whether the neuron should be activated or not based on
applied artificial neural networks in the development of multi- the input information [25]. The mathematical mechanism of a neural
parameter seismic fragility curves for skewed concrete bridge class, network with single hidden layer can be expressed as:
considering the variation in number of spans, number of seat abutments,
Y = f (Xw1 + b1 )w2 + b2 (1)
column bents and different degree of skew angles.
In the present study, an efficient and advanced deep learning-based where X is an i-dimensional row vector of model input. Y is the k-
fragility estimation methodology is developed for the seismic fragility dimensional output vector of the hidden layer. w1 and b1 are the weight
assessment of bridges. To address the limitations and problems of the matrix and bias vector of the former layer in size i × j and 1 × j,
existing deep neural networks (DNN), a new improved activation respectively. w2 and b2 are the weight matrix and bias vector of the
function, referred to as DLog (double log function), is proposed. The current layer in size j × k and 1 × k, respectively. Xw1 +b1 represents the
outperformance of the proposed activation function is evaluated output of the former layer. f(•) is the activation function used in the
through experiments and benchmarks. Based on the proposed DLog corresponding neuron. Eq. (1) shows the working mechanism of the
activation function, a DNN-based fragility framework is developed. activation function for a single hidden layer, which can be generalized to
Numerical tests and examples were carried out on the proposed meth- deep neural network with multi hidden layers by adding more sets of
odology to evaluate and validate its performance against traditional weight and bias matrixes in the same way. For a deep neural network
non-linear time history-based fragility approach. with multiple hidden layers, the output of each activation function is
transformed by a new weight matrix and bias matrix, then being used to
2. An improved activation function for deep neural networks feed the activation functions in the next layer.
The difference between the prediction vector computed from the
For highly repetitive analyses, deep learning models can be good forward propagation and the actual label can be measured by defining a
surrogates with high accuracy and efficiency. The mechanism of loss function. Using the loss function, the loss of the forward training can
generating fragility curves using deep learning can be simply regarded be calculated, and the performance can be evaluated. The smaller the
as a decision-making process, which compares demand with capacity to loss, the better the model.
classify whether the bridge exceeds the limit state or not. Hence, the As introduced, activation function is a key component of deep neural
complicated problem can be transformed into a binary classification networks, which largely influences the output, accuracy, convergence,
problem that for each damage state, a case is categorized into a 0 or 1 and the computational efficiency of training a model. There are various
class, respectively representing exceeding or not exceeding the limit non-linear activation functions used in Neural Networks. The existing
state. activation functions have some limitations, including the vanishing
A typical neural network generally consists of multiple layers as gradient problem, the neuron saturation problem, the dying neuron
shown in Fig. 1, which includes an input layer which receives the input
1057
M. Wang et al. Structures 40 (2022) 1056–1064
Fig. 4. Testing accuracy vs numbers of layers for TanH, ReLU and DLog.
Fig. 5. Testing accuracy vs numbers of neurons for TanH, ReLU and DLog.
1058
M. Wang et al. Structures 40 (2022) 1056–1064
Table 1
Testing accuracy (in %) of different activation functions on four datasets.
Activation Func. MNIST IMDB Reuters CIFAR10
• Softplus [24]:
f (x) = ln(1 + ex ) (4)
• Softsign [10]:
x
f (x) = (5)
1 + |x|
MNIST database is separated into three datasets, with 70% data for • Parametric ReLU (PReLU) [12]:
training dataset, 15% for validation dataset and 15% for testing dataset. {
The commonly used activation functions TanH and ReLU are employed αxifx < 0
f (α, x) = (7)
as baselines in the experiments. Four experiments are designed based on xifx ≥ 0
varying individual hyper-parameter of the network and maintaining where, α is a learnable parameter and initialized to 0.25.
other parameters constant. The first two experiments vary the depths
and scale of the neural network. In the first experiment, the number of • Exponential Linear Unit (ELU) [4]:
hidden layers varies from 20 to 40. In the second experiment, the {
number of neurons in each layer of the network varies from 200 to 1000. α(ex − 1)ifx < 0
f (x) = (8)
The third and fourth experiments are conducted based on varying the xifx ≥ 0
batch size from 32 to 2048 and increasing the dropout rate [34] from
20% to 50% for each layer, respectively. The prediction accuracy of α = 1.0
TanH, ReLU and DLog for these four experiments are plotted in Figs. 4-7.
From Figs. 4-7, it can be observed that DLog consistently outperforms
ReLU and TanH when changing different hyper-parameters in the neural
network. From Fig. 4, when increasing layers from 20 to 35, the accu- • Scaled Exponential Linear Unit (SELU) [17]:
racy of ReLU and TanH deteriorates by 60% and 80%, respectively. On {
α(ex − 1)ifx < 0
the other hand, the accuracy of DLog remains stable with increasing f (x) = λ (9)
xifx ≥ 0
layers; the accuracy is 80% when the number of layers is increased to 35.
Fig. 5 shows that when shrinking a network with decreasing neuron where, α ≈ 1.6733 and λ ≈ 1.0507.
numbers of each layer from 1000 to 200, DLog has shown a decrease of The benchmark of DLog against the nine baseline activation func-
accuracy from 97.66% to 97.23%, while ReLU and TanH have shown a tions are performed on four representative datasets: MNIST, IMDB Re-
decrease of accuracy from 97.47% to 96.91% and from 96.74% to view [19], Reuters Newswire [14]and CIFAR10 [18]datasets. MNIST
95.5%, respectively. The results of Figs. 4 and 5 indicate that compared and CIFAR10 are commonly used databases for training image pro-
with TanH and ReLU, DLog showed higher prediction accuracy, higher cessing classification models, which contains 70,000 image data in 10
stability, and robustness when the networks become deeper and larger. classes and 60,000 image data in 10 classes, respectively. IMDB Review
From Fig. 6, when increasing the batch size from 32 to 2048, the dete- and Reuters Newswire are commonly used datasets for training natural
rioration of accuracy of DLog is significantly less than TanH, and slightly language processing classification models, which contains 50,000 text
better than ReLU. Fig. 7 shows that when increasing the dropout rate data in 100 classes and 11,228 text data in 46 classes, respectively. For
from 40% to 50%, the accuracy of ReLU and TanH decrease by 10% and fair comparison, all the benchmarks are conducted by only changing the
1.8%, respectively, while the accuracy of DLog has decreased by only activation function in the architecture of the neural network while all
0.45%. The results indicate that, compared with ReLU and TanH, DLog
1059
M. Wang et al. Structures 40 (2022) 1056–1064
Fig. 8. The configuration of the MSC slab bridge, redrawn from ([26]).
the other hyper parameters remain constant. For MNIST, IMDB Review 1. Select a suite of ground motions and sample corresponding numbers
and Reuters Newswire datasets, 10 layers fully connected networks are of bridges. To improve sampling efficiency, Latin hypercube sam-
used with Adaptive Moment Estimation (Adam) [16] as the optimization pling method can be applied to sample bridge parameters, consid-
algorithm. For CIFAR 10, convolutional neural networks are used as this ering the uncertainties and variations in bridge geometry and
dataset possess more complicated features. It should be noted that all the material characteristics.
testing results are determined by the median of five different runs. The 2. Perform incremental dynamic analysis with scaling the ground mo-
testing results for different activation functions are compared in Table 1. tion intensity index from 0 to 1 to obtain seismic demands and record
The results in Table 1 show that DLog consistently outperformed the required seismic responses of structural components such as the
other activation functions in deep learning algorithms with higher ac- maximum curvature of columns.
curacy on different datasets, including image classification datasets 3. Compare the seismic demands with structural capacity and get a 0 or
MNIST and CIFAR10, and natural language processing datasets IMDB 1 result as a label for each run, representing whether the corre-
Review and Reuters Newswire. sponding damage state has been reached or not.
1060
M. Wang et al. Structures 40 (2022) 1056–1064
1061
M. Wang et al. Structures 40 (2022) 1056–1064
Fig. 9. ROC curves for DLog-based DNN and other machine learning methods.
1062
M. Wang et al. Structures 40 (2022) 1056–1064
1063
M. Wang et al. Structures 40 (2022) 1056–1064
tests. The proposed DNN-based fragility method transforms the fragility [9] Ghosh J, Padgett JE, Dueñas-Osorio L. Surrogate modeling and failure surface
visualization for efficient seismic vulnerability assessment of highway bridges.
analysis into a binary classification problem in which for each damage
Probab Eng Mech 2013;34:189–99.
state, a case is categorized into a 0 or 1 class, respectively representing [10] Glorot X, Bengio Y. Understanding the difficulty of training deep feedforward
exceeding or not exceeding the limit state. neural networks. Paper presented at the Proceedings of the thirteenth international
The proposed fragility method is demonstrated using an example of conference on artificial intelligence and statistics. 2010.
[11] Grother, P. J. (1995). NIST handprinted forms and characters database (NIST
multi-span continuous reinforced concrete slab bridge. The accuracy of special database 19). Retrieved from.
the DLog-based DNN is compared with other five machine learning- [12] He K, Zhang X, Ren S, Sun J. Delving deep into rectifiers: Surpassing human-level
based methods (logistic regression, random forest, naïve Bayes, Sup- performance on imagenet classification. Paper presented at the Proceedings of the
IEEE international conference on computer vision. 2015.
port Vector Machine, and K-Nearest Neighbour). It was found that the [13] Jeon J-S, Mangalathu S, Song J, Desroches R. Parameterized seismic fragility
average accuracy of the DLog-based DNN for model training of the four curves for curved multi-frame concrete box-girder bridges using Bayesian
damage states is 10% higher than those of the other five machine parameter estimation. J Earthquake Eng 2019;23(6):954–79.
[14] Joachims T. Text categorization with support vector machines: Learning with
learning candidates. many relevant features. Paper presented at the European conference on machine
The fragility curves of the example bridge are then constructed using learning. 1998.
the proposed DNN-aided fragility analysis and compared with the con- [15] Kameshwar S, Padgett JE. Multi-hazard risk assessment of highway bridges
subjected to earthquake and hurricane hazards. Eng Struct 2014;78:154–66.
ventional NLTHA-based fragility method. The example showed that the [16] Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv
new method agrees with the NLTHA-based method reasonably well. The preprint arXiv:1412.6980.
developed parameterised fragility curves can be used to efficiently [17] Klambauer G, Unterthiner T, Mayr A, Hochreiter S. Self-normalizing neural
networks. Paper presented at the Advances in neural information processing
examine the effects of the variations in structural material properties
systems. 2017.
and geometric parameters on the structural fragility without re-analysis [18] Krizhevsky, A., & Hinton, G. (2009). Learning multiple layers of features from tiny
for each additional set of parameters. This makes the method particu- images.
larly useful for sensitivity analysis of structural parameters on fragility, [19] Maas AL, Daly RE, Pham PT, Huang D, Ng AY, Potts C. Learning word vectors for
sentiment analysis. Paper presented at the Proceedings of the 49th annual meeting
or time-dependent fragility analysis of aging facilities considering the of the association for computational linguistics: Human language technologies-
degradation of structural resistance due to deterioration. The new volume 1. 2011.
method also has the potential to be used for fragility analysis of bridge [20] Maas AL, Hannun AY, Ng AY. Rectifier nonlinearities improve neural network
acoustic models. Proceedings of the International Conference on Machine Learning
networks with a group of similar bridges. The present method can be 2013;30(1):3.
used for this purpose for a relatively small region in which the spatial [21] Mackie K, Stojadinović B. Probabilistic seismic demand model for California
variation of ground motion intensity can be ignored. For bridge net- highway bridges. J Bridge Eng 2001;6(6):468–81.
[22] Mackie K, Stojadinović B. Post-earthquake functionality of highway overpass
works in relatively large areas, the hazard intensity measure also needs bridges. Earthquake Eng Struct Dyn 2006;35(1):77–93.
to be parametrised in the fragility model. This will be a future work. [23] Mangalathu S, Heo G, Jeon J-S. Artificial neural network based multi-dimensional
fragility development of skewed concrete bridge classes. Eng Struct 2018;162:
166–76.
Declaration of Competing Interest [24] Nair V, Hinton GE. Rectified linear units improve restricted boltzmann machines.
Paper presented at the Proceedings of the 27th international conference on
The authors declare that they have no known competing financial machine learning (ICML-10). 2010.
[25] Nielsen MA. Neural networks and deep learning. 2018: Determination Press; 2015.
interests or personal relationships that could have appeared to influence
[26] Nielson BG. Analytical fragility curves for highway bridges in moderate seismic
the work reported in this paper. zones. Georgia Institute of Technology; 2005. PhD PhD.
[27] Nielson BG, DesRoches R. Analytical seismic fragility curves for typical bridges in
References the central and southeastern United States. Earthquake Spectra 2007;23(3):
615–33.
[28] Padgett JE, DesRoches R. Retrofitted bridge fragility analysis for typical classes of
[1] Baker JW, Lin T, Shahi SK, Jayaram N. New ground motion selection procedures multispan bridges. Earthquake Spectra 2009;25(1):117–41.
and selected motions for the PEER transportation research program. PEER Report [29] Pang Y, Dang X, Yuan W. An artificial neural network based method for seismic
2011;3. fragility analysis of highway bridges. Adv Struct Eng 2014;17(3):413–28.
[2] Banerjee S, Shinozuka M. Nonlinear static procedure for seismic vulnerability [30] Park J, Towashiraporn P. Rapid seismic damage assessment of railway bridges
assessment of bridges. Comput-Aided Civ Infrastruct Eng 2007;22(4):293–305. using the response-surface statistical model. Struct Saf 2014;47:1–12.
[3] Basöz N, Mander J. Enhancement of the highway transportation lifeline module in [31] Pascanu, R., Mikolov, T., & Bengio, Y. (2012). Understanding the exploding
HAZUS. National Institute of Building Sciences 1999;16(1):31–40. gradient problem, CoRR, abs/1211.5063, 2, 417.
[4] Clevert, D.-A., Unterthiner, T., & Hochreiter, S. (2015). Fast and accurate deep [32] PEER. (2014). PEER ground motion database. In: Univ. of California Berkeley, CA.
network learning by exponential linear units (elus). arXiv preprint arXiv: [33] Seo J, Linzell DG. Use of response surface metamodels to generate system level
1511.07289. fragilities for existing curved steel bridges. Eng Struct 2013;52:642–53.
[5] Cornell C, Vamvatsikos D. Incremental dynamic analysis. Earthquake Eng Struct [34] Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: a
Dyn 2002;31:491–514. simple way to prevent neural networks from overfitting. J Machine Learning Res
[6] Dukes, J. D. (2013). Application of bridge specific fragility analysis in the seismic 2014;15(1):1929–58.
design process of bridges in California. Georgia Institute of Technology. [35] Zhong J, Gardoni P, Rosowsky D, Haukaas T. Probabilistic seismic demand models
[7] Gardoni P, Mosalam KM, Der Kiureghian A. Probabilistic seismic demand models and fragility estimates for reinforced concrete bridges with two-column bents.
and fragility estimates for RC bridges. J Earthquake Eng 2003;7(spec01):79–106. J Eng Mech 2008;134(6):495–504.
[8] Ghosh J, Padgett JE. Aging considerations in the development of time-dependent
seismic fragility curves. J Struct Eng 2010;136(12):1497–511.
1064