You are on page 1of 31

Acta Geotechnica

https://doi.org/10.1007/s11440-023-01988-0 (0123456789().,-volV)(0123456789().
,- volV)

RESEARCH PAPER

Short-term rockburst prediction in underground project: insights


from an explainable and interpretable ensemble learning model
Yingui Qiu1 • Jian Zhou1

Received: 11 December 2022 / Accepted: 23 June 2023


 The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023

Abstract
Rockburst is a frequent challenge during tunnel and other underground construction and is an extreme rock damage
phenomenon. Therefore, it is very crucial to accurately estimate the damage potential of rockburst events. Microseismic
(MS) monitoring can be used to obtain the relevant MS parameters for short-term rockburst prediction in real time that
reflect the evolution of short-term rockburst. In this study, short-term rockburst potential data containing 7 MS parameters
(cumulative number of events, cumulative released energy, cumulative apparent volume, event rate, energy rate, apparent
volume rate, and incubation time) and 91 rockburst events (none rockburst, low rockburst, moderate rockburst, and high
rockburst) were collected from the Jinping Hydropower Station diversion tunnel project in China. The objective of this
paper is to propose an ensemble learning (EML) model based on the LévyFlight-Jaya optimization (LFJaya) and fivefold
cross-validation (CV) method to achieve an accurate prediction of short-term rockburst damage potential using MS
information. The EML consists of light gradient boosting machine (LGBM), extreme gradient boosting (XGBoost),
random forest (RF), support vector machine (SVM), and logistic regression (LR), with seven MS parameters as the EML
inputs and four rockburst levels as target variables. 70% and 30% of the cases were randomly selected for training and
testing, respectively. Five metrics (accuracy, kappa, precision, recall, and F1-score) and nonparametric statistical tests were
used to evaluate the performance of the model. It can be observed from the results of this study that the proposed EML has
a higher test accuracy (89.29%) than the multiple base classifiers used in the study. With the use of the ensemble model, the
decision boundary becomes more precise and overfitting is significantly improved. Additionally, the internal decision-
making process of EML was elucidated through an analysis of the model parameters using SHapley Additive exPlanations
(SHAP) and Local Interpretable Model-agnostic Explanations (LIME). It was discovered that the cumulative released
energy, the number of MS events, and the cumulative apparent volume (which reflects the number and strength of rock
fractures) exert a significant influence on the prediction of short-term rockburst potential. Finally, developed graphical user
interface (GUI) accurately predicted six instances of rockburst in the deeply buried tunnel of Jinping. Verification results
indicated that the proposed EML exhibits strong generalization and can effectively utilize MS information to achieve
precise short-term rockburst potential predictions.

Keywords Ensemble learning  Graphical user interface  Lévy-Jaya  LIME  Model interpretation  SHAP value 
Short-term rockburst

1 Introduction

Rockburst is a typical and challenging geohazard often


encountered during mining or engineering excavation at
& Jian Zhou
j.zhou@csu.edu.cn; csujzhou@hotmail.com great depth [99]. It is described as an extreme phenomenon
of sudden rock failure caused by the strong release of
Yingui Qiu
225501014@csu.edu.cn accumulated elastic energy in a rock mass in a high-stress
environment underground [9]. Since it was first reported in
1
School of Resources and Safety Engineering, Central South the England Tin Mine in 1738, rockburst has been
University, Changsha 410083, China

123
Acta Geotechnica

recognized as one of the most serious and hazardous geo- intelligent stage. Given that rockbursts have numerous
logical hazards [19]. The impacts of rockburst can be inducing factors and complex nonlinear relationships, tra-
devastating, including injuries to underground workers, ditional methods cannot achieve the accuracy required for
damage to engineering equipment, and interruption of prediction. Therefore, efficient modern nonlinear science
underground operations. Rockburst accidents have occur- and artificial intelligence methods are gradually being
red in many mining countries around the world, with over introduced into rockburst risk assessment. Since Feng et al.
15 mines in Canada reporting cases of rockburst [6], 172 [18] predicted roadway rockbursts by neural networks
recorded rockburst events in the US between 1936 and learning and adaptive identification methods in 1994, many
1993 [55], and many mines in China reporting related intelligent models have been gradually applied to rockburst
destruction events [61, 98, 106]. In particular, during the prediction [5, 98]. These methods have achieved better
construction of the Jinping II Hydropower Station tunnel in rockburst prediction results by utilizing feature information
China, hundreds of rockburst events of varying degrees and data rules that are difficult to identify using traditional
occurred [19, 78]. Additionally, intense rockburst danger methods.
events have also occurred in Peru, Japan, Norway, and Short-term rockburst prediction, on the other hand,
Switzerland [19, 57]. In summary, rockburst is a complex mainly uses on-site measured data to establish a suit-
and destructive rock instability phenomenon influenced by able mathematical model that predicts and warns of the
various internal and external factors, which restricts the rockburst phenomenon, determines the location and timing
development of underground engineering [58]. To date, of rockburst in the area, and provides more accurate
preventing and mitigating rockburst has become a critical guidance for construction. Through appropriate on-site
issue for underground engineering worldwide, particularly monitoring methods that reflect the development mecha-
for the mining industry, due to its complexity. As the depth nism and evolution law of rockburst, useful feature infor-
of underground engineering increases, identifying and mation can be obtained. Major on-site monitoring methods
controlling the potential risks of rockburst becomes more for early warning signal monitoring and short-term rock-
challenging. To accurately estimate the actual destructive burst risk identification include microgravity, infrared
potential of rockburst events, continuous research on var- thermal imaging, electromagnetic radiation, acoustic
ious prediction strategies and technologies is required. emission, microseismic monitoring, and the photoelastic
Over the years, numerous scholars have conducted method, etc. [3, 17, 64, 80, 93]. Microseismic (MS) mon-
extensive research on the prediction and triggering mech- itoring technology is capable of detecting early signs of
anisms of rockbursts. Many related results have been damage and structural deformation, and it is one of the
achieved, and although it is not possible to completely most widely utilized methods in this regard [92]. Specifi-
control rockburst incidents, they can be predicted with cally, MS monitoring is a technique that can effectively
accuracy to enable proactive measures aimed at reducing capture elastic waves that are released during the formation
risks. It is important to note that the prediction of rock- of microcracks in rock. By analyzing the captured wave-
bursts is typically divided into two categories: long-term form, this technology can provide valuable information
and short-term prediction [99]. Long-term prediction refers regarding the spatial and temporal distribution of microc-
to preliminary assessments of the rockburst trend in the racks and the deformation patterns associated with them.
early stages of underground engineering, which are critical As an important means of rockburst monitoring, MS
for initial construction guidance. Methods used in long- monitoring has been increasingly used in engineering and
term rockburst prediction include empirical index criterion is of great significance for early warning and prediction of
methods, numerical simulation methods, and intelligent rockburst hazards [21, 24, 51]. Many scholars have con-
modeling methods [2, 98]. These empirical criteria for ducted research related to rockburst-induced mechanisms
single or multiple indicators are proposed based on the and rockburst prediction by microseismic monitoring
understanding of the mechanism of rockburst induction. methods. Some microseismic parameters (e.g., apparent
They determine different rockburst thresholds for predic- volume, energy index, b-value, number of microseismic
tion based on existing field empirical data. Although the events, event rate, and maximum magnitude) have been
empirical criterion methods are simple and easy to operate, developed to predict rockburst risk problems [44]. Brady
they have the limitation of poor applicability. The numer- and Leighton [7] recorded seismic activity information
ical simulation methods have also been used to carry out a before and after a moderate-intensity rockburst at Star
lot of work related to rockburst prediction, but their sim- Mine in Burke, Idaho. Mendecki [56] recommends quan-
ulations are sensitive to model inputs, and it is difficult to tifying potential rock instability through relevant seismic
simulate the real dynamic behavior of rockburst [98]. In parameters and finds that rock instability (rockbursts, etc.)
recent years, with the rapid development of soft computing is preceded by precursory features such as an increase in
technology, rockburst risk assessment has entered a new cumulative apparent volume and a decrease in the energy

123
Acta Geotechnica

index and Schmidt number. Alcott et al. [3] used a method impact. Yin et al. [89] proposed an optimized convolu-
based on microseismic parameters such as microseismic tional neural network to evaluate rockburst intensity using
energy, apparent stress, and seismic moment to evaluate 400 rockburst cases based on microseismic monitoring and
rockburst hazards. Tang and Xia [75] obtained that the obtained 91.67% accuracy in rockburst prediction.
probability of rockburst occurrence was reduced when the In this study, various intelligent models currently used
ratio of the stiffness of the microseismic nucleation zone to for rockburst prediction have been classified into six cat-
the stiffness of the surrounding rock outside the nucleation egories: hybrid models (such as combining two or more
zone was less than 0 based on the microseismic activity law soft computing methods), fuzzy computing, optimization
of the Dongguashan Copper Mine. Chen et al. [12] selected algorithms, general machine learning (ML) methods (such
microseismic energy as the evaluation index of rockburst as K-nearest neighbors, Naive Bayes, and SVM, etc.),
intensity and proposed a new quantitative classification ensemble learning, and neural networks. Figure 1 depicts
method to divide rockburst into five intensity levels. Xu the number of studies utilizing these intelligent models in
et al. [84] found that the concentration of microseismic rockburst prediction in the last decade. It is evident that the
events can be used as an effective precursory feature of use of methods like ML, ensemble learning, and neural
strain-type rockbursts, with a prediction accuracy of 63% networks has significantly increased over the past decade.
in 2240 rockburst cases. Ma et al. [53] found that the In particular, there is a growing interest in using ensemble
b-values of the Creighton and Kidd mines began to decline learning in rockburst prediction, indicating its robustness.
before the main shock and increased significantly near or Moreover, Table 1 provides a brief statistic of the research
during the main shock, indicating that there was an on intelligent prediction of long/short-term rockburst.
inflection point in the b-value, which can be used as a Many scholars have extensively explored this field, and the
precursor to assess the risk of the main shock. Xue et al. table shows that there is significantly less research on
[88] evaluated the risk of rockburst using the number of short-term rockburst prediction. Although ensemble
daily events N and b-value and found that rockbursts were learning methods have gained considerable attention [67],
more likely when (lgN)/b was greater than 1. With the there is a lack of research on short-term rockburst risk
accumulation of information based on microseismic mon- prediction using these methods.
itoring, the intelligent model method is helpful for the In summary, while research on rockburst assessment has
evaluation of short-term rockburst [61]. Heal [28] collected achieved many results, there are still some shortcomings in
254 sets of rockburst data from multiple underground short-term rockburst risk prediction, and the related accu-
mines in Australia and Canada. According to the obtained racy needs to be improved. Considering that rockburst
excavation vulnerability potential index and the peak par- problems are influenced by various factors unique to
ticle velocity of microseismic parameters, the logistic underground engineering, their complexity results in sig-
regression method was used to evaluate the potential fail- nificant limitations in the performance of single models in
ure probability of rockburst. Based on the rockburst data their prediction. To improve the data mining ability and
established by Heal, Zhou et al. [102] evaluated the rock- stability of the basic models, compensate for the short-
burst damage by the stochastic gradient boosting method, comings of each classifier, and reduce the risk of selecting
and the test accuracy was 61.22%. The model analysis inaccurate classifiers in short-term rockburst evaluation,
shows that peak particle velocity is the most important this paper used seven parameters from MS monitoring as
variable among the features of rockburst failure classifi- inputs and built an interpretable EML model based on the
cation. Li et al. [43] considered seven input parameters LFJaya method. The applicability of the EML model in
affecting rockburst failure, fused the damage levels R2 and short-term rockburst prediction was discussed, and GUI
R3, and then combined the rock engineering system para- interface was established to assist the EML model in the
digm and neural network algorithm to evaluate rockburst. evaluation of short-term rockburst potential.
Feng et al. [22] used six parameters based on MS moni-
toring to evaluate rockburst by combining the mean impact
value algorithm, improved firefly algorithm, and proba- 2 Methodology
bilistic neural network. Based on 6–7 microseismic
parameters, Liang et al. [44, 45] evaluated rockburst 2.1 Lévy flights-Jaya algorithm
through a variety of ensemble learning algorithms and
ensemble classifiers, and well-predicted the risk of rock- It is well known that the optimization of model hyperpa-
burst. Li et al. [42] used five microseismic parameters to rameters is a complex problem, and a suitable heuristic
construct a Bayesian network model for dynamic early algorithm Jaya was used to complete this work in this
warning of rockburst. Through analysis, it was found that paper. It is worth noting that population-based heuristic
moment magnitude and seismic energy have a significant algorithms typically require specific control parameters,

123
Acta Geotechnica

Fig. 1 Applications of different intelligent models in rockburst prediction


  
such as the convergence factor and weight coefficient in the ri;t;1 Xi;best;t  Xi;j;t  denotes the trend of the current target
whale optimization algorithm (WOA), or flight frequency, solution toward the best solution, and
  
foraging probability, and acceleration coefficient in the ri;t;2 Xi;worst;t  Xi;j;t  denotes the trend of the current
bird swarm algorithm (BSA), in addition to common con- solution away from the worst solution.
trol parameters such as population size and iteration Heuristic algorithms are essentially stochastic processes
number [65]. The adjustment of these specific parameters is as seen from the point of view of the research, and these
a crucial aspect that directly affects the performance of stochasticities are used for the algorithms to perform better
related algorithms. However, Jaya does not depend on any spatial search. To be able to better extract the optimal
specific algorithm parameters, making it a flexible parameters of the classifiers, this paper introduces the Lévy
approach for optimization. flight mechanism into the Jaya algorithm to enhance its
Considering this fact, Jaya was selected as the opti- ability to escape from local optima [29]. The Lévy Flight is
mization method in this study. Jaya is a simple and pow- named after the French mathematician Paul Pierre Lévy
erful swarm intelligence optimization algorithm proposed [30]. It refers to a random walk with a heavy-tailed prob-
by scholar Rao in 2016 [65]. In recent years, it has been ability distribution of the step size, that is, there is a rela-
used to solve many related optimization problems with and tively high probability of a large step in the process of a
without constraints [68, 105]. The basic idea is to move the random walk. Figure 2 shows an example of a three-di-
solution individuals closer to the best ones and keep them mensional Lévy flight. It can be seen that after a number of
away from the worst ones in the search. That is, to steps, it will jump suddenly, avoiding local stagnation.
approach the corresponding optimal solution while moving Mathematically, the basic form of the Lévy distribution
away from the worst solution to improve the quality of the is defined as:
solution through continuous improvement. Suppose the
initial number of populations generated by Jaya in the LðsÞ  jsj1b ; 0\b  2 ð2Þ
search space is N (j = 1, 2, …, N), the variable dimension is where s denotes the step size and b denotes the power
M (i = 1, 2, …, M), and the number of iterations is t. The exponent of the Lévy distribution, is 1.5. The step size s for
value of the i-th variable of the j-th candidate solution in the Lévy flight is as follows:
the t-th iteration is denoted by Xi,j,t, and then the formula
for generating the new solution is: s ¼ u=jvj1=b ð3Þ
  
0
Xi;j;t ¼ Xi;j;t þ ri;t;1 Xi;best;t  Xi;j;t  where u and v are random numbers of normal distribution,
      
 ri;t;2 Xi;worst;t  Xi;j;t  ð1Þ respectively, are as follows: u  N 0; r2u and v  N 0; r2v .
ru and rv are defined as follows (C denotes the gamma
where r1 and r2 are random numbers between 0 and 1
function):
(uniformly distributed), Xi,best represents the value of
variable i for the best candidate solution, Xi,worst represents
the value of variable i for the worst candidate solution,

123
Acta Geotechnica

Table 1 Some of the applications of intelligent models in long/short-term rockburst prediction


Authors Method Type Parameters Data Interpretability CV Accuracy
(%)

Li et al. [41] BO-FNN Long rh, rc, rt, rh/rc, rc/ 314 No Yes 90.48
rt,Wet
Sun et al. [73] Ensemble Long rh, rc, rt, rh/rc, rc/ 275 No Yes 85.59
rt,Wet
Kadkhodaei GEP Long rh/rc, rc/rt,Wet 335 No No 86
et al. [32]
Xu et al. [85] SSA-PNN Long rh, rc, rt, rh/rc, rc/rt, 75 No No 93.3
Wet
Zhang [91] PSO-BP Long rh/rc, rc/rt, Wet 90 No Yes 95.98
Jin et al. [31] SVM Short N, E, V, Nr, Er, Vr 107 No Yes 86
Xue et al. Copula-MC, DA, DT, Naive bayes, SVM, Long rh, rc, rt, Wet 243 No Yes 88.9
[87] KNN, Ensemble
Cao et al. [10] FDNet Short microseismic time, energy, 13058 No No 76.68
source coordinates
Wang et al. Cloud Model Long rh, rc, rt, rh/rc, rc/ 271 No Yes 93.33
[77] rt,Wet
Guo et al. BP-SVM Long rh, rc, rt, rh/rc, rc/ 420 No Yes 97.5
[27] rt,Wet
Sun et al. [74] FA-RF Long H, rh, rc, rt, Wet 279 No Yes 91
Yin et al. [89] BO-CNN Short E, M0, V, rA, Dr 400 No No 91.67
Guo et al. MARS, DF, t-SNE Long rh, rc, rt, rh/rc, rc/ 344 Yes Yes 90.8
[26] rt,Wet
Shukla et al. XGBoost, DT, SVM Long rh, rc, rt, Wet 134 No No 100
[71]
Liu et al. [49] GA-ANN Short N, E, V 74 No Yes 83.9
Liang et al. Ensemble Short N, E, V, Nr, Er, Vr, D 91 No Yes 86.67
[45]
Yin et al. [90] Ensemble Long rh, rc, rt, rh/rc,rc/ 246 No No 91.80
rt,Wet, (rc - rt)/
(rc ? rt)
Ke et al. [34] PSO-Naive bayes Long rh, rc, rt, Wet 134 No Yes 91.10
Xie et al. [83] GA-XGBoost Long rh, rc, rt, rh/rc, rc/ 275 No Yes 90
rt,Wet
Liang et al. RF, AdaBoost, XGBoost, LGBM, GBM Short N, E, V, Nr, Er, Vr 91 No Yes 80
[44]
Xue et al. PSO-ELM Long rh, rc, rt, rh/rc, rc/ 344 No Yes 88.89
[86] rt,Wet
Li et al. [43] BP-ANN Short r1/rc, ES, GSSC, GSSEM, 254 No No 71
PPV, RD
Liu et al. [48] PSO, BP, SVM, PNN Long rh, rh/rc, Wet 191 No Yes 82.61
Wu et al. [82] PSO-LSSVM Long rh, rc, rt, rh/rc, rc/ 259 No No 88.89
rt,Wet
Pu et al. [63] SVM, GP Long H, rh, rc, rt, rh/rc,rc/ 100 No Yes 52.5
rt,Wet, (rc - rt)/
(rc ? rt)
Shirani al. GA-ENN, GEP, CART Long rh, rc, rt, Wet 134 No No 85.19
[70]
Pu et al. [62] SVM Long rh, rh/rc, rc/rt,Wet 108 No Yes 95
Lin et al. [46] Cloud Model Long rh, rc, rt, rh/rc, rc/ 246 No Yes 71.05
rt,Wet
Li et al. [39] Bayesian Networks Long H, rh, rc, rt, Wet 135 Yes Yes 93.3

123
Acta Geotechnica

Table 1 (continued)
Authors Method Type Parameters Data Interpretability CV Accuracy
(%)

Li et al. [40] GA-ELM Long rh, rc, rt, Wet 30 No No 100


Zhou et al. LDA, QDA, PLSDA, KNN, ANN, DT, Long H, rh, rc, rt, rh/rc,rc/ 246 No Yes 71.18
[97] SVM, RF, GBM, Naive Bayes rt,Wet, (rc - rt)/
(rc ? rt)
Zhou et al. SGB Short r1/rc, ES, GSSC, GS, 254 No Yes 61.22
[102] PPV
Zhou et al. Cloud Model Long rh, rc, rt, rh/rc, rc/ 209 No No 76.2
[103] rt,Wet
Adoko et al. FIS, ANFIS Long rh, rc, rt, rh/rc, rc/ 174 No No 75
[1] rt,Wet
Dong et al. RF Long rh, rc, rt,Wet 46 No No 100
[16]
Liu et al. [50] Cloud Model Long rh, rc, rt, rh/rc, rc/ 164 No No 100
rt,Wet
Zhou et al. GA-SVM, PSO-SVM Long H, rh, rc, rt, rh/rc, rc/ 132 No Yes 85.71
[99] rt,Wet
Note: rh is the maximum tangential stress (MPa), rc is the uniaxial compressive strength (MPa), rt is the uniaxial tensile strength (MPa), rh/rc
is referred to as the stress concentration factor, (rc - rt)/(rc ? rt) is the rock brittleness index, Wet is the elastic energy index, rc/rt is another
rock brittleness index, r1 is the maximum principal stress (MPa), N denotes the cumulative event number (unit), E represents the cumulative
released energy (J), V is the apparent volume (m3), Nr is the event rate (unit/day), Er is the energy rate (J/day), Vr is the apparent volume rate
(m3/day), D is the incubation time (day), H represents the buried depth reflecting the levels of in-situ stresses (m), M0 is the seismic moment
(N m), rA is the apparent stress (Pa), Dr is the stress drop (Pa), r1/rc represents the stress condition factor, ES is the excavation span (m), GSSC
denotes the ground support system capacity, GS represents the influence of geological structure, SEM represents the seismic event magnitude,
RD is the rock density (kg/m3), PPV is the peak particle velocity (m/s), BO denotes Bayesian optimization, FNN denotes feedforward neural
network, GEP denotes gene expression programming, SSA denotes sparrow search algorithm, PNN denotes probabilistic neural network, PSO
denotes particle swarm optimization, BP denotes back propagation algorithm, Copula-MC denotes copula Monte Carlo method, DA denotes
discriminant analysis, DT denotes decision tree, KNN denotes K-nearest neighbor algorithm, FDNet denotes frequency domain network, FA
denotes firefly algorithm, CNN denotes convolutional neural network, MARS denotes multivariate adaptive regression splines, DF denotes deep
forest algorithm, t-SNE denotes t-distributed stochastic neighbor embedding, GA denotes genetic algorithm, ANN denotes artificial neural
network, AdaBoost denotes adaptive boosting algorithm, GBM denotes gradient boosting machine, ELM denotes extreme learning machine,
LSSVM denotes least squares support vector machine, GP denotes gaussian process algorithm, ENN denotes emotional neural network, CART
denotes classification and regression tree, LDA denotes linear discriminant analysis, QDA denotes quadratic discriminant analysis, PLSDA
denotes partial least-squares discriminant analysis, SGB denotes stochastic gradient boosting algorithm, FIS denotes fuzzy inference system, and
ANFIS denotes adaptive neuro-fuzzy inference systems

8   91=b process are changed flexibly to avoid the Jaya algorithm


< Cð1 þ bÞ sin pb =
h i
2 falling into stagnation.
ru ¼ ; rv ¼ 1 ð4Þ
:C 1þb b2ðb1Þ=2 ;
2
2.2 Machine learning algorithms
In view of the advantages of Lévy Flight, this paper uses
the stochastic search mode of Lévy Flight combined with Light Gradient Boosting Machine (LGBM) is a gradient
Jaya (namely, LFJaya) to find the global optimal solution, boosting framework based on tree learning algorithms. It
which is a combination of frequent small steps and occa- was introduced by the Microsoft team of Ke et al. [35] in
sional large steps. Thus, the following update formula is 2017, with ‘‘Light’’ as the prefix due to its high speed.
obtained [29]: Compared with other existing gradient boosting tree algo-
   rithms, LGBM has the characteristics of fast training speed,
0
Xi;j;t ¼ Xi;j;t þ jLevy1 j Xi;best;t  Xi;j;t 
    high efficiency, less memory usage, high prediction accu-
 jLevy2 j Xi;worst;t  Xi;j;t  ð5Þ
racy, and GPU learning support. In order to achieve its
Among them, Le´vy1 and Le´vy2 are random numbers distributed processing and high efficiency, LGBM uses two
obeying the Lévy distribution. Due to the mechanism of new technologies in the design: gradient-based one-side
Lévy Flight, the positions of individuals in the search sampling (OSS) and exclusive feature bundling (EFB) [14].
In addition, one feature that distinguishes the LGBM
algorithm from other boosting algorithms is that LGBM

123
Acta Geotechnica

splits the tree by leaves in a best-fit manner, while other Random Forest (RF) is a popular ensemble learning
boosting algorithms split the tree by depth or horizontally. algorithm based on decision trees, proposed by Breiman in
Therefore, when growing on the same leaf in LGBM, it is 2001 [8]. Although the decision tree has the advantages of
possible to reduce more losses than the horizontal seg- being simple, intuitive, and having strong interpretability,
mentation method, and the result has a higher accuracy it is easy to overfit. Thus, the random forest algorithm is
[14]. It is worth noting that for smaller data, leaf growth proposed in order to solve this problem. It takes decision
may lead to increased model complexity and overfitting. trees as the basic unit, and constructs random forest by
To address this issue, this study employs the LFJaya integrating a large number of decision trees. The random
algorithm to optimize LGBM parameters and control forest uses the idea of bagging to construct multiple trees to
model complexity. complete the learning task together, solving the problem
Extreme Gradient Boosting (XGBoost) is a widely used that the decision tree is easy to overfit, improving the tol-
ensemble method based on gradient boosting trees, which erance of the algorithm to noise, and having better gener-
was developed by Chen et al. [13] in 2016. This algorithm alization performance. In solving the classification
has shown excellent efficiency in industrial and learning problem, the random forest uses the voting method to
competitions, providing fast and reliable models for vari- select the majority as the final result, according to the
ous engineering simulation problems [101]. XGBoost’s decision result of each tree. For the regression problem, the
high efficiency comes from its ability to use parallel random forest calculates the average value of the predicted
computing and process sparse data efficiently. The idea value of each tree as a result. In addition, compared with
behind XGBoost is to construct a set of weak evaluators on the traditional tree algorithms, RF can be well compatible
the target data and then summarize the prediction results of with small-capacity data and high-dimensional data,
the weak evaluators, which leads to better performance avoiding overfitting and showing high performance [11].
than a single tree model. As with many algorithms, the core Support Vector Machine (SVM) is a powerful method
of XGBoost is finding the optimal solution to the objective proposed by Vapnik and his team based on the study of
function. The objective function consists of two parts: the statistical learning theory [76]. SVM is robust to uncertain
model deviation term (the difference between the predicted data and can efficiently identify the nonlinear relationships
value and the actual value) and the regularization term between input and output data. For the linear inseparable
(which controls the complexity of the model). The pre- problem, the main idea of SVM is to transform the linear
diction accuracy of the model depends on the deviation inseparable data samples of the original low-dimensional
term and the regularization term. XGBoost effectively input space into high-dimensional feature space by a non-
prevents overfitting by using regularization terms to opti- linear mapping method (kernel function) [54]. It becomes
mize the objective function [13]. linearly separable data so that SVM can perform linear
analysis on the nonlinear features of high-dimensional
feature space samples to construct an optimal decision
boundary (i.e., hyperplane) for classification tasks. The
definition of the optimal hyperplane is that there is a
classification surface that maximizes the distance from the
edge points of the two-point sets to this plane.
Logistic Regression (LR) is a generalized linear
regression analysis model. Its calculation method is similar
to the regression process, but it is mainly used to solve the
binary classification problem (it can also solve the multi-
classification problem) [37]. It is assumed that the training
data D0 is composed of the training examples d = (Xd, td),
where Xd = (xd1, xd2, …, xdm) is an m-dim numerical vector
containing m values of input attributes, and td 2 f0; 1g is a
real class label. LR maps xdi to the corresponding td, to
correctly classify the examples, and its output represents
the probability that the sample belongs to a certain cate-
gory. The model output is obtained by calculating the
weighted sum of the parameters learned during the training
Fig. 2 3D Lévy flights in 1000 steps process and then applying an activation function, which is
interpreted as a probability with a class label of 1. The
traditional LR activation function is a sigmoid function.

123
Acta Geotechnica

2.3 Ensemble learning deep-buried tunnel in the Jinping II Hydropower Station


project on the Yalong River in Sichuan Province, China,
Due to the multifactorial complexity of rockburst, it is and 91 sets of rockburst case data were collected in this
difficult to expect that only a stable model can predict the study. The dataset was derived from the microseismic
rockburst risk well. Ensemble learning (EML) has attracted monitoring system, which provides many relevant model
more and more attention in machine learning and statistics parameters that can be used to quantify MS events [19].
because of its effectiveness in numerous problem-domain The average tunnel length of the diversion tunnels under
applications [60]. EML is not a single machine learning the Jinping II Hydropower Station is about 16.67 km, and
algorithm but a combination of multiple learning algo- the tunnel system consists of seven parallel tunnels (four
rithms. It fuses multiple classifiers into a multi-classifica- diversion tunnels and two auxiliary tunnels), which is
tion system through a specific integration strategy to obtain currently the largest diversion tunnel project in the world.
a cumulative decision result with better performance [66]. The maximum burial depth of the overlying rock of the
The combination rule of an ensemble model is an diversion tunnel group is about 2525 m, and the average
important factor in improving its classification perfor- burial depth is about 1.5–2 km, among which the tunnel
mance [81]. In this study, the stacking strategy was used to sections with overburden depth greater than 1.7 km
develop the corresponding ensemble learning model, which account for more than 75%, indicating the characteristics
can combine the members of the ensemble through a single of large burial depth, large hole diameter, and long hole
classifier to make the fusion of the model more scientific. line. The lithology of the strata along its diversion tunnels
The basic idea is to use the prediction results of the base is mainly marble, tuff, and sandstone, and most of the
learners of the previous layers as a new model feature tunnel rocks are composed of marble, which is character-
matrix to train the learner of the final layer and finally to ized by brittleness and high strength (saturated uniaxial
use its output as the final prediction result of this strategy compressive strength of 30–120 MPa, Young’s modulus of
combination framework. about 25–40 GPa, tensile strength of 3–6 MPa)
The training process of EML is relatively complex [22, 44, 52]. Consequently, the deep engineering in such
compared to the training process of a single algorithm. geological conditions has a high risk of rockburst, which
First, N base classifiers (tier-1) are constructed and the often occurs at different intensities during the construction
training data is divided into B subsets. Then each base of Jinping tunnels (refer to Fig. 4 for the deep-buried
classifier (tier-1) is trained on different B-1 subsets for B tunnels and related geology of the Jinping II Hydropower
times, and the B-th subset is retained each time to evaluate Station).
the classifiers. At the same time, the output results of these Since MS events are closely linked to rock rupture
base classifiers are spliced as the training data of the tier-2 damage, the selected variables should reflect microfracture
model [60]. The EML model proposed in this study con- activity during rockburst evolution. Therefore, seven MS
sists of two layers, and its structure and training process are parameters related to rock rupture intensity, time, and
shown in Fig. 3. The base learners of tier-1 select several cumulative deformation were selected, which are as fol-
models that are widely used and have strong extraction lows: cumulative event number N (unit) to evaluate the
capabilities of nonlinear relationships, including LGBM, dynamic trend of rupture; cumulative release energy E (J)
XGBoost, RF, and SVM. And new data generated by the to reflect the overall rupture intensity of the regional rock
four classifiers are then combined (as shown in Fig. 3) and mass; cumulative apparent volume V (m3) to indicate the
used for the training of the tier-2 LR model, which finally degree of damage to the rock mass; event rate Nr (unit/day)
obtains the prediction results of EML. to indicate the frequency of microseismic events; energy
rate Er (J/day) to reflect the magnitude of the rate of energy
release; apparent volume rate Vr (m3/day) to reflect the
3 Data analysis time evolution pattern of rock deformation; rockburst
incubation time D (day) to reflect the latent period of
In general, the precursor information of microseismic rockburst occurrence. The calculation formulas for seismic
cracking is often available before the occurrence of sub- energy and apparent volume are as follows: Eqs. (6) and
surface dynamic hazards. The continuous monitoring and (7) [25, 44, 47]. In these formulas, q represents the density
analysis of microseismic waves released from rock rupture of the material, c0 represents the velocity of the wave,
by microseismic (MS) monitoring is used to record some v2(f0) is the velocity power spectrum, f0 represents the
precursor characteristics of rockburst and obtain high- frequency, l represents the shear modulus, and P repre-
quality rockburst data, which is crucial for safety studies of sents the seismic potency.
rockburst [19]. Based on the study by Feng et al. [19] of

123
Acta Geotechnica

complexity of the rockburst formation mechanism. As


shown in Fig. 5, each input indicator has a unique distri-
bution, and some parameter ranges overlap. The trend
analysis in the figure indicates that most MS parameters
(excluding D) have a positive correlation with the target
values, with these parameters showing an increase in val-
ues with increasing rockburst intensity.
A correlation analysis of variables can be conducted to
determine the relationships between different parameters,
which can lead to a better understanding of the corre-
sponding data and phenomena. In the correlation graph
presented in Fig. 6, it can be observed that most parameters
exhibit a positive correlation, except for the negative cor-
relation coefficients between parameters Nr and D, as well
as Vr and D. Moreover, it can be seen that, except for
parameter D, all other parameters have a correlation
coefficient greater than 0.5 with the rockburst potential. It
is noteworthy that the correlation coefficients between E
and Er (0.97) as well as V and Vr (0.89) are very high.
Fig. 3 The learning process of ensemble model Usually, these parameters are considered for deletion;
however, the time effect of the MS parameters has a sig-
Z1 nificant contribution to the development of short-term
E ¼ 8pqc0 v2 ðf0 Þdf ð6Þ rockbursts [31] and is therefore included in the analysis.
Furthermore, in the study conducted by Liang et al. [45],
0
including all parameters related to the time effect resulted
2
VA ¼ lP =E ð7Þ in the best prediction performance, indicating that the time
These parameters provide information on microseismic effect is a key factor influencing rockburst prediction. In
activity and reflect the process of rockburst formation in summary, to fully utilize the contribution of these param-
the monitored area. Table 2 presents a statistical summary eters and to better predict the rockburst potential, all seven
of the seven input parameters used in this study. To parameters listed in Table 2 are considered in this study.
facilitate calculation and compression of variable scales, When the data distribution of the training and testing
the values of cumulative release energy, cumulative sets are similar, the evaluation results of a model can more
apparent volume, energy rate, and apparent volume rate are accurately reflect its performance in application. Figure 7
expressed in logarithmic form [44]. The output variable is displays the parameters distribution of the training and
rockburst intensity level, which is classified into four cat- testing data used in this study. As shown in the figure, the
egories based on radiation energy and damage, following distribution of the different parameters in the training and
the work of Feng et al. [19]: none, low, moderate, and high. testing data is largely consistent, ensuring the reliability of
In this paper, the target variable was encoded as follows: 0 the model testing.
(NB), 1 (LB), 2 (MB), and 3 (HB) correspond to none, low,
moderate, and high rockburst, respectively. Further infor-
mation on microseismic monitoring can be found in Feng 4 Evaluation indicators
et al. [20].
To address the issue of commonly used data visualiza- Performance evaluation is an essential step in the model
tions failing to effectively represent redundant data infor- development process, and the appropriate selection of
mation, this study adopted the raincloud plot visualization evaluation metrics can help to identify the model’s
method proposed by Allen et al. [4]. Figure 5 presents the strengths and weaknesses and provide guidance for further
raincloud plot, which integrates multiple visualization improvements. In this study, five metrics, including two
techniques to provide a clear overview of the statistical global metrics (accuracy and kappa value) and three intra-
information on the overall distribution and central trends of class metrics (precision, recall, and F1-score), were used
the seven MS parameters selected for this study across the for the performance evaluation of rockburst prediction
four rockburst intensity levels. Moreover, this plot also models [95, 96].
reveals a few outliers for each parameter, which reflects the Accuracy measures the proportion of correct predic-
tions, kappa measures the agreement between actual and

123
Acta Geotechnica

Fig. 4 Location and related geology of Jinping II Hydropower Station

Table 2 Statistical information of MS parameters


Indicator Statistical descriptions Number Level
Standard deviation Mean Min Max

N (unit) 13.1337 12.3407 1 70 34 NB


E (J) 1.4501 4.3639 0.780 7.094
V (m3) 0.6595 4.1329 2.511 5.168 21 LB
Nr (unit/day) 1.6892 1.6625 0.111 12.250
Er (J/day) 1.3389 3.5392 0.178 5.890 25 MB
Vr (m3/day) 0.5886 3.3188 1.666 4.393
D (day) 4.4224 7.7692 1 23 11 HB

predicted values beyond chance, and it is a robust index. true positives among all actual positives. F1-score is the
According to the literature [38], kappa values range from - harmonic mean of precision and recall. Together, these
1 (complete disagreement) to 1 (complete agreement), and metrics provide a comprehensive understanding of model’s
in general, its value is below 0.4 for poor performance, performance. Finally, the computation of these metrics is
while 0.4 and above indicates a good result, and kappa shown in Fig. 8, where Po is the observed proportion of
value above 0.8 indicates almost perfect agreement. Pre- agreement, and Pe is the proportion of agreement expected
cision measures the proportion of true positives among all by chance.
predicted positives, while recall measures the proportion of

123
Acta Geotechnica

Fig. 5 Raincloud plot visualization of MS parameters across rockburst intensity

5 Ensemble model construction base (b) The internal parameters of the classifiers have a great
on LFJaya influence on their performance. After parametric
experiments and literature analysis for this study, the
Generally, the performance of individual model used in important parameters of each classifier were selected
rockburst prediction can be limited. Thus, an ensemble and then combined with LFJaya and cross-validation
framework base on multiple classifiers (i.e., LGBM, (CV) method for classifier optimization. It is worth
XGBoost, RF, SVM, and LR) in this paper was employed noting that the MS input data were normalized before
to evaluate rockburst potential. The development process the model training to improve the modeling effi-
of this framework was illustrated in Fig. 9 and consists of ciency and eliminate the interference of data dimen-
the following main steps: sion and range.
(c) Classifiers such as LGBM were used as base
(a) Firstly, 91 sets of microseismic information-based learners, and then multiple base learners were
rockburst case data were collected from the Jinping combined to construct a first-layer ensemble frame-
II Hydropower Station project, and randomly divided work with the original rockburst data as the input.
into training and testing sets with a ratio of 70% and Then a final hybrid ensemble model (LFJaya-EML)
30%, respectively, according to the literature was composed by using the first-layer framework
[45, 107]. results as the input of the second-layer learners.

123
Acta Geotechnica

6 Results and discussion

An ensemble framework is a strategic combination of


multiple different models, each of which provides specific
functionality, resulting in better predictive performance for
a given target. In this paper, five data mining models (i.e.,
LGBM, XGBoost, RF, SVM, and LR) were used as the
base learners for the EML model. The LFJaya method was
employed to optimize and adjust the model parameters.
The selection of these base learners provides the EML
model with diversity in rockburst prediction: LGBM has
fast training speed and higher efficiency to learn and infer
nonlinear relationships between rockburst parameters;
XGBoost can adapt to less rockburst training data and is
relatively insensitive to irrelevant parameters; RF performs
well on handling high-dimensional data; SVM has strong
overfitting processing ability; and LR provides well-cali-
brated prediction probabilities while being fast to compute.
Fig. 6 Correlation analysis of model parameters They have effectively solved related problems in many
fields and enhanced the generalization and reliability of the
These base learners were used to generate new data EML while achieving a highly diverse ensemble system.
features for model retraining, and the final ensemble Since the number of rockburst cases is relatively small in
results were obtained. this study, which may affect the predictive performance of
(d) Multiple performance comparisons were performed, each learner, it is meaningful to analyze the performance
and the contribution of input parameters and inter- differences between the ensemble framework and base
action effects were analyzed based on the EML learners.
learning results.

Fig. 7 Comparison of training and testing data distribution

123
Acta Geotechnica

process for each model resulted in a decreasing trend of the


loss value over iterations, with the lowest loss value
obtained at the end of optimization. While the iteration
curves differed for each base learner, they all showed a
similar trend. Notably, the LFJaya-XGBoost model had a
significantly smaller convergent validity compared to the
other LFJaya models, indicating a less desirable internal
parameter search. The LFJaya-RF model had the lowest
final loss value among the four hybrid models, with the loss
mean value of the cross-validation process reduced from
about 0.90 to about 0.70. The overall loss mean value
reduced significantly within about 10 iterations, demon-
strating the efficient convergence performance of the
LFJaya algorithm. The overall mean loss of the model
leveled off to the optimal state around 160 iterations.
In addition, to ensure a fair evaluation of the search
process, this study adopted a K-fold cross-validation (CV)
method to test the performance of each base classifier.
Specifically, the rockburst training set was divided into k
Fig. 8 Calculation of evaluation metrics subsets, with k-1 subsets used for training and the
remaining subset used for evaluation. This process was
6.1 Hyperparameters tuning repeated k times, with each subset used exactly once for
evaluation. In this study, a fivefold CV was used due to
To achieve the best classification of rockburst potential, an computational constraints. Figure 11 presents the hyper-
ensemble model based on multiple machine learning parameter tuning changes of the four LFJaya models dur-
algorithms was constructed. The selected base learners ing the CV optimization search, including the different
were optimized and adjusted using the LFJaya method, parameter combinations and their corresponding mean
with the cross-entropy loss function used as the fitness losses.
evaluation.
To effectively build and evaluate the EML model, the 6.2 Performance comparison
rockburst database was randomly divided in a ratio of 7:3
for training and testing. Four base learners were proposed To compare the performance of different models, this study
based on LFJaya, and the same sets were used for these employed five performance evaluation metrics and con-
models. It is worth noting that excessively large population ducted a nonparametric statistical test to rank the models
sizes would increase computation time. After repeated based on each metric. The results, presented in Table 4,
attempts, it was found that the four base learners were indicate that only LFJaya-LGBM and LFJaya-RF have
insensitive to population values. Therefore, a population accuracy values above 0.8, with most of the hybrid models
size of 50 and 200 iterations was chosen, which provided based on LFJaya showing better test performance. Among
sufficient computational efficiency and search space cov- the LFJaya-based hybrid tree models, LFJaya-RF demon-
erage for completing the optimal parameter search for each strated the best prediction performance, as evidenced by its
base learner. In addition, based on the experience of tuning superior performance on all evaluation metrics compared
parameters for each model and several experiments, the to other methods, including single models not optimized by
corresponding parameters in Table 3 were selected in this the LFJaya algorithm. In contrast, XGBoost performed
study as the main model parameters for optimization. The poorly on rockburst prediction, significantly underper-
settings of the parameter ranges and the results of optimal forming the other two tree models. For more details on the
hyperparameter combinations during optimization are performance results, please refer to Table 4.
summarized in Table 3, and their performance will be Given the large number of models and metrics in
evaluated in the next section. Table 4, this study utilized a statistical analysis of perfor-
To effectively evaluate the performance of the four mance to comprehensively evaluate the models. Specifi-
LFJaya models, iteration curves for these models were cally, this study employed the suggestion of Demšar [15]
plotted based on the values of the fitness function during and performed a nonparametric Friedman test, followed by
optimization, as shown in Fig. 10. The optimization the corresponding Nemenyi test [97]. The Friedman test,
which is a nonparametric equivalent of the repeated

123
Acta Geotechnica

Fig. 9 Flowchart of the ensemble learning framework for rockburst potential prediction

Table 3 Parameter ranges and optimal values for LFJaya models in top-ranked rank. The Friedman statistic was used to mea-
rockburst prediction sure the degree of difference in performance between the
Algorithm Optimized value Parameters Xmin Xmax models. The Friedman statistic was calculated using the
following formula:
LGBM 26 num_boost_round 2 500 " #
12NT X k
kðk þ 1Þ2
0.2408 learning_rate 0.001 0.99 2
xf ¼ 2
R  ð8Þ
XGBoost 12 num_boost_round 2 500 kðk þ 1Þ j¼1 j 4
0.5342 learning_rate 0.001 0.99
RF 15 n_estimators 2 500
where k denotes the number of classifiers, NT represents the
amount of data, and Rj denotes the ranking of classifiers on
5 max_depth 1 15
all data.
SVM 3 C 0.1 100
The Nemenyi test calculates the critical difference (CD)
0.9793 gamma 0.01 50
between the rankings of two rockburst classifiers, which
represents the minimum significant difference between the
measure analysis of variance, was used to rank each model two classifiers. In other words, the two classifiers must
separately based on their performance in rockburst pre- have a critical difference in order to be considered to have
diction. The model with the better performance was given a

123
Acta Geotechnica

Fig. 10 The loss optimization curves of LFJaya models

a statistically significant difference. The critical difference The comprehensive evaluation results of the models
(CD) is calculated using the following formula [97]: mentioned above are presented in Fig. 12. It can be
sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi observed that LFJaya-XGBoost demonstrated inferior
kðk þ 1Þ performance compared to LGBM. In addition, the majority
CD ¼ qa  ð9Þ
6NT of hybrid models based on LFJaya showed satisfactory
prediction performance. Among all compared models,
where qa is the critical value in the Nemenyi test and
LFJaya-RF had the best overall performance in predicting
depends on the number of rockburst classifiers as well as
rockburst. This finding is consistent with the conclusions
the significance level a.
drawn in Fig. 10, which demonstrate that LFJaya-RF

Fig. 11 Optimization tunings for parameters of LFJaya models: a LFJaya-LGBM; b LFJaya-XGBoost; c LFJaya-RF; d LFJaya-SVM

123
Acta Geotechnica

Table 4 Comparison of base learners performance before and after identify model bias and variance and reveal model weak-
LFJaya optimization nesses, facilitating further improvements. Based on the
Models Evaluation Indicators results shown in Fig. 14, the main diagonal of the confu-
sion matrix represents the number of correctly classified
Accuracy Kappa Precision Recall F1-
score rockburst potentials, while the off-diagonal entries indicate
the number of incorrectly classified ones. Misclassifying
LFJaya-LGBM 0.8214 0.7477 0.7806 0.7917 0.7840 none rockburst (NB) cases can lead to unnecessary alarm
LFJaya- 0.7143 0.6021 0.7743 0.6917 0.7121 and increase the cost of risk prevention measures. There-
XGBoost
fore, accurate prediction of the NB class is particularly
LFJaya-RF 0.8571 0.7993 0.8812 0.8417 0.8395
important. As shown in Fig. 14, the EML model achieved
LFJaya-SVM 0.7500 0.6519 0.8021 0.7417 0.7478 high accuracy for the discrimination of the NB class, with a
LR 0.6071 0.438 0.6979 0.5667 0.5806 value of 100%, and also exhibited the highest number of
LGBM 0.7857 0.6973 0.7389 0.7417 0.7386 correct predictions for the MB class. Moreover, EML had
XGBoost 0.6786 0.5579 0.6944 0.6667 0.6639 the largest total number of correctly classified cases across
RF 0.6786 0.5640 0.7976 0.6917 0.7000 all classes (in Fig. 14, NB corresponds to the none rock-
SVM 0.6786 0.5602 0.7739 0.6917 0.6959 burst, LB corresponds to the low rockburst, MB corre-
sponds to the moderate rockburst, and HB corresponds to
the high rockburst).
exhibited the best performance across all metrics. In In practical applications, moderate and high rockburst
summary, the results indicate that the hybrid models based can lead to more serious consequences in engineering and
on LFJaya are effective in improving the rockburst pre- should receive greater attention. Meanwhile, to better
diction performance of the models. evaluate the discrimination performance of the models
After tuning the parameters of multiple classifiers, their regarding the presence or absence of rockburst, their
predictive results were compared and the corresponding classification performances were analyzed in different
optimal parameter combinations were recorded. The scenarios, as presented in Fig. 15.
detailed results can be found in Table 3. Once the training In Fig. 15a, the target was divided into two states: with
of the EML model was completed, its predictive perfor- and without rockburst. Misjudging the none rockburst (NB)
mance was tested, validated, and compared with that of the cases can result in unnecessary losses. EML and LR
base classifiers. The detailed indicator results are presented achieved 100% prediction accuracy for NB cases. In terms
in Table 5. As shown in Table 5, the EML classifier of predicting the occurrence of rockburst, EML exhibited
combines some of the advantages of the base classifiers and the highest accuracy of approximately 0.8333. In Fig. 15b,
achieves the best predictive performance for rockburst the NB and LB classes were combined into a low-hazard
potential, with a precision value of 0.9023 and an accuracy rockburst class, while the MB and HB classes were com-
value of 0.8929, which is close to 0.9. It can be observed bined into a high-hazard rockburst class. For low-hazard
that the accuracy of EML is 4% higher than that of LFJaya- rockburst, EML and LFJaya-RF showed the best accuracy
RF. These results suggest that the EML model outperforms (0.9333). For high-hazard rockburst, EML achieved the
the individual base classifiers that make up the EML in highest accuracy of about 0.8462, while the accuracy of
predicting rockburst potential. other models was lower than 0.8. Overall, based on the
Similarly, based on the statistical analysis described results in Fig. 15, EML was found to be the best choice for
above, a comprehensive evaluation was conducted to rockburst prediction in this study.
compare the predictive performance of the EML model and To gain a more intuitive understanding of the rockburst
the base classifiers. The evaluation results are presented in classifiers, it is essential to use the graph visualization
Fig. 13, which shows that some classifiers, such as LR and about discrimination. The model decision chart can effec-
LFJaya-XGBoost, do not perform well on the same dataset, tively present the working situation of the classifiers, val-
while the EML model, which combines multiple base idate model performance, and deepen knowledge of
learners, outperforms them. The EML model is ranked underlying processes. Figure 16 analyzes the decision
significantly higher than its base learners, with LFJaya-RF boundary of the EML and other base classifiers.
following closely behind. The results demonstrate the In Fig. 16, it can be seen that different rockburst types
superior predictive performance of the EML in this study, were represented in different colors, and the decision
further confirming its ensemble effect. boundaries became more precise as the model was inte-
Confusion matrix [72] is a table used to evaluate the grated. Moreover, the red circles encircle parts that repre-
performance of classification models, which can help sent rockburst classification errors. Elements immediately
adjacent to the boundary are relatively difficult to classify,

123
Acta Geotechnica

Fig. 12 Comprehensive comparison of base classifiers performance before and after optimization

Table 5 Performance comparison of EML and base learners classes. Overall, the results in Fig. 16 demonstrate that
EML combines the advantages of base learners and pro-
Models Evaluation Indicators
vides more accurate decisions.
Accuracy Kappa Precision Recall F1- During model prediction, overfitting often occurs, which
score
indicates poor generalization performance of the model.
LFJaya-LGBM 0.8214 0.7477 0.7806 0.7917 0.784 Overfitting is a situation where a model is overly complex
LFJaya- 0.7143 0.6021 0.7743 0.6917 0.7121 and learns noise in the data, resulting in poor performance
XGBoost on new data. Therefore, it is essential to avoid overfitting.
LFJaya-RF 0.8571 0.7993 0.8812 0.8417 0.8395 In this paper, the relationship between the prediction
LFJaya-SVM 0.75 0.6519 0.8021 0.7417 0.7478 accuracy on the training and testing sets of the 10 classi-
LR 0.6071 0.438 0.6979 0.5667 0.5806 fiers is analyzed using an overfitting plot shown in Fig. 17.
EML 0.8929 0.8453 0.9023 0.8417 0.8631 The larger the deviation of the model from the 1:1 line, the
more severe the overfitting. Ideally, a point should fall on
the line, indicating that the model has good generalization
performance. From Fig. 17, it can be seen that XGBoost
while those further away have a higher degree of certainty
and RF models have the most severe overfitting among all
regarding rockburst class attribution. EML shows fewer
models, but their overfitting is significantly reduced by
decision errors in the none rockburst (NB) class and has the
using the LFJaya optimization technique. The overfitting
highest accuracy among the six models. The decision
plot analysis shows that EML can effectively reduce
regions of LFJaya-SVM and LR are distinctly jagged
overfitting in rockburst prediction and improve the overall
compared to EML. However, the results in Fig. 16 also
generalization performance of the model compared to other
suggest that low rockburst and high rockburst classes are
classifiers.
relatively more difficult to discriminate, which may be due
to the relatively small number of cases for these two

123
Acta Geotechnica

Fig. 13 Comprehensive comparison of performance for EML and base classifiers

Fig. 14 The confusion matrix of EML and base learners

123
Acta Geotechnica

Fig. 15 Discrimination comparison of models for different rockburst states: a with and without rockburst; b Low-hazard and high-hazard
rockburst

6.3 Rockburst potential interpretation based used to interpret the rockburst prediction results of EML
on ensemble model model and quantify the impact of associated risk factors
were discussed. Shapley value is a concept from Cooper-
The development of an accurate rockburst potential pre- ative Game Theory that was first proposed in 1953 [69].
diction model is essential for better understanding the SHAP is an additive explanation method based on Shapley
relationship between rockburst potential and risk factors. value [23]. The prediction value of the model is understood
However, it is also important to have an understanding of as the sum of the attribution values of each input feature.
how the model works and to explore the contribution of The most important characteristic of SHAP is that it can
parameter features to the model output. effectively reflect the influence of the features in each
Ensemble model with high complexity and good accu- sample (i.e., the extent to which the features contribute to
racy generally lack interpretability. In this section, how the predictive ability of the model) and show the positivity
SHapley Additive exPlanations (SHAP) value and Local or negativity of that influence. In addition, the SHAP
Interpretable Model-agnostic Explanations (LIME) can be method can capture feature interactions directly based on

123
Acta Geotechnica

Fig. 16 Decision boundaries for EML and base learners

the Shapley interaction index to reveal hidden relationships LIME is a popular method for local interpretation [59].
between features, which can further enhance the local It works by perturbing the data of input instances to create
interpretation of the model. The calculation of Shapley a new dataset, which is then used to train a simple and
values is as follows [100]: interpretable local model (such as a linear model) to fit the
X jSj!ðjN0 j  jSj  1!Þ decision boundary of the model being interpreted [59, 104].
Ui ¼ ½fS[i ðxS[i Þ  fS ðxS Þ ð10Þ This local model can then be used to explain the decision
SN nðiÞ
jN0 j!
0 made by the model being interpreted and provide insight
where N0 denotes the set of all features in the rockburst into the factors that contribute to that decision for a given
dataset, S denotes the set after index i is removed, the rockburst sample. The weight coefficients of the local
importance of feature i to the rockburst-model output is model reflect the importance of the rockburst features in
represented by Ui, xs denotes the vector of rockburst fea- the current decision. The definition of LIME is as follows:
tures in set S, and the feature contributions are calculated nðxÞ ¼ arg min Qðfx ; g; px Þ þ XðgÞ ð11Þ
g2G
with the corresponding function f.

123
Acta Geotechnica

time effect parameters like D still have significant contri-


butions to the classification of NB and LB, which high-
lights their importance in rockburst potential evaluation.
Previous studies have shown that factors such as the
number, strength, size, and temporal effects of rock frac-
tures and radiant energy are critical for rockburst risk
prediction [12, 45], which are consistent with the influen-
tial results of the parameters shown in Fig. 18 of this paper.
Figure 19 provides further insights into the hidden
contribution relationship between the most influential fea-
tures through an intuitive three-dimensional interaction
analysis. The analysis presented in Fig. 19a demonstrates a
negative correlation effect between cumulative release
energy and cumulative apparent volume on the prediction
of NB. Specifically, as the cumulative release energy
increases, there is a hindering effect on the prediction of
Fig. 17 Prediction accuracy diagram on training and testing sets NB. However, this hindering effect can be weakened if the
cumulative apparent volume is relatively small. Neverthe-
where n is defined as the objective function, the function Q less, overall, the effect is still negative. Figure 19b reveals
is used to measure the distance of prediction between the that cumulative release energy and apparent volume values
interpretable model g and the model fx to be interpreted, X in an intermediate range consistently have a promoting
(g) is used to measure the complexity of the agent model, effect on the prediction of LB. However, as the energy
G denotes the set of agent models, and px denotes the value increases beyond this range, the prediction of LB is
distance from the sampled instances to the original negatively influenced, since larger values of energy and
instances x. apparent volume correspond to higher intensity rockbursts.
This paragraph describes the influential factors on The overall trend shown in Fig. 19c is similar to that in
rockburst potential classification by using SHAP values. Fig. 19b, but the cumulative release energy and cumulative
Figure 18 shows the SHAP values of each feature point for apparent volume values that have a promoting effect on the
rockburst samples, where positive values indicate a pro- prediction of MB are higher. In Fig. 19d, the analysis
motion of the model prediction and negative values indi- demonstrates that the interaction between cumulative
cate a negative impact on the prediction [94]. The release energy and cumulative apparent volume has a
figure presents a comparison of the influential factors for positive effect on the prediction of HB. In summary, the
different rockburst potential levels separately and in order. cumulative release energy and apparent volume values
In Fig. 18a, factor E has the largest influence on the pre- have the significant effect on rockburst predictions, and the
diction of the NB class, where larger values of cumulative red part of Fig. 19, which indicates promoting prediction,
release energy result in a negative impact on the prediction, shifts from the lower left to the upper right (in the x–y
indicating a higher potential for rockburst occurrence. plane) as the intensity of rockbursts increases.
Similarly, in Fig. 18b–d, the factors E and V have the In the above content, an overall analysis based on SHAP
greatest effects on low, moderate, and high rockbursts. The was conducted to study the key input variables that affect
large values of cumulative release energy and cumulative rockburst potential. In order to enhance the understanding
apparent volume promote the prediction of moderate and of the behavior of the EML model and the rockburst
high rockbursts, which is consistent with the actual situa- classification results, LIME was used for a more detailed
tion. The overall contribution chart presented in Fig. 18e case study of these four types of rockbursts. LIME can
displays the contribution values of seven parameters: N, E, analyze the prediction results of the EML model on specific
V, Nr, Er, Vr, and D, which are 0.2116, 0.3105, 0.2714, instances and reveal how the model makes decisions based
0.0747, 0.0989, 0.0525, and 0.0264, respectively. The on input. Additionally, it can display the confidence level
results indicate that the parameters E, V, and N have the of the model prediction results, which can be used to
most significant effects on rockburst potential classifica- evaluate the reliability of the model prediction results for
tion. Although time effect parameters like D have rela- specific instances. Through the use of LIME analysis, the
tively lower contribution scores, experiments have aim is to better understand the factors that lead to each
demonstrated that including these parameters in the study specific type of rockburst and to validate the results
improves the predictive performance. It is worth noting that obtained through SHAP analysis. The results of the SHAP
and LIME analyses for cases with different rockburst types

123
Acta Geotechnica

Fig. 18 Summary of SHAP values for the impact of seven variables on model output: a none rockburst; b low rockburst; c moderate rockburst;
d high rockburst; e total contribution

are presented in Figs. 20, 21, 22, 23. Despite being cal- with feature values of 3.977, 8.0, 5.204, 4.72, 2.667, and
culated using different methods, similar results were 3.5 for V, N, E, Er, Nr, and Vr, respectively. Compared to
obtained from both methods. It’s worth noting that the base the results in Fig. 20a, the increased values of V and N in
value in the figure represents the target mean value, and Fig. 21a have a promoting effect on the prediction of LB.
f(x) is the output value of the rockburst type in the case. Similarly, Fig. 21b shows that V and N have a promoting
The red and blue arrows represent that increase and sup- effect, showing a maximum rockburst predictive proba-
press the rockburst predictions, respectively, while the bility of 0.48 for LB. It is worth noting that unlike the NB
arrow lengths indicate the degree of influence of the vari- case in Fig. 20, E in Fig. 21 has a positive effect since LR
ables on the prediction [36]. represents the state in which rockburst has already occur-
Figure 20 presents a case study of the NB class, with red. Figure 22 presents a case study of the MB class with
feature values of 0.111, 1.680, 2.0, 2.936, and 5.16 for Nr, feature values of 4.263, 5.865, and 19.0 for V, E, and N,
Vr, N, V, and E, respectively. The SHAP analysis in respectively. As shown in Fig. 22a, high values of V, E,
Fig. 20a indicates that a larger value of E has a hindering and N promote the prediction of MB, with V being the
effect on the prediction of NB, while smaller values of N most influential feature, followed by E and N. Figure 22b
and V have a promoting effect. The LIME analysis in shows a maximum rockburst predictive probability of 0.67
Fig. 20b produces similar results, showing a maximum for MB, and the ranking of the main contributing features
rockburst predictive probability of 0.63 for NB, and almost is consistent with that in Fig. 22a, all having a positive
identical contribution degrees and positive and negative effect. Figure 23 presents a case study of the HB class,
effects of the corresponding features as obtained from the with feature values of 4.995, 6.419, 49.0, 5.817, 4.393, and
SHAP analysis. Figure 21 illustrates a case of the LB class 12.25 for V, E, N, Er, Vr, and Nr, respectively. In Fig. 23a,

123
Acta Geotechnica

Fig. 19 Analysis of the interaction effects between the main variables: a none rockburst; b low rockburst; c moderate rockburst; d high rockburst

it is evident that V, E, and N exhibit large values and rockburst potential based on 91 rockburst samples from the
contribute significantly to the prediction of HB. The LIME Jinping II Hydropower Station project. The proposed
analysis presented in Fig. 23b shows the maximum rock- method achieves satisfactory rockburst prediction perfor-
burst probability of 0.90 for HB, with a significant dis- mance with a small dataset. Previously, Liang et al. [45]
criminatory effect. Furthermore, the feature contribution proposed a weight-based ensemble classifier and achieved
ranking is consistent between Fig. 23a, b, all presenting a the highest accuracy of 0.8667. In this study, the same 7:3
positive effect. In summary, the SHAP and LIME analysis ratio was used to construct the predictive model, and the
results demonstrate the robustness and reliability of the highest accuracy of 0.8929 was achieved by the proposed
EML model for the classification of rockburst potential. EML model, and 100% accuracy was obtained for the none
The positive and negative effects of different variables on rockburst class, indicating that the prediction performance
each rockburst type, as well as the strength of their rela- of EML has been significantly improved by integrating
tionships, are generally consistent. multiple learners. Therefore, EML developed in this study
can provide effective guidance for rockburst risk assess-
6.4 Study significance and limitations ment, and promote rockburst risk prediction in deep
underground engineering. Furthermore, Table 6 presents a
As underground engineering goes deeper, existing rock- comparison of this EML with previous studies on related
burst prediction methods become inadequate to meet safety rockburst data for a more intuitive form.
requirements of engineering at great depth [33, 79]. Although the overall prediction performance of the
Therefore, improving rockburst prediction accuracy and EML model is satisfactory, there are some limitations. The
reducing the risk and intensity of rockbursts is crucial. This confusion matrix shows that the prediction performance of
study proposes an LFJaya-based EML model to predict some rockburst classes is undesirable, which may be due to

123
Acta Geotechnica

Fig. 20 Probabilistic interpretation and contribution analysis for NB: a SHAP value; b LIME

Fig. 21 Probabilistic interpretation and contribution analysis for LB: a SHAP value; b LIME

the relatively small number of cases for these rockburst dataset to include other stress or rock indicators. Small
classes. Therefore, in future studies, more attention should datasets may lead to overfitting of the model and reduce its
be given to this aspect to improve the performance of the generalization. However, ensemble classifiers have better
model. In addition, the rockburst dataset is small in feature robustness and data scalability, which is a significant
types and has a limited variety of features based solely on advantage.
microseismic monitoring. Future studies could expand the

123
Acta Geotechnica

Fig. 22 Probabilistic interpretation and contribution analysis for MB: a SHAP value; b LIME

Fig. 23 Probabilistic interpretation and contribution analysis for HB: a SHAP value; b LIME

6.5 Case verification the original 91 samples. These new validation data were
obtained from Feng et al. [19, 20], which recorded six
To further demonstrate the effectiveness of the LFJaya- rockburst events in the Jinping II Hydropower Station
EML classifier proposed in this study in differentiating project. Table 7 presents the specific MS parameter infor-
between different rockburst potentials, this section applied mation and rockburst types of these events. A graphical
it to newly collected rockburst data that was not included in user interface (GUI) was developed for the corresponding

123
Acta Geotechnica

Table 6 Corresponding indicator values and levels for rockburst validation cases
Author Model Parameters Accuracy (%)

Liang et al. [44] RF, AdaBoost, XGBoost, LGBM, GBM N, E, V, Nr, Er, Vr 80
Liang et al. [45] Weight-based Ensemble N, E, V, Nr, Er, Vr, D 86.67
Jin et al. [31] SVM N, E, V, Nr, Er, Vr 86
This work LFJaya-EML N, E, V, Nr, Er, Vr, D 89.92

Table 7 Corresponding indicator values and levels for rockburst validation cases
N (unit) E (J) V (m3) NR (unity/day) ER (J/day) VR (m3/day) D (day) Level EML

1 2 1.940 3.250 2.0 1.920 3.250 1 NB NB


2 3 1.250 4.944 3.0 1.250 4.944 1 NB NB
3 42 6.284 5.050 6.0 5.439 4.204 7 HB HB
4 45 4.803 4.838 4.1 3.762 3.796 11 HB HB
5 17 3.172 5.015 1.7 2.172 4.015 10 LB LB
6 18 3.828 4.703 3.0 3.046 3.924 6 MB MB

Fig. 24 The graphical user interface of EML

123
Acta Geotechnica

EML prediction based on this study, and the results are analysis showed that release energy, the number
displayed in Fig. 24. It can be seen that the EML model and intensity of rock fractures are crucial for short-
accurately predicted all six rockburst events, further term rockburst potential prediction, and their com-
proving the good generalization of the ensemble model and plex interactions have a significant combined effect
its effectiveness and practicality. on prediction.
(4) Finally, a corresponding GUI was constructed based
on the proposed EML for new rockburst data
7 Conclusions validation. The results showed that EML can accu-
rately predict the potential levels of six rockburst
Rockburst is a hazardous phenomenon in underground rock events in this deeply buried tunnel according to the
engineering, which cannot be entirely controlled or pre- real-time MS monitoring information.
vented from occurring. However, it can be predicted in
In summary, the combination of EML with real-time
advance to take appropriate measures and minimize the
MS monitoring information achieved satisfactory results in
associated risks and losses. Therefore, this study aimed to
predicting short-term rockburst potential. These results
investigate the evolutionary prediction of microseismic
demonstrate that the EML model has good generalization
parameters for short-term rockburst using the ensemble
and can effectively utilize MS information for short-term
learning model based on 91 rockburst data obtained from
rockburst risk warning. By combining multiple methods in
the microseismic monitoring of Jinping Hydropower Sta-
the EML, the reliability of rockburst potential estimation
tion. An EML model based on LFJaya was proposed to
and early warning can be improved. Future work could
evaluate the short-term rockburst potential in this paper.
explore the potential benefits of combining MS information
The main findings of this study are summarized as follows:
with other feature types of rock-related data for multi-
(1) This study evaluated five commonly used ML variate feature fusion and the expansion of rockburst data.
algorithms for subsurface engineering analysis,
Acknowledgements This research is partially supported by the
including three tree-based models (LGBM,
National Natural Science Foundation Project of China (42177164 and
XGBoost, and RF), nonlinear SVM, and logistic 41807259) and the Distinguished Youth Science Foundation of
regression. To overcome the limitations of a single Hunan Province of China (2022JJ10073).
algorithm and achieve better rockburst potential
prediction, an ensemble strategy was employed to Declarations
combine these algorithms and obtained an EML
model. Furthermore, to effectively extract the opti- Conflict of interest The authors declare that they have no known
competing financial interests or personal relationships that could have
mal combination of parameters for the models, an
appeared to influence the work reported in this paper.
improved Jaya algorithm based on Lévy-Flight was
utilized in this study for the optimal parameter search
of the base learners in the EML model. The References
performance of the model was evaluated fair using
fivefold cross-validation. 1. Adoko AC, Gokceoglu C, Wu L, Zuo QJ (2013) Knowledge-
(2) The overfitting phenomenon of base learners was based and data-driven fuzzy modeling for rockburst prediction.
Int J Rock Mech Min Sci 61:86–95. https://doi.org/10.1016/j.
significantly reduced after LFJaya tuning. The EML ijrmms.2013.02.010
model achieved the best performance among all 2. Afraei S, Shahriar K, Madani SH (2019) Developing intelligent
methods, with an accuracy of 0.8929, kappa of classification models for rock burst prediction after recognizing
0.8453, precision of 0.9023, recall of 0.8417, and F1- significant predictor variables, Section 1: literature review and
data preprocessing procedure. Tunn Undergr Space Technol
score of 0.8631 on the testing set. Additionally, the 83:324–353. https://doi.org/10.1016/j.tust.2018.09.022
EML model obtained the highest ranking in the 3. Alcott JM, Kaiser PK, Simser BP (1999) Use of microseismic
nonparametric statistical test, which strongly con- source parameters for rockburst hazard assessment. Seism
firms the effectiveness of the ensemble classifier Caused Mines Fluid Inject Reserv Oil Extr. https://doi.org/10.
1007/978-3-0348-8804-2_4
proposed in this study. 4. Allen M, Poggiali D, Whitaker K, Marshall TR, Kievit RA
(3) In contrast to previous rockburst prediction methods, (2019) Raincloud plots: a multi-platform tool for robust data
the LFJaya-EML model proposed in this study visualization. Wellcome Open Res. https://doi.org/10.12688/
employed SHAP and LIME to provide a new wellcomeopenres.15191.1
5. Askaripour M, Saeidi A, Rouleau A, Mercier-Langevin P (2022)
approach for analyzing the contribution of main Rockburst in underground excavations: a review of mechanism,
MS parameters to rockburst prediction for better classification, and prediction methods. Undergr Space. https://
model interpretation. The parameter contribution doi.org/10.1016/j.undsp.2021.11.008

123
Acta Geotechnica

6. Blake W, Hedley DG (2003) Rockbursts: case studies from 24. Ghosh G, Sivakumar C (2018) Application of underground
North American hard-rock mines. SME. microseismic monitoring for ground failure and secure longwall
7. Brady BT, Leighton F (1977) Seismicity anomaly prior to a coal mining operation: a case study in an Indian mine. J Appl
moderate rock burst: a case study. Int J Rock Mech Min Sci Geophys 150:21–39. https://doi.org/10.1016/j.jappgeo.2018.01.
Geomech Abstr 14(3):127–132. https://doi.org/10.1016/0148- 004
9062(77)90003-1 25. Glazer S (2018) Mine seismology: data analysis and interpre-
8. Breiman L (2001) Random forests. Mach Learn 45:5–32. https:// tation. Springer, Berlin. https://doi.org/10.1007/978-3-319-
doi.org/10.1023/A:1010933404324 32612-2
9. Cai M (2013) Principles of rock support in burst-prone ground. 26. Guo D, Chen H, Tang L, Chen Z, Samui P (2021) Assessment of
Tunn Undergr Space Technol 36:46–56. https://doi.org/10.1016/ rockburst risk using multivariate adaptive regression splines and
j.tust.2013.02.003 deep forest model. Acta Geotechnica. https://doi.org/10.1007/
10. Cao A, Liu Y, Yang X, Li S, Liu Y (2022) FDNet: Knowledge s11440-021-01299-2
and data fusion-driven deep neural network for coal burst pre- 27. Guo J, Guo J, Zhang Q, Huang M (2022) Research on rockburst
diction. Sensors 22(8):3088. https://doi.org/10.3390/s22083088 classification prediction based on BP-SVM model. IEEE Access
11. Chen K, Chen H, Zhou C, Huang Y, Qi X, Shen R, Liu F, Zuo 10:50427–50447. https://doi.org/10.1109/ACCESS.2022.
M, Zou X, Wang J (2020) Comparative analysis of surface water 3173059
quality prediction performance and identification of key water 28. Heal D (2010) Observations and analysis of incidences of
parameters using different machine learning models based on rockburst damage in underground mines.
big data. Water Res 171:115454. https://doi.org/10.1016/j. 29. Iacca G, dos Santos Junior VC, de Melo VV (2021) An
watres.2019.115454 improved Jaya optimization algorithm with Lévy flight. Expert
12. Chen BR, Feng XT, Li QP, Luo RZ, Li S (2015) Rock burst Syst Appl 165:113902. https://doi.org/10.1016/j.eswa.2020.
intensity classification based on the radiated energy with dam- 113902
age intensity at Jinping II hydropower station, China. Rock 30. Ingle KK, Jatoth RK (2020) An efficient JAYA algorithm with
Mech Rock Eng 48:289–303. https://doi.org/10.1007/s00603- lévy flight for non-linear channel equalization. Expert Syst Appl
013-0524-2 145:112970. https://doi.org/10.1016/j.eswa.2019.112970
13. Chen T, Guestrin C (2016) Xgboost: A scalable tree boosting 31. Jin A, Basnet PMS, Mahtab S (2022) Microseismicity-based
system. In: Proceedings of the 22nd ACM SIGKDD interna- short-term rockburst prediction using non-linear support vector
tional conference on knowledge discovery and data mining, machine. Acta Geophys 70(4):1717–1736. https://doi.org/10.
https://doi.org/10.1145/2939672.2939785 1007/s11600-022-00817-4
14. Chen C, Zhang Q, Ma Q, Yu B (2019) LightGBM-PPI: Pre- 32. Kadkhodaei MH, Ghasemi E, Sari M (2022) Stochastic assess-
dicting protein-protein interactions through LightGBM with ment of rockburst potential in underground spaces using Monte
multi-information fusion. Chemom Intell Lab Syst 191:54–64. Carlo simulation. Environ Earth Sci 81(18):447. https://doi.org/
https://doi.org/10.1016/j.chemolab.2019.06.003 10.1007/s12665-022-10561-z
15. Demšar J (2006) Statistical comparisons of classifiers over 33. Kaiser PK, Cai M (2012) Design of rock support system under
multiple data sets. J Mach Learn Res 7:1–30. https://doi.org/10. rockburst condition. J Rock Mech Geotech Eng 4(3):215–227.
5555/1248547.1248548 https://doi.org/10.3724/SP.J.1235.2012.00215
16. Dong LJ, Li XB, Peng K (2013) Prediction of rockburst clas- 34. Ke B, Khandelwal M, Asteris PG, Skentou AD, Mamou A,
sification using random forest. Trans Nonferr Metals Soc China Armaghani DJ (2021) Rock-burst occurrence prediction based
23(2):472–477. https://doi.org/10.1016/S1003-6326(13)62487-5 on optimized Naı̈ve Bayes models. IEEE Access
17. Fajklewicz Z (1983) Rock-burst forecasting and genetic research 9:91347–91360. https://doi.org/10.1109/ACCESS.2021.
in coal-mines by microgravity method. Geophys Prospect 3089205
31(5):748–765. https://doi.org/10.1111/j.1365-2478.1983. 35. Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, Ye Q, Liu
tb01083.x TY (2017) Lightgbm: A highly efficient gradient boosting
18. Feng X, Chen B, Li S, Zhang C, Xiao Y, Feng G, Zhou H, Qiu decision tree. Adv Neural Inform Process Syst 30:3146–3154.
S, Zhao Z, Yu Y (2012) Studies on the evolution process of https://doi.org/10.5555/3294996.3295074
rockbursts in deep tunnels. J Rock Mech Geotech Eng 36. Kim Y, Kim Y (2022) Explainable heat-related mortality with
4(4):289–295. https://doi.org/10.3724/SP.J.1235.2012.00289 random forest and SHapley Additive exPlanations (SHAP)
19. Feng XT, Chen BR, Zhang CQ, Li SJ, Wu SY (2013) Mecha- models. Sustain Cities Soc 79:103677. https://doi.org/10.1016/j.
nism, warning and dynamic control of rockburst development scs.2022.103677
process. Science Press, Beijing 37. Kleinbaum DG, Dietz K, Gail M, Klein M, Klein M (2002)
20. Feng GL, Feng XT, Chen BR, Xiao YX, Yu Y (2015) A Logistic regression. Springer-Verlag, New York. https://doi.org/
microseismic method for dynamic warning of rockburst devel- 10.1007/978-1-4419-1742-3
opment processes in tunnels. Rock Mech Rock Eng 38. Landis JR, Koch GG (1977) The measurement of observer
48:2061–2076. https://doi.org/10.1007/s00603-014-0689-3 agreement for categorical data. Biometrics. https://doi.org/10.
21. Feng XT, Liu J, Chen B, Xiao Y, Feng G, Zhang F (2017) 2307/2529310
Monitoring, warning, and control of rockburst in deep metal 39. Li N, Feng X, Jimenez R (2017) Predicting rock burst hazard
mines. Engineering 3(4):538–545. https://doi.org/10.1016/J. with incomplete data using Bayesian networks. Tunn Undergr
ENG.2017.04.013 Space Technol 61:61–70. https://doi.org/10.1016/j.tust.2016.09.
22. Feng G, Xia G, Chen B, Xiao Y, Zhou R (2019) A method for 010
rockburst prediction in the deep tunnels of hydropower stations 40. Li TZ, Li YX, Yang XL (2017) Rock burst prediction based on
based on the monitored microseismicity and an optimized genetic algorithms and extreme learning machine. J Central
probabilistic neural network model. Sustainability 11(11):3212. South Univ 24(9):2105–2113. https://doi.org/10.1007/s11771-
https://doi.org/10.3390/su11113212 017-3619-1
23. Futagami K, Fukazawa Y, Kapoor N, Kito T (2021) Pairwise 41. Li D, Liu Z, Xiao P, Zhou J, Armaghani DJ (2022) Intelligent
acquisition prediction with SHAP value interpretation. J Financ rockburst prediction model with sample category balance using
Data Sci 7:22–44. https://doi.org/10.1016/j.jfds.2021.02.001 feedforward neural network and Bayesian optimization. Undergr

123
Acta Geotechnica

Space 7(5):833–846. https://doi.org/10.1016/j.undsp.2021.12. 57. Myrvang A, Grimstad E (1983) Rockburst problems in Nor-
009 wegian highway tunnels—recent case histories. Rockbursts:
42. Li X, Mao H, Li B, Xu N (2021) Dynamic early warning of prediction and control. Symposium pp 133–139
rockburst using microseismic multi-parameters based on Baye- 58. Naji AM, Emad MZ, Rehman H, Yoo H (2019) Geological and
sian network. Eng Sci Technol Int J 24(3):715–727. https://doi. geomechanical heterogeneity in deep hydropower tunnels: a
org/10.1016/j.jestch.2020.10.002 rock burst failure case study. Tunn Undergr Space Technol
43. Li N, Zare Naghadehi M, Jimenez R (2020) Evaluating short- 84:507–521. https://doi.org/10.1016/j.tust.2018.11.009
term rock burst damage in underground mines using a systems 59. Peng J, Zou K, Zhou M, Teng Y, Zhu X, Zhang F, Xu J (2021)
approach. Int J Min Reclam Environ 34(8):531–561. https://doi. An explainable artificial intelligence framework for the deteri-
org/10.1080/17480930.2019.1657654 oration risk prediction of hepatitis patients. J Med Syst 45:1–9.
44. Liang W, Sari A, Zhao G, McKinnon SD, Wu H (2020) Short- https://doi.org/10.1007/s10916-021-01736-5
term rockburst risk prediction using ensemble learning methods. 60. Polikar R (2012) Ensemble machine learning: Methods and
Nat Hazards 104:1923–1946. https://doi.org/10.1007/s11069- applications. Springer, New York, pp 1–34. https://doi.org/10.
020-04255-7 1007/978-1-4419-9326-7_1
45. Liang W, Sari YA, Zhao G, McKinnon SD, Wu H (2021) 61. Pu Y, Apel DB, Liu V, Mitri H (2019) Machine learning
Probability estimates of short-term rockburst risk with ensemble methods for rockburst prediction-state-of-the-art review. Int J
classifiers. Rock Mech Rock Eng 54:1799–1814. https://doi.org/ Min Sci Technol 29(4):565–570. https://doi.org/10.1016/j.ijmst.
10.1007/s00603-021-02369-3 2019.06.009
46. Lin Y, Zhou K, Li J (2018) Application of cloud model in rock 62. Pu Y, Apel DB, Wang C, Wilson B (2018) Evaluation of burst
burst prediction and performance comparison with three liability in kimberlite using support vector machine. Acta
machine learning algorithms. IEEE Access 6:30958–30968. Geophys 66:973–982. https://doi.org/10.1007/s11600-018-0178-
https://doi.org/10.1109/ACCESS.2018.2839754 2
47. Liu JP, Feng XT, Li YH, Sheng Y (2013) Studies on temporal 63. Pu Y, Apel DB, Wei C (2019) Applying machine learning
and spatial variation of microseismic activities in a deep metal approaches to evaluating rockburst liability: a comparation of
mine. Int J Rock Mech Min Sci 60:171–179. https://doi.org/10. generative and discriminative models. Pure Appl Geophys
1016/j.ijrmms.2012.12.022 176(10):4503–4517. https://doi.org/10.1007/s00024-019-02197-
48. Liu Y, Hou S (2020) Rockburst prediction based on particle 1
swarm optimization and machine learning algorithm. In: Infor- 64. Qiu L, Liu Z, Wang E, He X, Feng J, Li B (2020) Early-warning
mation Technology in Geo-Engineering: Proceedings of the 3rd of rock burst in coal mine by low-frequency electromagnetic
International Conference (ICITG), Guimarães, Portugal 3 radiation. Eng Geol 279:105755. https://doi.org/10.1016/j.
pp 292–303. https://doi.org/10.1007/978-3-030-32029-4_25 enggeo.2020.105755
49. Liu GF, Jiang Q, Feng GL, Chen DF, Chen BR, Zhao ZN (2021) 65. Rao R (2016) Jaya: a simple and new optimization algorithm for
Microseismicity-based method for the dynamic estimation of the solving constrained and unconstrained optimization problems.
potential rockburst scale during tunnel excavation. Bull Eng Int J Ind Eng Comput 7(1):19–34. https://doi.org/10.5267/j.ijiec.
Geol Env 80:3605–3628. https://doi.org/10.1007/s10064-021- 2015.8.004
02173-x 66. Rokach L (2010) Ensemble-based classifiers. Artif Intell Rev
50. Liu Z, Shao J, Xu W, Meng Y (2013) Prediction of rock burst 33:1–39. https://doi.org/10.1007/s10462-009-9124-7
classification using the technique of cloud models with attribu- 67. Sagi O, Rokach L (2018) Ensemble learning: a survey. Wiley
tion weight. Nat Hazards 68:549–568. https://doi.org/10.1007/ Interdiscip Rev Data Min Knowl Discov 8(4):e1249. https://doi.
s11069-013-0635-9 org/10.1002/widm.1249
51. Ma T, Lin D, Tang L, Li L, Tang CA, Yadav KP, Jin W (2022) 68. Sauer J, Mariani VC, dos Santos CL, Ribeiro MHDM, Ram-
Characteristics of rockburst and early warning of microseismic pazzo M (2021) Extreme gradient boosting model based on
monitoring at qinling water tunnel. Geomat Nat Haz Risk improved Jaya optimizer applied to forecasting energy con-
13(1):1366–1394. https://doi.org/10.1080/19475705.2022. sumption in residential buildings. Evolving Syst. https://doi.org/
2073830 10.1007/s12530-021-09404-2
52. Ma T, Tang C, Tang L, Zhang W, Wang L (2015) Rockburst 69. Shapley LS (1953) 17. A value for n-person games. In: Kuhn
characteristics and microseismic monitoring of deep-buried HW, Tucker AW (eds) Contributions to the theory of games
tunnels for Jinping II Hydropower Station. Tunn Undergr (AM-28), volume II. Princeton University Press, Princeton,
Space Technol 49:345–368. https://doi.org/10.1016/j.tust.2015. pp 307–318. https://doi.org/10.1515/9781400881970-018
04.016 70. Shirani Faradonbeh R, Taheri A (2019) Long-term prediction of
53. Ma X, Westman E, Slaker B, Thibodeau D, Counter D (2018) rockburst hazard in deep underground openings using three
The b-value evolution of mining-induced seismicity and main- robust data mining techniques. Eng Comput 35(2):659–675.
shock occurrences at hard-rock mines. Int J Rock Mech Min Sci https://doi.org/10.1007/s00366-018-0624-4
104:64–70. https://doi.org/10.1016/j.ijrmms.2018.02.003 71. Shukla R, Khandelwal M, Kankar P (2021) Prediction and
54. Mammone A, Turchi M, Cristianini N (2009) Support vector assessment of rock burst using various meta-heuristic approa-
machines. Wiley Interdiscip Rev Comput Stat 1(3):283–289. ches. Min Metall Explor 38:1375–1381. https://doi.org/10.1007/
https://doi.org/10.1002/wics.49 s42461-021-00415-w
55. Mark C (2016) Coal bursts in the deep longwall mines of the 72. Sokolova M, Lapalme G (2009) A systematic analysis of per-
United States. Int J Coal Sci Technol 3(1):1–9. https://doi.org/ formance measures for classification tasks. Inf Process Manage
10.1007/s40789-016-0102-9 45(4):427–437. https://doi.org/10.1016/j.ipm.2009.03.002
56. Mendecki A, Gibowicz S, Lasocki S (1997) Keynote lecture: 73. Sun L, Hu N, Ye Y, Tan W, Wu M, Wang X, Huang Z (2022)
principles of monitoring seismic rockmass response to mining. Ensemble stacking rockburst prediction model based on Yeo-
In: Gibowiez SJ (ed) Proceedings of the fourth international Johnson, K-means SMOTE, and optimal rockburst feature
symposium on rockbursts and seismieity in mines pp 69–80 dimension determination. Sci Rep 12(1):15352. https://doi.org/
10.1038/s41598-022-19669-5

123
Acta Geotechnica

74. Sun Y, Li G, Zhang J, Huang J (2021) Rockburst intensity 91. Zhang M (2022) Classification prediction of rockburst in railway
evaluation by a novel systematic and evolved approach: tunnel Based on hybrid PSO-BP neural network. Geofluids.
machine learning booster and application. Bull Eng Geol Env https://doi.org/10.1155/2022/4673073
80:8385–8395. https://doi.org/10.1007/s10064-021-02460-7 92. Zhang M, Liu S, Shimada H (2018) Regional hazard prediction
75. Tang LZ, Xia K (2010) Seismological method for prediction of of rock bursts using microseismic energy attenuation tomogra-
areal rockbursts in deep mine with seismic source mechanism phy in deep mining. Nat Hazards 93:1359–1378. https://doi.org/
and unstable failure theory. J Cent South Univ Technol 10.1007/s11069-018-3355-3
17(5):947–953. https://doi.org/10.1007/s11771-010-0582-5 93. Zhao Y, Jiang Y (2010) Acoustic emission and thermal infrared
76. Vapnik VN (1995) The nature of statistical learning. Theory. precursors associated with bump-prone coal failure. Int J Coal
https://doi.org/10.1007/978-1-4757-3264-1 Geol 83(1):11–20. https://doi.org/10.1016/j.coal.2010.04.001
77. Wang J, Liu P, Ma L, He M, Xiong H (2021) A rockburst 94. Zheng S, He C, Hsu SC, Sarkis J, Chen JH (2020) Corporate
proneness evaluation method based on multidimensional cloud environmental performance prediction in China: an empirical
model improved by control variable method and rockburst study of energy service companies. J Clean Product 266:121395.
database. Lithosphere. https://doi.org/10.2113/2022/5354402 https://doi.org/10.1016/j.jclepro.2020.121395
78. Wang J, Zhang J (2010) Preliminary engineering application of 95. Zhou J, Huang S, Qiu Y (2022) Optimization of random forest
microseismic monitoring technique to rockburst prediction in through the use of MVO, GWO and MFO in evaluating the
tunneling of Jinping II project. J Rock Mech Geotech Eng stability of underground entry-type excavations. Tunn Undergr
2(3):193–208. https://doi.org/10.3724/SP.J.1235.2010.00193 Space Technol 124:104494. https://doi.org/10.1016/j.tust.2022.
79. Wang SM, Zhou J, Li CQ, Armaghani DJ, Li XB, Mitri HS 104494
(2021) Rockburst prediction in hard rock mines developing 96. Zhou J, Huang S, Zhou T, Armaghani DJ, Qiu Y (2022)
bagging and boosting tree-based ensemble techniques. J Central Employing a genetic algorithm and grey wolf optimizer for
South Univ 28(2):527–542 optimizing RF models to evaluate soil liquefaction potential.
80. Wang M, Zhu ZM, Liu JH (2012) The photoelastic analysis of Artif Intell Rev 55(7):5673–5705
stress intensity factor for cracks around a tunnel. Appl Mech 97. Zhou J, Li X, Mitri HS (2016) Classification of rockburst in
Mater 142:197–200. https://doi.org/10.4028/www.scientific.net/ underground projects: comparison of ten supervised learning
AMM.142.197 methods. J Comput Civ Eng 30(5):04016003. https://doi.org/10.
81. Woźniak M, Grana M, Corchado E (2014) A survey of multiple 1061/(ASCE)CP.1943-5487.0000553
classifier systems as hybrid systems. Inform Fusion 16:3–17. 98. Zhou J, Li X, Mitri HS (2018) Evaluation method of rockburst:
https://doi.org/10.1016/j.inffus.2013.04.006 state-of-the-art literature review. Tunn Undergr Space Technol
82. Wu S, Wu Z, Zhang C (2019) Rock burst prediction probability 81:632–659. https://doi.org/10.1016/j.tust.2018.08.029
model based on case analysis. Tunn Undergr Space Technol 99. Zhou J, Li XB, Shi XZ (2012) Long-term prediction model of
93:103069. https://doi.org/10.1016/j.tust.2019.103069 rockburst in underground openings using heuristic algorithms
83. Xie X, Jiang W, Guo J (2021) Research on rockburst prediction and support vector machines. Safety Sci 50(4):629–644. https://
classification based on GA-XGB model. IEEE Access doi.org/10.1016/j.ssci.2011.08.065
9:83993–84020. https://doi.org/10.1109/ACCESS.2021. 100. Zhou J, Qiu Y, Khandelwal M, Zhu S, Zhang X (2021)
3085745 Developing a hybrid model of Jaya algorithm-based extreme
84. Xu N, Li T, Dai F, Zhang R, Tang C, Tang L (2016) Micro- gradient boosting machine to estimate blast-induced ground
seismic monitoring of strainburst activities in deep tunnels at the vibrations. Int J Rock Mech Min Sci 145:104856
Jinping II hydropower station, China. Rock Mech Rock Eng 101. Zhou J, Qiu Y, Zhu S, Armaghani DJ, Khandelwal M, Mohamad
49:981–1000. https://doi.org/10.1007/s00603-015-0784-0 ET (2021) Estimation of the TBM advance rate under hard rock
85. Xu G, Li K, Li M, Qin Q, Yue R (2022) Rockburst intensity conditions using XGBoost and Bayesian optimization. Undergr
level prediction method based on FA-SSA-PNN model. Ener- Space 6(5):506–515
gies 15(14):5016. https://doi.org/10.3390/en15145016 102. Zhou J, Shi XZ, Huang RD, Qiu XY, Chong C (2016) Feasi-
86. Xue Y, Bai C, Qiu D, Kong F, Li Z (2020) Predicting rockburst bility of stochastic gradient boosting approach for predicting
with database using particle swarm optimization and extreme rockburst damage in burst-prone mines. Trans Nonferr Metals
learning machine. Tunn Undergr Space Technol 98:103287. Soc China 26(7):1938–1945. https://doi.org/10.1016/S1003-
https://doi.org/10.1016/j.tust.2020.103287 6326(16)64312-1
87. Xue Y, Li G, Li Z, Wang P, Gong H, Kong F (2022) Intelligent 103. Zhou KP, Yun L, Deng HW, Li JL, Liu CJ (2016) Prediction of
prediction of rockburst based on Copula-MC oversampling rock burst classification using cloud model with entropy weight.
architecture. Bull Eng Geol Env 81(5):209. https://doi.org/10. Trans Nonferr Metals Soc China 26(7):1995–2002. https://doi.
1007/s10064-022-02659-2 org/10.1016/S1003-6326(16)64313-3
88. Xue R, Liang Z, Xu N, Dong L (2020) Rockburst prediction and 104. Zhou J, Zhu S, Qiu Y, Armaghani DJ, Zhou A, Yong W (2022)
stability analysis of the access tunnel in the main powerhouse of Predicting tunnel squeezing using support vector machine
a hydropower station based on microseismic monitoring. Int J optimized by whale optimization algorithm. Acta Geotech
Rock Mech Min Sci 126:104174. https://doi.org/10.1016/j. 17(4):1343–1366
ijrmms.2019.104174 105. Zitar RA, Al-Beta MA, Awadallah MA, Doush IA, Assaleh K
89. Yin X, Liu Q, Huang X, Pan Y (2021) Real-time prediction of (2022) An intensive and comprehensive overview of JAYA
rockburst intensity using an integrated CNN-Adam-BO algo- algorithm, its versions and applications. Archiv Comput Method
rithm based on microseismic data and its engineering applica- Eng 29(2):763–792. https://doi.org/10.1007/s11831-021-09585-
tion. Tunn Undergr Space Technol 117:104133. https://doi.org/ 8
10.1016/j.tust.2021.104133 106. Zhou J, Chen C, Wang M, Khandelwal M (2021) Proposing a
90. Yin X, Liu Q, Pan Y, Huang X, Wu J, Wang X (2021) Strength novel comprehensive evaluation model for the coal burst lia-
of stacking technique of ensemble learning in rockburst pre- bility in underground coal mines considering uncertainty factors.
diction with imbalanced data: comparison of eight single and Int J Min Sci Technol 31(5):799–812
ensemble models. Nat Resour Res 30:1795–1815. https://doi. 107. Zhou J, Zhang R, Qiu Y, Khandelwal M (2023) A true triaxial
org/10.1007/s11053-020-09787-0 strength criterion for rocks by gene expression programming.

123
Acta Geotechnica

J Rock Mech Geotech Eng. https://doi.org/10.1016/j.jrmge. Springer Nature or its licensor (e.g. a society or other partner) holds
2023.03.004 exclusive rights to this article under a publishing agreement with the
author(s) or other rightsholder(s); author self-archiving of the
Publisher’s Note Springer Nature remains neutral with regard to accepted manuscript version of this article is solely governed by the
jurisdictional claims in published maps and institutional affiliations. terms of such publishing agreement and applicable law.

123

You might also like