Professional Documents
Culture Documents
2021-Assessment of landslide susceptibility mapping based on Bayesian hyperparameter optimization- A comparison between logistic regression and random forest
2021-Assessment of landslide susceptibility mapping based on Bayesian hyperparameter optimization- A comparison between logistic regression and random forest
Engineering Geology
journal homepage: www.elsevier.com/locate/enggeo
A R T I C L E I N F O A B S T R A C T
Keywords: This study aims to develop two optimized models of landslide susceptibility mapping (LSM), i.e., logical
Landslide regression (LR) and random forest (RF) models, premised on hyperparameter optimization using the Bayesian
Bayesian hyperparameter optimization algorithm, and compare their applicability in a typical landslide-prone area (Fengjie County, China). First, data
Logical regression
for 1520 historical landslides occurring was collected from field investigations and literature reviews, to
Random forest
construct a spatial database of 16 conditioning factors. Subsequently, the Bayesian algorithm was adopted to
Landslide susceptibility mapping
optimize the hyperparameters of the LR and RF models, premised on the dataset of all cells (including landslides
and non-landslides). Finally, the two optimized models were estimated and compared with the area under curve
(AUC) and confusion matrix. Based on the Bayesian algorithm, the AUC value of the test dataset in LR model is
improved by 4%, while the AUC value of the test dataset in RF model is improved by 10%, indicating that both
models’ hyperparameter optimization premised on the Bayesian algorithm have delivered considerable impact
on the accuracy of the models; so hyperparameter optimization is very important for models of LSM. Although
both models exhibit reasonable performances, the optimized RF model premised on hyperparameter optimiza
tion has a better stability and predictive capability in case area. These findings make up for the crucial step in
LSM (hyperparameter optimization) through the Bayesian algorithm, and provide a comparison case between LR
and RF models after comprehensive consideration of hyperparameter optimization, so as to increase the
convincing power of the comparison of these models and provide a knowledge base for model comparison:
comparison premised on hyperparameter optimization.
* Corresponding author: Key Laboratory of New Technology for Construction of Cities in Mountain Area, Ministry of Education, Chongqing 400045, China.
E-mail address: jhw@cqu.edu.cn (H. Wen).
https://doi.org/10.1016/j.enggeo.2020.105972
Received 19 May 2020; Received in revised form 9 December 2020; Accepted 10 December 2020
Available online 16 December 2020
0013-7952/© 2020 Elsevier B.V. All rights reserved.
D. Sun et al. Engineering Geology 281 (2021) 105972
generated (Wang et al. 2020a). As the two most frequently adopted dataset, and it is the general model parameter; the number of decision
models for LSM, it is evident that both Logical Regression (LR) and trees in the RF model cannot be obtained through data training, but shall
Random Forest (RF) are suitable for analyzing the presence/absence of a be set before the model training, and this is the hyperparameter. In
landslide; a few studies have been published regarding the comparison machine learning, the performance of models is closely related to their
of these two models. By illustration, Tsangaratos et al. reported that the hyperparameters. By constantly adjusting the hyperparameters’ setting,
RF model has a slightly higher predictive capability than the LR model in the accuracy, operating speed and reliability of models can be greatly
Nancheng (China) (Tsangaratos et al. 2016), while Hong et al. demon improved (Xie et al. 2021). For this reason, the accuracy of models not
strated that the LR model exhibits a higher predictive capability than the only depends on the algorithm used, but also on the hyperparameters,
RF model in Lianhua (China) (Hong et al. 2016). Be that as it may, these rendering optimization of hyperparameters indispensable in any model.
studies have overlooked a crucial step: they have failed to consider the However, discussions on hyperparameter optimization mostly appear in
hyperparameters of their models. computer algorithm science. Premised on the Gaussian Kernel, Wang
Unlike general model parameters obtained through data training, et al. proposed a Support Vector Machine hyperparameter selection
hyperparameters are set before model training. For instance, the coef method, which includes two stages: selecting the kernel parameters and
ficient of the LR model could be obtained through training on the training the optimal penalty factors (Wang et al. 2014). This method has
2
D. Sun et al. Engineering Geology 281 (2021) 105972
3
D. Sun et al. Engineering Geology 281 (2021) 105972
4
D. Sun et al. Engineering Geology 281 (2021) 105972
Table 2 the search to only focus on the areas of the input space that are expected
Classification of landslide-conditioning factors. to provide the most effective information about the solution to the
Conditioning factor Class Classification standard optimization problem. In particular, the probabilistic model for
Bayesian optimization is a GP because GPs can easily calculate the
Elevation / (m) 7 1. <340; 2. 340–595; 3. 595–850; 4. 850–1105; 5.
1105–1360; 6. 1360–1615; 7. >1615 predictive distribution of the target.
Slope / ( )
◦
6 1. <10◦ ; 2. 10◦ –20◦ ; 3. 20◦ –30◦ ; 4. 30◦ –40◦ ; 5. When GP is adopted as the basic model, the assumption is that the
40◦ –50◦ ; 6. 50◦ –60◦ optimized black-box objective function f (x) is randomly sampled from
Aspect / (◦ ) 9 1. flat; 2. north; 3. northeast; 4. east; 5. southeast; GP. To be specific, f(x) ~ GP(m(x), k (x, x’)), where k (x, x’) represents a
6. south; 7. southwest; 8. west; 9. northwest
Slope position 6 1. valley; 2. lower slope; 3. flats slope; 4. middle
covariance function, and m(x) refers to a mean function. The covariance
slope; 5. upper slope; 6. ridge function k (x, x’) specifies the intrinsic characteristics of the f (x)
Micro-landform 10 1. canyons, and deeply incised streams; 2. objective lens (such as smoothness, level of additive noise). The output
midslope drainages, and shallow valleys; 3. result is the covariance of f(x) and f(x’). In a general GP model, the
upland drainages, and headwaters; 4. U-shape
probability of each feature is calculated and added, while there is a
valleys; 5. plains; 6. open slopes; 7. upper slopes,
and mesas; 8. local ridges/hills in valleys; 9. necessity to construct a covariance matrix and utilize the probability
midslope ridges, and small hills in plains; 10. values of all feature vectors in a multivariate GP model. The ultimate
mountain tops, and high narrow ridges multivariate GP model is as follows:
Profile curvature 7 1. <− 1.0; 2. − 1 to − 0.5; 3. -0.5–0; 4. 0–0.5; 5. ( )
0.5–1.0; 6. 1.0–1.5; 7. >1.5 1 1
TWI 7 1. <10; 2. 10–12; 3. 12–14; 4. 14–16; 5. 16–18; 6.
P(x) = n 1 exp − (x − μ )T
cov(x − μ )− 1
(4)
(2π)2 |cov|2 2
18–20; 7. >20
Lithology 12 1. J3p/J3s; 2. J2s/J2xs; 3. J1-2z/J1z; 4. T3xj; 5. T3b1; where μ (a mean), and cov (a covariance) are as follows:
6. T2b2; 7. T1d; 8. T1j; 9. P2; 10. P1; 11. D3/D2; 12.
S1-2 1∑n
Distance from faults/ 6 1. <500; 2. 500–1000; 3. 1000–1500; 4. μ= xi (5)
n i=1
(m) 1500–2000; 5. 2000–2500; 6. 2500–3000; 7.
>3000 1∑n
CRDS 7 1. dip-slope I; 2. dip-slope II; 3. outward slope; 4. cov = (xi − μ)(xi − μ)T (6)
oblique slope; 5. tangential slope; 6. reverse slope; n i=1
7. flat In the present research, the Bayesian algorithm was employed to
NDVI 7 1. <0.10; 2. 0.10–0.20; 3. 0.20–0.30; 4.
optimize the objective function with the accuracy of cross-validation.
0.30–0.40; 5. 0.40–0.50; 6. 0.50–0.60; 7. >0.60
Distance from rivers/ 7 1. <100; 2. 100–200; 3. 200–300; 4. 300–400; 5.
(m) 400–500; 6. 500–600; 7. >600
Annual average 8 1. < 1221; 2. 1221–1251; 3. 1251–1276; 4.
3.4. Model performance and validation
rainfall / (mm) 1276–1308; 5. 1308–1343; 6. 1343–1389; 7.
1389–1440; 8. >1440 Assessments require validation to ensure scientific significance;
Land cover 6 1. farmland; 2. forest; 3. grassland; 4. building; 5. hence, it is necessary to evaluate the validity of the landslide suscepti
water; 6. not used
bility models used. The confusion matrix and AUC were used to analyze
Distance from roads / 7 1. <100; 2. 100–200; 3. 200–300; 4. 300–400; 5.
(m) 400–500; 6. 500–600; 7. >600 the accuracy. In the confusion matrix, examples can be divided into
Distance from 7 1. <100; 2. 100–200; 3. 200–300; 4. 300–400; 5. positive and negative ones. If the sample point is a landslide, it is posi
buildings / (m) 400–500; 6. 500–600; 7. >600 tive; if the sample point is non-landslide, it is negative. If the status of the
instance is “non-landslide” and is predicted as “landslide,” it is recorded
as TN (true negative); if the state of the instance is “landslide” and is
variables in the nodes of the tree are randomly arranged (Breiman
predicted to be “non-landslide,” it is recorded as FN (false negative). The
2001). In the model-building process, RF grows multiple decision trees.
formulae for accuracy and precision are as follows:
The generalization error of RF depends on the accuracy of a single tree
and the correlation between the trees; the final prediction result is TP + TN
Accuracy = (7)
determined by a majority vote of the decision trees (Sahin et al. 2018). TP + TN + FP + FN
RF ranks the importance of factors based on the Gini index, repre
senting a random variable in the sample set. The probability that the Precision =
TP
(8)
selected sample was misclassified is determined as follows: TP + FP
Confusion-matrix-based statistical measures such as accuracy, pre
G = p(s) × p(m) (3)
cision, and recall were also used to evaluate the predictive capabilities of
where G is the Gini index, the larger the value of G, the higher the the models. Receiver operating characteristic (ROC) is a type of curve
uncertainty of data; p(s) is the probability of a sample being selected; based on confusion matrixes, which consider sensitivity and specificity
and p(m) is the probability of a sample being misclassified. as the horizontal and vertical axes, respectively. The value of AUC under
the ROC curve can quantitatively express the accuracy of model pre
3.3.3. Bayesian Optimization dictions: the value “1” represents an ideal model, while “0” represents
The Bayesian Optimization algorithm is extensively adopted to the model is lack of sufficient information (Wang et al. 2020b).
determine the optimal hyperparameter value of a model, owing to the
ability thereof to rapidly obtain optimal values (Garrido-Merchán and 4. Results and analyses
Hernández-Lobato 2020). Owing to the use of the GP, the Bayesian al
gorithm can completely master prior knowledge with strong robustness. 4.1. Model hyperparameter optimization
By increasing the number of samples, this algorithm can fit the posterior
distribution of the objective function, thereby obtaining the optimal Table 3 lists the five main hyperparameters included in the LR
value to realize the hyperparameter optimization of the models. model: Tol had a default value of 1e− 4; max_iter had a default value of
The Bayesian algorithm relies on fitting the probability model to the 100, with int as its default data type; and solver, penalty, and C were
observed value of the black-box target to be optimized. By considering optimized, with the hyperparameter values obtained in each iteration as
the predictive distribution (specifying the potential value of the target at output. As can be observed in Fig. 3, the AUC values ranged from 0.755
each point in the input space), the Bayesian optimization method guides to 0.799 under different hyperparameter values. When the AUC value
5
D. Sun et al. Engineering Geology 281 (2021) 105972
Table 3 Fig. 4, the AUC values range from 0.917 to 0.946 under different
Main hyperparameters involved in logistic regression. hyperparameter values. When the AUC value reached the maximum
Hyperparameters Explanation (0.946), the optimal hyperparameter values were as follows: [‘n_esti
mators’: ‘50’, ‘max_depths’: ‘16’, ‘max_features’: ‘10’, ‘min_samples_s
solver Loss function minimization algorithm, including sag, Newton
CG, lbfgs, and liblinear. plits’: ‘4’].
penalty Regularization methods include L1 and L2. For sag, Newton CG, Both models showed that their hyperparameters had a great impact
and lbfgs, only L2 can be chosen. on their accuracy in machine learning. The AUC values of LR and RF
C Regularization intensity value. models using 10-fold cross-validation were 5.8% and 3.2%, respectively.
max_iter Maximum number of iterations.
Tol Tolerance to stop the iteration.
Therefore, the RF model was determined to be relatively stable.
Fig. 6 shows the LSM produced by the RF model based on the opti
mized hyperparameters for Fengjie County, overlaid with landslides.
This indicates that the LSM matched well with the distribution of the
actual historical landslides.
The confusion matrix and ROC plots of the LR and RF models are
depicted in Table 4 and Fig. 7. Further, three evaluation metrics,
namely, the AUC, overall accuracy, and precision, were utilized to
evaluate the models based on the training and test datasets. The AUC of
the training dataset is an indication of the success power of the models,
while the AUC of the test dataset is an indication of the predictive ca
pabilities of the models (Tsangaratos et al. 2016). As shown in Fig. 7, the
AUC values of the training dataset of the LR and RF models were 0.78
Fig. 4. Receiver operating characteristic curves of random forest under and 1.00, respectively, while the AUC values of the test dataset of the LR
different hyperparameter values. model and the RF model were 0.80 and 0.95, respectively. In addition,
6
D. Sun et al. Engineering Geology 281 (2021) 105972
7
D. Sun et al. Engineering Geology 281 (2021) 105972
Table 4 Table 5
Confusion matrix of the logistic regression (LR) and random forest (RF) models. Statistics of the susceptibilities classified based on the logistic regression (LR)
LR Actual value
and random forest (RF) models.
Class Model Very Low Middle High Very
Landslide Non-landslide
low high
(1) (0)
Coverage (%) LR 50 20 10 10 10
Predicted Landslide (1) 2 5 Accuracy:
RF 50 20 10 10 10
value 0.909
Landslide LR 14.14 20.86 15.00 20.26 29.74
Non-landslide 1518 15,195 Precision:
proportion (%) RF 7.30 12.89 12.70 19.61 47.50
(0) 0.286
Landslide density LR 0.11 0.39 0.56 0.76 1.12
RF Actual value
(point /km2) RF 0.05 0.24 0.48 0.74 1.78
Landslide Non-landslide
(1) (0)
Predicted Landslide (1) 1317 13 Accuracy:
value 0.987 were located in the same area (10%). These results reveal that the
Non-landslide 203 15,187 Precision: proportion of landslides located in the low-susceptibility regions of LSM
(0) 0.990 generated by RF was lower in the same area. However, the proportion of
those in high-susceptibility regions was higher, exhibiting that the RF
model has a better application in indicating the landslide susceptibility
in the whole area compared to the LR model.
Furthermore, new landslide data from the study area in 2017 were
collected for further comparison. By overlaying the 25 coordinated
landslides in 2017 on the LSMs generated by the LR and RF models
(Fig. 8), an observation can be made that most of the new landslides fell
into the high- or very-high-susceptibility regions of LSMs generated by
both LR and RF models, showing a certain prediction ability. In January
2017, Zhoulaoshiliangzi suffered a medium-sized landslide, with 185
people affected, 180 buildings damaged, and properties worth 80
million yuan compromised. In the same month, a medium-sized land
slide occurred in Houpoli, with 10 people affected and properties worth
3 million yuan damaged. Comparative analyses indicated that the
landslides in both areas were located in very-low-susceptibility regions
of LSM generated by the LR model but fell within the high- or very-high-
susceptibility regions of LSM generated by the RF model. For this reason,
the prediction ability of the RF model was exhibited to be significantly
better than that of the LR model.
5. Discussion
8
D. Sun et al. Engineering Geology 281 (2021) 105972
The “Mean Decrease Gini” in the RF model also indicated that the occurrence of landslides (Guo et al. 2015). By comprehensively
distance from faults had little effect on the occurrence of landslides in exploring the mechanism of landslides, we determined that the land
the study region (Figs. 9 and 10). As seen in Fig. 11b, only a few small slides in the study area were triggered by rainfall, while earthquakes
faults exist in the study area. It is necessary to note that some scholars triggered most of the landslides in the areas considered in other studies.
arrived at a contrasting conclusion: faults were essential for the Consequently, the present study maintained that there was no strong
9
D. Sun et al. Engineering Geology 281 (2021) 105972
1.00, while the corresponding AUC values of the test dataset are 0.85
and 0.95, respectively. Additionally, for all the datasets summarized in
Table 6, the overall accuracy and precision of RF before the optimization
are 0.866 and 0.962, respectively, and the overall accuracy and preci
sion of the RF model after optimization are 0.987 and 0.990, respec
tively. The results highlight that the RF model indexes increased after
the hyperparameter optimization, in both the test dataset and the whole
data set. For this reason, hyperparameter optimization can optimize the
performance of the model to a certain extent, thereby increasing its
accuracy.
10
D. Sun et al. Engineering Geology 281 (2021) 105972
Fig. 11. Thematic layers of distance from buildings and distance from faults.
Table 6
Confusion matrix of the random forest (RF) models before and after hyper
parameter optimization.
RF (before) Actual value
Landslide Non-landslide
(1) (0)
6. Conclusion Fig. 12. Receiver operating characteristic curves of the random forest models
(a) before and (b) after hyperparameter optimization.
In this study, the optimized LR and RF landslide susceptibility
models were proposed through hyperparameter optimization. A com both models’ hyperparameter optimization premised on the
parison of these two models was conducted predicated on research on a Bayesian algorithm have delivered considerable impact on the
typical landslide-prone area, Fengjie County, China. The following accuracy of the models; so hyperparameter optimization is very
conclusions were drawn: important for models of LSM.
(2) Although both models exhibit reasonable performances, the
(1) Based on the Bayesian algorithm, the AUC value of the test optimized RF model premised on hyperparameter optimization
dataset in LR model is improved by 4%, while the AUC value of has a better stability and predictive capability in case area.
the test dataset in RF model is improved by 10%, indicating that
11
D. Sun et al. Engineering Geology 281 (2021) 105972
(3) Three major conditioning factors, i.e., annual average rainfall, Heckmann, T., Gegg, K., Gegg, A., et al., 2014. Sample size matters: investigating the
effect of sample size on a logistic regression susceptibility model for debris flows.
elevation, and distance from buildings, played an important role
Nat. Hazards Earth Syst. Sci. 14 (2), 259–278. https://doi.org/10.5194/nhess-14-
in landslide occurrence, whereas distance from faults was 259-2014.
impractical for explaining the spatial distribution of landslides in Hong, H., Pourghasemi, H.R., Pourtaghi, Z.S., 2016. Landslide susceptibility assessment
this study region. in Lianhua County (China): a comparison between a random forest data mining
technique and bivariate and multivariate statistical models. Geomorphology 259,
105–118. https://doi.org/10.1016/j.geomorph.2016.02.012.
These findings make up for the crucial step in LSM (hyperparameter Huang, Y., Zhao, L., 2018. Review on landslide susceptibility mapping using support
optimization) through the Bayesian algorithm, and provide a compari vector machines. Catena 165, 520–529. https://doi.org/10.1016/j.
catena.2018.03.003.
son case between LR and RF models after comprehensive consideration Hussin, H.Y., Zumpano, V., Reichenbach, P., et al., 2016. Different landslide sampling
of hyperparameter optimization, so as to increase the convincing power strategies in a grid-based bi-variate statistical susceptibility model. Geomorphology
of the comparison of these models and provide a knowledge base for 253, 508–523. https://doi.org/10.1016/j.geomorph.2015.10.030.
Kang, L., Chen, R.-S., Xiong, N., et al., 2019. Selecting Hyper-Parameters of Gaussian
model comparison: comparison premised on hyperparameter Process Regression based on Non-Inertial Particle Swarm Optimization in Internet of
optimization. Things. IEEE Access 7, 59504–59513. https://doi.org/10.1109/
access.2019.2913757.
Lee, J.-H., Sameen, M.I., Pradhan, B., et al., 2018. Modeling landslide susceptibility in
Funding data-scarce environments using optimized data mining and statistical methods.
Geomorphology 303, 284–298. https://doi.org/10.1016/j.geomorph.2017.12.007.
This research was funded by the National Key Research and Devel Li, C., Fu, Z., Wang, Y., et al., 2019. Susceptibility of reservoir-induced landslides and
strategies for increasing the slope stability in the Three Gorges Reservoir Area: Zigui
opment Program of China (No. 2018 YFC 1505501), the Natural Science
Basin as an example. Eng. Geol. 261 https://doi.org/10.1016/j.
Foundation of Chongqing (Grant No. cstc2020jcyj-msxmX0841), and enggeo.2019.105279.
Humanities and Social Sciences Foundation of the Ministry of Education Lombardo, L., Mai, P.M., 2018. Presenting logistic regression-based landslide
of China (Grant No. 20XJAZH002). susceptibility results. Eng. Geol. 244, 14–24. https://doi.org/10.1016/j.
enggeo.2018.07.019.
Merghadi, A., Yunus, A.P., Dou, J., et al., 2020. Machine learning methods for landslide
Declaration of Competing Interest susceptibility studies: a comparative overview of algorithm performance. Earth Sci.
Rev. 207 https://doi.org/10.1016/j.earscirev.2020.103225.
Pradhan, B., Lee, S., 2010. Landslide susceptibility assessment and factor effect analysis:
The authors declare no conflict of interest. backpropagation artificial neural networks and their comparison with frequency
ratio and bivariate logistic regression modelling. Environ. Model Softw. 25 (6),
Acknowledgments 747–759. https://doi.org/10.1016/j.envsoft.2009.10.016.
Reichenbach, P., Rossi, M., Malamud, B.D., et al., 2018. A review of statistically-based
landslide susceptibility models. Earth Sci. Rev. 180, 60–91. https://doi.org/
We want to express our gratitude to Chongqing Meteorological 10.1016/j.earscirev.2018.03.001.
Administration for providing essential meteorological data and also to Sahin, E.K., Colkesen, I., Kavzoglu, T., 2018. A comparative assessment of canonical
correlation forest, random forest, rotation forest and logistic regression methods for
Chongqing Institute of Geology and Mineral Resources for providing landslide susceptibility mapping. Geocarto Int. 35 (4), 341–363. https://doi.org/
valuable research data on historical landslides. We are also grateful to 10.1080/10106049.2018.1516248.
the editors and anonymous reviewers for their valuable comments on Sun, D., Wen, H., Wang, D., et al., 2020a. A random forest model of landslide
susceptibility mapping based on hyperparameter optimization using Bayes
this manuscript.
algorithm. Geomorphology 362. https://doi.org/10.1016/j.geomorph.2020.107201.
Sun, D., Wen, H., Zhang, Y., et al., 2020b. An optimal sample selection-based logistic
Appendix A. Supplementary data regression model of slope physical resistance against rainfall-induced landslide.
Natural Hazards. https://doi.org/10.1007/s11069-020-04353-6.
Trigila, A., Iadanza, C., Esposito, C., et al., 2015. Comparison of Logistic Regression and
Supplementary data to this article can be found online at https://doi. Random Forests techniques for shallow landslide susceptibility assessment in
org/10.1016/j.enggeo.2020.105972. Giampilieri (NE Sicily, Italy). Geomorphology 249, 119–136. https://doi.org/
10.1016/j.geomorph.2015.06.001.
Tsangaratos, P., Ilia, I., Hong, H., et al., 2016. Applying Information Theory and GIS-
References based quantitative methods to produce landslide susceptibility maps in Nancheng
County, China. Landslides 14 (3), 1091–1111. https://doi.org/10.1007/s10346-016-
Bourenane, H., Bouhadad, Y., Guettouche, M.S., et al., 2014. GIS-based landslide 0769-4.
susceptibility zonation using bivariate statistical and expert approaches in the city of Wang, X., Huang, F., Cheng, Y., 2014. Super-parameter selection for Gaussian-Kernel
Constantine (Northeast Algeria). Bull. Eng. Geol. Environ. 74 (2), 337–355. https:// SVM based on outlier-resisting. Measurement 58, 147–153. https://doi.org/
doi.org/10.1007/s10064-014-0616-6. 10.1016/j.measurement.2014.08.019.
Breiman, L., 2001. Random Forests. Mach. Learn. 45 (1), 5–32. https://doi.org/10.1023/ Wang, Y., Sun, D.L., Wen, H.J., et al., 2020a. Comparison of Random Forest Model and
A:1010933404324. Frequency Ratio Model for Landslide Susceptibility Mapping (LSM) in Yunyang
Chen, W., Zhang, S., Li, R., et al., 2018. Performance evaluation of the GIS-based data County (Chongqing, China). Int. J. Environ. Res. Public Health 17, 4206. https://doi.
mining techniques of best-first decision tree, random forest, and naive Bayes tree for org/10.3390/ijerph17124206.
landslide susceptibility modeling. Sci. Total Environ. 644, 1006–1018. https://doi. Wang, Y.M., Feng, L.W., Li, S.J., et al., 2020b. A hybrid model considering spatial
org/10.1016/j.scitotenv.2018.06.389. heterogeneity for landslide susceptibility mapping in Zhejiang Province, China.
Garrido-Merchán, E.C., Hernández-Lobato, D., 2020. Dealing with categorical and Catena 188. https://doi.org/10.1016/j.catena.2019.104425.
integer-valued variables in Bayesian Optimization with Gaussian processes. Xie, W., Chen, W., Shen, L., et al., 2021. Surrogate network-based sparseness hyper-
Neurocomputing 380, 20–35. https://doi.org/10.1016/j.neucom.2019.11.004. parameter optimization for deep expression recognition. Pattern Recogn. 111.
Goetz, J.N., Brenning, A., Petschko, H., et al., 2015. Evaluating machine learning and https://doi.org/10.1016/j.patcog.2020.107701.
statistical prediction techniques for landslide susceptibility modeling. Comput. Zhao, Y., Wang, R., Jiang, Y., et al., 2019. GIS-based logistic regression for rainfall-
Geosci. 81, 1–11. https://doi.org/10.1016/j.cageo.2015.04.007. induced landslide susceptibility mapping under different grid sizes in Yueqing,
Guo, C., Montgomery, D.R., Zhang, Y., et al., 2015. Quantitative assessment of landslide Southeastern China. Eng. Geol. 259 https://doi.org/10.1016/j.enggeo.2019.105147.
susceptibility along the Xianshuihe fault zone, Tibetan Plateau, China.
Geomorphology 248, 93–110. https://doi.org/10.1016/j.geomorph.2015.07.012.
12