Professional Documents
Culture Documents
https://doi.org/10.1007/s00366-017-0545-7
ORIGINAL ARTICLE
Received: 3 February 2017 / Accepted: 10 November 2017 / Published online: 16 November 2017
© Springer-Verlag London Ltd., part of Springer Nature 2017
Abstract
The application of models provided by artificial neural network (ANN) in predicting bearing capacity of driven pile is under-
lined in several investigations. However, weakness of ANN in slow rate of convergence as well as finding reliable testing
output is known to be the major drawbacks of implementing ANN-based techniques. The present study aims to introduce
and evaluate an optimized ANN with imperialism competitive algorithm (ICA) model based to estimate bearing capacity
of driven pile in cohesionless soil. The training data for optimizing the ICA-ANN structure are based on the in situ study.
To develop the ICA-ANN model, the input parameters are internal friction angle of soil located in shaft (φ shaft), and tip (φ
tip), pile length (L), effective vertical stress at pile toe (σv), and pile area (A) while the output is the total driven pile bear-
ing capacity in cohesionless soil. The predicted results are compared with a pre-developed ANN model to demonstrate the
ability of the hybrid model. As a result, coefficient of determination (R2) values of (0.885 and 0.894) and (0.964 and 0.974)
was obtained for testing and training datasets of ANN and ICA-ANN models, respectively. In addition, values of variance
account for (VAF) of (88.212 for training and 89.215 for testing) and (96.369 for training and 97.369 for testing, respectively)
were obtained for ANN and ICA-ANN models, respectively. The obtained results declare high reliability of the developed
ICA-ANN model. This model can be introduced as a new model in field of deep foundation engineering.
Keywords Driven pile · ICA-ANN · ANN · Cohesionless soil · Bearing capacity · Optimization.
13
Vol.:(0123456789)
348 Engineering with Computers (2018) 34:347–356
density, average effective vertical stress, deformability of the resistances of concrete piles. In this study, an ANN-based
soil beneath the pile toe, and the well-known bearing capac- predictive model for estimating axial bearing capacity of
ity factor of the piles (Nq and β). A database comprising 28 bored piles and its distribution is developed. To construct
axially compressive pile load tests were collected. Compari- the network, 36 PDA (pile driving analyzer) experiments
son of measured and predicted compressive capacity of an were undertaken on various concrete piles. The data are
independent database indicated that the proposed formula is collected from different project sites. The PDA results, pile
reliable and accurate. Yang et al. [22] emphasized the behav- geometrical characteristics as well as soil investigation data
ior of jacked and driven piles in sandy soil. This paper illus- were used for training the ANN models. Findings indicate
trates a comprehensive is situ investigation that was aimed the feasibility of ANN in predicting ultimate, shaft and tip
to study the similarities and differences between the behav- bearing resistances of piles. The coefficients of determina-
ior of driven H-piles and jacked H-piles. The instrumented tion, R2, equal to 0.941, 0.936, and 0.951 for testing data
piles varied in pile length from 32 to 55 m and having a illustrate that the shaft, tip and ultimate bearing capacities of
pile design bearing capacity of up to 3540 kN. A correla- piles predicted by proposed ANN-based model are in good
tion was observed between the mean standard penetration agreement with those of in situ pile. In addition, after sensi-
test (N) value and the ultimate shaft friction, which sug- tivity analysis, it was observed that the area and length of the
gests that the shaft friction capacity can be taken as 1.5 (N) piles are dominant factors in the developed predictive model.
over bar to 2 (N) over bar (kPa) for both jacked and driven The main objectives of the present study are to find a new
H-piles. Lee et al. [23] investigated on the combined load reliable mathematical equation to estimate the total bearing
(i.e., vertical and lateral load) response of driven pile mod- capacity of the driven pile installed in cohesionless soil. It
els in sand experimentally. A series of lateral load experi- is also aimed to assess the influence of important param-
ments were conducted on the pile simultaneously subjected eters such as internal friction angle of soil located in shaft
to vertical loads. The results of combined load test indicated (φ shaft), and tip (φ tip), pile length (L), effective vertical
that the presence of a vertical compression load (driven pile stress at pile toe (σv), and pile area (A) on predicted output.
installed in sand) is detrimental to its lateral capacity. As a
result, the bending moments, in the presence of axial loads,
at the pile head were substantially increased (by 10, 36 and 2 Methods
39% for loose, medium dense and dense sand, respectively).
Samui [24] adopted an MARS (multivariate adaptive regres- 2.1 Artificial neural network (ANN)
sion spline) approach to determine the ultimate capacity of
driven piles installed in sands. MARS approach used differ- The ANN is a tool to model the complex systems in approxi-
ent parameters consisting of driven pile area (A), angle of mation problems such as medicine, finance and engineer-
shear resistance of the soil surrounding the shaft (φ shaft) ing. ANN is a data-processing analysis system making a
and soil at the tip of the derived pile (φ tip), pile length (L), simulation of the structure and functions of human brain. It
and effective vertical stress at the tip of the pile (σv) as input is an extremely interconnected multilayer structure includ-
variables, where the output of MARS is taken to be pile ing a large number of neurons. This network is enable to
ultimate bearing capacity. The results of MARS are com- recognize similarities, especially when they are presented
pared with other developed ANN based such as the GRNN with new input terms after properly predicting the proposed
model (Generalized Regression Neural Network model). output pattern. The ANN is generally applicable as an alter-
Finally, an equation was provided based on the proposed native for some complex statistical analysis techniques such
MARS. Dzagov and Razvodovskii [25] investigated on the as autocorrelation, trigonometric, multivariable regression,
driven pile bearing capacity in clays. With respect to the sur- and linear regression. It is a well-established network that
rounding soil properties and pile length, driven piles could can be defined using three basic components known as: (1)
provide the basic characteristics to be known as both end- transfer function, (2) network architecture, and (3) learn-
bearing and/or friction piles. Momeni et al. [3] developed a ing law [27]. These components are considered to select
new hybrid ANN-GA (genetic algorithm) based on predict- the most appropriate model for a given problem(s). Up to
ing pile bearing capacity. The hammer weight, geometrical now, numerous algorithms have been suggested to train
properties of pile, pile set, and drop height were taken to the neural networks, among which the feedforward neural
be the network inputs and, on the other hand, the ultimate networks (FFNN) and back-propagation (BP) algorithm are
bearing capacity of the pile was considered to be the output known as the most reliable and accurate technique [28, 29].
of the proposed GA-based ANN model. Results indicate that For instance, BP can solve predictive complex geotechnical
the predicted pile bearing capacities by proposed model are problems; it makes back-propagation so popular among all
in excellent agreement with measured values in the field. existing algorithms for training ANN. The FFNN are the
Momeni et al. [26] applied ANN for predicting tip and shaft most common neural networks consisting of multiple hidden
13
Engineering with Computers (2018) 34:347–356 349
layers containing weight matrices, bias vectors and nonlinear a result of looping this competition, stronger empires expand
transfer functions. Using such a network, it is possible to their power by taking possession of weak colonies located in
find nonlinear complicated relations between inputs and out- weaker empires. This process is continuously repeated until
put data sets via a training procedure. The neural networks the process stopped after being satisfied by a pre-defined
extracted relations are not exact and there is always an error stopping criterion. A detailed description of the designed
between the networks estimated data and the real data. The steps in ICA algorithm alone is widely available in the litera-
components of weight and bias are tunable constants which ture. The readers are recommended to see Armaghani et al.
should be tuned to minimize the network error. The process [31], Ghorbani and Jokar [32] and Al Dossary and Nasrabadi
of tuning of these constants is called training of network. [33] for more detail of ICA.
The act of training is similar to an optimization process.
Various mathematical approaches are used to train the neural 2.3 Combination of ICA‑ANN
networks. Most of these approaches are basically analytical
such as Levenberg Marquardt (LM), Bayesian regularization Many attempts have been conducted to improve the perfor-
(BR), and BFGS quasi-Newton (BFG). LM approach is used mance of ANNs through the use of optimization algorithms
for batch training of the networks in this paper. (OAs) like ICA, particle swarm optimization and genetic
algorithm in engineering problems (e.g.[11, 28, 34–37]).
2.2 Imperialist competitive algorithm (ICA) Since BP is a local search learning algorithm, the optimum
search process of ANN may fail and return unsatisfied solu-
Imperialist competitive algorithm (ICA) is firstly proposed tion [38]. OAs can be utilized to adjust the bias and weight
by Atashpaz-Gargari and Lucas [30] to be used in optimiza- of the ANN to improve its performance level. Regarding
tion problems. It is a global search population-based sys- the local minimum in ANN system, there is normally more
tem that its process is similar to many other evolutionary probability of convergence, while OAs are able to discover
algorithms. ICA gets started with an initial population (or a global minimum. So, hybrid systems like ICA-ANN enjoy
candidate solutions), that with the ICA consists of countries. search properties of all ANN and ICA techniques. In search
These countries are then divided into two categories: impe- space, ICA searches for global minimum, and then ANN
rialists (i.e., some of the best countries) and colonies (i.e., employs it for finding the best results of the system.
the remaining countries) (see Fig. 1). To generate empires
the colonies are distributed among the imperialists, as deter-
mined by a pre-defined criterion, according to their rela-
tive strength. The empires then compete with each other to
expand their power and control more colonies. Therefore, as
13
350 Engineering with Computers (2018) 34:347–356
3 Established database on this equation and Ni = 5, it seems that a range of 1–11 can
be solved bearing capacity problem. A series of ANN mod-
In previous section, structure of the ICA-ANN is introduced els were analyzed and their results were evaluated accord-
and discussed. The data used for development of the predic- ing to RMSE values as presented in Table 2. Each model
tive models in this study are collected from Mohanty et al. was iterated five times. RMSE was selected as one of the
[39] comprising an extensive in-situ driven pile load tests most popular performance indices to evaluate the predictive
from the installed location. To train predictive models, a models. Average RMSE values presented in the last column
database including 47 datasets is used for training and 12 of Table 2 show that iterations with 9 hidden nodes (0.075
in situ experiments are used for testing. All 59 used data- and 0.066 for training and testing datasets, respectively)
sets in this study are tabulated in Table 1. In this table, the are the best among all constructed models. Therefore, the
in situ test results together with their effective parameters architecture of 5 × 9 × 1 was selected as the optimum ANN
are presented. In the predictive models, angle of shear resist- architecture to predict ultimate bearing capacity of driven
ance of soil at the shaft (φ shaft) and at the tip (φ tip) of pile. Evaluation of the iterations 1–5 of model no. 9 is dis-
the pile, effective overburden pressure (σ′v) at the tip of the cussed later.
pile, length of pile (L) and cross-sectional area of pile (A)
were utilized as model inputs. In the following section, the 4.2 ICA‑ANN
process of ANN and ICA-ANN models will be presented.
In modeling of ICA-ANN, the most important factors on
ICA should be investigated and subsequently designed.
4 Prediction of ultimate bearing capacity The most important factors on ICA are Ncountry, Nimp and
Ndecade. Various values of Ncountry have been utilized to
In the present section, two predictive approaches, namely approximate problems of geotechnical engineering. The
pre-developed ANN and hybrid ICA-ANN were designed values of 40, 56, and 135 were recommended for Ncountry
to estimate ultimate bearing capacity of driven pile. Many by Ahmadi et al. [44], Marto et al. [45], and Hajihassani
models were constructed using various values of the effec- et al. [46], respectively. Based on the results of the men-
tive parameters on ANN and ICA-ANN. In the follow- tioned studies, it seems that a parametric study is needed to
ing subsections, procedure of their modeling is explained obtain the proper Ncountry. Therefore, a series of ICA-ANN
clearly. analyses were conducted using various Ncountr ranging from
25 to 500. In these models, Ndecade equal to 200 and Nimp
4.1 ANN equal to 5 were utilized. The obtained results showed that
Ncountry = 300 can provide higher performance capacity of
At the beginning of ANN modeling, as mentioned by Liou the ICA-ANN models in comparison with the other Ncountry.
et al. [38], the developed datasets should be normalized to Therefore, value of 300 was selected as optimum Ncountry in
simplify the design procedure using the following equation: modeling of ICA-ANN.
In the next stage of ICA-ANN, there is a need to obtain
(1)
( ) ( )
Xnorm = X − Xmin ∕ Xmax − Xmin the optimum Nimp considering another sensitivity analysis.
where X and Xnorm are the measured and normalized values, In this regard, a range of 5–65 for Nimp was used to deter-
respectively. Xmax and Xmin are the maximum and minimum mine the best Nimp in modeling of ultimate bearing capac-
values of the X. ity of driven pile. Based on the obtained results, Nimp = 15
Then, for developing and evaluating the model, all shows better network performance compared to other num-
datasets should be divided into training and testing parts, ber of imperialists. Therefore, this value was considered as
respectively. A range of (20–30%) of whole datasets were optimum Nimp. Determining Ndecade is considered as the next
recommended for testing datasets in the investigation con- stage of ICA-ANN modeling procedure. Another parametric
ducted by Nelson and Illingworth [40]. So, in this study, study was conducted to investigate the effect of the Ndecade
20% of whole datasets (59 datasets) were considered as on performance of the network. To achieve this aim, the
testing datasets. Many investigations reported the success- Ndecade was set to be 500 in this study. Figure 2 displays
ful utilization of LM training algorithm [41, 42]. Because network’ results of using different number of Ndecade in esti-
of that, in this study, the mentioned algorithm was utilized mating ultimate bearing capacity of driven pile. As it can
to design ANN. Additionally, it is well established that an be seen, the changes in the network performance (RMSE)
ANN with one hidden layer can approximate any continuous after Ndecade = 350 are not significant and remarkable. As
function. For determining the No. of hidden node, Hornik mentioned earlier, RMSE as one of the most popular perfor-
et al. [43] stated that the maximum number of hidden node mance indices to evaluate predictive models was chosen in
is ≤ 2 × Ni + 1, where Ni is the number of input layers. Based determining Ndecade in this study. Hence, the optimum Ndecade
13
Engineering with Computers (2018) 34:347–356 351
Table 1 Training database used in this study, the data collected by [39]
Test number φ shaft φ tip σ′v (kN/m2) L (m) A (m2) Qm (total) (kN)
13
352 Engineering with Computers (2018) 34:347–356
Table 1 (continued)
Test number φ shaft φ tip σ′v (kN/m2) L (m) A (m2) Qm (total) (kN)
1 0/086 0/090 0/077 0/062 0/080 0/059 0/099 0/061 0/085 0/090 0/085 0/072
2 0/088 0/085 0/074 0/066 0/095 0/051 0/091 0/060 0/082 0/090 0/086 0/070
3 0/085 0/088 0/073 0/065 0/086 0/056 0/084 0/069 0/078 0/084 0/081 0/072
4 0/081 0/078 0/078 0/060 0/080 0/058 0/080 0/068 0/080 0/088 0/080 0/070
5 0/079 0/074 0/079 0/066 0/082 0/05 0/082 0/07 0/076 0/085 0/080 0/069
6 0/078 0/071 0/080 0/067 0/085 0/052 0/076 0/062 0/077 0/087 0/079 0/068
7 0/077 0/072 0/076 0/068 0/084 0/053 0/080 0/061 0/072 0/086 0/078 0/068
8 0/076 0/075 0/075 0/066 0/081 0/051 0/081 0/059 0/073 0/090 0/077 0/068
9 0/075 0/070 0/070 0/067 0/082 0/046 0/078 0/057 0/070 0/092 0/075 0/066
10 0/077 0/078 0/081 0/070 0/084 0/051 0/080 0/060 0/072 0/099 0/079 0/072
11 0/076 0/071 0/072 0/069 0/083 0/049 0/080 0/059 0/073 0/094 0/077 0/068
13
Engineering with Computers (2018) 34:347–356 353
Table 3 R2, VAF, and RMSE Method Stage Model R2 RMSE VAF Rating for R2 Rat- Rating Rank value
results of the developed models ing for for VAF
together with ranking values RMSE
TR training, TS testing
13
354 Engineering with Computers (2018) 34:347–356
Table 4 Obtained values of Method Model Total rank number 5 (with R2, RMSE and VAF values of 0.964, 0.039
total rank for the proposed and 96.369 for training and 0.974, 0.051 and 97.369 for test-
techniques ANN 1 21 ing, respectively) show the highest total rank values among
2 22 their groups (22 and 24, respectively). More details about the
3 16 simple ranking method can be found in the study conducted
4 20 by Zorlu et al. [47].
5 15 The relationships between the best models of ICA-ANN
ICA-ANN 1 19 and ANN in predicting ultimate bearing capacity of driven
2 20 pile and the measured ones for testing and training datasets
3 21 are displayed in Figs. 3 and 4, respectively. Results of the
4 13 developed ANN model based on R2 values were obtained
5 24 as 0.885 and 0.894 for training and testing datasets, respec-
tively, whereas values of 0.964 and 0.974 were achieved
for R2 of the ICA-ANN model, respectively. The obtained
and 3 were assigned for their ranks, respectively. Addition- results revealed that the proposed hybrid ICA-ANN model
ally, this process was repeated/applied for other performance can be introduced as a more accurate in predicting ultimate
indices and also for testing datasets. Afterwards, for each bearing capacity of driven pile. The results showed that by
predictive model, the ratings of the performance indices for developing an ICA-ANN model, performance prediction
both training and testing datasets were summed up (see total based on R2 can be improved from about 0.89 to about 0.97.
rank column in Table 4). According to the total rank results, Therefore, the developed ICA-ANN model can be used/
ANN dataset number 2 (with R2, RMSE and VAF values introduced as a new model with high accuracy level in esti-
of 0.885, 0.070 and 88.212 for training and 0.894, 0.067 mating ultimate bearing capacity of driven pile.
and 89.215 for testing, respectively) and ICA-ANN dataset
Fig. 3 Relationship between measured and predicted ultimate bearing capacity developing ANN model
Fig. 4 Relationship between measured and predicted ultimate bearing capacity developing ICA-ANN model
13
Engineering with Computers (2018) 34:347–356 355
13
356 Engineering with Computers (2018) 34:347–356
29. Armaghani DJ, Hajihassani M, Sohaei H et al (2015) Neuro- 38. Liou S-W, Wang C-M, Huang Y-F (2009) Integrative discovery
fuzzy technique to predict air-overpressure induced by blast- of multifaceted sequence patterns by frame-relayed search and
ing. Arab J Geosci 8:10937–10950. https://doi.org/10.1007/ hybrid PSO-ANN. J UCS 15:742–764
s12517-015-1984-3 39. Mohanty R, Suman S, Das SK (2016) Prediction of vertical pile
30. Atashpaz-Gargari E, Lucas C (2007) Imperialist competitive capacity of driven pile in cohesionless soil using artificial intel-
algorithm: an algorithm for optimization inspired by imperialis- ligence techniques. Int J Geotech Eng 1–8
tic competition. In: Evol. Comput. 2007. CEC 2007. IEEE Congr, 40. Nelson MM, Illingworth WT (1991) A practical guide to neural
IEEE, pp 4661–4667 nets. Addison-Wesley, Reading
31. Jahed Armaghani D, Hasanipanah M, Tonnizam Mohamad E 41. Monjezi M, Khoshalan HA, Varjani AY (2012) Prediction of fly-
(2016) A combination of the ICA-ANN model to predict air- rock and backbreak in open pit blasting operation: a neuro-genetic
overpressure resulting from blasting. Eng Comput. https://doi. approach. Arab J Geosci 5:441–448
org/10.1007/s00366-015-0408-z 42. Ornek M, Laman M, Demir A, Yildiz A (2012) Prediction of
32. Ghorbani A, Jokar MRA (2016) A hybrid imperialist compet- bearing capacity of circular footings on soft clay stabilized with
itive-simulated annealing algorithm for a multisource multi- granular soil. Soils Found 52:69–80
product location-routing-inventory problem. Comput Ind Eng 43. Hornik K, Stinchcombe M, White H (1989) Multilayer feed-
101:116–127 forward networks are universal approximators. Neural Netw
33. Al Dossary MA, Nasrabadi H (2016) Well placement optimi- 2:359–366
zation using imperialist competitive algorithm. J Pet Sci Eng 44. Ahmadi MA, Ebadi M, Shokrollahi A, Majidi SMJ (2013) Evolv-
147:237–248 ing artificial neural network and imperialist competitive algorithm
34. Jahed Armaghani D, Mohd Amin MF, Yagiz S et al (2016) Predic- for prediction oil flow rate of the reservoir. Appl Soft Comput
tion of the uniaxial compressive strength of sandstone using vari- 13:1085–1098
ous modeling techniques. Int J Rock Mech Min Sci 85:174–186. 45. Marto A, Hajihassani M, Jahed Armaghani D et al (2014) A novel
https://doi.org/10.1016/j.ijrmms.2016.03.018 approach for blast-induced flyrock prediction based on imperialist
35. Jahed Armaghani D, Hajihassani M, Marto A et al (2015) Pre- competitive algorithm and artificial neural network. Sci World J.
diction of blast-induced air overpressure: a hybrid AI-based pre- https://doi.org/10.1155/2014/643715
dictive model. Environ Monit Assess. https://doi.org/10.1007/ 46. Hajihassani M, Jahed Armaghani D, Marto A, Tonnizam
s10661-015-4895-6 Mohamad E (2014) Ground vibration prediction in quarry blast-
36. Mohamad ET, Faradonbeh RS, Armaghani DJ et al (2016) An ing through an artificial neural network optimized by imperial-
optimized ANN model based on genetic algorithm for predicting ist competitive algorithm. Bull Eng Geol Environ 74:873–886.
ripping production. Neural Comput Appl 1–14 https://doi.org/10.1007/s10064-014-0657-x
37. Jahed Armaghani D, Hasanipanah M, Mahdiyar A et al (2016) Air- 47. Zorlu K, Gokceoglu C, Ocakoglu F et al (2008) Prediction of uni-
blast prediction through a hybrid genetic algorithm-ANN model. axial compressive strength of sandstones using petrography-based
Neural Comput Appl. https://doi.org/10.1007/s00521-016-2598-8 models. Eng Geol 96:141–158
13