You are on page 1of 10

Soils and Foundations 2014;54(2):233–242

The Japanese Geotechnical Society

Soils and Foundations

www.sciencedirect.com
journal homepage: www.elsevier.com/locate/sandf

Predicting pile dynamic capacity via application


of an evolutionary algorithm
I. Alkrooshn, H. Nikraz
Department of Civil Engineering, Curtin University, Perth, Western Australia, Australia
Received 21 May 2013; received in revised form 11 October 2013; accepted 14 November 2013
Available online 3 April 2014

Abstract

This study presents the development of a new model obtained from the correlation of dynamic input and SPT data with pile capacity. An
evolutionary algorithm, gene expression programming (GEP), was used for modelling the correlation. The data used for model development
comprised 24 cases obtained from existing literature. The modelling was carried out by dividing the data into two sets: a training set for model
calibration and a validation set for verifying the generalization capability of the model. The performance of the model was evaluated by
comparing its predictions of pile capacity with experimental data and with predictions of pile capacity by two commonly used traditional methods
and the artificial neural networks (ANNs) model. It was found that the model performs well with a coefficient of determination, mean, standard
deviation and probability density at 50% equivalent to 0.94, 1.08, 0.14, and 1.05, respectively, for the training set, and 0.96, 0.95, 0.13, and 0.93,
respectively, for the validation set. The low values of the calculated mean squared error and mean absolute error indicated that the model is
accurate in predicting pile capacity. The results of comparison also showed that the model predicted pile capacity more accurately than traditional
methods including the ANNs model.
& 2014 The Japanese Geotechnical Society. Production and hosting by Elsevier B.V. All rights reserved.

Keywords: Pile; Dynamic capacity; Gene expression programming; Training; Validation

1. Introduction as the main purpose of driving formulae is using the driving


record of the pile to establish the safe working load for a pile, or
Although it is common in design practice to predict pile to determine the driving requirements for a required working load
capacity by static analysis, a pile driving formula or a dynamic (Ng et al., 2004).
formula is perhaps the most frequently used method for evaluat- Numerous researchers have proposed different procedures
ing the capacity of driven piles, as described by Poulos and Davis for evaluating pile capacity based on dynamic input. However,
(1980). Evaluation of pile dynamic capacity is considered useful there are two approaches most commonly used dynamic
formulae and wave equations. Despite the frequent and wide-
spread use of these methods, their reliability is still question-
n
Corresponding author. able. Dynamic formulae have been investigated by researchers
Peer review under responsibility of The Japanese Geotechnical Society.
(e.g. Flaate, 1964; Housel, 1966), who concluded that pile
capacities determined from dynamic formulae correlate poorly
with static load test results and have a wider scatter when
statistically compared. The Manual for Design and Construc-
tion of Driven Pile Foundations by Hannigan et al. (1996)

http://dx.doi.org/10.1016/j.sandf.2014.02.013
0038-0806/& 2014 The Japanese Geotechnical Society. Production and hosting by Elsevier B.V. All rights reserved.
234 I. Alkroosh, H. Nikraz / Soils and Foundations 54 (2014) 233–242

clearly characterizes unsatisfactory prediction in pile capacity engineering problems (e.g. Cevic and Cabalar, 2009; Alkroosh
by dynamic formulae. The wave equation analysis is also and Nikraz, 2011a, 2011b; Gandomi, 2011; Gandomi and
criticized by a number of researchers such as Svinkin and Alavi, 2012). The objectives of this paper are as follows:
Woods (1998), who explained that this method does not take
into account changes in soil properties after pile installation;  To investigate the feasibility of using GEP to correlate
thus the method apparently cannot predict reliable pile capacity dynamic input data and SPT results with pile capacity;
for various elapsed times after driving has ceased.  To evaluate the performance of the developed GEP model in
The limited success of dynamic methods in achieving training and validation sets by comparing its prediction of pile
accurate evaluation of pile capacity can be attributed to the capacity with experimental data and with predictions of pile
assumptions on which these methods are based along with an capacity by traditional methods along with the ANN model;
oversimplification of pile behaviour. Pile driving formulae  To conduct a parametric study to evaluate the influence of
assumes that the work done in forcing down the pile (i.e., the the input variables on the performance of the model.
product of the weight of the ram and the stroke) is equal to the
product of the ultimate soil resistance. The main shortcoming
of this assumption is that there is a difficulty in estimating the
actual energy transmitted by the ram to the pile through the
cap block, pile cap and cushion. Thus the energy losses in a 2. Overview of gene expression programming
real pile driving situation cannot be accounted for accurately
(Coduto, 1994). The wave equation assumes that static resistance GEP is an instance of an evolutionary algorithm from the
is a function of dynamic force and the velocity generated by field of evolutionary computation, invented by Ferreira (2001)
hammer blows and damping coefficient. This assumption presents as a global optimization algorithm. It has similarities to other
two difficulties: (1) the total resistance is time dependent and evolutionary algorithms such as genetic algorithms (GAs), as
different variations in the method produce different results; (2) the well as other evolutionary automatic programming techniques
dimensionless damping coefficient has a questionable correlation such as genetic programming (GP). Similar to GAs, GEP uses
to soil type and needs to be calibrated for the specific pile, soil the evolution of linear computer programs (individuals or
and site condition (Ng et al., 2004). chromosomes) of fixed length and likewise the GP the evolved
The complexity of pile behaviour and the presence of many programs are expressed in nonlinear forms of expression trees
involving factors have made it difficult to develop an accurate (ETs) of different sizes and shapes. However, GEP implements
model based on traditional modelling procedures. Artificial a different evolutionary computational method. The GEP
intelligence techniques may be a better alternative, due to the distinguishes itself from GAs in that the evolved solutions
capability of these techniques being able to deal with complex are expressed in the form of parse trees of different sizes and
and highly nonlinear functions, and employing the consider- structures and unlike GP, genetic variations are performed on
able capacity of computers to perform enormously iterated chromosomes before they are translated into ETs.
work. A number of researchers (e.g. Chan et al.,1995; Teh The GEP chromosomes can be composed of single or multiple
et al., 1997; Abu-Kiefa, 1998; Das and Basudhar, 2006; genes; each gene is encoded in a smaller sub-program. Every gene
Ardalan et al., 2009; Shahin, 2010; Ornek et al., 2012; has a constant length and includes a head that contains functions
Tarawneh, 2013) have successfully applied artificial neural (e.g. þ ,–) and terminals (e.g. d1, d2, which are the symbolic
networks (ANNs) for the modelling of pile behaviour. The representation of the input variables), and a tail composed of
modelling advantage of ANNs is their ability to capture the terminals only. The genetic code represents a one-to-one relation-
nonlinear and complex relationships between the targeted ship between the symbols of the chromosome, the functions or
output and the factors affecting it, without having to assume terminals. The process of information decoding from chromosomes
a priori formulae describing this relationship. However, the to ETs is called translation; this is based on sets of rules that
main shortcoming of ANNs is the complexity of their network determine the spatial organization of the functions and terminals in
structure, as they represent the knowledge in terms of weight the ETs and the type of interaction (link) between the sub-ETs
matrices together with biases that are not accessible to the user (Ferreira, 2002). The principal terms of the GEP are described in
(Rezania and Javadi, 2007). In this regard, the genetic the following subsections.
programming (GP) may present a better alternative. The main
advantage of the GP over ANNs is the ability to provide the
relationship between a set of inputs and the corresponding
outputs in a simple mathematical form considered accessible to 2.1. Initial population
the user (Rezania and Javadi, 2007). Recently, the GP has been
applied with success in solving engineering problems (e.g. In GEP, the search for a solution begins when a number of
Javadi et al., 2006; Rezania and Javadi, 2007; Alavi et al., computer programs (individuals or chromosomes), referred to
2011). In this paper, pile dynamic capacity has been correlated as the initial population, are randomly created from the set of
with SPT data and dynamic input using a developed version of functions and terminals defined by the user. Each program is
genetic programming; that is, gene expression programming expressed, evaluated and assigned ‘fitness’ according to how
(GEP). Recently, GEP has been applied successfully in solving well it performs with regard to achieving the desired objective.
I. Alkroosh, H. Nikraz / Soils and Foundations 54 (2014) 233–242 235

2.2. Genes and expression trees Create chromosomes of initial population

Genes are sub-programs encoded in the chromosome with


the gene having a fixed length and composed of a head and a Express chromosomes & evaluate their fitness

tail. The length of the head is usually predefined by the user


Yes
during data setting, while the length of the tail is determined by Stopping criterion is satisfied Designate results

t ¼ hðn 1Þ þ 1 ð1Þ
No
End
where t is the tail length, h is the head length, and n is the number
of arguments of used function. A typical GEP gene is written as Select chromosomes & keep the
fittest for next generation
follows: +.sqrt.c1.+.×./.d0.d1.d2.d0.d1.d1.d2, where: “.” is the
separation mark between the symbols, sqrt is the square root
function, c1 is a constant, and d0, d1, and d2 are variables known Perform genetic modifications via genetic
as terminals. The blue symbols represent the gene head, while the operators and gene recombination
black bold symbols represent the tail. This written format is named
K-expression or Karva notation (Ferreira, 2002), which can be
New generation of chromosomes
converted into the ET as shown in Fig. 1. The tree is a spatial
illustration demonstrating the interactions among the gene's Fig. 2. Flow chart of gene expression algorithm.
components on the map of solution.

2.3. Mutation  the two chromosomes pair and trade in the selected parts; and
 two offspring belonging to the new population are obtained.
In GEP, mutation means randomly selecting any component
of the gene's head or tail and replacing it with any other
randomly selected component from the function or terminal
set. In the heads, any component can change into another 2.5. Modelling process
(function or terminal), whereas in the tails, terminals can only
change into terminals. The mutation may take place at one or As illustrated in Fig. 2, the process that the GEP implements
two points within the chromosome and there are no con- for developing the solution to the problem begins with creating an
straints, neither in the kind of mutation nor the number of initial population of computer programs chosen randomly from
mutations. In all cases, the newly created individuals are the sets of functions and terminals. The functions can contain
syntactically correct programs. basic mathematical operators (e.g. þ ,  ,  , /) or any other
user-defined functions, whereas the terminals may consist of
2.4. Recombination numerical constants, logical constants or variables. Each program
(chromosome) is executed and its fitness is evaluated through the
The last significant step during each cycle of program fitness function, which measures how well the chromosome
evolution includes the introduction of genetic variation by performs with regard to competition with the rest of population.
recombination. The variations take place when two chromo- Chromosomes are then selected for further development based on
somes are paired and split at exactly the same point in order to their fitness. The ones that have a higher fitness level are given a
exchange their components downward to the merging point. higher chance of being reselected, whereas the chromosomes with
The following steps explain how recombination is performed: less fitness are deleted or given a slim chance of reselection. The
selected programs are then exposed to further developmental
 two chromosomes are selected randomly from the population; operations, which are performed through genetic variations such
 one part of each chromosome is selected randomly; as mutation and recombination. New offspring of chromosomes
with new traits are generated and used to replace the existing
population. The chromosomes of the new generation are then
Root subjected to the same developmental process, which is repeated
+
until the stopping criteria are satisfied.
Sqrt c1

Functional nodes 3. GEP model development


+
The GEP model developed in this work is based on results
* / Terminal nodes from 24 case records collected from the literature and reported
upon by Lee and Lee (1995). Since the piles were only briefly
d0 d1 d2 d0 described by Lee and Lee (1995), the authors of this paper
recalled papers in which more information could be found. The
Fig. 1. A typical example of an expression tree. authors were successful in collecting data from 18 case records
236 I. Alkroosh, H. Nikraz / Soils and Foundations 54 (2014) 233–242

found in Bozozuk et al. (1979), Thompson and Thompson not available. As a result, this reduces the rate of success in
(1979), and Likins et al. (1992). It was difficult to find detailed reaching a solution to the problem.
information from all of the case records. In conjunction with
the information reported by Lee and Lee (1995), the authors
have added additional information obtained from the original 3.1. Model input and output
sources of the data. The case records were collected from
different sites from regions in the United States. The piles were The authors adopted the input variables suggested by Lee
tested under axial compression with no further details available and Lee (1995). The most significant factors assumed to
on the testing procedures. The tests were performed on driven influence pile capacity were presented to the GEP model as
piles made of steel or concrete with different shapes (square, potential model inputs. These included the penetration ratio, l/d
pipe, and H-Piles), embedded into layered cohesionless soils. (pile embedment length by pile diameter); average standard
The piles had a penetration depth to diameter ratio ranging penetration test number (N-value) near the pile tip in a zone
from 16.32 to 129.92. extends between 1.5–2 times pile diameter above and below the
The modelling process was carried out using the com- toe, Nb; average standard penetration test number (N-value)
mercially available software package GeneXpro Tools 4.0 over shaft length, Ns; pile set, S (final penetration depth mm /
(Gepsoft, 2002). Using the commercial version of the GEP blow); and hammer energy, E. The interpreted failure load (pile
software is useful when solutions to complex problems, such capacity), Qu, is the single model output. The failure load was
as the one in this study, are required. This is due to the taken as the plunging load for the well-defined failure cases,
commercial version of the GEP giving the modeller more and Davisson's (1972) criterion was applied for the cases where
options along with the freedom to choose the setting para- failure load was not clearly defined. Fig. 3 presents the
meters of the model. For example, the modeller can choose the geotechnical profile and the definition of the failure load for
mutation rate needed to develop a robust solution to the a case record obtained in Thompson and Thompson (1979); for
studied problem, whereas in the free version such an option is better presentation, the figure was redrawn.

0
Fill
5

7.6 m
10
Depth (m)

Silty clay
till
15

18.4 m
Silt till 20
22.5 m
24.4 m
Shell 25
305 mm bedrock 0 20 40 60 80 100 120
Penetration resistance (blows/300 mm)

4.5
4
3.5
3
Load (MN)

2.5
2
1.5 Failure load = 2.7 MN
according to Davisson (1972)
1
0.5
0
0 10 20 30 40 50 60 70
Settlement (mm)

Fig. 3. Typical geotechnical profiles for a reported case history: (a) pile geometry and soil profile; (b) standard penetration test profile; and (c) load settlement plot.
I. Alkroosh, H. Nikraz / Soils and Foundations 54 (2014) 233–242 237

3.2. Data division Table 1


GEP model input and output statistics.
Dividing the available data into subsets (a training set and a Data set Mean SDa Mina Maxa Range
validation set) is a necessary step in modelling with GEP.
Usually, GEP models involve a large number of programs, and l/d
as such, they have a high tendency towards over-fitting, Training 73 35 16.3 130 114
Validation 61 24 31.7 98 66
particularly, if the training data is noisy. Over-fitting refers
to the large error in predictions when new data is presented to Ns
the trained model (i.e. ability of the model to memorise rather Training 17 17 8.0 78 70
Validation 15 8 10.0 30 20
than generalise the form of the relationship between input and
output data). Generally, over-fitting is expected when data Nb
points in training sets are scanty (Das, 2013). Training 131 92 10.0 247 237
Validation 129 102 10.0 247 237
The main aim of data division is to prevent the model from
over-fitting which may take place during the training phase. S (mm/blow)
The training data are used for the adjustment of the model Training 4 5 0.1 19 19
Validation 5 6 0.2 17 17
parameters in order to reduce the error between the model
output and the corresponding targeted output. The validation E (kJ)
set is independent data not included in the training phase, and Training 67 54 27.0 211 184
Validation 86 60 33.2 187 154
it is used to test the generalisation ability of the model and to
verify its performance in the real world. Measured capacity, Qu (MPa)
In the literature, there is no definite ratio of the used data to Training 21 11 6.6 49.7 43.1
Validation 19 9 7.4 33 25.6
be assigned to each subset, but in general 10–20% of the
available data is suggested to be used as a validation set, and a
SD, standard deviation; Min, minimum; Max, maximum.
80–90% as a training set (Ferreira, 2002).
In order to develop a robust model, researchers suggest that
all of the patterns contained in the available data should be phase. In this study the data sets were scaled between 0 and 1
contained in the calibration set. Likewise, all of the patterns in using the following equation:
the available data should also be contained in the validation v vmin
vs ¼ ð2Þ
set. This provides the toughest evaluation of the generalisation vmac  vmin
ability of the model (Bowden et al., 2006). To achieve this, where vs is the scaled variable value, v is the value required to
several researchers (e.g. Ferreira, 2002) suggest that data be scaled, vmin is the minimum value of the variable among the
subsets should be statistically consistent; training and valida- data set, vmax is the maximum value of the variable among the
tion sets should possess similar statistical properties including data set.
mean, standard deviation, maximum and minimum.
In this work, the data was randomly divided into two
statistically consistent sets, as recommended by Master 3.4. Modelling process and determination of GEP model
(1993) and detailed by Shahin et al. (2004). In total, 18 case
records (75%) of the available 24 cases were used for training, The search for the GEP model was carried out as follows:
and six cases (25%) for validation. The statistics of the data
used for the training and validation sets are presented in 3.4.1. Obtaining the optimum setting parameters
Table 1, which include the mean, standard deviation, mini- The success of the modelling process using GEP depends
mum, maximum and range. It should be noted that like all significantly on the design of the structure of the model. In
empirical models, GEP performs best in interpolation rather this, the optimal model parameters are determined to ensure
than extrapolation, thus the extreme values of the data used that the best performing model is achieved. In the search for a
were included in the training set. model using the GEP, the number of chromosomes, chromo-
some structure, functional set, fitness function, linking function
3.3. Data pre-processing and rates of genetic operators play important roles during the
modelling process, and choosing suitable input of these
After completing the data division, the data pre-processing parameters can considerably reduce modelling time and effort
was carried out before starting the training phase. Pre- and produce a robust solution.
processing can vary from simple scaling or range compression In this work, a trial-and-error approach was used to
to complex techniques such as polynomial expansion and determine the values for the setting of parameters. This
Fourier transformation (Prasad and Beg, 2009). In this study, approach involved using different settings and conducting
data pre-processing was performed using a form of data runs in steps. During each step, runs were carried out and
scaling. Although data scaling is not necessary when using the value of one of the setting parameters was varied, whereas
GEP, scaling can be a useful step as it ensures all variables the values of the other parameters were set as constant (i.e.
receive the same attention from the GEP during the training number of chromosomes ¼ 30, number of genes ¼ 3, gene head
238 I. Alkroosh, H. Nikraz / Soils and Foundations 54 (2014) 233–242

size ¼ 8, functions set ¼ þ ,  ,  , and /, fitness func- 22 20


tion ¼ mean squared error (MSE), linking function ¼ þ , muta- Training set

Mean squred error × 10-3 (validation)


tion rate ¼ 0.04, and gene recombination rate ¼ 0.1). The runs Validation set

Mean squred error × 10-3 (training)


18 16
were stopped after 25,000 generations, which were found to be
sufficient to evaluate the fitness of the output. At the end of
each run, the MSEs for both training and validation sets were 14 12

recorded in order to identify the values that gave the smallest


MSE. In the first step, the number of chromosomes was 10 8
determined. Several runs were conducted where the number of Selected rate
chromosomes was varied (i.e. 15, 16, 17,…, 25), whilst the
6 4
other parameters were set to be constant. The number of
chromosomes found to correspond to the smallest MSE in both
the training and validation sets was selected. 2 0
0 0.02 0.04 0.06 0.08 0.1
In the same way, the chromosome architecture, i.e. the head
Mutation rate
size and number of genes per chromosome, were determined.
Several runs were carried out using gene head sizes of 6, 7, 8, Fig. 5. Effect of mutation rate on the performance of the model.
…, 14, and number of genes per chromosome as 1, 2, 3, …, 5.
The fitness of the output of the runs was then compared to
determine the optimum chromosome architecture. 22 20
Training set
In the next step, the best set of functions was determined.

Mean squred error × 10-3 (validation)


Mean squred error × 10-3 (training)
Validation set
The initial run began with the use of the four basic arithmetic 18 16
operators ( þ ,  ,  , /). In the subsequent run an additional
function such as root square was added to the set, and so on.
14 12
The addition and multiplication linking functions were then
used in different runs to determine which of these functions
best suited this problem. 10 8
The last step was to search for the optimum rates for each of
the genetic operators. The focus was predominantly on 6 4
mutation and gene recombination, as they are the main Selected rate

parameters in creating genetic variations.


The results of the search for model setting parameters are 2 0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
shown in Figs. 4–6. Fig. 4 shows that the model performs best Gene recommbination rate
when the number of chromosomes is 22. Figs. 5 and 6 present
the influence of the rates of the genetic operators – mutation Fig. 6. Effect of gene recombination rate on the performance of the model.
and gene recombination – on the performance of the GEP
model. It can be seen that the GEP model performs best when
mutation and gene recombination rates are 0.06 and 0.6, Table 2
respectively. Input parameters used for the GEP models.
Fig. 6 shows that the model performs best when the gene Parameter Used input
recombination rate is 0.6. However, the recombination rate of
Number of chromosomes 22
21 8 Number of genes 3
Training set Head size 9
pffiffiffi pffiffiffi
Functions set þ;  ; ; =; 2 x; 3 x; Exp; Power; x2 ; x3
Mean squred error × 10-2 (validation)

18 Validation set
Mean squred error × 10-3 (training)

Fitness function Mean squared error


15
6 Linking function þ
Mutation rate 0.06
Recombination rate 0.4
12
4
9
Selected 0.4 was selected in order to avoid the risk of subjecting
number
6
2
already-evolved individuals to high rates of genetic variation,
which could lead to a reduction in their fitness.
3 The results of the optimum settings for the parameters are
presented in Table 2. The table shows that the optimum
0 0
12 15 18 21 24 27 30 chromosome structure consists of 3 genes with a head size of
Number of chromosomes 9. The presence of a function such as the Exp (exponential)
Fig. 4. Effect of number of chromosomes on the performance of the model.
among the functions group is recognizable in the models of
I. Alkroosh, H. Nikraz / Soils and Foundations 54 (2014) 233–242 239

evolutionary algorithms. During the evolution process, this input and the output. The ETs can be easily translated into a
function is selected randomly by the program to improve the mathematical expression which can be simplified and rear-
fitness of the solution. ranged to read as follows:
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
pffiffiffiffiffiffi l=d
3.4.2. Selection of the GEP model Qsp ¼ ðN s Þ2 3 m1 þ EðN m b Þ þ
2 3
 0:034 ð3Þ
m3
After finding the optimum setting parameters, the GEP
model was determined by conducting runs using these para- where
meters. The outputs of the runs were several chromosomes pffiffiffiffiffiffi
m1 ¼ ðl=dÞexp 3 N b ð4Þ
(models) which represent potential solutions to the problem.
The best model was determined by screening these solutions m2 ¼ 7:5E þ S  ðl=dÞ ð5Þ
through two selection criteria defined as follows: the model has
to have a correlation coefficient, rZ 0.80, for both of the m3 ¼ expðl=dÞ þ expðsÞ  ðl=dÞþ 2E ð6Þ
training and validation sets; and it has to have mean values
within 10%. A desirable criterion for the model is that it be Qsp, scaled predicted pile capacity; Ns, average of SPT
presented as a short and simple expression. blows along shaft; Nb, average of SPT blows within pile base;
l/d, penetration ratio; S, setting; and E, hammer energy. It
3.4.3. Optimization and simplification of the GEP model should be emphasized here that before using Eqs. (3)–(6), the
The third stage was to develop the model selected from the input variable must first be scaled by applying Eq. (1).
previous stage. The model that satisfied the selection criteria The output of Eq. (3) is unscaled to calculate the normal
was further developed with the optimization and simplification values of predicted capacity, Qp, using:
procedures available in the program. Qp ¼ 43:1Qsp þ 6:6 ð7Þ

4. Results and evaluation of the performance of the GEP


model
4.1. Model evaluation
One of the advantages of GEP is that it presents the
relationship between the input and output in a form of ETs, The accuracy of the GEP model was evaluated by compar-
as shown in Fig. 7. ing its predictions of pile capacity in training and validation
The figure illustrates the mathematical operations and sets with experimental data and with predictions of pile
interactions between the components of the solution. This capacity by number of existing methods. Two traditional
can give insight into the nature of the relationship between the methods, Meyerhof (1976) and Shioi and Fukui (1982), along
with ANNs, developed by Lee and Lee (1995) were used for
Sub-ET 1 Sub-ET 2 comparison. A brief description of the compared methods is
- /
provided in Table 3.
The results of the comparison are presented numerically in
l /d +
* 0.034 Table 4 and graphically in Fig. 8. Numerically, the coefficient
+ - of determination, R, mean, standard deviation, probability
X2 3Rt
density, P50, MSE and mean absolute error (MAE), were used
Exp Exp * l /d to evaluate the accuracy of the model. The R is calculated
Ns * according to Rodgers and Nicewander (1988) from:
l /d S 2 E
Exp Sub-ET 3 ∑ni ¼ 1 ðQmi  Qm ÞðQpi  Qp Þ
l /d
r ¼ qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi ð8Þ
Sqrt ∑ni ¼ 1 ðQmi  Qm Þ2 ∑ni 1 ðQpi  Qm Þ2
3Rt
* R ¼ r2 ð9Þ

Nb E X3 where r is the coefficient of correlation, Qmi is the measured


value of case i, Qpi is the predicted value of case i, Qm is the
average of measured values, Qp is the average of predicted
Pow
values.
Nb + The optimal value of R is unity, which means that a perfect
fit is achieved between predicted and measured values. Table 4
-
* shows that the model performs well with R ¼ 0.94 for the
E S l /d
training set and R ¼ 0.96 for the validation set. This indicates
7.5
that the model is capable of achieving an accurate correlation
Fig. 7. Expression trees of the developed model; X2, to power 2; X3, to power between input and output and that it performs better than the
3; Sqrt, square root; 3Rt, cubic root; Pow, power other methods.
240 I. Alkroosh, H. Nikraz / Soils and Foundations 54 (2014) 233–242

Table 3
Selected SPT methods for prediction of pile-bearing capacity.

No Method Unit shaft Unit base resistance (MPa) Remarks


resistance (kPa)

1 Meyerhof (1976) fs ¼nsNs qp ¼ 0.4N1C1C2 ns ¼ 1 (low displacement piles)


C1 ¼ (Bþ 0.05/2B)n ns ¼ 2 (high displacement piles)
C2 ¼2, C2 ¼D/10B N1 ¼N value at the base level
when penetration in dense410B n ¼1, 2, 3 for loose, medium and dense sand, respectively
2 Shioi and Fukui (1982) fs ¼nsNs qp ¼ (1þ 0.04(Db/B)Nb ¼0.3Nb Ns ¼2 for sand and 10 for clay.
Ns ¼average value of N around pile embedment
depth; Nb ¼ average value of N around pile base
3 Lee and Lee (1995) No mathematical Results are based on artificial neural network modelling
expressions are available

Table 4
Numerical results of statistical analysis.

Statistical measure Prediction method

Data set Proposed Meyerhof (1976) Shioi and Fukui (1982) Lee and Lee (1995)

R Training 0.94 0.01 0.01 0.30


Validation 0.96 0.43 0.20 0.20
Mean Training 1.08 1.86 3.40 1.13
Validation 0.95 1.92 3.60 1.06
Standard deviation Training 0.14 1.17 0.83 0.43
Validation 0.13 1.21 0.67 0.64
P50 Training 1.05 3.50 4.00 1.11
Validation 0.93 3.00 4.00 0.97
Mean squared error Training 8.67 2325.88 4834.67 102.85
Validation 5.10 2084.36 3072.62 77.86
Mean absolute error Training 2.18 40.33 53.87 7.52
Validation 1.96 38.01 50.017 6.65

The mean and standard deviation were calculated according capacity by 10%, whereas the traditional and ANNs methods
to Long and Wysockey (1999) from the following equation: tend to over-predict the measured pile capacity by considerably
 i ¼ n   more than that. The low values of standard deviation can also
1 Qpi
μ ¼ Exp ∑ Ln ð10Þ be considered as an indicator to the reliability of the model.
n i¼1 Qmi The cumulative probability, P50, is calculated from Eq. (12)
sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi by sorting the values of the predicted capacity by the measured
   2 capacity (Qp/Qm) in ascending order for all cases. The smallest
1 i¼n Qpi
s¼ ∑ Ln  LnðμÞ ð11Þ Qp/Qm is given number i ¼ 1, and the largest is given i¼ n. The
n1 i ¼ 1 Qmi
value of Qp/Qm that corresponds to P ¼ 50% is considered as
where μ is the mean, s is the standard deviation, Qpi is the P50:
predicted capacity of case i, Qmi is the measured capacity of i
case i, n is the number of measurements. P¼ ð12Þ
nþ1
The optimal value of the mean is unity. Less than unity
indicates that the model tends to under-predict the pile The values of P50 are provided in Table 4. The closeness to
capacity. Conversely, more than unity is an indication of unity of the values in the two sets indicates that the model
over-prediction. The optimum value of standard deviation is performs well. It also indicates that on average, the model may
zero; the closer to zero the greater the accuracy. The calculated tend to over-predict the pile capacity. It can also be seen that
values of μ were 1.08 and 0.95 for the training and validation the other methods may tend to over-predict the measured pile
sets respectively, as presented in Table 4. This indicates that capacity by three times according to the Meyerhof method,
the model performs well and that the low mean values of the fourfold according to Shioi and Fukui, and by more than 10%
two sets suggest that the model possesses a high capability to as per the ANNs method.
predict pile capacity. The results also indicate that the model, The predictive ability of the GEP model is also evaluated by
on average, may tend to over-predict the measured pile calculating the error. As presented in Table 4, the calculated
I. Alkroosh, H. Nikraz / Soils and Foundations 54 (2014) 233–242 241

1
100
Hammer energy
Pile set
0.8 Nb
Nsa

Scaled pile capacity


80 Penetration depth ratio
0.6
Predicted pile capacity (MPa)

60 0.4

0.2
40
GEP (Proposed) 0
0 0.2 0.4 0.6 0.8 1
Meyerhof (1976)
Scaled input variable
20
Shioi & Fukui (1982) Fig. 9. Sensitivity analyses.
Lee & Lee (1995)
4.2. Sensitivity analysis
0
0 20 40 60 80 100
The significance of the input variables and the response of
Measured pile capacity (MPa)
the model to their variations were investigated by conducting
sensitivity analyses. Hypothetical values of one input variable
100 were varied within the range of the training data, whereas the
other variables were assumed as constant. For example, the
effect of the penetration ratio, l/d, was investigated by allowing
80 it to change while all other input variables were set to selected
constant values. The inputs were then accommodated into the
Predicted pile capacity (MPa)

GEP model, and the predicted pile capacity was calculated.


This process was repeated for the next input variable and so
60
on, until the model response had been examined for all inputs.
The results of the sensitivity analyses are presented in Fig. 9.
It can be seen that the most influential variable on the pile
40 GEP (Proposed) capacity is the average of the standard penetration number
along the pile shaft. Variation in this variable resulted in
Meyerhof (1976) significant variations in the value of the corresponding output
20 Shioi & Fukui (1982)
capacity. The figure also shows that the variations in the
average of the standard penetration number within the pile tip
Lee & Lee (1995) zone and penetration depth ratio have significant influence on
the pile capacity, whereas pile set and hammer energy have a
0 converse and insignificant effect on pile capacity.
0 20 40 60 80 100
Measured pile capacity (MPa)
4.3. Some weaknesses in GEP
Fig. 8. Comparison the performance of the GEP model and the other methods:
(a) in training set; (b) in validation set.
(i) The GEP approach is data driven; the accuracy of the GEP
model depends on the accuracy of the data input;
(ii) GEP models cannot extrapolate answers (i.e. they can only
MSE and MAE indicate that the GEP model has great ability to provide predictions within the range of the training data);
predict pile capacity. The table also shows that the model gives (iii) the structure of the solution increases with the increase of
lowest error in comparison with the other methods. Hence, fitness;
GEP model predicts pile capacity better. (iv) a significant number of runs may be required to obtain a
A graphic presentation of the results for the two sets in suitable answer for complex problems.
Fig. 8 reveals that the GEP model achieved minimum scatter
of the points around the line of equality between the measured 5. Conclusion
and predicted pile capacities. This provides confirmation of the
capability of the model to perform well. The majority of the This study shows that the GEP technique is capable of
points are sited on or below the line of equality, hence the modelling the dynamic capacity of pile foundations. The
model may tend to under-predict the measured pile capacity. developed model can predict the targeted capacity with a high
242 I. Alkroosh, H. Nikraz / Soils and Foundations 54 (2014) 233–242

level of accuracy and performs well. The closeness of the Gandomi, A.H., 2011. Non-linear genetic based models for prediction of flow
results in the training and validation sets indicates that the GEP number of asphalt mixtures. ASCE J. Mater. Civil Eng. 23 (3), 248–263.
Gandomi, A.H., Alavi, A.H., 2012. A new multi-gene genetic programming
model has a great capacity for generalising the solution.
approach to nonlinear system modelling, part II geotechnical and earth-
Statistical analysis indicates that model predicts measured pile quake engineering problems. Neural Comput. Appl. 21, 189–201.
capacity more effectively than the traditional and ANNs Gepsoft. Genexprotools 4.0. Standard Edition ed., 2002. 〈http://www.gepsoft.
methods. A sensitivity analysis revealed that the most influen- com〉.
tial factor on the pile capacity is Ns, the number along the pile Hannigan P.J., Goble,G.G, Tendean,G., Likins,G.E., Rausche,F., 1996. Design
shaft, followed by Nb and l/d. These factors also have an and Construction of Driven Pile Foundations.Workshop Manual, Publica-
tion No. FHWA-HI-97-014-1996.
incremental relationship to pile capacity. Conversely, the Housel, W.S., 1966. Pile load capacity-estimates and test results. Soil Mech.
factors with least effect on pile capacity are the hammer Found. Eng. ASCE 92 (SM4), 1–29.
energy and pile set. These two factors have a converse Javadi, A.A., Rezania, M., Nezhad, M.M., 2006. Evaluation of liquefaction
relationship to pile capacity. Overall, the output of this study induced lateral displacements using genetic programming. Comput. Geo-
has demonstrated that while the GEP approach has some small tech. 33 (4-5), 222–233.
Lee, I-M., Lee, J.H., 1995. Prediction of pile bearing capacity using artificial
setbacks, the resulting model is competent in predicting neural networks. Comput. Geotech. 18 (3), 189–200.
dynamic pile capacity. Likins, G., DiMaggio, J., Rausche, F., Teferra, W., 1992. A solution for high
damping constants in sand. In: Proceeding of the 4th International
References Conference on Application of Stress-Wave Theory to Piles, Balkema,
Rotterdam, pp. 117–120.
Abu-Kiefa, M., 1998. General regression neural networks for driven piles Long, J., Wysockey, M., 1999. Accuracy of Methods for Predicting Axial
incohesionless soils. J. Geotech. Geoenviron. Eng. 124 (12), 1177–1185. Capacity of Deep Foundations, 88. Geotechnical Special Publication
Alavi, A.H., Aminian, P., Gandomi, A.H., Esmaeili, M.A., 2011. Genetic 181–195.
Master, T., 1993. Practical Neural Network Recipes in C þ þ . Academic Press,
based modeling of uplift capacity of suction caissons. Exp. Syst. Appl. 38,
12608–12618. San Diego, California.
Alkroosh, I., Nikraz, H., 2011a. Correlation of pile axial capacity and CPT data Meyerhof, G.G, 1976. Bearing capacity of settlement of pile foundations. The
using gene expression programming. Geotech. Geol. J. 29 (5), 725–748. eleventh Terzaghi lecture. ASCE J. Geotech. Eng. 102 (GT3), 195–228.
Ng, C., Simon, W., Menzies, B., 2004. Soil-Structure Engineering of Deep
Alkroosh, I., Nikraz, H., 2011b. Predicting axial capacity of driven piles in
cohesive soils using intelligent computing. Eng. Appl. Artif. Intell. 25 (3), Foundations Excavations and Tunnels. Thomas Telford Ltd, London.
618–627. Ornek, M., Laman, M., Demir, A., Yildiz, A., 2012. Prediction of bearing
Ardalan, H., Eslami, A., Nariman-Zadeh, N., 2009. Piles shaft capacity from capacity of circular footings on soft clay stabilized with granular soil. Soils
Found. 52 (1), 69–80.
CPT and CPTu data by polynomial neural networks and genetic algo-
rithms. Comput. Geotech. 36, 616–625. Poulos, H.G., Davis, E.H., 1980. Pile Foundation Analysis and Design. Wiley,
Bowden, G.J., Nixon, J.B., Dandy, G.C., Maier, H.R., Holmes, M., 2006. New York.
Prasad, P.W., Beg, A., 2009. Investigating data preprocessing methods for
Forecasting chlorine residuals in a water distribution system using a general
regression network. Math. Comput. Modell. 44 (5–6), 469–484. circuit complexity models. Exp. Syst. Appl. 36 (1), 519–526.
Bozozuk, M., Keenan, G.H., Pheeney, P.E., 1979. Analysis of Load Tests on Rezania, M., Javadi, A., 2007. A new genetic programming model for
Instrumented Steel Test Piles in Compressible Silty Soil. Behaviour of predicting settlement of shallow foundations. Can. Geotech. J. 44,
1462–1473.
Deep Foundations, ASTM STP 670. American Society for Testing and
Materials153–180. Rodgers, J.L, Nicewander, W.A, 1988. Thirteen ways to look at correlation
Cevic, A., Cabalar, A.F., 2009. Modelling damping ratio and shear modulus of coefficient. Am. Stat. 42 (1), 59–66.
sand–mica mixtures using genetic programming. Exp. Syst. Appl. 36 (4), Shahin, M., 2010. Intelligent computing for modeling axial capacity of pile
foundations. Can. Geotech. J. 47, 230–243.
7749–7757.
Chan, W., Chow, Y., Liu, L., 1995. Neural network: an alternative to pile Shahin, M., Maier, H., Jaksa, M., 2004. Data division for developing neural
driving formulas. Comput. Geotech. 17, 135–156. networks applied to geotechnical engineering. J. Comput. Civil Eng. 18 (2),
105–114.
Coduto, D.P., 1994. Foundation Design. Prentice-Hall Inc, New Jersey.
Das, S.K., 2013. Artificial neural networks in geotechnical engineering: Shioi, Y., Fukui, J., 1982. Application of N-value to design of foundation in
modeling and application issues. In: Yan, X., Gandomi, A.H., Talatahati, Japan. In: The Second European Symposium on Penetration Testing,
S., Alavi, A.H (Eds.), Metaheuristics in Water. Elsevier, pp. 231–270. Amsterdam, vol. 1, pp. 159–164.
Svinkin, M.R., Woods, R.D., 1998. Accuracy of determining pile capacity by
Das, S.K., Basudhar, P.K., 2006. Undrained lateral capacity of piles in clay
using artificial neural network. Comput. Geotech. 33 (8), 454–459. dynamic methods. In: Conference Papers, 7th International Conference and
Davisson, M. T., 1972. Lecture Series, Innovations in Foundation Construc- Exhibition on Piling and Deep Foundations, Austria, 1998, pp. 1.2.1–1.2.8.
tion. In: Proceedings American Society of Civil Engineers', Illinois Section. Tarawneh, B., 2013. Pipe pile set up: database and prediction model using
artificial neural network. Soils Found. 53 (4), 607–615.
Ferreira, C., 2001. Gene expression programming: a new adoptive algorithm
for solving problem. Complex Syst. 13 (2), 87–129. Teh, C.I., Wong, K.S., Goh, A.T., Jaritngam, S., 1997. Prediction of pile
Ferreira, C., 2002. Gene Expression Programming: Mathematical Modeling by capacity using neural networks. J. Comput. Civil Eng. 11 (2), 129–138.
Thompson, C.D., Thompson, D.E., 1979. Influence of Driving Stress on the
Artificial Intelligence. Angra do Heroismo, Portugal.
Flaate, K.S., 1964. An Investigation of the Validity of Three Pile Driving Development of High Pile Capacities. Behavior of Deep Foundations,
Formulae in Cohesionless Material. Pub. No. 56. Norwegian Geotechnical ASTM STP 670. American Society for Testing and Materials562–577.
Institute, Oslo.

You might also like