You are on page 1of 10

Waste Management xxx (2016) xxx–xxx

Contents lists available at ScienceDirect

Waste Management
journal homepage: www.elsevier.com/locate/wasman

Forecasting municipal solid waste generation using artificial intelligence


modelling approaches
Maryam Abbasi ⇑, Ali El Hanandeh
Griffith School of Engineering, Griffith University, Nathan, QLD, Australia

a r t i c l e i n f o a b s t r a c t

Article history: Municipal solid waste (MSW) management is a major concern to local governments to protect human
Received 23 May 2015 health, the environment and to preserve natural resources. The design and operation of an effective
Revised 16 April 2016 MSW management system requires accurate estimation of future waste generation quantities. The main
Accepted 18 May 2016
objective of this study was to develop a model for accurate forecasting of MSW generation that helps
Available online xxxx
waste related organizations to better design and operate effective MSW management systems. Four intel-
ligent system algorithms including support vector machine (SVM), adaptive neuro-fuzzy inference sys-
Keywords:
tem (ANFIS), artificial neural network (ANN) and k-nearest neighbours (kNN) were tested for their
Municipal solid waste
Artificial intelligence
ability to predict monthly waste generation in the Logan City Council region in Queensland, Australia.
Support vector machine Results showed artificial intelligence models have good prediction performance and could be successfully
Artificial neural network applied to establish municipal solid waste forecasting models. Using machine learning algorithms can
k-nearest neighbours reliably predict monthly MSW generation by training with waste generation time series. In addition,
Adaptive neuro-fuzzy inference system results suggest that ANFIS system produced the most accurate forecasts of the peaks while kNN was suc-
cessful in predicting the monthly averages of waste quantities. Based on the results, the total annual
MSW generated in Logan City will reach 9.4  107 kg by 2020 while the peak monthly waste will reach
9.37  106 kg.
Ó 2016 Elsevier Ltd. All rights reserved.

1. Introduction period from 1970 to 2014, more than 80 studies addressing the
topic of forecasting of MSW generation have been published.
Accurate projection of municipal solid waste quantities is MSW forecasting methods can be broadly classified into five main
important for the successful planning of efficient waste manage- categories: descriptive statistical methods (Bider and Franklin,
ment system. Future estimations of municipal solid waste genera- 1980; Even et al., 1981; Sha’Ato et al., 2007); regression analysis
tion serve as a basis in the development of existing waste (Denafas et al., 2014; Franchetti, 2012; Vivekananda and Nema,
management infrastructures as well as their further sustainable 2014; Wei et al., 2013); material flow model (Liu et al., 2014;
development and optimization. Imprecise forecasts may lead to Schiller et al., 2010; Tonjes and Greene, 2012; Zhang et al.,
widespread problems, such as inadequate or excessive waste dis- 2012); time series analysis (Katsamaki et al., 1998; Navarro-Esbrí
posal infrastructure (collection, incineration, landfilling or process- et al., 2002; Xu et al., 2013); and artificial intelligence models
ing) (Buenrostro et al., 2001). Therefore, the demand for reliable (Abbasi et al., 2014; Antanasijević et al., 2013; Kumar et al.,
data concerning waste generation is implicitly included in the 2011; Noori et al., 2010). However, all modelling approaches have
majority of national waste management laws (Antanasijević their own strengths and weaknesses.
et al., 2013). However, the process of forecasting MSW generation Conventional and descriptive statistical methods of forecasting
is often challenging and compound by rapidly changing and MSW generation usually use population growth and average per-
uncontrollable parameters (Beigl et al., 2008). capita waste generation as the main predictor (Abdoli et al.,
There is a growing body of literature on forecasting municipal 2012). However, this method is no longer effective due to the
solid waste (MSW) generation which includes a high heterogeneity dynamic characteristics of MSW generation process (Abbasi et al.,
of applied models from purely application-oriented to highly 2013). Regression analysis is a widely used modelling technique
sophisticated academically inclined tools (Chung, 2010). Over the because of the simplicity of underlying mathematics and well-
developed statistical theory. In regression analysis, MSW genera-
⇑ Corresponding author. tions are associated with economic and demographic variables
E-mail address: m.abbasi@griffith.edu.au (M. Abbasi). (Abdoli et al., 2011). In order to conform with the theoretical

http://dx.doi.org/10.1016/j.wasman.2016.05.018
0956-053X/Ó 2016 Elsevier Ltd. All rights reserved.

Please cite this article in press as: Abbasi, M., El Hanandeh, A. Forecasting municipal solid waste generation using artificial intelligence modelling
approaches. Waste Management (2016), http://dx.doi.org/10.1016/j.wasman.2016.05.018
2 M. Abbasi, A. El Hanandeh / Waste Management xxx (2016) xxx–xxx

assumptions of regression, input variables must meet stringent term periods (Abbasi et al., 2013; Abdoli et al., 2012; Noori et al.,
requirements such as independence, constant variance and nor- 2010).
mality of errors (Hockett et al., 1995). All of these requirements Ordóñez-Ponce et al. (2006) employed multi-layer perceptron
place limitations on the suitability of regression for predicting neural network to predict long-term generation rate of MSW in
MSW generation as a complex real-world problem. The material Chile. Using a range of variables which covered socio-
flow model can fully characterize the dynamic properties in the demographic, economic, geographic and waste-related factors,
process of solid waste generation. However, this modelling ANN was able to predict waste generation with great accuracy
approach can be used for predicting total waste rather than col- (R2 = 0.819). Ordóñez-Ponce et al. (2006) concluded that popula-
lected waste (Beigl et al., 2008). Nevertheless, collected waste tion, percentage of urban population, years of education, number
may still be estimated where recycling and littering rates are of libraries and number of indigents were the most important fac-
known. However, Hockett et al. (1995), after analysing results tors which affected waste generation in Chile.
obtained from input-output models like material flow analysis, The ability of ANN to predict short-term MSW generation was
highlighted that comparisons of the results with real observed also examined by other researchers (Noori et al., 2009a,b). These
waste data on the highest aggregation levels were questionable studies focused on forecasting MSW generation by analysing time
due to the presence of different aggregations or because of low series of waste generation rather than analysing effective factors in
consistency within the studies. Hekkert et al. (2000) further sug- waste generation. Results showed that feed-forward ANN with one
gested that comparison may even prove to be impossible because hidden layer and 16 neurons was the best structure to forecast
studies on final consumption are almost completely lacking. In short-term waste generation rates (Noori et al., 2009a,b). However,
contrast with the previously mentioned methods, time series anal- ANN accuracy may suffer when faced with large database due to
ysis does not rely on the estimation of social and economic factors; the effect of irrelevant, redundant and noise in the data. Therefore,
thus it has the advantage of overcoming the lack of social parame- different input selection methods such as principal component
ters and other predictors. Time series data of waste generation are analysis, wavelet transform and gamma test were introduced to
dynamic in nature, and it is possible to employ non-linear tools in deal with accuracy loss (Noori et al., 2009a,c). Although ANN model
order to discern relationships within the time series. has good ability to forecast MSW generation, its performance suf-
In recent years, artificial intelligence models and machine fers because of its tendency to over-fitting training, local mini-
learning such as support vector machine (SVM), adaptive neuro- mum, and poor generalization.
fuzzy inference system (ANFIS) and artificial neural network
(ANN) have been gaining popularity because of their high flexibil- 2.2. Adaptive neuro-fuzzy inference systems
ity and proven prediction abilities (Abbasi et al., 2013, 2014;
Antanasijević et al., 2013). Intelligent models are shown to be cap- Adaptive neuro-fuzzy inference systems (ANFIS) are a well-
able of predicting MSW generation on long, medium and short known data driven modelling technique that combines ANN and
term scales (Abbasi et al., 2014; Abdoli et al., 2012; Jalili Ghazi fuzzy logic. ANFIS is composed of two parts, antecedent and con-
Zade and Noori, 2008). However, there is limited information about clusion, which are connected to each other by fuzzy rules based
monthly prediction of MSW generation as well as optimal algo- on the network form. Limited attempts, 3 studies, to predict waste
rithm for this purpose. This paper will review the state of the art generation using ANFIS were found in the literature. The studies
of the intelligent modelling approaches for MSW generation fore- compared the performance of ANN and ANFIS models ability to
cast and then apply them to a real case scenario in order to identify predict MSW generation (Chen and Chang, 2000; Noori et al.,
the most suitable algorithm to predict MSW collected by kerbside 2009c; Tiwari et al., 2012). Tiwari et al. (2012) suggested that
service on a medium-term scale. ANFIS is a more reliable model than ANN for forecasting the aggre-
gate impact of economic trend, population changes, and recycling
on solid waste generation. Chen and Chang (2000), on the other
2. Application of artificial intelligence in forecasting MSW
hand, demonstrated the ability of ANFIS to forecast waste genera-
generation
tion with limited input data. Chen and Chang (2000) and later
Noori et al. (2009c) applied fuzzy goal regression method to
Advanced artificial intelligence forecast systems have shown
improve the overall prediction accuracy of ANFIS.
superiority to conventional models in engineering problems as
well as in waste management research (Abdoli et al., 2012). Recent
2.3. Support vector machine
research in this topic focused on using artificial intelligence models
to deal with the non-linearity of the historical data. In this section,
Support vector machine (SVM), a novel neural network algo-
application of the techniques including ANN, ANFIS, SVM and k-
rithm, was developed by Vapnik and colleagues (Vapnik, 1995).
nearest neighbours (kNN) will be reviewed in the field of MSW
SVMs are a type of maximum margin classifiers which seek to find
generation.
a maximum margin hyperplane to the best line across data. The
hyperplane thus obtained is called the optimal separating hyper-
2.1. Artificial neural network plane and the training examples that are closest to the maximum
margin hyperplane are called support vectors. While most of con-
Artificial neural networks are cellular information processing ventional neural network models implement the empirical risk
systems designed and developed on the basis of the perceived minimization principle, SVM implements the structural risk mini-
notion of the human brain and its neural system (Firat et al., mization principle. Neural network model seeks to minimize the
2010). One of the most beneficial and significant features of ANN misclassification error or deviation from correct solution of the
in forecasting is its learning ability. ANN can construct a complex training data but the SVM searches to minimize an upper bound
nonlinear system through a set of input/output examples. Conse- of generalization error. Therefore, the solution of SVM may be glo-
quently, ANN has been successfully employed in nonlinear system bal optimum while other neural network models may tend to fall
modelling (Firat et al., 2010). Accordingly, the nonlinear structure into a local optimal solution. Thus, SVM is unlikely to result in
of MSW generation makes the ANN an ideal candidate for forecast- overfitting (Kim, 2003).
ing waste generation. Literature survey returned applications of SVM model was used to forecast weekly MSW generation in
ANN to forecast MSW generation in short, medium and long- Tehran city, Iran by Abbasi et al. (2013). They concluded that

Please cite this article in press as: Abbasi, M., El Hanandeh, A. Forecasting municipal solid waste generation using artificial intelligence modelling
approaches. Waste Management (2016), http://dx.doi.org/10.1016/j.wasman.2016.05.018
M. Abbasi, A. El Hanandeh / Waste Management xxx (2016) xxx–xxx 3

SVM could predict MSW generation in a short term scale with rea- 3. Material and method
sonable accuracy. Further, Abbasi et al. (2014) found that pre-
processing of input variables by wavelet transform method can Time series analysis comprises methods that attempt to under-
improve the accuracy and robustness of the model. stand the nature of the series and is often useful for future forecast-
ing and simulation. In time series forecasting, past observations are
2.4. k-nearest neighbours collected and analysed to develop a suitable mathematical model
which captures the underlying data generating process for the ser-
Due to its simplicity and intuitiveness, k-nearest neighbours ies. The future events are then predicted using the model. This
algorithm is widely adopted for regression and classification (Wu approach is particularly useful when there is not much knowledge
et al., 2008a). The application of kNN to time series forecasting about the statistical pattern followed by the successive observa-
under nonparametric locally weighted regression condition was tions or when there is a lack of a satisfactory explanatory model.
presented independently by Yakowitz (1987) and Cleveland Here a model was built to predict one step ahead and the vari-
(1979). kNN has also drawn attention in the field of time series able which is used as an input is the waste generation, where Wt is
forecasting because of work presented by Meade (2002). The the waste generation at time instance t. A simple time series pre-
underlying intuition to apply kNN to univariate time series is that diction performs the following functional mapping:
consistent data-generating processes often produce observations W tþ1 ¼ f ðW t ; W t1 ; . . . ; W tp1 Þ ð1Þ
of repeated patterns of behaviour. Therefore, if a previous pattern
can be identified as similar to the current behaviour of the time In Eq. (1), the waste generation at time (t + 1) is a function of the
series, the subsequent behaviour of previous pattern can provide waste generations at previous time. Therefore, at time t, a one step
valuable information to predict the behaviour in the immediate ahead forecast Wt+1 is computed using p number of the past obser-
future. In the kNN regression algorithm introduced by Meade vations Wt, Wt1, . . . , Wtp1. While, p was 12 in this study, Wt+1
(2002), the target variable of a time series forecasting problem is was forecasted by waste generation in past twelve month equiva-
presented as a sequence of interval scaled values. Given a pattern lent to a year. To compare forecasting ability of artificial intelli-
whose future value is to be predicted, the algorithm identifies gence models to predict MSW generation, real data on waste
the k most similar past patterns and combines them. So far, no generation obtained from the city of Logan, Australia was used.
attempt has been made to evaluate the ability of kNN to forecast
MSW generation. 3.1. Theory and development of ANFIS

The most basic and common Sugeno method of fuzzy inference


2.5. Summary of present studies
structure system which has validation capability by training data
was used in this study (Rezaee and Zarandi, 2010). Fig. 1 shows
Forecasting of MSW generation can be classified into 3 different
an ANFIS system with two inputs, one output and two rules. This
groups according to how far in the future the forecast period is:
system has two inputs x and y and one output, where its rules are:
short-term forecasting (usually ranging from days to few months);
mid-term forecasting (usually range from few months to 3– If x is A1 and y is B1 ; then f 1 ¼ p1 x þ q1 y þ r1 ð2Þ
5 years); and long-term forecasting (usually attempting to forecast If x is A2 and y is B2 ; then f 2 ¼ p2 x þ q2 y þ r2 ð3Þ
many years ahead). Table 1 summarises the work which applied
intelligent modelling approach to forecasting MSW generation. Ai and Bi are fuzzy sets, fi is the output within the fuzzy region
Choosing the best modelling approach for waste generation specified by the fuzzy rule, pi, qi and ri are the design parameters
depends on waste stream type, economic factors, purpose of the that are determined during the training process.
modelling, and availability of data (Beigl et al., 2008). However, Every node computes the degree of activation of any rules in the
aside from ANN and ANFIS, no attempt has been made to compare second layer. The membership functions are then multiplied in this
the ability of different artificial models to predict MSW generation, layer:
especially in midterm period with monthly time step. The follow- wi ¼ lAi ðxÞ  lBi ðyÞ i ¼ 1; 2 ð4Þ
ing section will compare the ability of artificial intelligence models
including ANFIS, SVM, ANN and kNN for monthly prediction of where lAi (x) is membership degree of x in Ai set, lBi (y) is the mem-
MSW collected by kerbside service. To the best of the authors’ bership degree of y in Bi set.
knowledge, this study is the first to apply kNN for forecasting The ith node computes (third layer) the ratio of activity degree
MSW generation as well as evaluate prediction ability of SVM, of i rule to the sum of activation degrees of all rules. wi is normal-
ANN and ANFIS models for estimating monthly waste generation. ized membership degree of i rule.
wi
 ¼
w i ¼ 1; 2 ð5Þ
Table 1 w1 þ w2
Summary of MSW generation classified by forecast period and algorithm.
The output of any node is calculated in the fourth layer:
Forecast Reference Algorithm
period wi f i ¼ wi ðpi x þ qi y þ r i Þ ð6Þ
Short-term Noori et al. (2009c) ANFIS and ANN where p, q and r are changeable consequent parameters. The final
Jalili Ghazi Zade and Noori (2008), Noori ANN
network output f is produced by the node of the fifth layer as a sum-
et al. (2009a,b, 2010)
Abbasi et al. (2013, 2014), Noori et al. SVM
mation of all incoming signals. The final outputs (number of nodes
(2009d) equals to output parameters) of all nodes are derived in the fifth layer.
X P
Medium-term Lupu et al. (2008) ANN wi f i
Ov erall Output ¼ wi f i ¼ Pi ð7Þ
i wi
Long-term Chen and Chang (2000), Tiwari et al. ANFIS and ANN
i
(2012)
Abdoli et al. (2012), Antanasijević et al. ANN In the ANFIS system, each input parameter might be clustered
(2013), Jahandideh et al. (2009), Kumar
into several class values to build up fuzzy rules, and each fuzzy rule
et al. (2011), Ordóñez-Ponce et al. (2006)
would be constructed using two or more membership functions

Please cite this article in press as: Abbasi, M., El Hanandeh, A. Forecasting municipal solid waste generation using artificial intelligence modelling
approaches. Waste Management (2016), http://dx.doi.org/10.1016/j.wasman.2016.05.018
4 M. Abbasi, A. El Hanandeh / Waste Management xxx (2016) xxx–xxx

Fig. 1. Structure of the ANFIS network.

(Abraham, 2001). Several methods have been proposed to classify X


N

the input data and to make the rules. In this study, the one-pass ðai  ai Þ ¼ 0
i¼1
subtractive clustering algorithm is used for data classification. It ð11Þ
is a fast algorithm for estimating the number of clusters and the 0 6 ai 6 C
cluster centres in a set of data by getting the cluster radius (r).
0 6 ai 6 C ; i ¼ 1; 2; . . . ; N
The radius (r) of each data cluster should be optimized and may
vary between 0.1 and 0.95 (Noori et al., 2009c). The above objective function in Eq. (10) is a convex function,
and the solution to Eq. (10) is unique and optimal. Subsequent to
3.2. Theory and development of SVM the determination of Lagrange multipliers in Eq. (10), the parame-
ters w and b in support vector regression function can be calculated
For forecasting purpose, e-type support vector regression under Karush-Kuhn-Tucker complementarities conditions (Wu,
method was employed in this study (Clarke et al., 2005). The e- P
2009), where W ¼ Ni¼1 ðai  ai Þ/ðxi Þ. Consequently, the support
insensitive sparse solution saves CPU time during prediction vector regression function can be written as
because it does not take into account small residual errors. In a
SVM regression model, the functional dependence of the depen- X
N

dent variable y on a set of independent variables x should be esti- f ðxÞ ¼ ðai  ai ÞKðxi ; xÞ þ b ð12Þ
i¼1
mate by a deterministic function f(x) (f(x) = wT  K(x) + b) plus the
addition of some noise (y = f(x) + noise). The noise can be usually the kernel function K corresponding to
defined by error term (e).
To find a functional form for f(x) the SVM model should be cal- Kðxi ; xÞ ¼ /ðxi ÞT  /ðxÞ ð13Þ
ibrated on a sample set, a process that involves the sequential opti-
Kernels can be any symmetric function satisfying the Mercer’s
mization of an error function. Then, w and b are derived by
condition. Different kernels can be selected to construct different
minimizing the error function (Eq. (8)) subject to Eq. (9):
types of SVM. Typical examples include (Noori et al., 2009d):
1 T XN XN
w  w þ C ni þ C ni ð8Þ 1. Polynomial kernels K(x, xi) = [x  xi + 1]a
2 i¼1 i¼1
2. Radial basis function (RBF) kernels K(x, xi)=exp(c||x  xi||2)
3. Linear kernels K(x, xi) = x  xi
wT  /ðxi Þ þ b  yi 6 e þ ni
yi  wT  /ðxi Þ  b 6 e þ ni ð9Þ where x and xi is the unlabeled input and each of the training
ni ; ni P 0 ; i ¼ 1; . . . ; N inputs respectively. Where c and a are constants and the parame-
ters of the kernels. Gaussian radial basis function due to its popu-
where C indicates a positive constant, u denotes a set of nonlinear larity and accuracy was used as a kernel function in this study
transformations, N is the sample size, and ni and ni are slack vari- (Kumar et al., 2011).
ables specifying the upper and lower training error subject to an It is well known that SVM generalization performance depends
error term e. In the regression problem, most data examples are on a good setting of meta-parameters, especially: regularization
expected to be within the e-tube. If a data example falls outside parameter (C); precision parameter (e) and kernel parameter (c).
the tube, then an error ni and ni will exist. SVM reduces under- The regularization parameter (C) determines penalties to estima-
fitting and over-fitting problem by minimizing both the regulariza- tion errors, e. The parameter e represents the radius of the tube
P
tion term wT  w/2 and the training error term C Ni¼1 ðni þ ni Þ in Eq. located around the regression function. In other words, choice of
(8). Therefore, introducing a dual set of Lagrange multiplier i.e. ai e defines the magnitude of errors that can be neglected. Improper
and ai allows the optimization problem to be solved by maximizing selection of these parameters can cause over-fitting or under-
the quadratic programming algorithm (Eq. (10)) subjected to Eq. fitting problems. In data mining and machine learning 10-fold
(11). cross-validation is the most common (Refaeilzadeh et al., 2009).
This method has been employed to obtain optimal SVM parame-
X
N X
N X
N
yi ðai  ai Þ  e ðai þ ai Þ  0:5 ðai  ai Þ ters in two step grid search method. In the first step, a coarse grid
i¼1 i¼1 i;j¼1 search is taken using sets of values for C, c and e. In the second step,
a finer grid search is conducted around the optimal values obtained
ðaj  aj Þ/ðxi ÞT  /ðxj Þ ð10Þ
in the first step.

Please cite this article in press as: Abbasi, M., El Hanandeh, A. Forecasting municipal solid waste generation using artificial intelligence modelling
approaches. Waste Management (2016), http://dx.doi.org/10.1016/j.wasman.2016.05.018
M. Abbasi, A. El Hanandeh / Waste Management xxx (2016) xxx–xxx 5

 ðkÞ
3.3. Theory and development of k-nearest neighbours ðkþ1Þ ðkÞ @E
wij ¼ wij  k ð20Þ
@wij
k-nearest neighbours (kNN) regression is an instance based lazy
learning algorithm (Wu et al., 2008b). It is a nonparametric where k is the learning rate. The training mode begins with random
method, where a new observation is placed into the class of the numbers of the weights and proceeds iteratively. The crucial prob-
observation from the learning set that is closest to the new obser- lem in the model is how to determine the numbers of hidden units.
vation, with respect to the covariates used. The determination of An optimal network topology from the number of inputs and out-
this similarity is based on distance measures. Formally this simple puts depends critically on the number of training cases, the amount
fact can be described as follows: Let of noise and the complexity of the function or classification. The
optimal number of neurons in hidden layer is determined by trial
L ¼ fðyi ; xi Þ; i ¼ 1; . . . ; nL g ð14Þ and error in this analysis.
be a training or learning set of observed data, where yi 2 {1, . . . , c}
denotes class membership and the vector xi = (xi1, . . . , xip) represents 3.5. Evaluation metrics
the predictor values. The determination of the nearest neighbours is
based on an arbitrary distance function D (x,xi). Then for a new There are many criteria that can be used to evaluate the perfor-
observation (y, x) the nearest neighbour (y(1), x(1)) within the learn- mances of forecasting models in the numerical study. In this paper,
ing set is determined by: the prediction performance of the proposed forecasting models is
evaluated through statistical metrics: coefficient of determination
Dðx; xð1Þ Þ ¼ minðDðx; xi ÞÞ ð15Þ (R2), Root Mean Squared Error (RMSE), Mean Absolute Percentage
Error (MAPE), and Mean Absolute Error (MAE).
and y^ ¼ yð1Þ , the class of the nearest neighbour, is selected as predic-
Pn
ðW i  W i Þ
2
tion for y. R2 ¼ 1  Pi¼1 ð21Þ
2
The notation x(j) and y(j) here describes the jth nearest neigh- n
i¼1 ðW i  W i Þ
bour of x and its class membership, respectively. rffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
1 Xn
The weighted kNN learning has two user-specific parameters to RMSE ¼ jW i  W i j2 ð22Þ
be determined including the number of nearest neighbours, k, and n i¼1
n   
the weights assigned to the selected neighbours. Specially, the per- 1X W i  W i 
MAPE ¼  ð23Þ
formance of kNN learning mainly depends on the locality control n i¼1 Wi 
parameter, k, which is usually chosen by empirical cross-
1X n
validation. The weights are specified by kernel methods using a MAE ¼ jW i  W i j ð24Þ
n i¼1
density function. In the implementation of kNN regression used
in this study, Gaussian kernel method is used with Gaussian den- where W, W⁄ and n is actual weight of generated waste, model out-
sity function. put and number of observation, respectively. All computations were
run on Windows XP operating system and MATLAB (Version
3.4. Theory and development of artificial neural network R2009a, Mathwork Inc.).

Multi-layer feedforward neural network is the most popular 3.6. Case study and data
ANN method. It has been applied widely in many fields because
of its superior ability to approximate non-linear functions with Logan City Council is one of the fastest growing regions in south
high accuracy. Most applications of feedforward neural network east Queensland, Australia. It is situated between the City of Bris-
use some variation of the gradient technique, back propagation bane to the north and Gold Coast to the south as shown in Fig. 2.
algorithm, to optimize neural networks (He et al., 2005). A feedfor- Logan city has a population of more than 280,000 inhabitants.
ward network is made up of several layers of processing elements. The city is divided into 63 suburbs and 12 divisions with one Coun-
The three layer feed forward neural network constructed in this cil. Logan has a subtropical climate, typical of the eastern coastal
study with input layer, one hidden layer and output layer. Each locations of the high pressure belt of the southern hemisphere
neuron in a certain layer is connected with all neurons in the next (Mollasalehi, 2015).
layer. The connection between the ith and jth neuron is character- Logan Council manages a range of waste services and facilities,
ized by the weight coefficient wij. The output value of xi is deter- across residential and commercial waste streams. In addition, it
mined by Eqs. (16) and (17). operates all waste facilities in the City and manages contracts with
private companies for collection services. The majority of waste
X i ¼ f ðni Þ ð16Þ
X received and managed by the council is classified as MSW. The
ni ¼ wij xj ð17Þ monthly average MSW generation was 6,400,000 kg in 2014. Based
on Council audits of domestic waste bins, Council’s MSW is com-
where ni is the potential of the ith neuron and function f (ni) is called posed of 25% food waste, 27% garden waste, 15% paper and card-
the transfer function. board (recyclable), 4% textiles, 4% nappies, 6% recyclable glass,
metal, and plastics, and 19% ‘others’ (including wood, rubber,
f ðnÞ ¼ 1=ð1 þ expðnÞÞ ð18Þ
leather, non-recyclable paper, cardboard, metal, and glass) (Logan
The weight coefficients wij is revised to minimize the sum of the city council, 2014). In this study, monthly time series of MSW gen-
squared differences between the computed and required output eration were collected over a eighteen-year period from July 1996
values. This is accomplished by minimization of the object function to June 2014.
E:
4. Result and discussion
E ¼ 1=2ðx0  x0r Þ2 ð19Þ
where x0 and x0r are the computed and required activities of the This study simulated monthly MSW generation by 4 different
output neuron. A back-propagation training algorithm is used to artificial intelligence models, SVM, ANN, ANFIS and kNN, to find
vary the weight coefficients. It holds that the best predictive model. Before starting the modelling process,

Please cite this article in press as: Abbasi, M., El Hanandeh, A. Forecasting municipal solid waste generation using artificial intelligence modelling
approaches. Waste Management (2016), http://dx.doi.org/10.1016/j.wasman.2016.05.018
6 M. Abbasi, A. El Hanandeh / Waste Management xxx (2016) xxx–xxx

Fig. 2. Location of Logan city in Australia.

we needed to determine the best model structure for each algo- Fig. 3 demonstrates the model performance with different r values
rithm by determining model parameters. These parameters are dif- with the optimum value marked by an arrow.
ferent according to each model theory as discussed above. For SVM model, optimization using grid search method showed
One-pass subtractive clustering algorithm was used for data that optimum values for C, c and e were 100, 41 and 0.001, respec-
classification in ANFIS, radius, r, was determined by trial and error tively. Optimal values were obtained by R2 and MAPE estimators as
method. The cluster radius indicates the range of influence of a shown in Fig. 4(a), and (b) respectively. Fig. 4 visually illustrates
cluster and consequently number of rules. In general, as the num- parameter interdependencies, as expected, of C and c. In SVM,
ber of rules is increased, the difference between the predicted and the trade-off between complexity of decision rule and frequency
the obtained values decreases and more complex relations can be of error is controlled by changing the parameter C. the c parameter
modelled with a large number of rules. In this study, ANFIS model defines the influence of a single training input. The bulge in Fig. 4
was developed with different cluster radius to determine the num- suggest that increasing C and c lead to better results but no longer
ber of clusters by results obtained from the model. Results showed after the optimum value. Moreover, same results were obtained
the best fuzzy structure in neural-fuzzy network for estimating using MAE and RMSE.
MSW generation was at radius of 0.14 where R2, MAE, RMSE, and In kNN method, a large k value produces more precise predic-
MAPE values were 0.98, 52.16, 175.88, and 0.008 respectively. tions as it reduces the overall noise; however, the compromise is

Please cite this article in press as: Abbasi, M., El Hanandeh, A. Forecasting municipal solid waste generation using artificial intelligence modelling
approaches. Waste Management (2016), http://dx.doi.org/10.1016/j.wasman.2016.05.018
M. Abbasi, A. El Hanandeh / Waste Management xxx (2016) xxx–xxx 7

Fig. 5. Determining k variable in kNN method.

Fig. 3. Optimal r to construct the best structure for ANFIS model.

completely solve the problem or parts of it, and a too complex net-
that the distinct boundaries within the feature space are blurred. work has the tendency to memorize the data. It concentrates too
Fig. 5 provides optimization information. As it can be seen, opti- much on the data presented for learning and tends toward mod-
mum k was 6 with R2 = 0.51, MAE = 250.26, RMSE = 308.19 and elling the noise which may lead to underfitting or overfitting.
MAPE = 0.4. Number of neurons needs to be optimized by trial and error
As mentioned above, ANN includes the interconnections of a method. The Optimal number of neurons in hidden layer was
number of neurons. The information received by the input layer determined to be 8 where R2 is 0.46 as shown in Fig. 6. As expected,
is processed by neurons in hidden layer with appropriate non- performance of the model was improved as number of neurons
linear transfer functions. The number of hidden neurons affects increased to 8 and then performance decreased due to overfitting
the network complexity. If the network is too small, it will not problem.

Fig. 4. SVM parameter optimization by grid search method.

Please cite this article in press as: Abbasi, M., El Hanandeh, A. Forecasting municipal solid waste generation using artificial intelligence modelling
approaches. Waste Management (2016), http://dx.doi.org/10.1016/j.wasman.2016.05.018
8 M. Abbasi, A. El Hanandeh / Waste Management xxx (2016) xxx–xxx

Fig. 8. Comparison of models’ performance by R2 and RMSE.


Fig. 6. Determining optimal neuron number in ANN method.

for forecasting monthly waste generation. Although kNN was less


4.1. Forecasting monthly MSW generation by artificial intelligence
accurate than ANFIS, it could predict average value of waste gener-
models
ation accurately. This can be explained according to the fundamen-
tal of kNN theory which estimates future values based on nearest
In order to get an effective evaluation of kNN, ANN, ANFIS and
data. In addition, results revealed ANN had the lowest performance
SVM models performance, the best model structures were used.
in comparison with the other models.
85% of the data was used for training, 15% for testing and 10% of
Estimating maximum and average quantities of waste produced
the training dataset for validation. Monthly MSW generation in
by household in a region is critical for designing and operating the
Logan city was predicted by ANFIS, ANN, SVM and kNN as shown
waste management system. Understanding maximum waste gen-
in Figs. 7 and 8. Moreover, Table 2 summarises accuracy of each
erated in a city allows managers to estimate required facilities
model during training and testing stages.
related to waste collection and disposal accurately at the highest
As evident from Figs. 7 and 8 and Table 2, the results indicated
demand. In addition, regular operation of a waste management
that all models performed well during the training stage with a
system depends on estimating an average generation rate in a year.
high R2, more than 0.8. However, during the test stage, there were
As shown in Fig. 6, ANFIS could produce accurate estimates at
substantial differences between the models. ANFIS performed the
peaks, while results obtained from kNN model were close to the
best (R2 = 0.98). It showed good agreement with the observed data,
average value of waste generated in Logan City Council region.
and close predictions of the observed peaks. SVM ranked second in
Therefore, in the next section, ANFIS and kNN were employed to
terms of its accuracy in matching the observed data (R2 = 0.71).
determine maximum and average of monthly waste generation
However, it tended to underestimate the peaks and generated vol-
for the future period 2015–2020.
umes. Prediction ability of kNN was in middle. Nevertheless, it was
the closest in estimating monthly average values of generated
waste. ANN was the least accurate in matching the observed data 4.2. Projection of monthly MSW generation in Logan city
(R2 = 0.46).
ANFIS is a class of adaptive networks that provides nonlinear As demonstrated in the previous section, ANFIS accurately esti-
ability, capacity for fast learning, and adaptation capability mated waste generation at peaks while kNN was successful in esti-
(Tiwari et al., 2012). Earlier studies demonstrated good ability of mating monthly average values. Fig. 9 shows projections of
SVM and ANFIS for forecasting MSW generation (Abbasi et al., monthly MSW generation in Logan city for the coming five years
2014; Noori et al., 2009c). Results obtained from this study support period (2015–2020). Results predict that 1,270,000 kg will be
these findings, but also shows the superiority of ANFIS algorithm added to the monthly average value of generated waste by 2020.

Fig. 7. Modelling results produced by ANFIS, ANN, SVM and kNN.

Please cite this article in press as: Abbasi, M., El Hanandeh, A. Forecasting municipal solid waste generation using artificial intelligence modelling
approaches. Waste Management (2016), http://dx.doi.org/10.1016/j.wasman.2016.05.018
M. Abbasi, A. El Hanandeh / Waste Management xxx (2016) xxx–xxx 9

Table 2
Accuracy of SVM, ANFIS, ANN and kNN models to forecast monthly MSW generation in Logan city.

Train Test
2
MAE RMSE MAPE R MAE RMSE MAPE R2
ANFIS 0.001 0.002 3.39E06 0.99 52.16 175.18 0.008 0.98
SVM 203.03 300.70 0.05 0.93 206.42 231.99 0.033 0.71
ANN 335.03 498.43 0.07 0.83 226.50 290.55 0.037 0.46
kNN 251.80 387.21 0.06 0.88 250 308.19 0.040 0.51

Fig. 9. Monthly MSW generation predicted by ANFIS, SVM and kNN.

Moreover, peak waste flow is forecast to grow by 2.7% in compar- is also the first to apply kNN modelling to waste generation fore-
ison to year 2014 peak value. Therefore, Logan Council may need casting. kNN was able to estimate average monthly waste genera-
more facilities to deal with the peak demand over the next 5 years tion rates with good accuracy.
period.

Acknowledgements
5. Conclusion
The authors would like to acknowledge Logan Council for pro-
To ensure suitable MSW management in a town, the problem
viding municipal waste generation data. The authors would also
must be addressed from its roots. Forecasting solid waste genera-
like to thank Mr Brett Lee from Logan Council for his co-
tion is a fundamental step in planning and operation of a waste
operation. The authors would also like to thank Ms Siti Amri for
management system. Therefore, it is necessary for MSW managers
preparing the study area map.
and decision makers to develop a tool for accurate predictions of
solid waste quantities generated.
Due to limitation of conventional methods of waste generation References
estimation, choosing one statistical model may limit the ability to
find the best predictive model for waste generation data. In artifi- Abbasi, M., Abduli, M.A., Omidvar, B., Baghvand, A., 2013. Forecasting municipal
solid waste generation by hybrid support vector machine and partial least
cial intelligence models, the data inform the choice of model rather square model. Int. J. Environ. Res. 7, 27–38.
than imposing a specific model a priori. In this study, four algo- Abbasi, M., Abduli, M.A., Omidvar, B., Baghvand, A., 2014. Results uncertainty of
rithms of artificial intelligence were compared for their accuracy support vector machine and hybrid of wavelet transform-support vector
machine models for solid waste generation forecasting. Environ. Prog. Sust.
of predicting waste generation rates. They were then employed Energy 33, 220–228.
to forecast monthly MSW generation in Logan city, Australia. Our Abdoli, M.A., Falah Nezhad, M., Salehi Sede, R., Behboudian, S., 2012. Longterm
results showed that artificial intelligence models provide promis- forecasting of solid waste generation by the artificial neural networks. Environ.
Prog Sust. Energy 31, 628–636.
ing tools that may allow decision makers to successfully forecast Abdoli, M.A., Falahnezhad, M., Behboudian, S., 2011. Multivariate econometric
future trends in MSW generation for planning and design of approach for solid waste generation modeling: impact of climate factors.
MSW management purposes. Environ. Eng. Sci. 28, 627–633.
Abraham, A., 2001. Neuro fuzzy systems: state-of-the-art modeling techniques. In:
Based on the results obtained from the study, it can be con-
Mira, J., Prieto, A. (Eds.), Connectionist Models of Neurons, Learning Processes,
cluded that kNN, ANFIS and SVM models can give good prediction and Artificial Intelligence. Springer, Berlin Heidelberg, pp. 269–276.
performance and may be applied to establish the forecasting mod- Antanasijević, D., Pocajt, V., Popović, I., Redžić, N., Ristić, M., 2013. The forecasting of
els that could provide accurate and reliable monthly MSW genera- municipal waste generation using artificial neural networks and sustainability
indicators. Sust. Sci. 8, 37–46.
tion prediction. In addition, results suggest that ANFIS model Beigl, P., Lebersorger, S., Salhofer, S., 2008. Modelling municipal solid waste
produced more accurate forecasts than kNN and SVM. This study generation: a review. Waste Manage. 28, 200–214.

Please cite this article in press as: Abbasi, M., El Hanandeh, A. Forecasting municipal solid waste generation using artificial intelligence modelling
approaches. Waste Management (2016), http://dx.doi.org/10.1016/j.wasman.2016.05.018
10 M. Abbasi, A. El Hanandeh / Waste Management xxx (2016) xxx–xxx

Bider, W.L., Franklin, W.E., 1980. A Method for Determining Processible Waste for a Navarro-Esbrí, J., Diamadopoulos, E., Ginestar, D., 2002. Time series analysis and
Resource Recovery Facility, pp. 211–219. forecasting techniques for municipal solid waste management. Resour. Conserv.
Buenrostro, O., Bocco, G., Vence, J., 2001. Forecasting generation of urban solid Recycl. 35, 201–214.
waste in developing countries – a case study in Mexico. J. Air Waste Manage. Noori, R., Abdoli, M.A., Farrokhnia, A., Ghaemi, A., 2009a. Solid waste generation
Assoc. 51, 86–93. predicting by hybrid of artificial neural network and wavelet transform. J.
Chen, H.W., Chang, N.B., 2000. Prediction analysis of solid waste generation based Environ. Stud. 35, 25–30.
on grey fuzzy dynamic modeling. Resour. Conserv. Recycl. 29, 1–18. Noori, R., Abdoli, M.A., Ghazizade, M.J., Samieifard, R., 2009b. Comparison of neural
Chung, S.S., 2010. Projecting municipal solid waste: the case of Hong Kong SAR. network and principal component-regression analysis to predict the solid waste
Resour. Conserv. Recycl. 54, 759–768. generation in Tehran, Iran. J. Public Health 38, 74–84.
Clarke, S.M., Griebsch, J.H., Simpson, T.W., 2005. Analysis of support vector Noori, R., Abdoli, M.A., Farokhnia, A., Abbasi, M., 2009c. Results uncertainty of solid
regression for approximation of complex engineering analyses. J. Mech. Des. waste generation forecasting by hybrid of wavelet transform-ANFIS and
127, 1077–1087. wavelet transform-neural network. Expert Sys. Appl. 36, 9991–9999.
Cleveland, W.S., 1979. Robust locally weighted regression and smoothing Noori, R., Abdoli, M.A., Ameri Ghasrodashti, A., Jalili Ghazizade, M., 2009d.
scatterplots. J. Am. Stat. Assoc. 74, 829–836. Prediction of municipal solid waste generation with combination of support
Denafas, G., Ruzgas, T., Martuzevičius, D., Shmarin, S., Hoffmann, M., Mykhaylenko, vector machine and principal component analysis: a case study of Mashhad.
V., Ogorodnik, S., Romanov, M., Neguliaeva, E., Chusov, A., Turkadze, T., Environ. Prog. Sust. Energy 28, 249–258.
Bochoidze, I., Ludwig, C., 2014. Seasonal variation of municipal solid waste Noori, R., Karbassi, A., Salman Sabahi, M., 2010. Evaluation of PCA and Gamma test
generation and composition in four East European cities. Resour. Conserv. techniques on ANN operation for weekly solid waste prediction. J. Environ.
Recycl. 89, 22–30. Manage. 91, 767–771.
Even Jr., J.C., Arberg, P., Parker, J.R., Alter, H., 1981. Residential waste generation. A Ordóñez-Ponce, E., Samarasinghe, S., Torgerson, L., 2006. Artificial neural networks
case study. Resour. Conserv. 6, 187–201. for assessing waste generation factors and forecasting waste generation: a case
Firat, M., Turan, M.E., Yurdusev, M.A., 2010. Comparative analysis of neural network study of Chile. J. Solid Waste Technol. Manage. 32, 167–184.
techniques for predicting water consumption time series. J. Hydrol. 384, 46–51. Refaeilzadeh, P., Tang, L., Liu, H., 2009. Cross-validation. Encyclopedia of Database
Franchetti, M.J., 2012. Development of a solid waste prediction, characterization, Systems. Springer, pp. 532–538.
and modeling tool for the assessment of manufacturing and service waste Rezaee, B., Zarandi, M.F., 2010. Data-driven fuzzy modeling for Takagi–Sugeno–
management systems. J. Solid Waste Technol. Manage. 38, 38–57. Kang fuzzy system. Inf. Sci. 180, 241–255.
He, Y., Qiu, Y., Liu, G., Lei, K., 2005. Optimizing weights of neural network using an Schiller, F., Raffield, T., Angus, A., Herben, M., Young, P.J., Longhurst, P.J., Pollard, S.J.
adaptive tabu search approach. Advances in Neural Networks–ISNN 2005. T., 2010. Hidden flows and waste processing-an analysis of illustrative futures.
Springer, pp. 672–676. Environ. Technol. 31, 1507–1516.
Hekkert, M.P., Joosten, L.A.J., Worrell, E., 2000. Analysis of the paper and wood flow Sha’Ato, R., Aboho, S.Y., Oketunde, F.O., Eneji, I.S., Unazi, G., Agwa, S., 2007. Survey of
in The Netherlands. Resour. Conserv. Recycl. 30, 29–48. solid waste generation and composition in a rapidly growing urban area in
Hockett, D., Lober, D.J., Pilgrim, K., 1995. Determinants of per capita municipal solid Central Nigeria. Waste Manage. 27, 352–358.
waste generation in the southeastern United States. J. Environ. Manage. 45, Tiwari, M.K., Bajpai, S., Dewangan, U.K., 2012. Prediction of industrial solid waste
205–217. with ANFIS model and its comparison with ANN model-A case study of Durg-
Jahandideh, S., Jahandideh, S., Asadabadi, E.B., Askarian, M., Movahedi, M.M., Bhilai twin city India. Int. J. Eng. Innov. Technol. 6, 192–201.
Hosseini, S., Jahandideh, M., 2009. The use of artificial neural networks and Tonjes, D.J., Greene, K.L., 2012. A review of national municipal solid waste
multiple linear regression to predict rate of medical waste generation. Waste generation assessments in the USA. Waste Manage. Res. 30, 758–771.
Manage. 29, 2874–2879. Vapnik, V.N., 1995. The Nature of Statistical Learning Theory. Springer-Verlag, New
Jalili Ghazi Zade, M., Noori, R., 2008. Prediction of municipal solid waste generation York Inc.
by use of artificial neural network: a case study of Mashhad. Int. J. Environ. Res. Vivekananda, B., Nema, A.K., 2014. Forecasting of solid waste quantity and
2, 13–22. composition: a multilinear regression and system dynamics approach. Int. J.
Katsamaki, A., Willems, S., Diamadopoulos, E., 1998. Time series analysis of Environ. Waste Manage. 13, 179–198.
municipal solid waste generation rates. J. Environ. Eng. 124, 178–183. Wei, Y., Xue, Y., Yin, J., Ni, W., 2013. Prediction of municipal solid waste generation
Kim, K.-J., 2003. Financial time series forecasting using support vector machines. in china by multiple linear regression method. Int. J. Comput. Appl. 35, 136–140.
Neurocomputing 55, 307–319. Wu, H.-C., 2009. The Karush–Kuhn–Tucker optimality conditions in multiobjective
Kumar, J.S., Subbaiah, K.V., Rao, P., 2011. Prediction of municipal solid waste with programming problems with interval-valued objective functions. Eur. J. Oper.
RBF network-a case study of Eluru, AP, India. Int. J. Innov., Manage. Technol. 2, Res. 196, 49–60.
238–243. Wu, X., Kumar, V., Ross, Q.J., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G.J., Ng, A.,
Liu, J., Xue, Y.H., Yang, Y.N., 2014. A tentative study on quantity of construction and Liu, B., Yu, P.S., Zhou, Z.H., Steinbach, M., Hand, D.J., Steinberg, D., 2008a. Top 10
demolition waste generation in Tianjin. In: 3rd International Conference on algorithms in data mining. Knowl. Inf. Syst. 14, 1–37.
Energy and Environmental Protection, ICEEP 2014. Trans Tech Publications Ltd., Wu, X., Kumar, V., Ross Quinlan, J., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G., Ng,
Xi’an, pp. 2727–2730. A., Liu, B., Yu, P., Zhou, Z.-H., Steinbach, M., Hand, D., Steinberg, D., 2008b. Top
Logan city council, 2014. Waste and recycling <http://www.logan.qld.gov.au/ 10 algorithms in data mining. Knowl. Inf. Syst. 14, 1–37.
environment-water-and-waste/waste-and-recycling>. Xu, L., Gao, P., Cui, S., Liu, C., 2013. A hybrid procedure for MSW generation
Lupu, V., Lupu, C., Morariu, N., 2008. The implementation of clean production and forecasting at multiple time scales in Xiamen City, China. Waste Manage. 33,
the use of neural networks in forecasting waste management. WSEAS Trans. 1324–1331.
Syst. Control 3, 722–736. Yakowitz, S., 1987. Nearest-neighbour methods for time series analysis. J. Time
Meade, N., 2002. A comparison of the accuracy of short term foreign exchange Series Anal. 8, 235–247.
forecasting methods. Int. J. Forecasting 18, 67–83. Zhang, L., Yuan, Z., Bi, J., Huang, L., 2012. Estimating future generation of obsolete
Mollasalehi, A., 2015. GIS based modelling for fuel reduction using controlled burn household appliances in China. Waste Manage. Res. 30, 1160–1168.
in Australia: case study: Logan City, Queensland LUMA-GIS Thesis <https://lup.
lub.lu.se/student-papers/search/publication/5276964>.

Please cite this article in press as: Abbasi, M., El Hanandeh, A. Forecasting municipal solid waste generation using artificial intelligence modelling
approaches. Waste Management (2016), http://dx.doi.org/10.1016/j.wasman.2016.05.018

You might also like