Professional Documents
Culture Documents
PII: S0959-6526(17)31185-X
DOI: 10.1016/j.jclepro.2017.06.016
Please cite this article as: Wei Sun, Caifei Wang, Chongchong Zhang, Factor analysis and
forecasting of CO2 emissions in Hebei, using extreme learning machine based on particle swarm
optimization, Journal of Cleaner Production (2017), doi: 10.1016/j.jclepro.2017.06.016
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to
our customers we are providing this early version of the manuscript. The manuscript will undergo
copyediting, typesetting, and review of the resulting proof before it is published in its final form.
Please note that during the production process errors may be discovered which could affect the
content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT
Abstract
control CO2 emissions, therefore, of great significance is the study to analyze what
accurately and harness the growth from the source. In this paper, basing on 22
adopted to extract the latent factors which essentially affect emissions and 8 special
(ELM) whose input weights and bias threshold were optimized by particle swarm
emissions and testify the availability of the factor analysis. Case studies reveal that the
factor analysis which generates 8 factors as input can highly improve prediction
accuracy. And the simulation results demonstrate that the built model PSO-ELM
outperforms the compared ELM and back propagation neural network in forecasting
CO2 emissions. Eventually, the analysis made in this study can provide valuable
policy implications for Hebei’s CO2 emissions reduction and strategic low-carbon
development.
ACCEPTED MANUSCRIPT
Keywords:
factor analysis
1 Introduction
Drawing high attention from the international society, global climate change has
become one of the most tough challenges faced by most countries. As the
greenhouse effect, becoming primary cause for the climate change. At present, China,
proved to be the second largest energy consumer, energy importer and the biggest
emitter of CO2 in the world, has been in the spotlight. During the twelfth “Five-Year”
with times, the target that carbon emissions of China will reach its peak in 2030 is put
methods based on deep analysis of influential factors are entailed for predicting CO2
emissions and then matching effectual measures must be taken to reduce emissions.
can be classified into 2 categories, one is the analysis on influence factors of CO2
emissions, and the other is predicting CO2 emissions on the basis of influencing
factors.
In the case of factors associated with CO2 emissions, there are three major
methods: (1) structural decomposition analysis such as input-output method (Wei and
ACCEPTED MANUSCRIPT
statistical analysis, containing grey relational analysis (Huang, 2016), cluster analysis
From the perspective of the content and extending factor analytic methods, a wealth
industry (Summerbell, 2016), iron and steel industry (Sun, 2012) and power industry
2014) and others. For specific factor analytic methods, logarithmic mean divisia index
(Wang, 2017), adaptive weighted divisia (Greening and Davis, 1998) or else
combined with kaya model (Duro, 2013) and so on are prevailing in factors analysis
forecast the evolution of carbon dioxide emissions in Bahrain over the 2012-2021
exponential smoothing state space model (ETS), the Holt-Winters Model, the
BATS/TBATS model, ARIMA, the structural time series model (STS), the naive
model, and the neural network time series forecasting method (NNAR), which arrived
at the conclusion that Bahrain cannot meet its assumed target. Another study by Wang
ACCEPTED MANUSCRIPT
(2017) clearly demonstrated the influence of various factors related to energy on CO2
STIRPAT model based on the classical IPAT identify. Otherwise, vector auto-
regression model (Xu and Lin, 2015a), DDEPM method (Zhao and Liu, 2013) and
other original methods were used to study CO2 emissions by scholars at home and
abroad.
recent years. Back propagation neural network (BPNN), singly or in conjunction with
(2016) are wielded extensively within the realm of predicting CO2 emissions.
Furthermore, support vector machine (SVM) (Mladenovic, Sokolov et al., 2016) and
least squares support vector machine (LSSVM) (Sun and Liu, 2016), along with
objects and circumstance, the advent of scenario analysis (Rayle and Pai, 2010)
Nonetheless, flaws such as easily plunging into local optimum,over-fitting and poor
elements in Hebei to study CO2 emissions and utilizes factor analysis to seek common
factors, which furnishes more entry points for emission mitigation measures. In
ACCEPTED MANUSCRIPT
addition, the hybrid model PSO-ELM is firstly introduced for CO2 emissions
The innovation and contribution of the present study mainly lies in the following
aspects. First, this paper comprehensively considers 22 influential factors and make
bivariate correlation analysis to identify there existing real relevance between selected
indicators and CO2 emissions. In this foundation, total factors are divided into 6
categories based on relativity and economic connotation. And factor analysis which
dimensions without distortion and dig out potential factors hiding in the 6 classes.
Then, extreme learning machine based on particle swarm optimization, tapping PSO
into optimizing the weight of input layer and the bias of hidden layer in ELM, is
applied in Hebei for CO2 emissions prediction from 1995 to 2014. The proposed PSO-
ELM not only maximize the merit of PSO’s global search capability and ELM’s fast
learning speed, but also overcome the inherent instability of ELM. Through case
studies, taking the factors uncovered by factor analysis as input variables for PSO-
ELM, there is evidence that the hybrid model makes a favorable performance in
2 Methodology
Proposed by Kennedy and Eberhart (1995) in 1995, PSO calls the bird “particle”.
Each particle represents a potential optimal solution of the problem, initialized at the
commencement of the iterations by position, velocity and fitness value, among them,
ACCEPTED MANUSCRIPT
the velocity determines the direction and the distance of the particle.
was updated by tracking two extremes, one is called Pbest which is the optimal position
found by the particle itself, and the other called Gbest is the global best position of the
and the ith of which can be expressed by Xi = (xi1, xi2, …, xiD)T. Subsequently, the
velocity and the Pbest of the ith particle are respectively stated as Vi = (vi1, vi2, …, viD)T
and Pi = (pi1, pi2, …, piD)T. The finest position of all is similarly denoted as Pg = (Pg1,
Where wi conceived as inertia weight can modify the search coverage. c1 and c2
are acceleration factors equal to 1.49445. Then, rand1 and rand2 are generated
uniformly in the interval [0, 1]. To avert blind searching, particles’ position and
algorithm based on single hidden layer feed-forward neural network (SFLN). SLFN
adopts gradient descent algorithm to adjust the weights and parameters in training
process. Nevertheless, multiple parameters that need to update during each iteration
and chain of driring principle lead to SLFN unsuitable for the problem of large scale
ACCEPTED MANUSCRIPT
neural network.
In order to address those, ELM, as shown in Fig. 1, was put forward and adopted
the Moore-penrose inverse to calculate output matrices of the hidden layer, turning
training program into striving for the solution of the least square problem. Meanwhile,
with the random setting of input weights and bias of hidden layer, ELM immensely
Hidden layer
Input layer Output layer
x beta y
w
...
...
...
Although ELM has a desirable performance in most cases, there are still some
unnecessary weight values and thresholds will degrade the performance of ELM and
result in unstable results. In addition, ELM requires for plenty of hidden layer nodes
to achieve desired results in some practical applications, but excessive nodes shall add
optimization
machine based on PSO (PSO-ELM) which taps PSO into optimizing the weights of
input layer and the bias of hidden layer in ELM, and thereby obtains an optimal
emissions are identified by bivariate correlation analysis and on the basis of this,
factor analysis is used for extracting input variables. Part 2 is the particle swarm
Part 1
22 Indicators
Bivariate correlation
Input factors Factor analysis Start
analysis
CO2 emissions
Optimum w and b
Searching for pid and Pgd
N Output
Y
Terminal condition
The present study analyzes the carbon dioxide emissions and it’s factors of Hebei
province in China from 1995 to 2014, where the data stems from “China Statistical
the coefficients promulgated in the 2006 IPCC Guidelines for National Greenhouse
Gas Inventories (IPCC, 2006), CO2 emissions is calculated through different types of
energy consumption that the unit has been transformed as standard coal. From the
results presented in figure 3, CO2 emissions in Hebei accounted for about 6.7 percent
of the nation, occupying a considerable scale. And conversion coefficients of the three
Table 1
cement, steel and vehicle, power generation, R&D, general public budget expenditure,
Bivariate correlation analysis and the two-tailed significance test in SPSS were
preliminary used to verify the relevance between carbon dioxide emissions and pre-
selected factors, and the Pearson coefficient was reckoned to reflect the relatedness.
As illustrated in Table 2, entire factors are tightly linked with CO2 emissions, passing
through the two-tailed significance test at the confidence level of 99% universally.
Table 2
Production of vehicle 0.930** The fuel and power purchase price index 0.966**
and a certain degree of correlation between variables may put the subsequent research
to some inconvenience and deviation. Therefore, this paper employs factor analysis in
SPSS to process variables and further identify the latent common factors which
statistical technique that can explore common factors from the variable group.
interrelated variables are classified into one group, thus, rendering the same group of
variables with fairly correlation and different groups of variables less associated.
Those variables within one group can be substituted by a mutual thing that is
distinguished as a common factor. Not only does factor analysis reduce indicators’
dimensions, but also the common factors it elicited to portray and replace primitive
variables can commendably mirror and explain the complicated relationship between
variables, keeping data messages with essentially no less and facilitating the later
research.
After correlation analysis for the pre-selection factors, indicators are divided into
the relevance on value. Taking the type of energy intensive industry as an example,
hiding in the four indicators. First, statistic value of Kaiser Meyer Olkin (KMO) that
served as the criteria to judge whether factor analysis can be done is 0.785, more than
0.7, really suitable for factor analysis. Whilst the probability of barlett test of
sphericity is 0.000, rejecting the null hypothesis and denying the correlation matrix is
Likewise, the results of KMO and others can be attained from the following
Table 4 to 8. Particularly, in Table 4, two common factors are extracted through factor
rotation to vividly construe the indicators. If nothing but F11 was determined, the
group can only be decoded by 53.922%, whereas coupled with F12, the accumulative
contribution rate of the two factors can be 96.965% . As we can see, GDP of primary
industry, secondary industry and tertiary industry have greater load on F11 than export,
as opposed to export loading more on F12, thus, F11 and F12 is designated as output
value factor and export factor respectively. Similarly, F31 and F32 are parts of the
factor. F4, F5, F6 are separately defined as consumption factor, city factor and
Table 3
Table 4
Indicator Loading of F11 F12 Scoring coefficient of F11 F12 Index Value
GDP of secondary industry 0.749 0.662 0.662 -0.472 Bartlett's test 250.755
Table 5
Indicator Loading of F31 F32 Scoring coefficient of F31 F32 Index Value
Table 6
The fuel and power purchase price index 0.98 0.253 Contribution rate 96.414%
ACCEPTED MANUSCRIPT
Table 7
Table 8
To sum up, six classes of factors, actually 8 factors, are obtained as input for the
forecasting model, containing output value factor F11 and export factor F12, energy
consumption factor F2, general investment factor F31 and pollution investment factor
F32, consumption factor F4, city factor F5 and transportation factor F6. Transformed by
consequence, the following study will take the 8 factors as input variables to predict
To examine the rational alternative of factors, the PSO-ELM model was applied
in Hebei’s carbon dioxide emission forecasting. Meanwhile, root mean square error
(RMSE) and mean absolute percentage error (MAPE) are in the utilization of
checking PSO-ELM’s performance. The equations of the two error criterion are as
below:
ACCEPTED MANUSCRIPT
2
1 n yˆ t yt
RMSE
n t 1 yt
(3)
1 n yˆ t yt
MAPE
n t 1 yt
*100% (4)
Where n stands for the quantity of the test sample, yt is the tth real CO2
It was PSO that optimized the matrix of output weight in ELM. In PSO, the max-
generation = 100, the size of population = 50, whilst in ELM, nodes of the hidden
Table 9
Parameters Value
present study also makes comparisons with different models, including back
propagation neural network (BPNN) and ELM, to testify the effectiveness of PSO-
ELM.
As illustrated in Fig.3, it can apparently draw the conclusion that the fitting curve
from 2010 to 2014 of the suggested PSO-ELM model is superior to others in CO2
emissions forecasting of Hebei. Relatively, the tracks of ELM algorithm and BPNN
model have a worse fitting effect with some predicted points deviating from the
ACCEPTED MANUSCRIPT
authentic values.
both RMSE and MAPE of PSO-ELM are less than other two models, exhibiting high
prediction exactitude. As for BPNN, the value of MAPE is higher than ELM and
PSO-ELM about 1.15% and 2.02%. Moreover, results forecast by ELM have already
shown a good precision for its MAPE value is merely 1.18%. Nonetheless, the
improved ELM --- PSO-ELM whose MAPE and RMSE are only 0.31% and 91.63,
achieves a pretty good consequence. According to the results, not only are the factors
selected by factor analysis available and efficient, but also the PSO-ELM model is
proved to be applicable.
Table 10
indicators, the accurate predictions further corroborated that total factors or the basic
orientation and conception for policy makers regarding CO2 emissions reduction.
vehicle and others are the leading contributors to manufacturing industry’s CO2
emissions in China (Xu and Lin, 2016). Corresponding measures like reorganization
thermal power units in Hebei, so that replacing small units with large ones, oxy-fuel
enriched combustion to facilitate carbon capture and sequestration and clean energies
Since referring to power, it is well known that coal reflected by the consumption
factor F4 is the primary fuel of the thermal power. Similarly emphasized in (Sun and
Xu, 2016), coal consumption offers insights into measures such as promotion of new
catalyst for coal combustion, usage of high quality coal, excise tax on coal and other
effective techniques.
As regards city factor F5, environmental issue is highlighted as an area for further
managing and the augment of urban green area such as vertical planting and vacant
According to (Xu and Lin, 2015b), private cars leads to increased CO2 emissions
in the early stages of rapid growth, but with the increased use of electric and hybrid
cars and other advancement, it’s emissions intensity will gradually decline. In light of
this, discerning from the transportation factor F6, odd and even license plate rule, free
ACCEPTED MANUSCRIPT
public transits in special period, substitution of electric power for petrol and
congestion charge in big cities are fairly encouraged. As discussed previously, it has
been evident from this research, the government could carry out proper actions
5 Conclusion
Hebei, and based on the classification while considering correlation and economic
connotation of the total variables, 8 factors are eventually identified through factor
analysis. Afterwards, the 8 factors are input into PSO-ELM to forecast CO2
embodying ELM and BPNN, it is regarded as evident to conclude that: (1) Factor
analysis of the total selected indicators influencing CO2 emissions is available and
conducive to enhance predicting accuracy of the results; (2) Compared with other
methods, the PSO-ELM model has the lowest RMSE and MAPE values, which is
carbon economy. Therefore, researches on CO2 emissions and its influential factors
reference for relevant agencies to take mitigation measures and improve carbon
trading market internally and externally. Compared with most previous studies, factor
analysis applied in this paper takes more influencing factors of CO2 emissions into
ACCEPTED MANUSCRIPT
References
Antanasijevic, D.Z., Ristic, M.D., Peric-Grujic, A.A., Pocajt, V.V., 2014. Forecasting GHG
emissions using an optimized artificial neural network model based on correlation and principal
Change, I.P.O.C, 2006. 2006 IPCC Guidelines for National Greenhouse Gas Inventories.
Duro, J.A., 2013. Weighting vectors and international inequality changes in environmental
indicators: An analysis of CO2 per capita emissions and Kaya factors. Energy Econ. 39, 122-127.
Greening, L.A., Davis, W.B., Schipper, L., 1998. Decomposition of aggregate carbon intensity for
the manufacturing sector: comparison of declining trends from 10 OECD countries for the period
Huang, G.B., Zhu, Q.Y., Siew, C.K., 2006. Extreme learning machine: Theory and applications.
Huang, M., Wang, B., 2016. Factors influencing CO2emissions in China based on grey relational
Karmellos, M., Kopidou, D., Diakoulaki, D., 2016. A decomposition analysis of the driving
factors of CO2 (Carbon dioxide) emissions from the power sector in the European Union
Kennedy, J., Eberhart, R.C., 1995. Particle swarm optimization. In: Proceedings of IEEE
International Conference on Neutral Networks (ICNN’95), vol. IV. Perth, Australia, pp. 1942-
1948.
Lee, J.B., Byun, W., Lee, S.H., Do, M., 2014. Correlation between optimal carsharing locations
ACCEPTED MANUSCRIPT
and carbon dioxide emissions in urban areas. Int. J. Environ. Sci. Te. 11, 2319-2328.
Liimatainen, H., Kallionpää, E., Pöllänen, M., Stenholm, P., Tapio, P., McKinnon, A., 2014.
Decarbonizing road freight in the future - Detailed scenarios of the carbon emissions of Finnish
road freight transport in 2030 using a Delphi method approach. Technol. Forecast. Soc. Change
81, 177-191.
Mishina, Y., Taniguchi, Y., Muromachi, Y., 2011. Why carbon dioxide emissions from Japanese
passenger cars peaked in 2001: Complete decomposition analysis from 1990 to 2008. Transport.
Mladenović, I., Sokolov-Mladenović, S., Milovančević, M., Marković, D., Simeunović, N., 2016.
Management and estimation of thermal comfort, carbon dioxide emission and economic growth
Ouyang, X.L., Lin, B.Q., 2017. Carbon dioxide (CO2) emissions during urbanization: A
comparative study between China and Japan. J. Clean. Prod. 143, 356-368.
Rayle, L., Pai, M., 2010. Scenarios for future urbanization: Carbon dioxide emissions from
passenger travel in three Indian cities. Transport. Res. Rec. 2193, 124-131.
Schipper, L., Fulton, L., 2013. Dazzled by diesel? the impact on carbon dioxide emissions of the
Summerbell, D.L., Barlow, C.Y., Cullen, J.M., 2016. Potential reduction of carbon emissions by
performance improvement: A cement industry case study. J. Clean. Prod. 135, 1327-1339.
Sun, W., Liu, M.H., 2016. Prediction and analysis of the three major industries and residential
consumption CO2 emissions based on least squares support vector machine in China. J. Clean.
Sun, W., Xu, Y.F., 2016. Using a back propagation neural network based on improved particle
swarm optimization to study the influential factors of carbon dioxide emissions in Hebei Province,
Sun, W., Ye, M., Xu, Y., 2016. Study of carbon dioxide emissions prediction in Hebei province,
Sun, W.Q., Cai, J.J., Yu, H., Dai, L., 2012. Decomposition analysis of energy-related carbon
dioxide emissions in the iron and steel industry in China. Front. Env. Sci. Eng. 6, 265-270.
Tudor, C., 2016. Predicting the Evolution of CO2 Emissions in Bahrain with Automated
Vaninsky, A., 2014. Factorial decomposition of CO2 emissions: A generalized Divisia index
Wang, C.J., Wang, F., Zhang, X.L., Yang, Y., Su, Y.X., Ye, Y.Y., Zhang, H.O., 2017. Examining
the driving factors of energy related carbon emissions using the extended STIRPAT model based
Wang, M., Feng, C., 2017. Decomposition of energy-related CO2 emissions in China: An
empirical analysis based on provincial panel data of three sectors. Appl. Energy 190, 772-787.
Wei, J., Huang, K., Yang, S.S., Li, Y., Hu, T.T., Zhang, Y., 2016. Driving forces analysis of
Xu, B., Lin, B., 2015a. Carbon dioxide emissions reduction in China's transport sector: A dynamic
Xu, B., Lin, B., 2015b. Factors affecting carbon dioxide (CO2) emissions in China's transport
ACCEPTED MANUSCRIPT
sector: a dynamic nonparametric additive regression model. J. Clean. Prod. 101, 311-322.
Xu, B., Lin, B., 2016. Reducing carbon dioxide emissions in China's manufacturing industry: a
Zhao, X., Liu, G.W., Qi, J.M., 2013. China carbon emission forecast based on the Discrete
Difference Equation Prediction Model. J. Arid Land Resour. Environ. 27, 63-69.