38 - Factor Analysis and Forecasting of CO2 Emissions in Hebei, Using

Accepted Manuscript
Factor analysis and forecasting of CO2 emissions in Hebei, using extreme

learning machine based on particle swarm optimization
Wei Sun, Caifei Wang, Chongchong Zhang
PII: S0959-6526(17)31185-X
DOI: 10.1016/j.jclepro.2017.06.016
Reference: JCLP 9766
To appear in: Journal of Cleaner Production
Received Date: 21 January 2017
Revised Date: 02 June 2017
Accepted Date: 03 June 2017
Please cite this article as: Wei Sun, Caifei Wang, Chongchong Zhang, Factor analysis and
forecasting of CO2 emissions in Hebei, using extreme learning machine based on particle swarm
optimization, Journal of Cleaner Production (2017), doi: 10.1016/j.jclepro.2017.06.016
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to
our customers we are providing this early version of the manuscript. The manuscript will undergo
copyediting, typesetting, and review of the resulting proof before it is published in its final form.
Please note that during the production process errors may be discovered which could affect the
content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT
Factor analysis and forecasting of CO2 emissions in Hebei, using
extreme learning machine based on particle swarm optimization
Wei Suna, Caifei Wangb,*, Chongchong Zhangc

Department of Business Administration, North China Electric Power University,
Baoding 071000, China
* Corresponding author. Tel.: +8615028250807.
E-mail address: 840400817@qq.com (C. Wang).
Abstract
In the prevailing low-carbon economy, China is under enormous pressure to
control CO2 emissions, therefore, of great significance is the study to analyze what
influential factors mainly contribute to emissions, so as to forecast emissions
accurately and harness the growth from the source. In this paper, basing on 22
influencing factors identified by bivariate correlation analysis, factor analysis is then
adopted to extract the latent factors which essentially affect emissions and 8 special
factors transformed by scoring coefficients are acquired. Extreme learning machine
(ELM) whose input weights and bias threshold were optimized by particle swarm
optimization (PSO), hereafter referred as PSO-ELM, is established to predict CO2
emissions and testify the availability of the factor analysis. Case studies reveal that the
factor analysis which generates 8 factors as input can highly improve prediction
accuracy. And the simulation results demonstrate that the built model PSO-ELM
outperforms the compared ELM and back propagation neural network in forecasting
CO2 emissions. Eventually, the analysis made in this study can provide valuable
policy implications for Hebei’s CO2 emissions reduction and strategic low-carbon
development.
ACCEPTED MANUSCRIPT
Keywords:
CO2 emissions, extreme learning machine, particle swarm optimization algorithm,
factor analysis
1 Introduction
Drawing high attention from the international society, global climate change has
become one of the most tough challenges faced by most countries. As the
predominant greenhouse gas, carbon dioxide contributes approximately 63% to the
greenhouse effect, becoming primary cause for the climate change. At present, China,
proved to be the second largest energy consumer, energy importer and the biggest
emitter of CO2 in the world, has been in the spotlight. During the twelfth “Five-Year”
plan, China is dedicated to reduce carbon intensity by 40-50% by 2020. Advancing
with times, the target that carbon emissions of China will reach its peak in 2030 is put
forward in 2016 through the thirteenth “Five-Year” plan.
In view of the claims mentioned above, researches concerning scientific forecast
methods based on deep analysis of influential factors are entailed for predicting CO2
emissions and then matching effectual measures must be taken to reduce emissions.
Throughout recent studies, various methods employed to analyze CO2 emissions
can be classified into 2 categories, one is the analysis on influence factors of CO2
emissions, and the other is predicting CO2 emissions on the basis of influencing
factors.
In the case of factors associated with CO2 emissions, there are three major
methods: (1) structural decomposition analysis such as input-output method (Wei and
ACCEPTED MANUSCRIPT
Huang, 2016); (2) index decomposition analysis mainly encompassing Laspeyres
decomposition (Schipper, 2013) and Divisia decomposition (Vaninsky, 2014); (3)
statistical analysis, containing grey relational analysis (Huang, 2016), cluster analysis
(Liimatainen, 2014), principal component analysis (Antanasijevic, 2014) and so on.
From the perspective of the content and extending factor analytic methods, a wealth
of researches has been conducted in differing ways. Regarding concrete factors,
emphasis is placed on industrial department which can be subdivided into cement
industry (Summerbell, 2016), iron and steel industry (Sun, 2012) and power industry
(Karmellos, 2016), transportation department (Mishina, 2011), urban areas (Lee,
2014) and others. For specific factor analytic methods, logarithmic mean divisia index
(Wang, 2017), adaptive weighted divisia (Greening and Davis, 1998) or else
combined with kaya model (Duro, 2013) and so on are prevailing in factors analysis
of CO2 emissions. In general, statistical analysis is capable of processing multivariate
situation so that influential factors can be considered more comprehensive.
In regard to the second part, there is abundant literature on estimation of CO2
emissions, most of which vary in constructed models. Tudor (2016) attempted to
forecast the evolution of carbon dioxide emissions in Bahrain over the 2012-2021
decade by employing seven Automated Forecasting Methods, including the
exponential smoothing state space model (ETS), the Holt-Winters Model, the
BATS/TBATS model, ARIMA, the structural time series model (STS), the naive
model, and the neural network time series forecasting method (NNAR), which arrived
at the conclusion that Bahrain cannot meet its assumed target. Another study by Wang
ACCEPTED MANUSCRIPT
(2017) clearly demonstrated the influence of various factors related to energy on CO2
emissions that are different in three development stages by using an extended
STIRPAT model based on the classical IPAT identify. Otherwise, vector auto-
regression model (Xu and Lin, 2015a), DDEPM method (Zhao and Liu, 2013) and
other original methods were used to study CO2 emissions by scholars at home and
abroad.
However, compared with the model referred above, artificial intelligence
technology are overwhelmingly exploited in forecasting CO2 emissions due to its
strong nonlinear fitting capability, robustness, self-learning ability and so forth in
recent years. Back propagation neural network (BPNN), singly or in conjunction with
miscellaneous algorithm such as genetic algorithm (GA) proposed by Sun and Ye
(2016) are wielded extensively within the realm of predicting CO2 emissions.
Furthermore, support vector machine (SVM) (Mladenovic, Sokolov et al., 2016) and
least squares support vector machine (LSSVM) (Sun and Liu, 2016), along with
plenty of mixed models offer viewpoints into forecast. In consideration of discrepant
objects and circumstance, the advent of scenario analysis (Rayle and Pai, 2010)
facilitates the construes on CO2 emissions under feasible scenario settings.
Nonetheless, flaws such as easily plunging into local optimum，over-fitting and poor
generalization capability still remain.
Summing up researches of predecessors, this thesis considers more influential
elements in Hebei to study CO2 emissions and utilizes factor analysis to seek common
factors, which furnishes more entry points for emission mitigation measures. In
ACCEPTED MANUSCRIPT
addition, the hybrid model PSO-ELM is firstly introduced for CO2 emissions
forecasting with high precision in this article.
The innovation and contribution of the present study mainly lies in the following
aspects. First, this paper comprehensively considers 22 influential factors and make
bivariate correlation analysis to identify there existing real relevance between selected
indicators and CO2 emissions. In this foundation, total factors are divided into 6
categories based on relativity and economic connotation. And factor analysis which
extends the common statistical approaches is introduced to decrease variables’
dimensions without distortion and dig out potential factors hiding in the 6 classes.
Then, extreme learning machine based on particle swarm optimization, tapping PSO
into optimizing the weight of input layer and the bias of hidden layer in ELM, is
applied in Hebei for CO2 emissions prediction from 1995 to 2014. The proposed PSO-
ELM not only maximize the merit of PSO’s global search capability and ELM’s fast
learning speed, but also overcome the inherent instability of ELM. Through case
studies, taking the factors uncovered by factor analysis as input variables for PSO-
ELM, there is evidence that the hybrid model makes a favorable performance in
contrast to other ELM and BPNN methods.
2 Methodology
2.1 Particle swarm optimization algorithm
Proposed by Kennedy and Eberhart (1995) in 1995, PSO calls the bird “particle”.
Each particle represents a potential optimal solution of the problem, initialized at the
commencement of the iterations by position, velocity and fitness value, among them,
ACCEPTED MANUSCRIPT
the velocity determines the direction and the distance of the particle.
To explore the best position, particle’s position calculated by fitness function
was updated by tracking two extremes, one is called Pbest which is the optimal position
found by the particle itself, and the other called Gbest is the global best position of the
entire swarm currently.
Supposed in the D-dimensional search space, the swarm consists of n particles,
and the ith of which can be expressed by Xi = (xi1, xi2, …, xiD)T. Subsequently, the
velocity and the Pbest of the ith particle are respectively stated as Vi = (vi1, vi2, …, viD)T
and Pi = (pi1, pi2, …, piD)T. The finest position of all is similarly denoted as Pg = (Pg1,
Pg2, ..., PgD). Afterwards, the core of PSO is as following:
Vidk 1  wVidk  c1rand1 ( Pidk  X idk )  c2 rand 2 ( Pgdk  X idk ), (1)
X idk 1  X idk  Vk 1id k  1,2,..., n; d  1,2,..., D , (2)
Where wi conceived as inertia weight can modify the search coverage. c1 and c2
are acceleration factors equal to 1.49445. Then, rand1 and rand2 are generated
uniformly in the interval [0, 1]. To avert blind searching, particles’ position and
velocity are limited in [-Xmax , Xmax] and [-Vmax , Vmax] separately.
2.2 Extreme learning machine
Extreme learning machine introduced by Huang (2006) is an innovative
algorithm based on single hidden layer feed-forward neural network (SFLN). SLFN
adopts gradient descent algorithm to adjust the weights and parameters in training
process. Nevertheless, multiple parameters that need to update during each iteration
and chain of driring principle lead to SLFN unsuitable for the problem of large scale
ACCEPTED MANUSCRIPT
neural network.
In order to address those, ELM, as shown in Fig. 1, was put forward and adopted
the Moore-penrose inverse to calculate output matrices of the hidden layer, turning
training program into striving for the solution of the least square problem. Meanwhile,
with the random setting of input weights and bias of hidden layer, ELM immensely
accelerates the learning speed and strengthens the generalization ability. As a
consequence, ELM has been widely applied in regression, fitting, classification,
pattern recognition and other fields.
Hidden layer
Input layer Output layer
x beta y
w
...
...
...
Fig. 1 Topological structure of ELM.
Although ELM has a desirable performance in most cases, there are still some
shortcomings reducing accuracy. In the learning process, the possible non-optimal or
unnecessary weight values and thresholds will degrade the performance of ELM and
result in unstable results. In addition, ELM requires for plenty of hidden layer nodes
to achieve desired results in some practical applications, but excessive nodes shall add
complications and make it prone to over-fitting.
2.3 An extreme learning machine based on particle swarm

ACCEPTED MANUSCRIPT
optimization
Aiming at the above problems, this thesis proposes an extreme learning
machine based on PSO (PSO-ELM) which taps PSO into optimizing the weights of
input layer and the bias of hidden layer in ELM, and thereby obtains an optimal
network whose flowchart is in Fig. 2. In part 1, the influencing factors of CO2
emissions are identified by bivariate correlation analysis and on the basis of this,
factor analysis is used for extracting input variables. Part 2 is the particle swarm
optimization algorithm and part 3 is the forecasting of extreme learning machine.
Part 1
22 Indicators
Bivariate correlation
Input factors Factor analysis Start
analysis
CO2 emissions
Initializing particles’Xi and The topological

Part 2 structure of ELM Part 3
Vi and setting parameters by
PSO
Initializing ELM connection

Error of Training by ELM weight w and bias threshold b
served as fitness function
Optimum w and b
Searching for pid and Pgd
Updating vid and xid Optimum matrix of the output

weights β
Renewal of the fitness value

Forecasting CO2
emission using ELM
Updating pid and pgd
N Output
Y
Terminal condition
Fig. 2 Flowchart of the PSO-ELM model.

ACCEPTED MANUSCRIPT
3 Data analysis in SPSS
3.1 Data conversion and factor pre-selection
The present study analyzes the carbon dioxide emissions and it’s factors of Hebei
province in China from 1995 to 2014, where the data stems from “China Statistical
Yearbook” and “Hebei Economic Yearbook”.
However, the statistical information of carbon dioxide emissions isn’t directly
disclosed, which must be converted through pertinent coefficients. In accordance with
the coefficients promulgated in the 2006 IPCC Guidelines for National Greenhouse
Gas Inventories (IPCC, 2006), CO2 emissions is calculated through different types of
energy consumption that the unit has been transformed as standard coal. From the
results presented in figure 3, CO2 emissions in Hebei accounted for about 6.7 percent
of the nation, occupying a considerable scale. And conversion coefficients of the three
energies which foremost induce CO2 emissions, are listed in Tab. 1.
Fig. 3 CO2 emissions of Hebei province and China.
Table 1
Conversion coefficients of CO2 emissions for various energies
Energy Coal Crude oil Natural gas Primary Power
Coefficient(C/t/t) 0.747 0.585 0.448 1.814
With regard to the factors affecting CO2 emissions in Hebei, taking

ACCEPTED MANUSCRIPT
comprehensive consideration of economy, energy and other relating aspects, this
article pre-selects 22 indicators: GDP of primary industry, secondary industry and
tertiary industry, total export, level of urbanization, coal consumption, production of
cement, steel and vehicle, power generation, R&D, general public budget expenditure,
total investment in fixed assets, investment in the treatment of industrial pollution,
owned automobile quantity, coverage of railway and highway, household
consumption, final consumption expenditure, urban green areas, ownership structure
which is the quotient of employment in state-owned units and overall employment,
and the fuel and power purchase price index.
3.2 Bivariate correlation analysis
Bivariate correlation analysis and the two-tailed significance test in SPSS were
preliminary used to verify the relevance between carbon dioxide emissions and pre-
selected factors, and the Pearson coefficient was reckoned to reflect the relatedness.
As illustrated in Table 2, entire factors are tightly linked with CO2 emissions, passing
through the two-tailed significance test at the confidence level of 99% universally.
Table 2
Bivariate correlation of CO2 and various factors
Indicator Pearson correlation Indicator Pearson correlation
Primary industry GDP 0.956** Coverage of railway 0.983**
Secondary industry GDP 0.961** Coverage of highway 0.961**
Tertiary industry GDP 0.946** Urban green areas 0.991**
R&D 0.88** Household consumption 0.948**
Level of urbanization 0.990** Ownership structure -0.985**
Total export 0.953** Owned automobile quantity 0.897**
Coal consumption 0.998** Final consumption expenditure 0.955**
Production of cement 0.97** Total investment in fixed assets 0.898**

ACCEPTED MANUSCRIPT
Production of steel 0.964** General public budget expenditure 0.899**
Production of vehicle 0.930** The fuel and power purchase price index 0.966**
Power generation 0.98** Investment in the treatment of industrial pollution 0.688**
Note: ** denotes a significant correlation at the significance level of 1%.
Despite the feasible selections of the influencing elements, abounding variables
and a certain degree of correlation between variables may put the subsequent research
to some inconvenience and deviation. Therefore, this paper employs factor analysis in
SPSS to process variables and further identify the latent common factors which
virtually impacts the experimental subject.
3.3 Factor analysis
First submitted by British psychologist C.E. Spearman, factor analysis refers to a
statistical technique that can explore common factors from the variable group.
Laboratory data collected tends to be not independent, in view of this, highly
interrelated variables are classified into one group, thus, rendering the same group of
variables with fairly correlation and different groups of variables less associated.
Those variables within one group can be substituted by a mutual thing that is
distinguished as a common factor. Not only does factor analysis reduce indicators’
dimensions, but also the common factors it elicited to portray and replace primitive
variables can commendably mirror and explain the complicated relationship between
variables, keeping data messages with essentially no less and facilitating the later
research.
After correlation analysis for the pre-selection factors, indicators are divided into
6 categories based on relativity and economic connotation: comprehensive strength,

ACCEPTED MANUSCRIPT
energy intensive industry, investment, consumption, city and transportation, aiming to
refrain variables’ dimensions from being considerably plummeted, merely because of
the relevance on value. Taking the type of energy intensive industry as an example,
factor analysis are shown in Table 3.
In Table 3, there is one common factor whose contribution rate up to 96.965%
hiding in the four indicators. First, statistic value of Kaiser Meyer Olkin (KMO) that
served as the criteria to judge whether factor analysis can be done is 0.785, more than
0.7, really suitable for factor analysis. Whilst the probability of barlett test of
sphericity is 0.000, rejecting the null hypothesis and denying the correlation matrix is
a unit matrix. In conclusion, the common factor indicated as F2 in this article, is
nominated as energy consumption factor.
Likewise, the results of KMO and others can be attained from the following
Table 4 to 8. Particularly, in Table 4, two common factors are extracted through factor
rotation to vividly construe the indicators. If nothing but F11 was determined, the
group can only be decoded by 53.922%, whereas coupled with F12, the accumulative
contribution rate of the two factors can be 96.965% . As we can see, GDP of primary
industry, secondary industry and tertiary industry have greater load on F11 than export,
as opposed to export loading more on F12, thus, F11 and F12 is designated as output
value factor and export factor respectively. Similarly, F31 and F32 are parts of the
investment factor, named as general investment factor and pollution investment
factor. F4, F5, F6 are separately defined as consumption factor, city factor and
transportation factor as well.

ACCEPTED MANUSCRIPT
Table 3
Factor analysis of energy intensive industry
Indicator Loading of F2 Scoring coefficient of F2 Index Value
Production of cement 0.965 0.249 KMO 0.785
Production of steel 0.994 0.256 Bartlett's test 181.367
Production of vehicle 0.984 0.254 Prob. 0.000
Power generation 0.997 0.257 Contribution rate 96.965%
Table 4
Factor analysis of comprehensive strength
Indicator Loading of F11 F12 Scoring coefficient of F11 F12 Index Value
GDP of primary industry 0.765 0.643 0.643 -1.242 KMO 0.838
GDP of secondary industry 0.749 0.662 0.662 -0.472 Bartlett's test 250.755
GDP of tertiary industry 0.776 0.629 0.629 -1.77 Prob. 0.000
Total export 0.639 0.769 0.769 4.192 Contribution rate 99.897%
Table 5
Factor analysis of investment
Indicator Loading of F31 F32 Scoring coefficient of F31 F32 Index Value
R&D 0.765 0.643 1.488 -1.242 KMO 0.843
General public budget

0.749 0.662 0.778 -0.472 Bartlett's test 142.724
expenditure
Total investment in fixed

0.776 0.629 1.975 -1.77 Prob. 0.000
assets
Investment in the treatment

0.639 0.769 -3.528 4.192 Contribution rate 99.255%
of industrial pollution
Table 6
Factor analysis of consumption
Household consumption 0.986 0.256 KMO 0.704
Final consumption expenditure 0.989 0.256 Bartlett's test 215.552
Coal consumption 0.974 0.254 Prob. 0.000
The fuel and power purchase price index 0.98 0.253 Contribution rate 96.414%
ACCEPTED MANUSCRIPT
Table 7
Factor analysis of city
Level of urbanization 0.998 0.335 KMO 0.781
Urban green areas 0.995 0.334 Bartlett's test 140.51
Ownership structure -0.997 -0.335 Prob. 0.000
Contribution rate 99.298%
Table 8
Factor analysis of transportation
Owned automobile quantity 0.952 0.339 KMO 0.785
Coverage of railway 0.977 0.348 Bartlett's test 181.367
Coverage of highway 0.973 0.346 Prob. 0.000
Contribution rate 93.637%
To sum up, six classes of factors, actually 8 factors, are obtained as input for the
forecasting model, containing output value factor F11 and export factor F12, energy
consumption factor F2, general investment factor F31 and pollution investment factor
F32, consumption factor F4, city factor F5 and transportation factor F6. Transformed by
scoring coefficients in Table 3-8, specific factor variables are constructed. As a
consequence, the following study will take the 8 factors as input variables to predict
CO2 emissions in Hebei.
3.4 Application of the PSO-ELM
To examine the rational alternative of factors, the PSO-ELM model was applied
in Hebei’s carbon dioxide emission forecasting. Meanwhile, root mean square error
(RMSE) and mean absolute percentage error (MAPE) are in the utilization of
checking PSO-ELM’s performance. The equations of the two error criterion are as
below:
ACCEPTED MANUSCRIPT
2
1 n yˆ t  yt
RMSE  
n t 1 yt
(3)
1 n yˆ t  yt
MAPE  
n t 1 yt
*100% (4)
Where n stands for the quantity of the test sample, yt is the tth real CO2
emissions, while ŷt is the matching predicted output.
It was PSO that optimized the matrix of output weight in ELM. In PSO, the max-
generation = 100, the size of population = 50, whilst in ELM, nodes of the hidden
layer is 20, and the rest parameters are setting as Table 9.
Table 9
Parameters’ setting of the PSO-ELM model
Parameters Value
The maximum number of velocity 0.5
The minimum number of velocity -0.5
The maximum number of population 5
The minimum number of population -5
4 Results and discussion
The integral procedure of PLO-ELM was executed by MATLAB 2016a. In
addition to demonstrating the reasonableness of factors’ selection and processing, the
present study also makes comparisons with different models, including back
propagation neural network (BPNN) and ELM, to testify the effectiveness of PSO-
ELM.
As illustrated in Fig.3, it can apparently draw the conclusion that the fitting curve
from 2010 to 2014 of the suggested PSO-ELM model is superior to others in CO2
emissions forecasting of Hebei. Relatively, the tracks of ELM algorithm and BPNN
model have a worse fitting effect with some predicted points deviating from the
ACCEPTED MANUSCRIPT
authentic values.
Fig.4 Fitting curves of the four predictions and desired output
Simultaneously, from the perspective of error criterion, Table 10 reveals that
both RMSE and MAPE of PSO-ELM are less than other two models, exhibiting high
prediction exactitude. As for BPNN, the value of MAPE is higher than ELM and
PSO-ELM about 1.15% and 2.02%. Moreover, results forecast by ELM have already
shown a good precision for its MAPE value is merely 1.18%. Nonetheless, the
improved ELM --- PSO-ELM whose MAPE and RMSE are only 0.31% and 91.63,
achieves a pretty good consequence. According to the results, not only are the factors
selected by factor analysis available and efficient, but also the PSO-ELM model is
proved to be applicable.
Table 10
Error analysis of the compared models
PSO-ELM ELM BPNN
RMSE(million tons) 91.63 317.22 660.15
MAPE(%) 0.31% 1.18% 2.33%
Furthermore, attending to the input factors which converted by 22 primitive
indicators, the accurate predictions further corroborated that total factors or the basic
indicators do have practical impacts on CO2 emissions in Hebei, which provides

ACCEPTED MANUSCRIPT
orientation and conception for policy makers regarding CO2 emissions reduction.
Considering F2, highly energy-consuming industries including steel, cement,
vehicle and others are the leading contributors to manufacturing industry’s CO2
emissions in China (Xu and Lin, 2016). Corresponding measures like reorganization
of migratory large industrial enterprises, carbon emissions trading and so on must be
intensively implemented. As we know, power supply in F2, mainly depends on
thermal power units in Hebei, so that replacing small units with large ones, oxy-fuel
enriched combustion to facilitate carbon capture and sequestration and clean energies
can strive to develop with the intention of reducing emissions.
Since referring to power, it is well known that coal reflected by the consumption
factor F4 is the primary fuel of the thermal power. Similarly emphasized in (Sun and
Xu, 2016), coal consumption offers insights into measures such as promotion of new
catalyst for coal combustion, usage of high quality coal, excise tax on coal and other
effective techniques.
As regards city factor F5, environmental issue is highlighted as an area for further
urbanization (Ouyang and Lin, 2017). Thus, beneficial is a head of pollution
managing and the augment of urban green area such as vertical planting and vacant
land afforests that can absorb already emitted CO2.
According to (Xu and Lin, 2015b), private cars leads to increased CO2 emissions
in the early stages of rapid growth, but with the increased use of electric and hybrid
cars and other advancement, it’s emissions intensity will gradually decline. In light of
this, discerning from the transportation factor F6, odd and even license plate rule, free
ACCEPTED MANUSCRIPT
public transits in special period, substitution of electric power for petrol and
congestion charge in big cities are fairly encouraged. As discussed previously, it has
been evident from this research, the government could carry out proper actions
directing at the 8 factors to harness CO2 emissions in Hebei.
5 Conclusion
In this article, 22 indexes are selected as influential factors of CO2 emissions in
Hebei, and based on the classification while considering correlation and economic
connotation of the total variables, 8 factors are eventually identified through factor
analysis. Afterwards, the 8 factors are input into PSO-ELM to forecast CO2
emissions. Through comparing the recommended model with other models
embodying ELM and BPNN, it is regarded as evident to conclude that: (1) Factor
analysis of the total selected indicators influencing CO2 emissions is available and
conducive to enhance predicting accuracy of the results; (2) Compared with other
methods, the PSO-ELM model has the lowest RMSE and MAPE values, which is
effective and promising in carbon dioxide emissions forecasting.
As we know, the onus is on China reducing CO2 emissions to achieve a low
carbon economy. Therefore, researches on CO2 emissions and its influential factors
are significant to provide data in support of governments’ negotiation in United
Nations Framework Convention on Climate Change (UNFCCC). And also it is a
reference for relevant agencies to take mitigation measures and improve carbon
trading market internally and externally. Compared with most previous studies, factor
analysis applied in this paper takes more influencing factors of CO2 emissions into
ACCEPTED MANUSCRIPT
consideration, which offers policy makers more breakthrough points.
References
Antanasijevic, D.Z., Ristic, M.D., Peric-Grujic, A.A., Pocajt, V.V., 2014. Forecasting GHG
emissions using an optimized artificial neural network model based on correlation and principal
component analysis. Int. J. Greenh. Gas Con. 20, 244-253.
Change, I.P.O.C, 2006. 2006 IPCC Guidelines for National Greenhouse Gas Inventories.
Duro, J.A., 2013. Weighting vectors and international inequality changes in environmental
indicators: An analysis of CO2 per capita emissions and Kaya factors. Energy Econ. 39, 122-127.
Greening, L.A., Davis, W.B., Schipper, L., 1998. Decomposition of aggregate carbon intensity for
the manufacturing sector: comparison of declining trends from 10 OECD countries for the period
1971-1991. Energy Econ. 20, 43-65.
Huang, G.B., Zhu, Q.Y., Siew, C.K., 2006. Extreme learning machine: Theory and applications.
Neurocomputing 70, 489-501.
Huang, M., Wang, B., 2016. Factors influencing CO2emissions in China based on grey relational
analysis. Energy Source. Part A. 38, 555-561.
Karmellos, M., Kopidou, D., Diakoulaki, D., 2016. A decomposition analysis of the driving
factors of CO2 (Carbon dioxide) emissions from the power sector in the European Union
countries. Energy 94, 680-692.
Kennedy, J., Eberhart, R.C., 1995. Particle swarm optimization. In: Proceedings of IEEE
International Conference on Neutral Networks (ICNN’95), vol. IV. Perth, Australia, pp. 1942-
1948.
Lee, J.B., Byun, W., Lee, S.H., Do, M., 2014. Correlation between optimal carsharing locations
ACCEPTED MANUSCRIPT
and carbon dioxide emissions in urban areas. Int. J. Environ. Sci. Te. 11, 2319-2328.
Liimatainen, H., Kallionpää, E., Pöllänen, M., Stenholm, P., Tapio, P., McKinnon, A., 2014.
Decarbonizing road freight in the future - Detailed scenarios of the carbon emissions of Finnish
road freight transport in 2030 using a Delphi method approach. Technol. Forecast. Soc. Change
81, 177-191.
Mishina, Y., Taniguchi, Y., Muromachi, Y., 2011. Why carbon dioxide emissions from Japanese
passenger cars peaked in 2001: Complete decomposition analysis from 1990 to 2008. Transport.
Res. Rec. 2252, 152-160.
Mladenović, I., Sokolov-Mladenović, S., Milovančević, M., Marković, D., Simeunović, N., 2016.
Management and estimation of thermal comfort, carbon dioxide emission and economic growth
by support vector machine. Renew. Sust. Energy Rev. 64, 466-476.
Ouyang, X.L., Lin, B.Q., 2017. Carbon dioxide (CO2) emissions during urbanization: A
comparative study between China and Japan. J. Clean. Prod. 143, 356-368.
Rayle, L., Pai, M., 2010. Scenarios for future urbanization: Carbon dioxide emissions from
passenger travel in three Indian cities. Transport. Res. Rec. 2193, 124-131.
Schipper, L., Fulton, L., 2013. Dazzled by diesel? the impact on carbon dioxide emissions of the
shift to diesels in Europe through 2009. Energy Policy 54, 3-10.
Summerbell, D.L., Barlow, C.Y., Cullen, J.M., 2016. Potential reduction of carbon emissions by
performance improvement: A cement industry case study. J. Clean. Prod. 135, 1327-1339.
Sun, W., Liu, M.H., 2016. Prediction and analysis of the three major industries and residential
consumption CO2 emissions based on least squares support vector machine in China. J. Clean.
Prod. 122, 144-153.

ACCEPTED MANUSCRIPT
Sun, W., Xu, Y.F., 2016. Using a back propagation neural network based on improved particle
swarm optimization to study the influential factors of carbon dioxide emissions in Hebei Province,
China. J. Clean. Prod. 112, 1282-1291.
Sun, W., Ye, M., Xu, Y., 2016. Study of carbon dioxide emissions prediction in Hebei province,
China using a BPNN based on GA. J. Renew. Sustain. Ener. 8, 043110-494.
Sun, W.Q., Cai, J.J., Yu, H., Dai, L., 2012. Decomposition analysis of energy-related carbon
dioxide emissions in the iron and steel industry in China. Front. Env. Sci. Eng. 6, 265-270.
Tudor, C., 2016. Predicting the Evolution of CO2 Emissions in Bahrain with Automated
Forecasting Methods. Sustainability-Basel 8, 923.
Vaninsky, A., 2014. Factorial decomposition of CO2 emissions: A generalized Divisia index
approach. Energy Econ. 45, 389-400.
Wang, C.J., Wang, F., Zhang, X.L., Yang, Y., Su, Y.X., Ye, Y.Y., Zhang, H.O., 2017. Examining
the driving factors of energy related carbon emissions using the extended STIRPAT model based
on IPAT identity in Xinjiang. Renew. Sust. Energy Rev. 67, 51-61.
Wang, M., Feng, C., 2017. Decomposition of energy-related CO2 emissions in China: An
empirical analysis based on provincial panel data of three sectors. Appl. Energy 190, 772-787.
Wei, J., Huang, K., Yang, S.S., Li, Y., Hu, T.T., Zhang, Y., 2016. Driving forces analysis of
energy-related carbon dioxide (CO2) emissions in Beijing: An input-output structural
decomposition analysis. J. Clean. Prod. 1-11.
Xu, B., Lin, B., 2015a. Carbon dioxide emissions reduction in China's transport sector: A dynamic
VAR (vector autoregression) approach. Energy 83, 486-495.
Xu, B., Lin, B., 2015b. Factors affecting carbon dioxide (CO2) emissions in China's transport
ACCEPTED MANUSCRIPT
sector: a dynamic nonparametric additive regression model. J. Clean. Prod. 101, 311-322.
Xu, B., Lin, B., 2016. Reducing carbon dioxide emissions in China's manufacturing industry: a
dynamic vector autoregression approach. J. Clean. Prod. 131, 594-606.
Zhao, X., Liu, G.W., Qi, J.M., 2013. China carbon emission forecast based on the Discrete
Difference Equation Prediction Model. J. Arid Land Resour. Environ. 27, 63-69.

38 - Factor Analysis and Forecasting of CO2 Emissions in Hebei, Using

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

38 - Factor Analysis and Forecasting of CO2 Emissions in Hebei, Using

Uploaded by

Copyright:

Available Formats

Accepted Manuscript

Factor analysis and forecasting of CO2 emissions in Hebei, using extreme

Wei Sun, Caifei Wang, Chongchong Zhang

Reference: JCLP 9766

To appear in: Journal of Cleaner Production

Received Date: 21 January 2017

Revised Date: 02 June 2017

Accepted Date: 03 June 2017

Factor analysis and forecasting of CO2 emissions in Hebei, using

extreme learning machine based on particle swarm optimization

Wei Suna, Caifei Wangb,*, Chongchong Zhangc

In the prevailing low-carbon economy, China is under enormous pressure to

influential factors mainly contribute to emissions, so as to forecast emissions

influencing factors identified by bivariate correlation analysis, factor analysis is then

factors transformed by scoring coefficients are acquired. Extreme learning machine

optimization (PSO), hereafter referred as PSO-ELM, is established to predict CO2

CO2 emissions, extreme learning machine, particle swarm optimization algorithm,

predominant greenhouse gas, carbon dioxide contributes approximately 63% to the

plan, China is dedicated to reduce carbon intensity by 40-50% by 2020. Advancing

forward in 2016 through the thirteenth “Five-Year” plan.

In view of the claims mentioned above, researches concerning scientific forecast

Throughout recent studies, various methods employed to analyze CO2 emissions

Huang, 2016); (2) index decomposition analysis mainly encompassing Laspeyres

decomposition (Schipper, 2013) and Divisia decomposition (Vaninsky, 2014); (3)

(Liimatainen, 2014), principal component analysis (Antanasijevic, 2014) and so on.

of researches has been conducted in differing ways. Regarding concrete factors,

emphasis is placed on industrial department which can be subdivided into cement

(Karmellos, 2016), transportation department (Mishina, 2011), urban areas (Lee,

of CO2 emissions. In general, statistical analysis is capable of processing multivariate

situation so that influential factors can be considered more comprehensive.

In regard to the second part, there is abundant literature on estimation of CO2

emissions, most of which vary in constructed models. Tudor (2016) attempted to

decade by employing seven Automated Forecasting Methods, including the

emissions that are different in three development stages by using an extended

However, compared with the model referred above, artificial intelligence

technology are overwhelmingly exploited in forecasting CO2 emissions due to its

strong nonlinear fitting capability, robustness, self-learning ability and so forth in

miscellaneous algorithm such as genetic algorithm (GA) proposed by Sun and Ye

plenty of mixed models offer viewpoints into forecast. In consideration of discrepant

facilitates the construes on CO2 emissions under feasible scenario settings.

generalization capability still remain.

Summing up researches of predecessors, this thesis considers more influential

forecasting with high precision in this article.

extends the common statistical approaches is introduced to decrease variables’

contrast to other ELM and BPNN methods.

2.1 Particle swarm optimization algorithm

To explore the best position, particle’s position calculated by fitness function

entire swarm currently.

Supposed in the D-dimensional search space, the swarm consists of n particles,

Pg2, ..., PgD). Afterwards, the core of PSO is as following:

Vidk 1  wVidk  c1rand1 ( Pidk  X idk )  c2 rand 2 ( Pgdk  X idk ), (1)

X idk 1  X idk  Vk 1id k  1,2,..., n; d  1,2,..., D , (2)

velocity are limited in [-Xmax , Xmax] and [-Vmax , Vmax] separately.

2.2 Extreme learning machine

Extreme learning machine introduced by Huang (2006) is an innovative

accelerates the learning speed and strengthens the generalization ability. As a

consequence, ELM has been widely applied in regression, fitting, classification,

pattern recognition and other fields.

Fig. 1 Topological structure of ELM.

shortcomings reducing accuracy. In the learning process, the possible non-optimal or

complications and make it prone to over-fitting.

2.3 An extreme learning machine based on particle swarm

Primary industry GDP 0.956 Coverage of railway 0.983

Secondary industry GDP 0.961 Coverage of highway 0.961

Tertiary industry GDP 0.946 Urban green areas 0.991

R&D 0.88 Household consumption 0.948

Level of urbanization 0.990 Ownership structure -0.985

Total export 0.953 Owned automobile quantity 0.897

Coal consumption 0.998 Final consumption expenditure 0.955

Production of cement 0.97 Total investment in fixed assets 0.898

Production of steel 0.964 General public budget expenditure 0.899

Power generation 0.98 Investment in the treatment of industrial pollution 0.688