Professional Documents
Culture Documents
Energy
journal homepage: www.elsevier.com/locate/energy
a r t i c l e i n f o a b s t r a c t
Article history: The rapid development of big data and smart technology in the natural gas industry requires timely and
Received 10 September 2018 accurate forecasting of natural gas consumption on different time horizons. In this work, we propose a
Received in revised form robust hybrid hours-ahead gas consumption method by integrating Wavelet Transform, RNN-structured
20 April 2019
deep learning and Genetic Algorithm. The Wavelet Transform is used to reduce the complexity of the
Accepted 23 April 2019
Available online 29 April 2019
forecasting tasks by decomposing the original series of gas loads into several sub-components. The RNN-
structured deep learning method is built up via combining a multi-layer Bi-LSTM model and a LSTM
model. The multi-layer Bi-LSTM model can comprehensively capture the features in the sub-components
Keywords:
Natural gas demand forecasting
and the LSTM model is used to forecast the future gas consumption based on these abstracted features.
Deep learning To enhance the performance of the RNN-structured deep learning model, Genetic Algorithm is employed
Recurrent neural network to optimize the structure of each layer in the model. Besides, the dropout technology is applied in this
Genetic algorithm work to overcome the potential problem of overfitting. In this case study, the effectiveness of the
Long short time memory model developed method is verified from multiple perspective, including graphical examination, mathematical
errors analysis and model comparison, on different data sets.
© 2019 Elsevier Ltd. All rights reserved.
1. Introduction this work, our research focuses on hourly gas forecasting at the
customer level. In other words, the forecasting method aims to be
A rapid increase of demand of natural gas, as an important used on the customer level, like power plants, factories and dis-
source of clean energy, is occurring in many countries. The tribution companies.
important role of natural gas in the world energy portfolio and the Efforts have been made to forecast well the gas demand. The
increasing awareness of environment issues have accelerated the literature surveys [2,3] indicate that the exploration of natural gas
development of natural gas industry. The usage of natural gas has forecasting can be divided according to different rules, such as
penetrated in varies field, e.g., power generation, urban heating forecasting horizon, forecasting tools, data type and applied area.
supplying, public transportation, manufacturing and so on. On the For different types of forecasting work, i.e., different horizons and
other hand, the uncertainties in natural gas demand increase the applied area, the methods used are different. According to Ref. [2],
difficulty of management of the gas production and distribution the forecasting horizon can be hourly, daily, monthly, annually and
system and the risk of interruption of gas supply, which poses combined. The applied area can refer to world level, national level,
threats on the economy and society [1]. Robust and accurate fore- regional level, gas distribution system level and individual
casting of demand of natural gas is one of the critical problems for customer level. Methods to forecast natural gas demand should be
maintaining a reliable supply of gas for different applications. In carefully selected to fit the specific conditions of the forecasting
problem. According to the literature research, the methods for gas
demand forecasting can be mainly grouped as time series model,
regression model, artificial neural network and hybrid method [4].
* Corresponding author. College of Mechanical and Transportation Engineering,
Generally, the choice of the forecasting method depends on the
China University of Petroleum, Fuxue Road 18, Changping District, 102249, Beijing,
China. forecasting scenario and the type of input data.
E-mail address: zhangjj@cup.edu.cn (J. Zhang). Time series (TS) models are used to forecast the gas demand
https://doi.org/10.1016/j.energy.2019.04.167
0360-5442/© 2019 Elsevier Ltd. All rights reserved.
586 H. Su et al. / Energy 178 (2019) 585e597
based on the collected data, without prior knowledge [4]. The re- Various works have explored the abilities of hybrid forecasting
ports in literature indicate that TS models can be applied for a wide models. In general, hybrid forecasting models have better perfor-
range of forecasting horizons (from annual to hourly). ARIMA mance in flexibility, robustness, computing efficiency than most
model was applied to forecast annual or monthly gas demand of individual forecasting methods. For example, in Ref. [25], different
Turkey, with the consideration of GDP and price of gas [5]. A fore- methods including wavelet transform, genetic algorithm, adaptive
casting model based on SARIMAX was developed for short-term neuro-fuzzy inference system and feedforward neural network
prediction of the daily gas demand, with the consideration of were integrated for day-ahead demand forecasting in Greece.
temperature, pressure, humidity and cloudiness [6]. Structural time According to the above literature survey, we can tentatively
series model was used to forecast the annual gas demand consid- conclude that the critical part of demand forecasting relates to
ering multiple factors, such as future trend of natural gas con- finding the inherent features and relationships hidden in the nat-
sumption, determinants income and natural gas price [7]. The ural gas demand data. In this paper, we originally develop a robust
results of literature indicate that SARIMA and SARIMAX have better natural gas demand forecasting model by integrating Wavelet
performances in capturing seasonal factors in the time series of the Transform, stacked Bi-Directional LSTM model, Genetic Algorithm
demand than ARIMAX, which is able to provide qualified annual (GA) and LSTM model. The wavelet transform is used to decompose
forecasting for demand of gas [4]. the original demand data, to reduce the difficulties to learn the
Besides time series models, regression models are also widely relationships in data. The stacked Bi-Directional LSTM model can
used for natural gas demand forecasting. Generally, linear regres- comprehensively learn the inherent knowledge from both forward
sion models are preferred for long-term horizons, country level and backward directions in each decomposed component. GA
forecasting, based on some main independent factors [8,9]. For method is used to optimize the structure of the stacked Bi-
example, linear regression was applied to forecast annual natural Directional LSTM model to improve its performance of feature
gas demand in South Korea with four variables: population, GDP, learning. Finally, the LSTM model with a dropout part is applied for
export and import amounts [10]. Linear regression model consid- hourly demand forecasting based on chronologically-arranged
ering temperature, GDP per capita and natural gas price was data. This hourly forecasting method can contribute to fulfill
applied for long-term forecasting of natural gas demand at the research and application for predictive optimization of natural gas
country level [11]. Besides linear regression models, the OLS (or- pipeline networks and real-time demand side management in gas
dinary least squares) regression model [6] and some nonlinear- supply systems.
regression based statistical methods [12] have shown good abili- The main contribution of this work is summarized as follows:
ties for natural gas demand forecasting at different levels.
In recent years, a number of forecasting models based on arti- (1) This paper proposes a hybrid model, which is able to effec-
ficial neural networks have been developed and these models tively learn the knowledge in natural gas demand data and
significantly improve the accuracy and efficiency of natural gas make accurate forecasting, with high efficiency. Natural gas
demand forecasting. Feedforward neural network, fuzzy neural consumption data has the characteristics of series data, but
network, recurrent neural network or some hybrid neural networks these have not been paid enough attention for gas demand
have been applied at different horizons and levels [13e15]. The forecasting. Under this consideration, the deep RNN model
comparison of the forecasting results show that the neural for time series data analysis is originally used for natural gas
network-based models have strong abilities for natural gas demand demand data forecasting in this work. To the best of the
forecasting [16]. For example, reference [17] indicated that the authors' knowledge, this is the first time that deep recurrent
developed neural network model outperforms the condition de- neural networks are used for natural gas demand forecasting.
mand analysis method and the engineering model. The comparison (2) This paper originally proposes the idea of bi-direction feature
carried out in Ref. [10] showed that the developed multilayer per- learning, to fundamentally improve the performance of
ceptron model has better prediction performance than a linear natural gas demand forecasting by effectively mining the
regression model and an exponential model. features from the sequential structured consumption data.
Among the neural network models, recurrent neural network The results prove that considering the backward relationship
(RNN) models, which process data by internal memory loops and can significantly improve the accuracy of forecasting and
maintain a chain-structure, are more suitable to learn the features shorten the model training process.
of time series data [18,19]. However, the deep chain-like structure
increase the difficulty of training RNN models by backpropagation. 2. Development of natural gas demand forecasting method
This is not the case for long-short time memory (LSTM) model [20],
whose great power of LSTM mode for analyzing data sequence has For a clear illustration, the developed forecasting method is
been proved by successful applications in many areas, including introduced in three parts: data decomposing part, data feature
speech recognition [21], human trajectory prediction [22], traffic learning part and demand forecasting part.
prediction [23], etc. However, the potential advantages of LSTM
model for forecasting are far from being totally exploited because 2.1. Decomposition of the original time series of natural gas
most of LSTM-based forecasting models are shallow-structured and demand
the knowledge embedded in the data can not be fully learned [24].
Also, because time series data are fed chronologically to a LSTM Data pre-processing is usually performed to improve the per-
model, the information is passed in forward direction along the formances of prediction. Most of the time series data of natural gas
chain-like structure and the traditional LSTM model can learn only demand contain trends and volatilities, which means forecasting
the forward relationship in the data. This results in the fact that difficulty. To handle this difficulty, wavelet transform is applied to
LSTM models may filter out valuable information of backward de- decompose the original time series of gas demands into several
pendencies of data. This is quite relevant for the case that of interest high-frequency and one low-frequency subseries in the wavelet
because, generally, demands of natural gas have relatively strong domain, in order to reduce the difficulty of feature learning and
periodicity and regularity, which means that backward temporal improve the forecasting performance [26,27].
dependencies constitude an important part of the natural gas de- A wavelet transform can provide useful information in both time
mand pattern. and frequency, especially when the time series is non-stationary,
H. Su et al. / Energy 178 (2019) 585e597 587
X
T1
t b2a
Wða; bÞ ¼ 2½2
a
f ðtÞf (1)
t¼0
2a
2.2.3. Enhancement of the performance of the deep forecasting explained by the following equations:
model
The developed deep-structure network model is capable to r ðlÞ BernoulliðpÞ (9)
learn the complex relationships of the data and the selected factors,
but many of these so-called relationships may be just noises. In this ðlÞ
condition, overfitting would be a serious problem, threatening the y ¼ r ðlÞ *yðlÞ
b (10)
forecasting accuracy. To address this problem, the dropout tech-
nique is introduced in this model. The dropout technique can effi- ðlþ1Þ ðlþ1Þ ðlÞ ðlþ1Þ
zi ¼ wi b
y þ bi (11)
ciently prevent overfitting by randomly dropping a unit [35]. With
reference to the generic layer, classical dropout mechanism can be
H. Su et al. / Energy 178 (2019) 585e597 589
where * represents an element-wise product. For the specific layer ½d1; d2; d3; a34N ¼ DWTðDÞ (15)
l, r(l) denotes a vector of independent Bernoulli random variables,
whose probability of being equal to 1 is p. This vector is randomly where d1, d2, d3, a3 and D2ℝ1N .
sampled and, then, multiplied element-wise by the outputs of the The stacked Bi-LSTM model G*(j) takes in input the decomposed
layer (y(l)), to obtain the “thinned” outputs b
ðlÞ
y . Then the outputs b
y
ðlÞ signal component iðjÞ , outputs the feature Y *ðjÞ 2ℝ1N , with the
are fed into the next layer. The dropout operation is repeated at number of neurons in the Bi-LSTM layers given by Mj.
each layer.
It is necessary also to optimize the architecture of the proposed Y *ðjÞ ¼ G*ðjÞ iðjÞ ; Mj (16)
deep-learning model, i.e., the number of neurons in each layer, to
enhance its forecasting performance [36]. In this work, we used where j denotes the index of the decomposed signal component
Genetic Algorithm, which has been effectively employed to solve obtained by DWT in Eq. (15), j ¼ 1,2,3,4, corresponding to the
many kinds of optimization problems [25,37e39]. In this work, the components d1, d2, d3, a3, iðjÞ 2ℝ1N denotes the input, i.e., the j-th
GA minimizes the Root Mean Square Error (RMSE) between the decomposed signal component, i2{d1, d2, d3, a3}, Mj denotes the
forecasted natural gas demands F and the real natural gas demand number of neurons in the Bi-LSTM layers.
data T. The optimization problem is formulated by Equations (13) The LSTM model G(j)takes in input the output Y*(j) of the stacked
and (14) below. The framework of the deep forecasting method Bi-LSTM model G*(j) and provide in output the prediction of the
and the enhancements are shown in Fig. 5. decomposed signal component Y(j), with the number of neurons in
the LSTM layer Nj given by:
Minimize : RMSEðF; TÞ (13)
Y ðjÞ ¼ GðjÞ Y *ðjÞ ; Nj (17)
F ¼ PðD; M1 ; M2 ; M3 ; M4 ; N1 ; N2 ; N3 ; N4 Þ (14)
in which P represents the deep forecasting model needing to be where j denotes the index of the decomposed signal component
optimized. D represents the vector of the input data of natural gas, obtained by DWT, j ¼ 1,2,3,4, corresponding to the component d1,
Mi denotes the vector of the decision variables which determine the d2, d3, a3.
structure of the deep RNN model for the component i, Ni denotes The InverseWT model takes in input all the predictions of the
the number of neurons of the LSTM model for the component i, T decomposed signal component and provides in output the pre-
represents the real natural gas demand data. The ranges for the diction of the original data of natural gas demand D:
search of the decision variables are pre-defined.
The prediction model P is composed by four major sub-models, F ¼ InverseWT Y ð1Þ ; Y ð2Þ ; Y ð3Þ ; Y ð4Þ (18)
including Discrete Wavelet Transform model DWT (shown in
Equation (1)), stacked Bidirectional LSTM model G* (shown in Finally, let DWT(D)(j) (j ¼ 1,2,3,4) denotes the decomposed signal
Equation (2)), LSTM model G (shown in Equation (3)) and Inverse component d1, d2, d3, a3, respectively. We get the detailed math-
Wavelet Transform model InverseWT (shown in Equation (4)). ematical presentation of Eq. (14) as follows:
F ¼ InverseWT Gð1Þ G*ð1Þ DWTðDÞð1Þ ; M1 ; N1 ; …; Gð4Þ G*ð4Þ DWTðDÞð4Þ ; M4 ; N4 (19)
3. Applications
3.1. Preparation
dxðtÞ axðt tM Þ
¼ bxðtÞ (20)
dt 1 þ xc ðt tM Þ
N
1 XN 1 X
Fi Ti
MRE ¼ (23)
MAE ¼ jF Ti j
N i¼1 i
(21) N i¼1 Ti
vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
u N
XN 1uX
Fi Ti RMSE ¼ t ðF Ti Þ2 (24)
RE ¼ T (22) N i¼1 i
i¼1 i
Fig. 8. “Predicted demands” compared with “actual demands” on data generated by Mackey- Glass model.
592 H. Su et al. / Energy 178 (2019) 585e597
forecasting; sy is the variance of the time series. The calculation of error as: population size ¼ 30, crossover probability ¼ 0.4 and
ACF is performed for every wavelet component and the input sizes mutation probability ¼ 0.6. For the application of the GA, one needs
of the components are determined by the lengths of their first to trade off the performance improvement and the computational
period (Fig. 7). For example, the values of ACF of wavelet compo- burden for the optimality search. For this reason, the number of
nents of one group of data collected on OpenEI indicate that the generations in the GA is set to 100, leading to a significant
input sizes for the wavelet components should be longer than improvement of the forecasting accuracy with acceptable compu-
around 25 h (Fig. 7). In the cases based on the real world data, the tation time. Considering that, the maximal number of generations
available information of date and time (hour) is also chosen as in the GA are set by 100, which can effectively improve the fore-
another input of the forecasting model and some other important casting accuracy under acceptable computation time consuming.
information, such as weather, climate and gas price, should also be During the training process, the maximum number of epochs is
carefully selected as the inputs of gas consumption forecasting. 600. The dropout rate is set by 0.4, to avoid the overfitting problem.
The developed forecasting model is used on the simulated data
by the Mackey Glass model and to predict the gas consumption
3.2. Performance evaluation based on two sets of real world data (named as Set I and Set II). For
these latter, only the consumption data of winter are used.
For each deep RNN model, the number of Bi-LSTM layer is set to Considering this research focuses on hourly gas load forecasting,
2, by trial and error. The numbers of neurons in the Bi-LSTM layers the prediction time interval of these presented applications is
and the LSTM layer are optimized by GA in a search within the chosen as 10 h. For the three different forecasting tasks, the struc-
range of [100, 110, 120, 130, 140, 150], to find good structures for tures of the deep RNN models, each of which includes four stacked
each RNN layer. The main parameters of the GA are set by trial and
Fig. 10. Predicted demands compared with actual demands on data Set II.
594 H. Su et al. / Energy 178 (2019) 585e597
Bi-LSTM layers and four LSTM layers, are optimized by the GA. The To further verify the effectiveness of the forecasting model, a
optimization variables are the numbers of neurons in every layer of comparison of the forecasting performance is performed among
the deep RNN model. For example, in the model for forecasting the the developed model, a three-layer-LSTM model and a Non-linear
simulated data, the optimized numbers of the neurons of the two- Autoregressive (NAR) model. The three-layer-LSTM model is
layer Bi-LSTM parts and the LSTM parts for the wavelet components introduced here to verify the effectiveness of the Wavelet Trans-
of d1, d2, d3 and a3 are [130, 120, 130], [130, 110, 100], [120, 130, formation and the Bi-LSTM model. The Non-linear Autoregressive
120], [130, 110, 100], respectively. During the training processes of model is a classical method for time series prediction and is used
the deep RNN models, the developed models show good conver- here to compare the overall performance of the developed model.
gence rates. The normalized RMSEs of the training processes are The three-layer-LSTM shares the same structure with the optimized
lower than 0.1 after around 100 epochs. Some oscillation are Deep RNN model. These three models are used to forecast the data
observed because of the dropout technique. of 10 h ahead (10 steps) in the future, based on the three data sets.
Both the prediction results of the original series and the wavelet Then, the forecasting performances are presented by relative errors
components are shown in Figs. 8e10, to give a comprehensive and compared with each other in the form of Cumulative Distri-
picture of the performances of the forecasting model. bution Function (CDF). The analysis results are shown in
These Figures show that the developed forecasting model is able Figs. 11e13.
to make accurate predictions and this indicates that the deep RNN From Figs. 11e13, we can conclude that the developed fore-
model has a strong ability to capture the features behind the gas casting model outperforms the other models on different types of
consumption data. However, the model shows relatively poor
performance on the wavelet components of d1, whose periodical
behaviors are not obvious compared with the other components.
From the opposite perspective, this observation indicates that the
data decomposition process, via the wavelet method, can reduce
the complexity of the forecasting task and improve the accuracy of
the overall results.
The error measures are listed in Table 1. The errors are given
based on different lengths of forecasting horizon, which are 1 h, 5 h
and 10 h, to present a comprehensive information of model per-
formance and its sensitivity under different requirements of fore-
casting time horizon.
The error analysis results in Table 1 indicate that the developed
model is capable to perform accurate forecasting on different data
sets. The forecasting based on Mackey Glass series has an accuracy
level of about 99%, even when the forecasting horizon is increased
to 10 steps ahead. For the real world data in Set I and Set II, the
MREs of the forecasting errors are maintained (at acceptable values
of 5.84% and 6.78%), even when the model is used to perform 10-
steps ahead predictions.
We notice that the forecasting performance of the developed
model degenerates as the forecasting horizon extends. Generally,
the reason of this degeneration is that the strength of the rela-
tionship between the future data of consumption and the current Fig. 11. Performance comparison based on the CDFs of the relative error values (the
data decreases as the forecasting horizon increases, and this in- Mackey Glass series).
creases the difficulty for the Deep RNN model to learn such rela-
tionship. Hence, if we need to make a forecasting for a relatively
long time in a real application, it is not a good choice to extend the
forecasting horizon of the model without limitation. To overcome
this problem, one needs to firstly determine the limitation of the
forecasting model based on the data and the accuracy requirement,
and, then, apply controlled recursive processes that make use of the
forecasting results to enhance the ability of the deep learning
model [42].
Table 1
Prediction performances for different forecasting horizons (winter data for data Set I
and Set II).
data sets. According to Figs. 12 and 13, we can also observe that the The aim of this work is to introduce a new and highly reforming
developed model has superior capacity compared to the three- hourly forecasting method of natural gas demand forecasting. The
Fig. 14. Predicted demands compared with actual demands on the data summer (1 h ahead).
Fig. 15. Predicted demands compared with actual demands on the data summer (5 h ahead).
Fig. 16. Predicted demands compared with actual demands on the data summer (10 h ahead).
596 H. Su et al. / Energy 178 (2019) 585e597
Table 2 [8] Zhu L, Li MS, Wu QH, Jiang L. Short-term natural gas demand prediction based
Prediction performances for different forecasting horizons (summer data for data on support vector regression with false neighbours filtered. Energy 2015;80:
Set II). 428e36.
[9] Herbert JH, Sitzer S, Eades-Pryor Y. A statistical evaluation of aggregate
Data basis Task MAE MRE RMSE monthly industrial demand for natural gas in the U. SA,” Energy Dec.
1987;12(12):1233e8.
Set II 1 h forecasting 10.2701 0.0094 11.8352
[10] Geem ZW, Roper WE. Energy demand estimation of South Korea using arti-
5 h forecasting 11.9608 0.0110 14.1134 ficial neural network. Energy Policy Oct. 2009;37(10):4049e54.
10 h forecasting 55.1135 0.0501 69.2558 [11] Khan MA. Modelling and forecasting the demand for natural gas in Pakistan.
Renew Sustain Energy Rev Sep. 2015;49:1145e59.
[12] Wang Q, Li S, Li R. Forecasting energy demand in China and India: using
single-linear, hybrid-linear, and non-linear time series forecast techniques.
method is developed based on the integrated Wavelet Transform, Energy Jul. 2018;161:821e31.
Bi-LSTM model, LSTM model and Genetic Algorithm. The Wavelet [13] Chen Y, Chua WS, Koch T. Forecasting day-ahead high-resolution natural-gas
demand and supply in Germany. Appl Energy Oct. 2018;228:1091e110.
Transform is used to decompose the original series into sub-
[14] Ceperi c E, Zikovic S, Ceperi c V. Short-term forecasting of natural gas prices
components, to reduce the difficulty of forecasting. Several Bi- using machine learning and feature selection algorithms. Energy Dec.
LSTM models are stacked together to comprehensively learn the 2017;140:893e900.
[15] Yu F, Xu X. A short-term load forecasting model of natural gas based on
complicated relationships behind each sub-components and the
optimized genetic algorithm and improved BP neural network. Appl Energy
LSTM model is adopted to forecast the future values of these 2014;134:102e13.
components. To enhance the forecasting performances, the struc- [16] Soldo B. Forecasting natural gas consumption. Appl Energy Apr. 2012;92:
tures of the Bi-LSTM models and the LSTM models are optimized by 26e37.
[17] Aydinalp-Koksal M, Ugursal VI. Comparison of neural network, conditional
the GA method. To avoid potential overfitting, dropout is performed demand analysis, and engineering approaches for modeling end-use energy
during the training process. consumption in the residential sector. Appl Energy Apr. 2008;85(4):271e96.
The developed model is applied to three sets of data to verify its [18] Rigamonti M, Baraldi P, Zio E, Roychoudhury I, Goebel K, Poll S. Ensemble of
optimized echo state networks for remaining useful life prediction. Neuro-
effectiveness: one simulated set and two real gas consumption computing Mar. 2018;281:121e38.
data. To test the robustness of the model, the forecasting is per- [19] Zheng J, Xu C, Zhang Z, Li X. Electric load forecasting in smart grid using long-
formed on different horizons, i.e., 1 h, 5 h and 10 h. The experi- short-term-memory based recurrent neural network electric load forecasting
in smart grid using long-short-term-memory based recurrent neural network.
mental results show that the model is capable to achieve a high In: 2017 51st annu. Conf. Inf. Sci. Syst. (CISS); 2017. p. 1e6. no. January.
forecasting accuracy even when the horizon is increased to 10 steps [20] Kamilaris A, Prenafeta-Boldú FX. Deep learning in agriculture: a survey.
ahead of the current data. The degeneration of performances at Comput Electron Agric Apr. 2018;147:70e90.
[21] Hirafuji Neiva D, Zanchettin C. Gesture recognition: a review focusing on sign
large forecasting horizons can be controlled by recursive processes
language in a mobile context. Expert Syst Appl Aug. 2018;103:159e83.
method: this will developed in the future research. [22] Marug an AP, Marquez FPG, Perez JMP, Ruiz-Hern andez D. A survey of artificial
The forecasting performance of the developed model is neural network in wind energy systems. Appl Energy Oct. 2018;228:1822e36.
[23] Nagy AM, Simon V. Survey on traffic prediction in smart cities. Pervasive Mob
compared with a three-layer-LSTM model and a Non-linear
Comput Jul. 2018;50:148e63.
Autoregressive (NAR) model. The results of the comparison show [24] Fischer T, Krauss C. “ARTICLE IN PRESS Deep learning with long short-term
that the developed model outperforms the three-layer-LSTM model memory networks for financial market predictions. Eur. J. Oper. Res. Eur. J.
and the NAR model, which indicates that the Bi-LSTM model has a Oper. Res. J. 2017;17(0). 48e0.
[25] Panapakidis IP, Dagoumas AS. Day-ahead natural gas demand forecasting
solution capacity to learn the complicated features of the actual based on the combination of wavelet transform and ANFIS/genetic algorithm/
data. neural network model. Energy 2017;118:231e45.
In the future work, we will further improve the forecasting ac- [26] Tascikaraoglu A, Sanandaji BM, Poolla K, Varaiya P. Exploiting sparsity of in-
terconnections in spatio-temporal wind speed forecasting using Wavelet
curacy via exploring different advanced methods and performing Transform. Appl Energy Mar. 2016;165:735e47.
more detailed analysis on different influencing variables. Further- [27] Bento PMR, Pombo JAN, Calado MRA, Mariano SJPS. A bat optimized neural
more, the ability of deep RNN models for natural gas demand network and wavelet transform approach for short-term price forecasting.
Appl Energy Jan. 2018;210:88e97.
forecasting on longer time horizons, e.g., days or months, will be [28] Stocchi M, Marchesi M. Fast wavelet transform assisted predictors of
explored. streaming time series. Digit Signal Process Jun. 2018;77:5e12.
[29] Zheng H, Yuan J, Chen L. Short-term load forecasting using EMD-LSTM neural
networks with a xgboost algorithm for feature importance evaluation. En-
Acknowledgements ergies 2017;10(8):1168.
[30] Asala HI, Chebeir J, Zhu W, Gupta I, Taleghani AD, Romagnoli J. A machine
learning approach to optimize shale gas supply chain networks. SPE Annu.
This work is supported by National Natural Science Foundation Tech. Conf. Exhib. 2017;(October):0e28.
of China [grant number 51134006], and the research fund provided [31] Qing X, Niu Y. Hourly day-ahead solar irradiance prediction using weather
by China University of Petroleum, Beijing [grant number forecasts by LSTM. Energy Apr. 2018;148:461e8.
[32] Cui Z, Ke R, Wang Y. Deep bidirectional and unidirectional LSTM recurrent
01JB20188428].
neural network for network-wide traffic speed prediction. 2018. p. 22e5.
[33] Mellit A, Kalogirou SA, Hontoria L, Shaari S. Artificial intelligence techniques
for sizing photovoltaic systems: a review. Renew Sustain Energy Rev Feb.
References 2009;13(2):406e19.
[34] Rahman A, Srikumar V, Smith AD. Predicting electricity consumption for
[1] Flouri M, Karakosta C, Kladouchou C, Psarras J. How does a natural gas supply commercial and residential buildings using deep recurrent neural networks.
interruption affect the EU gas security? A Monte Carlo simulation. Renew Appl Energy 2018;212.
Sustain Energy Rev 2015;44:785e96. [35] Y. Gal and Z. Ghahramani, “A theoretically grounded application of dropout in
[2] Soldo B. Forecasting natural gas consumption. Appl Energy 2012;92:26e37. recurrent neural networks.”.
[3] Tamba JG, et al. Forecasting natural gas: a literature survey. Int J Energy Econ [36] Yang J, Ma J. A structure optimization framework for feed-forward neural
Policy 2018;8(3):216e49. networks using sparse representation. Knowl Base Syst Oct. 2016;109:61e70.
[4] Szoplik J. Forecasting of natural gas consumption with artificial neural net- [37] Yu F, Xu X. A short-term load forecasting model of natural gas based on
works. Energy 2015;85:208e20. optimized genetic algorithm and improved BP neural network. Appl Energy
[5] Erdogdu E. Natural gas demand in Turkey. Appl Energy Jan. 2010;87(1): Dec. 2014;134:102e13.
211e9. [38] Askari S, Montazerin N, Zarandi MHF. Forecasting semi-dynamic response of
[6] Taşpinar F, Çelebi N, Tutkun N. Forecasting of daily natural gas consumption natural gas networks to nodal gas consumptions using genetic fuzzy systems.
on regional basis in Turkey using various computational methods. Energy Energy 2015;83:252e66.
Build 2013;56:23e31. [39] Ene S, Küçükog €
_ Aksoy A, Oztürk
lu I, N. A genetic algorithm for minimizing
[7] Potocnik P, Thaler M, Govekar E, Grabec I, Poredos A. Forecasting risks of energy consumption in warehouses. Energy 2016;114:973e80.
natural gas consumption in Slovenia. Energy Policy Aug. 2007;35(8):4271e82.
H. Su et al. / Energy 178 (2019) 585e597 597
[40] Sharma V, Yang D, Walsh W, Reindl T. Short term solar irradiance forecasting [42] Su H, Zio E, Zhang J, Yang Z, Li X, Zhang Z. A systematic hybrid method for
using a mixed wavelet neural network. Renew Energy 2016;90:481e92. real-time prediction of system conditions in natural gas pipeline networks.
[41] Bernas M, Płaczek B. Period-aware local modelling and data selection for time J Nat Gas Sci Eng Sep. 2018;57:31e44.
series prediction. Expert Syst Appl Oct. 2016;59:60e77.