Professional Documents
Culture Documents
Abstract—This paper explores the classic prediction algo- the next day. Therefore, the focus of this paper is on short-term
rithms such as Autoregressive Integrated Moving Average model load forecasting.
(ARIMA) and Long Short-Term Memory Neural Network The randomness and large scale of power data are the main
(LSTM), and combines the advantages of both, and proposes
a short-term forecast method ARIMA-LSTM fusion model. This difficulties in predicting power data. The existing forecasting
model computes the final predicted value by applying a linear methods are roughly divided into two categories, one is
correctio to the LSTM model error. Using ARIMA and LSTM the traditional regression method represented by time series
as a comparison algorithm, train and predict the next day’s load analysis, and the other is the artificial intelligence method
with ARIMA-LSTM. The root mean square error (RMSE), mean represented by neural network.
absolute percentage error (MAPE), and the worst relative error
(WRE) were used to evaluate the performance of the proposed The basic method of traditional regression prediction is to
algorithm. After testing, the RMSE of the proposed model is establish a fitting curve and mathematical model according to
0.433, while ARIMA and LSTM are 0.461 and 0.445 respectively. historical load data, take the load data as a random variable,
ARIMA-LSTM also has better results and performs well on calculate the change law of the load data through statistical
different types of power datasets. laws, deduce the mathematical expression, and use this expres-
Index Terms—forecast, power load, ARIMA, LSTM, ARIMA-
sion to carry out load forecast. The more common methods are
LSTM
autoregressive model (AR), moving average model (MA) and
autoregressive moving average model (ARMA) [1]. ARMA
I. I NTRODUCTION can only deal with stationary sequences. For non-stationary
sequences, it needs to be differentiated to obtain stationary
Orderly electricity consumption refers to staggering the sequences, and then use ARMA, which is the autoregressive
electricity consumption of different users during peak elec- integrated moving average model (ARIMA) [2], [3]. The tra-
tricity consumption periods. The precise distribution of power ditional regression prediction method only needs endogenous
resources depends on the cooperation of many aspects. If variables without the help of other exogenous variables, and
the power supply is insufficient, there will be power outages, does not require manual intervention. The algorithm speed
affecting people’s lives, factory production, and even serious is relatively fast. Its disadvantage is that it relies too much
power outages for important facilities. Conversely, if the power on historical data and seldom considers other influencing
supply is excessive, it will lead to the waste of resources. factors, which leads to its inability to achieve high accuracy,
Therefore, the government and other relevant departments especially its poor fitting effect on nonlinear data. The previous
need to rationally allocate electricity resources through orderly initial conditions possess the inherent deficiencies of having
regulation of electricity consumption. a fixed structure and poor adaptability to changing raw data.
Accurate load prediction can provide a reference for the In addition, the method of gray scale prediction can also be
orderly power consumption regulation. Relevant departments used. [4]
or institutions can understand the load law of the power grid The prediction method of artificial intelligence refers to the
through the predicted load and formulate relevant measures. At load prediction through the method of machine learning or
the same time, by comparing the actual load with the forecast deep learning. Its typical representatives are artificial neural
results, it is possible to keep abreast of changes in the grid network (ANN) [5] and recurrent neural network (RNN) [6].
load and investigate possible problems as soon as possible. The predictive performance of an ANN largely depends on
In addition, the user’s electricity load curve is an important its hyperparameters. If an ANN contains fewer connections,
basis for e-commerce companies to decide to purchase elec- it has limited learning ability and does not predict well. On
tricity from the spot market. Accurate daily load forecasting the other hand, if an ANN is given too many connections,
can better meet users’ electricity demand and reduce the risk of it may learn noise during the training phase and thus cannot
e-commerce purchasing electricity from the real-time market predict the regularity accurately [7], [8]. Compared with the
1914
Authorized licensed use limited to: Institut Teknologi Sepuluh Nopember. Downloaded on August 29,2023 at 09:02:33 UTC from IEEE Xplore. Restrictions apply.
TABLE II
P ERFORMANCE OF INDIVIDUAL MODELS ( POWER DATA )
A. Experimental setup
Using the electricity load data of a city, this dataset contains
96-point load curves from January 1, 2014 to December 31,
2014. Select the data from January 1, 2014 to November
30, 2014 for training, the data from December 1, 2014 to
Fig. 3. Comparison of the predicted results of each model with the actual
December 30, 2014 for validation, and predict the data from value
December 31, 2014.
ARIMA, LSTM, ARIMA-LSTM model settings are as
follows: C. Universality
TABLE I
This section uses the ElectricityLoadDiagrams20112014
ARIMA, LSTM, ARIMA-LSTM PARAMETER SETTINGS dataset, which contains the 96-point daily load curve of 370
users from January 1, 2011 to January 1, 2015, that is, from
Model Parameters
autoregressive order p: 2
00:00 to 23:45 every 15 minutes load. The data is a txt file,
ARIMA difference order d: 0 each line is the load of all users in 15 minutes.
moving average order q: 2 Select the data from January 1, 2012 to November 30, 2012
layers: 3
LSTM
hidden node: 256
for training, and the data from December 1, 2012 to December
autoregressive order p: 2 30, 2012 for validation, and forecast the data from December
difference order d: 0 31, 2012.
ARIMA moving average order q: 3
layers: 3
The parameters of the test model are still selected from the
hidden node: 256 parameters in Table I. The test data selects the power data
of user 2, user 4 and user 5 in 2012. The Fig. 4 shows the
approximate load curves for three users (values are taken at
ARIMA and LSTM are used as comparison algorithms, intervals of 500).
and they are trained and tested together with ARIMA-LSTM. Fig. 5 is a box plot of the electricity data of user 2, user 4
Adam optimizer is used in training, the mean square loss and user 5 for the whole year of 2012.
function MSE loss is used as the loss function, the learning Table III shows the mean and overall variance of the
rate is 1e-4, the batch size is 256, and the number of iterations electricity data for user 2, user 4 and user 5 for the whole
is 100 epochs. year of 2012.
B. Forecast Result
Table II shows the RMSE, MAPE, and WRE of ARIMA, TABLE III
M EAN AND POPULATION VARIANCE OF ELECTRICITY DATA FOR THREE
LSTM, and ARIMA-LSTM models on the dataset. It can be USERS .
seen that ARIMA-LSTM outperforms the other two models in
most metrics. User2 User4 User5
Mean 26.269 53.716 40.788
Fig. 3 shows the results of each model compared to the true Var. 129.582 1057.596 451.848
value.
1915
Authorized licensed use limited to: Institut Teknologi Sepuluh Nopember. Downloaded on August 29,2023 at 09:02:33 UTC from IEEE Xplore. Restrictions apply.
TABLE IV
P ERFORMANCE OF INDIVIDUAL MODELS (U SER 2)
TABLE V
P ERFORMANCE OF INDIVIDUAL MODELS (U SER 4)
1916
Authorized licensed use limited to: Institut Teknologi Sepuluh Nopember. Downloaded on August 29,2023 at 09:02:33 UTC from IEEE Xplore. Restrictions apply.
This paper proposes the ARIMA-LSTM model for short-
term load forecasting. This model improves the accuracy of
model prediction by using ARIMA to correct the model error
of LSTM. The prediction error of the ARIMA-LSTM model
on some data is usually lower than that of other models.
However, the performance of this model on the volatile dataset
needs to be improved. In addition, the lack of rules formulated
by hyperparameters is the insufficiency of this model and
is one of the directions for future research. The choice of
ARIMA parameters is also an area that can be optimized. This
shortcoming can be improved by using heuristic algorithm.
In conclusion, ARIMA-LSTM is one of the best options for
short-term electricity forecasting, and roles such as decision
makers and e-commerce retailers can benefit from it.
ACKNOWLEDGEMENT
Research in this paper is supported by the Key Research and
Fig. 6. Comparison of the predicted results of each model (User2) Development Program of Jiangxi Province (20212BBE51002)
R EFERENCES
[1] Moon J, Hossain M B, Chon K H. AR and ARMA model order se-
lection for time-series modeling with ImageNet classification[J]. Signal
Processing, 2021, 183: 108026.
[2] Tang L, Yi Y, Peng Y. An ensemble deep learning model for short-
term load forecasting based on ARIMA and LSTM[C]//2019 IEEE
International Conference on Communications, Control, and Computing
Technologies for Smart Grids (SmartGridComm). IEEE, 2019: 1-6.
[3] Satrio C B A, Darmawan W, Nadia B U, et al. Time series analysis and
forecasting of coronavirus disease in Indonesia using ARIMA model
and PROPHET[J]. Procedia Computer Science, 2021, 179: 524-532.
[4] Ding S, Hipel K W, Dang Y. Forecasting China’s electricity consumption
using a new grey prediction model[J]. Energy, 2018, 149: 314-328.
[5] Arvanitidis A I, Bargiotas D, Daskalopulu A, et al. Enhanced Short-
Term Load Forecasting Using Artificial Neural Networks[J]. Energies,
2021, 14(22): 7788.
[6] Shi H, Xu M, Li R. Deep learning for household load forecasting—A
novel pooling deep RNN[J]. IEEE Transactions on Smart Grid, 2017,
9(5): 5271-5280.
[7] Runge J, Zmeureanu R. Forecasting energy use in buildings using
artificial neural networks: A review[J]. Energies, 2019, 12(17): 3254.
[8] Mao G, Wang M, Liu J, et al. Comprehensive comparison of artificial
neural networks and long short-term memory networks for rainfall-
Fig. 7. Comparison of the predicted results of each model (User4) runoff simulation[J]. Physics and Chemistry of the Earth, Parts A/B/C,
2021, 123: 103026.
[9] Hewamalage H, Bergmeir C, Bandara K. Recurrent neural networks
for time series forecasting: Current status and future directions[J].
International Journal of Forecasting, 2021, 37(1): 388-427.
[10] Noh S H. Analysis of Gradient Vanishing of RNNs and Performance
Comparison[J]. Information, 2021, 12(11): 442.
[11] Pascanu R, Mikolov T, Bengio Y. On the difficulty of training recur-
rent neural networks[C]//International conference on machine learning.
PMLR, 2013: 1310-1318.
[12] Cui C, He M, Di F, et al. Research on power load forecasting method
based on LSTM model[C]//2020 IEEE 5th Information Technology
and Mechatronics Engineering Conference (ITOEC). IEEE, 2020: 1657-
1660.
[13] Fu R, Zhang Z, Li L. Using LSTM and GRU neural network meth-
ods for traffic flow prediction[C]//2016 31st Youth Academic Annual
Conference of Chinese Association of Automation (YAC). IEEE, 2016:
324-328.
[14] Shi X, Chen Z, Wang H, et al. Convolutional LSTM network: A machine
learning approach for precipitation nowcasting[J]. Advances in neural
information processing systems, 2015, 28.
[15] Graves A, Schmidhuber J. Framewise phoneme classification with
bidirectional LSTM networks[C]//Proceedings. 2005 IEEE International
Joint Conference on Neural Networks, 2005. IEEE, 2005, 4: 2047-2052.
1917
Authorized licensed use limited to: Institut Teknologi Sepuluh Nopember. Downloaded on August 29,2023 at 09:02:33 UTC from IEEE Xplore. Restrictions apply.