Professional Documents
Culture Documents
Load Forecasting
mahoque@iut-dhaka.edu
Abstract—To ensure a stable and reliable operation of a power genetic algorithms to optimize ANN structure [6]. Besides,
system network, load forecasting on a short-term basis is very some researchers worked on time series forecasting models like
crucial. The two most important requirements of short-term Auto Regressive Integrated Moving Average (ARIMA) [7] or
load forecasting are accurate forecasting and speed. It is really Auto Regressive Moving Average (ARMA), Seasonal ARIMA
necessary to study as well as analyze the load characteristics (SARIMA) or Auto Regressive Fractionally Integrated Moving
and to find out the primary factors responsible for obstructing
Average (ARFIMA) to forecast the load [8]. The selection
accurate load forecasting. Auto-Regressive Integrated Moving
Average (ARIMA) method is most frequently used because it of these models highly depends on the variation of provided
needs only information regarding the historical loads to predict information and predicted time period.
the load and no other assumptions are required to consider. This The time series method provides a higher advantage on
paper compares the forecasting ability of ARIMA and Support
the analysis of the linear portion of the load but it cannot
Vector Machines (SVMs) model with the help of the Mean
Absolute Percentage Error (MAPE) and Mean Square Error predict the non-linear sensitive part. Moreover, the external
(MSE). After the comparison, it is found that for a non-linear factors with non-linear behavior such as the weather, days of
pattern SVM performs better than the ARIMA, whereas ARIMA the week etc. are also crucial for forecasting the load correctly.
gives a better show for the approximation of linear type of load. The Artificial Intelligent approaches like ANN and SVM have
the ability to forecast the part of the load accurately which
changes non-linearly.
Keywords—Short Term Load Forecasting (STLF), Auto Re-
gressive Integrated Moving Average (ARIMA), Support Vector But the ANN method cannot produce the perfect result
Machines (SVM), Mean Absolute Percentage Error (MAPE) and and quantitative analysis as it only focuses to minimize the
Mean Square Error (MSE) experience risk by adopting the empirical risk minimization
principle based on the Statistical Learning Theory (SLT). On
I. I NTRODUCTION the contrary, SVM works according to the Structural Risk
Minimization (SRM) technique. In instead of reducing the
The main challenge for the power system is to maintain empirical error SRM technique minimizes the generalization
the balance between the load and generation. The electricity error [9], [10]. Besides, SVM regression method, an advanced
should be generated on-demand basis because it is impossible forecasting technique works based on statistical learning theory
to store electrical energy on a substantial scale and presently, and the principle of structural risk minimization. So, it can
this method is also treated as uneconomical. Hence, to ensure be utilized to find out the best trade-off offered by the
the proper planning and reliable operation of the power sys- model between the complexity and learning capability with
tem, the load required by the consumer has to be predicted the inadequate sample information which helps to acquire
beforehand. The owner of the storage device can be benefited the best generalization ability. Compared to the experience
if the forecasted load profile has become more accurate. A risk minimization principle based artificial neural network,
positive impact can be reflected in the electricity bill of the SVM regression method can be utilized to solve the problems
consumer for reducing the forecasting errors [1]. The accurate involving features of small sample size, high dimension and
Short Term Load Forecasting (STLF) is vital in the deregulated local minima with good generalization ability. Hence, the focus
power industry due to its great usefulness for buying the power of researchers has increased more to address these issues in
in real time from the generating companies and switching of the short-term load forecasting field.
the loads, as well. Besides, avoiding congestion due to dynamic
control in the transmission line network is another key factor As the forecasting methods deal with historical loads,
to utilize STLF. the precision of the historical is treated with the highest
importance. But unfortunately, some missing values or error
In the previous literature, various fundamental STLF tech- may appear in the data while recording. It can be happened
niques can be found i.e. Kalman filtering [2], dynamic linear due to some abnormal values because of outages or load
and non-linear models [1], and various optimization methods shedding [11], [12] and random fluctuation in the demand.
for load forecasting [3]. Machine learning approaches have The above-mentioned information are termed as the Outliers.
also caught into the eyes of researchers, such as Support Hence, it is necessary to adopt an Outlier detection method for
Vector Machines (SVMs), fuzzy-neural model, Support Vector forecasting the load. In this paper, we compare the forecasting
Machines (SVMs) [4], artificial neural network (ANN) [5] and ability between SVM and ARIMA model and we found that
Authorized licensed use limited to: UNIVERSITY OF BIRMINGHAM. Downloaded on August 29,2023 at 06:02:31 UTC from IEEE Xplore. Restrictions apply.
the SVM method offers better performance than the ARIMA
model. Finally, the effectiveness of the SVM method is verified
after introducing the detail steps for short-term load forecasting
based on power.
The paper is organized as follows: section II describes
different forecasting models. In section III, the methodology
for forecasting the short-term load is illustrated and simulation
results are discussed in section IV. Finally, section V concludes
the paper.
Authorized licensed use limited to: UNIVERSITY OF BIRMINGHAM. Downloaded on August 29,2023 at 06:02:31 UTC from IEEE Xplore. Restrictions apply.
III. METHODOLOGY
Mainly, the whole experiment can be split into two steps,
firstly the model verification and secondly, the forecasting
performance demonstration using a known case. To perform
this expreiment, we use MATLAB R 2012a as the simulator
to develop the models. Daylong predicted pattern of electricity
consumption for 28 BTS are compared to calculate the perfor-
mance of the SVM and ARIMA model. We consider regression
factor, MSE and MAPE for the comparison. The electricity
consumption data from March’18 to May’18 is used as training
data for the SVM model. Later, the ARIMA model utilizes
the same data to forecast. The unit of power consumption is
considered as Whr. It is suggested in previous works that the 15
Figure 2: SVM basic Structure [13] minutes resolution is regarded as standard practice to predict
accurately.
To develop a forecasting model, the most popular model
The SVM algorithm is popularly used to overcome two is SVM with single hidden layer. Figure 2 illustrates the rela-
types of problems; they are classification problem and regres- tionship between inputs and outputs of SVM model. However,
sion estimation. The analysis of STLF has a connection with the number of hidden layers and inputs are the major factors
the latter one that is called Support Vector Machine Regression to perform the SVM model properly. We consider the logical
(SVR) which uncovers the relationship between the output day and the electricity consumption as inputs. Here, a logical
vectors and the input vectors. day is either 1 for weekdays or 2 for weekends. The input
Depending on time period and type of input data SVM and data is seperated into three different datasets. The first one is
ARIMA both models are shown to be accurate in literature. called the training dataset where 70% of the total data is used
Thus, the comparison of these two models for day-ahead load for training. In addition, the half of remaining data is utilized
forecasting is the objective of this work. Moreover, better for testing and the rest of the data is for validation.
scheduling of the storage devices depend on the accuracy of The main challenge to train the ARIMA model is to set the
the forecasted result. Different errors can be used to evaluate proper model parameters p,d, and q. Usually, these parameters
the performance of the models. In this paper, Mean Square can be found by analyzing ACF and PACF. For our case, we
Error (MSE) and Mean Absolute Percentage Error (MAPE) find out, p = 2 and q = 1 while using our data set. We use the
are used to evaluate the level of accuracy of the models. The value of differencing parameter d=1.
regression factor is another criterion to evaluate the modes,
which tries to find a linear relation between the outputs of the
network and the targets. In section IV it is discussed in detail. IV. R ESULT
According to the previous works, both ARIMA and SVM The performance of SVM is significantly better than
models manifest proper prediction based on time period and ARIMA which are shown in table1 and 2 [8]. SVM showed a
input data type. According to the previous works, both ARIMA better performance. It indicates that SVM is a suitable model
and SVM models manifest proper prediction based on time for a day ahead forecasting of electricity consumption profile
period and input data type. Hence the main focus of the than ARIMA for each consumer. The forecasting results for
paper is to compare these two models for day-ahead load consumer 10 are shown in figure 3. Here forecasted results
forecasting. Besides, the forecasted result with good accuracy differ slightly from actual consumption up to sample value of
can ensure better scheduling for the storage devices. Different 50. But the mismatch after 52 is huge, due to the behavioral
error models can be utilized to determine the performance of change of the consumer after office hour of that particular day,
the prediction models. In this paper, both MSE and MAPE where he did not have his regular evening peak. In figure 4
methods are used to evaluate the accuracy level of the models. the forecasting performance of ARIMA for the same consumer
The regression factor is another important key factor to deter- is shown. The regression curve shows the exact result in the
mine the effectiveness of these models, which tries to look for form of dotted line while the solid line (fits) indicates the best
a linear relationship between the outputs of the network and fitted linear regression line between the targets and the outputs.
the targets. In section IV, it will be discussed in detail. Sometimes the regression factor for the SVM is less than the
others indicate forecasting failure as an exception than the rule.
Figure 5, shows the regression function for ARIMA, where
1
n
M SE = (Ŷi − Yi )2 (3) no match is found between the samples of the targets. And
n i=1 mean square error is 48833 for consumer 10. The MAPE for
site 10 is also very high as 320.96 which results the significant
1 Yi − Ŷi
n
M AP E = (4) monetary losses in the context of our cases for the storage
n Yi device owner.
i=1
Figure 3 shows the forecasting ability of SVM for con-
Where, Ŷ represents the vector of n predictions and Y sumer 10. In the morning while consumption is low, there is
represents the real values of the vector. a very good match in the forecasted results and significant
Authorized licensed use limited to: UNIVERSITY OF BIRMINGHAM. Downloaded on August 29,2023 at 06:02:31 UTC from IEEE Xplore. Restrictions apply.
Table I: SVM evaluation
Consumer MSE MAPE
1. 0.224 199
2. 0.5017 141.06
3. 0.5003 172.22
4. 0.580 141.08
5. 0.549 133.47
6. 0.4080 74.22
7. 0.4237 130.32
8. 0.589 481.2
9. 0.51153 146.2
10. 0.563 186
11. 0.518 108.3
12. 0.00789 84.46
13. 0.76 227.3
14. 0.5892 101.1
15. 0.388 145
16. 0.5610 146.9
17. 0.51564 197
18. 0.54974 160.1 Figure 4: Forecasting Performance of ARIMA for consumer
19. 0.60978 171.9 10
20. 0.3941 153.03
21. 0.57928 180.53
22. 0.12644 150.78
23. 0.62837 162.38 Table II: ARIMA evaluation
24. 0.2789 186
25. 0.00778 75.7 Consumer MSE MAPE (%)
26. 0.763 539.79 1. 82584 320.53
27. 0.4899 144.299 2. 13988 149.07
28. 0.48445 134.133 3. 33589 163.85
4. 11721 183.41
5. 68198 108.15
6. 26749 83.59
7. 1217.4 105.02
8. 19667 165.06
9. 481.9 55.52
10. 48833 320.96
11. 17743 181.79
12. 6378.7 98.62
13. 468.5 85.97
14. 18445 193.75
15. 83753 244.35
16. 4295.7 86.01
17. 34156 200.95
18. 11631 263.32
19. 22769 247.46
20. 12272 151.82
21. 9560.3 125.70
22. 4457.2 696.86
23. 25294 237.52
24. 6798.1 111.74
Figure 3: Forecasting Performance of SVM for consumer 10 25. 1620.1 205.43
26. 3414.9 311.23
27. 2658.1 487.03
28. 1106 598.88
mismatch is found while estimating the peak consumption
at evening. The consumption profile perfectly identifies the
position of the peaks while size of the peaks is not estimated consumer 10 are shown in Figure 3. ARIMA shows inferior
accurately. The similar phenomenon of our simulation is results over SVM which is not suitable for short term load
observed for consumer 15 in figure 7 which indicate during forecasting where the data is not linear. Any further experiment
evening peak uses of certain electrical devices results slight may lead to convergence of forecasted data and the actual
variation while shape of the forecasted pattern remain same as demand. Similar forecasted pattern for consumer 15 in shown
the actual load for standard appliances. in figure 8.
As discussed earlier, a frustrating forecasted result of The qualitative parameter of the two forecasting model
Authorized licensed use limited to: UNIVERSITY OF BIRMINGHAM. Downloaded on August 29,2023 at 06:02:31 UTC from IEEE Xplore. Restrictions apply.
Table III: Comparison of both models for consumer 22
Forecasting Techniques MSE MAPE
SVM 0.12644 150.78
ARIMA 4457.2 696.86
V. C ONCLUSION
Authorized licensed use limited to: UNIVERSITY OF BIRMINGHAM. Downloaded on August 29,2023 at 06:02:31 UTC from IEEE Xplore. Restrictions apply.
R EFERENCES
[1] R. Sadownik and E. P. Barbosa, “Short-term forecasting of industrial
electricity consumption in brazil,” Journal of Forecasting, vol. 18, no. 3,
pp. 215–224, 1999.
[2] E. Paparoditis and T. Sapatinas, “Short-term load forecasting: the similar
shape functional time-series predictor,” IEEE Transactions on power
systems, vol. 28, no. 4, pp. 3818–3825, 2013.
[3] Z. Yu, “A temperature match based optimization method for daily load
prediction considering dlc effect,” IEEE Transactions on Power Systems,
vol. 11, no. 2, pp. 728–733, 1996.
[4] G. Y. C. Li, X. Liu, and Y. Wang, “Study of support vector machines
for short-term load forecasting,” in Proceedings of the CSEE. IEEE,
2003, pp. 2–6.
[5] F. Grimaccia, M. Mussetta, and R. E. Zich, “Advanced predictive
models towards pv energy integration in smart grid,” in Fuzzy Systems
(FUZZ-IEEE), 2012 IEEE International Conference on. IEEE, 2012,
pp. 1–6.
[6] D. W. Bunn, “Forecasting loads and prices in competitive power
markets,” Proceedings of the IEEE, vol. 88, no. 2, pp. 163–169, 2000.
[7] K. W. Hipel and A. I. McLeod, Time series modelling of water resources
and environmental systems. Elsevier, 1994, vol. 45.
[8] K. Ahmed, M. Ampatzis, P. Nguyen, and W. Kling, “Application of
time-series and artificial neural network models in short term load
forecasting for scheduling of storage devices,” in Power Engineering
Conference (UPEC), 2014 49th International Universities. IEEE, 2014,
pp. 1–6.
[9] H. Nie, G. Liu, X. Liu, and Y. Wang, “Hybrid of arima and svms for
short-term load forecasting,” Energy Procedia, vol. 16, pp. 1455–1460,
2012.
[10] Y. Wang, Q. Xia, and C. Kang, “Secondary forecasting based on
deviation analysis for short-term load forecasting,” IEEE Transactions
on Power Systems, vol. 26, no. 2, pp. 500–507, 2011.
[11] V. Margaret and K. U. Rao, “Partial load shedding using ant colony
algorithm in smart grid environment,” in TENCON 2015-2015 IEEE
Region 10 Conference. IEEE, 2015, pp. 1–5.
[12] V. Margaret, K. U. Rao, and G. Ganeshprasad, “Intelligent load shed-
ding using ant colony algorithm in smart grid environment,” in Power
Electronics and Renewable Energy Systems. Springer, 2015, pp. 1149–
1162.
[13] S. Karthika, V. Margaret, and K. Balaraman, “Hybrid short term load
forecasting using arima-svm,” in Power and Advanced Computing
Technologies (i-PACT), 2017 Innovations in. IEEE, 2017, pp. 1–7.
[14] E. Ceperic, V. Ceperic, A. Baric et al., “A strategy for short-term load
forecasting by support vector regression machines,” IEEE Transactions
on Power Systems, vol. 28, no. 4, pp. 4356–4364, 2013.
Authorized licensed use limited to: UNIVERSITY OF BIRMINGHAM. Downloaded on August 29,2023 at 06:02:31 UTC from IEEE Xplore. Restrictions apply.