Professional Documents
Culture Documents
fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TSG.2017.2766022, IEEE
Transactions on Smart Grid
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 1
1949-3053 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TSG.2017.2766022, IEEE
Transactions on Smart Grid
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 2
and rapid growth in subsequent years [4]. proposed. In analytical modeling, solar power is theoretically
While the world-wide deployment of solar energy modeled as a function of different weather conditions. Then the
contributes to a more sustainable future, the intermittent and parameters in the function are regressed according to field
volatile nature of solar power poses significant challenges to the experiments. Most of existing literature focuses on the accurate
reliable and economic operation of power grids [5]. On one modeling of a PV system. However, PV modeling is quite
hand, the intermittency of large-scale solar power may cause an complex, due to nonlinear and time-varying parameters in a
imbalance between power supply and demand, influencing the practical PV system. The analytical modeling-based regression
reliability of power grids. On the other hand, more traditional might not have satisfactory performance especially on rainy or
generation resources for rapid ramping and reserve should be cloudy days when weather conditions frequently vary.
scheduled to smooth out the fluctuations in solar power, In [8], a modified current-voltage relationship for the single-
decreasing the efficiency of generation resources [6]. Thus, it is diode model is presented. The model begins with a single solar
imperative to improve the forecast accuracy of solar power. In cell and is expanded to a PV module, then to a PV array. The
existing literature, solar power forecasting methods can be parameters of the model are regressed with experimental data
classified from different perspectives. For example, based on under different levels of irradiance and temperature. In [9], an
different forecasting engines, the methods can be categorized empirical model is formulated between solar power and
into machine learning and analytical modeling [7]. weather conditions, including irradiance, ambient temperature
In machine learning methods, the relationships between solar and humidity. Then, the parameters of that model are regressed
power and weather conditions, e.g., surface irradiance, ambient using actual observations of a PV module. In addition, some
temperature and wind speed, are learned from training with a recent studies investigate irradiance and temperature modeling
large amount of historical data. Most of the existing literature of a PV array [10][11].
focuses on improving the learning ability of the forecasting Some ensemble methods have been proposed to take
engine. However, the weather conditions from the numerical advantage of both machine learning and analytical modeling
weather prediction (NWP) are directly used as the input of methods. In [20], different weather conditions are respectively
machine learning, without preprocessing the coupling among forecasted using spatial and temporal time series methods. Then
these weather conditions. solar power is calculated using the forecasted weather
In [4], a weather-based method for day-ahead hourly conditions based on an empirical model. However, the
forecasting of PV power is presented. That method comprises ensemble method in [20] still highly relies on the accuracy of
classification, training and forecasting stages and is validated PV models. In [21], an ensemble method is proposed, which has
using real-world historical data of ambient temperature, three forecasting engines: an autoregressive model, gradient
precipitation and solar irradiance. In [12], a data-driven forecast boosting regression model and an empirical PV model. The
model is proposed to improve the accuracy of a short-term PV three engines are separately used to forecast solar power.
power forecast. The autoregressive method is adopted to However, the physical knowledge provided by PV models is
forecast 1-h-ahead solar power considering solar irradiance. In not thoroughly used in the forecasting engines.
[14], a similar-day detection engine is proposed to find several The literature review is summarized in TABLE I.
days that are similar to the target forecast day considering TABLE I
Literature Review on Solar Power Forecasting Methods
irradiance, temperature and wind speed. Then, the forecast
Method Weather feature Forecasting engine PV modeling
accuracy is validated using four different forecasting engines. Machine Ambient SVM[4] [14] [15], /
In [15], support vector machine (SVM) is used to learn the learning temperature[4] [16], persistence[12][13], K-
mapping between weather conditions from satellite images and cell temperature[14], nearest neighbor
solar power. The weather conditions include wind speed, cloud precipitation[4], solar (KNN)[14], artificial
irradiance[4] [12] [15] neural network
coverage and irradiance. In [16], a higher-order Markov chain [16]
, wind speed[14], (ANN) [14], weighted
is adopted to forecast the probability distribution of PV power. surface KNN[14], Markov
Ambient temperature and solar irradiance measurements are irradiance[14], cloud chain[16]
used to classify different operating conditions of a PV system. cover[14] [15]
Analytical Ambient Nonlinear Single-diode
In [17], a novel weighted Gaussian process regression approach modeling temperature[8] [9] [11], regression[9] [10] [11] model[8] [9],
is proposed, such that data samples with higher outlier have a cell temperature , [8]
empirical
lower weight. In [18], aerosol index is considered as an solar irradiance[8] [9] model[10] [11]
[11]
additional input parameter of solar power forecast. The case , humidity[9] [10]
[11]
, precipitation[10],
studies show that the linear correlation between aerosol index cloud cover[10], wind
and solar radiation helps improve forecasting accuracy. In [19], speed[11]
a forecasting framework is proposed to explore information Ensemble Surface irradiance[20] Persistence[20], Empirical
[21]
from a grid of numerical weather predictions applied to both method , ambient autoregression[20] model[20] [21]
temperature[20] [21], [21]
, ANN[20],
wind and solar energy. Compared to a model that only considers wind speed[20] [21] compressive spatio-
one NWP point for a specific location, a grid of weather temporal
predictions can greatly reduce the forecasting errors. forecasting[20],
To explore the physical relationships between solar power gradient boosting
regression[21]
and weather conditions, analytical modeling methods are
1949-3053 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TSG.2017.2766022, IEEE
Transactions on Smart Grid
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 3
According to existing forecasting methods, there remain module will be elaborated in Section II-B.
three issues to be addressed: i) The PV modeling is In the machine learning module, principal component
oversimplified that some important weather conditions might analysis (PCA) is used for feature extraction. K-nearest
be overlooked. For example, solar power is related to different neighbor (KNN) is employed to classify a forecasting period
irradiance components, including beam, diffuse and reflection into the historical periods with the closest weather conditions.
irradiance. ii) The weather measurements from NWP serve as To demonstrate the performance of the analytical modeling
direct inputs of machine learning methods, ignoring the integrated with different forecasting engines, three well-known
correlations between different weather measurements. iii) The classic forecasting engines [14], i.e., SVM, ANN and weighted
nonlinearity between weather conditions and solar power have KNN, are applied respectively. This module will be elaborated
not been thoroughly explored from PV analytical models. in Section II-C.
According to PV analytical modeling, some important weather In the deviation analysis module, cross-validation is used to
features and the coupling among weather conditions cannot be obtain the distribution of forecast errors. Then the forecasted
directly measured or predicted from NWP. Instead, these key solar power can be adjusted by a compensation term. This
weather factors need to be theoretically derived from PV module will be elaborated in Section II-D.
models. Without making full use of these key factors, the ability The major contribution of this paper is to improve forecasting
of machine learning methods cannot be efficiently deployed. performance by taking advantage of the key weather factors
Analytical modeling of PV systems can provide physical derived from PV analytical modeling. The forecasting engine
knowledge for solar power forecasting, and machine learning itself is not the focus of this paper. In the case studies, the solar
methods can capture the nonlinear and time-varying parameters power forecasting with and without the key weather factors will
of PV systems. To fully deploy the advantages of both methods, be compared to demonstrate the impacts brought on by PV
a novel solar forecasting approach is proposed in this paper. The analytical modeling.
proposed approach is comprised of three engines: i) analytical Weather predictions
modeling of PV systems; ii) machine learning methods to map
weather features and solar power; and iii) a deviation analysis Beam Reflect Diffuse Ambient Wind
for solar power forecast adjustment. The PV analytical Solar irradiance Cell temperature
modeling can generate key weather features, and then the Solar power = f (Weather conditions)
machine learning methods take advantage of the key features to Analytical model
improve the forecasting accuracy.
PCA Extract principal components
The major contributions of this paper are threefold:
1) The weather knowledge related to solar power is explored KNN Find closest weather conditions
from an analytical model. The relationships between solar SVM/ANN/Weighted KNN Forecast
power and key weather features are deduced. Machine learning method
2) The key weather factors provided by PV analytical
Cross Probability Compensati
modeling are used to reformulate input features, which validation distribution on term
improves the performance of the machine learning methods.
3) A deviation analysis is conducted to improve the Deviation analysis
1949-3053 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TSG.2017.2766022, IEEE
Transactions on Smart Grid
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 4
1) Plane of Array Irradiance 𝐸𝑃𝑂𝐴 Pmp cSTD EPOA 1 cT TC T0
EPOA is comprised of three components: i) beam component
Eb k3 EGHI k4 EDHI (9)
2
(12)
of the diffuse irradiance from the sky dome, related to the EGHI t EDHI t , EDHI t Eb t , t 1, 2,..., T .
diffuse horizontal irradiance (DHI). The isotropic sky model cGR
[24] of the DHI is expressed as follows:
EGHI Eg
Ed cSKY EDHI , (5) c SK Y
where cSKY 0.5 1 cos T , array . In practice, EGHI can be f DHI EDHI
cos AOI
Ed EPOA
observed and forecasted from global satellite images. Then,
f DNI EDNI Eb
given EGHI , the values of EDNI and EDHI can be estimated
1
based on the existing research [25]. In this paper, the empirical TC
cW 0 cW 1VW
model in [26] is adopted to estimate EDNI and EDHI from cTE TA
EGHI , simplified as follows: Fig. 2. The block diagram of the irradiance and temperature calculations.
EDNI f DNI EGHI , EDHI f DHI EGHI . (6)
Therefore, the key weather features, which are highly related
Therefore, EPOA can be derived from the three irradiance to solar power, can be generated from the analytical model. In
components: practice, other weather predictions may influence the accuracy
EPOA Eb Eg Ed =Eb cGR EGHI cSKY EDHI . (7) of solar power forecast, e.g., humidity, pressure and
2) Photovoltaic Cell Temperature 𝑇𝐶 precipitation. So the key weather feature matrix XC and other
The PV cell temperature can be derived based on heat weather predictions will be together taken as the input of
transfer with the ambient environment [27], shown as follows: machine learning methods. In contrast to existing forecasting
c E methods, the key weather factors from the analytical modeling
TC TA TE POA , (8)
cW 0 cW 1VW are used to reformulate the input of the machine learning
methods, instead of directly inputting weather predictions.
where cTE is a constant factor related to the adsorption
C. Machine Learning Methods
efficiency of the PV system, cW 0 and cW 1 are the constant heat
In the machine learning module, PCA is used for feature
transfer factor and the convective heat factor, equaling 25
extraction. KNN is employed to classify the forecasting period
W/m2·K and 6.84 W/m3·s·K, respectively [27].
into the historical periods with the closest weather conditions.
3) Reformulation
Then SVM, ANN and weighted KNN, are used as the
According to the models of EPOA and TC , the block diagram forecasting engines. The schematic is shown in Fig. 3.
of the calculation is shown in Fig. 2. T N
In Fig. 3, the input feature matrix XI R is composed of
In practice, some parameters are difficult to precisely acquire
without field experiments, e.g., cGR and cTE . Therefore, key weather features XC and other available weather
according to (7) and (8), Eb , EGHI , EDHI , TA and VW are predictions. The row of X I represents each historical period
and each column represents each feature. After PCA, the
regarded as the variables instead of EPOA and TC . Then, (1)
historical and forecasting principal component matrixes
can be reformulated as follows:
1949-3053 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TSG.2017.2766022, IEEE
Transactions on Smart Grid
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 5
X P RT L and XPF R H L are simultaneously extracted. For of K during the cross-validation for the historical datasets.
In this paper, three well-known forecasting engines are
each forecasting period from 1 to H, K nearest historical
adopted [14]. SVM is a promising tool to classify and regress
neighbors of each row of X PF are then acquired from X P .
with even small-sized training datasets [32]. The SVM
Then three forecasting engines are employed to learn the regression performs linear regressions in the high-dimension
mapping between the principal features and solar power of feature space using –insensitive loss while reducing model
these K nearest neighbors. The learned mapping will be used complexity [33]. The kernel function is selected as the radial
for forecasting solar power. basis function (RBF). The hyperparameters in SVM, e.g., the
Standardization
Calculate Manhattan distance
penalty parameter in the objective and the deviation tolerance,
Input
feature Singular value decomposition are determined by sensitivity analysis.
matrix
Extract principal features
Find K nearest neighbors Back propagation (BP) neural network is one of the most
PCA KNN widely used ANNs. BP-ANN employs a multilayered feed-
forward topology, and the basic structure is a three-layer
Principal features of K Learn Solar power of K network with the input, hidden and output layers [34].
nearest neighbors nearest neighbors
SVM/ ANN
Forecasted In this paper, KNN is used to find K nearest neighbors, and
Principal features of
the forecasting period
Learned mapping solar
power
the weighted KNN is used to calculate the forecasted solar
Weighted Manhattan distance of Weighted power by an average weighted by exponential function [31]:
KNN principal features exponential K
MD XPF t ,, XP i ,
Forecasting engines e Pmp i
ˆ t
P i 1 . (15)
Fig. 3. The schematic of the machine learning methods. mp K
MD XPF t ,, XP i ,
e
i 1
PCA is a powerful tool in dimensional reduction and
denoising for highly correlated data [28]. Since redundant data D. Deviation Analysis
and the noise influence the forecasting accuracy, PCA is used In practice, the control scheme of a PV system is aimed at
in this paper to extract the principal components of the weather controlling its inverter to work in maximum power point
features while eliminating redundancy. tracking (MPPT) mode. However, it is difficult to instantly
The first step of PCA is to standardize each column of X I in track the maximum power point on rainy, foggy and cloudy
the following manner [29]: days [35]. These volatile weather conditions can greatly
XI i, j min XI , j influence the performance of the inverters in PV systems. Thus,
XS i , j . (13)
max XI , j min XI , j system bias of solar power forecast is inevitable in the real
world. To reduce the errors caused by the system bias, a
Then, the singular value decomposition (SVD) is applied to
deviation analysis based on the cross-validation for historical
extract the principal components X P and X PF .
datasets is conducted. A compensation term is then generated
KNN is aimed at classifying a forecasting period into K ˆ .
to adjust the forecasted solar power P mp
historical periods with the closest weather conditions. This step
is known as similar-day selection [14]. The procedures of the deviation analysis for the forecasting
In existing literature, it has been widely verified that the period t are as follows:
relationships between weather conditions and solar power Procedure of Deviation Analysis for Forecasting Period t
highly depend on weather types [14]. [30] shows the 1 Randomly divide the K nearest neighbors into S equal-
normalized root mean square errors (nRMSE) of solar power sized groups;
forecast with 15-min intervals are 7.85% for sunny days, 9.12% 2 i=1;
for cloudy days, 12.40% for rainy days and 12.60% for foggy 3 Repeat
days. Therefore, it is important to classify a forecasting period 4 Take the ith group as a validation;
into a group of historical periods with closest weather 5 Take the remaining S-1 groups for training;
conditions. Manhattan distance MD is selected as the 6 Learn the mapping of the S-1 groups between the
measurement between forecasting and historical instances [31]: principal components and the solar power;
L 7 Define the forecasted deviations as the difference
MD X PF i, , X P j , X PF i, n X P j , n . (14) between the actual and forecasted solar power;
n 1
For the forecasting periods from 1 to H, K historical instances Calculate the forecasted deviations of the ith group;
8 i=i+1;
with smallest MD, will be selected from X P . It is worth
9 until i=S+1.
mentioning that the value of K in KNN has an impact on the For forecasting period t, the solar power outputted from the
forecasting accuracy. If the value of K is too small, the training
dataset will be insufficient due to the small sample size; If the forecasting engine is Pˆ t . Then, the statistical deviations of
mp
value of K is too large, irrelevant historical periods will be the forecasted power within the neighbor [ Pˆ mp t - , Pˆ mp t +
selected into the training dataset, increasing forecasting errors.
] can be obtained from the deviation distribution, where is
Thus, sensitivity analysis is conducted to determine the value
1949-3053 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TSG.2017.2766022, IEEE
Transactions on Smart Grid
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 6
a pre-defined constant. The compensation term Pmp t is where H 2 is the number of forecasting periods, and the periods
defined as the average of the forecasted deviations. The solar with 0 power are excluded. To demonstrate the effectiveness of
power forecast is adjusted as follows: the analytical modeling, different forecasting methods with and
ˆ * t P
P ˆ t P t . (16) without the key weather factors are compared in the case studies.
mp mp mp
The forecasting methods are shown in TABLE II.
An example based on measured datasets from Australia is TABLE II
illustrated in Fig. 4. The forecasted deviations are shown in Fig. FORECASTING METHODS WITH AND WITHOUT KEY WEATHER FACTORS
4(a), and the probability distributions of the deviations are No. Forecasting Engine Key weather factors Other weather conditions
shown in Fig. 4(b) under different levels of forecasted solar S1 SVM √ √
S2 SVM × √
power. As one can observe, the forecasting deviations obey a A1 ANN √ √
certain probability distribution, therefore it is reasonable to A2 ANN × √
adjust the solar power forecast according to the statistical errors. W1 Weighted KNN √ √
√
If the forecasted solar power from SVM is P ˆ t 0.3 , W2 Weighted KNN ×
mp NR Nonlinear regression √ ×
is set to 0.01 and the neighbor will be [0.3-0.01, 0.3+0.01]. S1, A1 and W1 are the proposed methods using SVM, ANN
Then, the compensation term Pmp t is the average of the and weighted KNN as the forecasting engine, respectively. By
deviations within the neighborhood. Note that the value of comparing different methods with and without key weather
factors, the effects of the analytical modeling can be
has a significant impact on the value of Pmp t . If is too
demonstrated. NR is a typical analytical method that uses
small, the points in the neighbor are sparse and the variation is nonlinear regression to estimate the parameters in the
large. If is too large, the forecasted solar power varies in a expression (9).
large neighbor and some points are not representative.
A. Solar Power Forecast Using Data in Australia
Therefore, sensitivity analysis is conducted to determine the
value of . 1) Data Description
In this section, the measured datasets from three PV arrays in
Deviation
Australia are tested [37]. The descriptions of the three PV arrays
Deviation within
the neighbor are shown in TABLE III.
TABLE III
DESCRIPTION OF THREE PV ARRAYS IN AUSTRALIA
… … PV Latitude Longitude PC A,array T ,array
1 35°16’30’’S 149°6’49’’E 1.56 kW 38° 36°
2 35°23’32’’S 149°4’1’’E 4.94 kW 327° 35°
3 35°32’S 149°9’E 4.00 kW 31° 21°
The hourly historical weather predictions and solar power
a b
Fig. 4. The deviation distribution based on measured datasets from Australia. from April 1, 1:00, 2012 to June 30, 24:00, 2013 are taken as
the training datasets. Seven scenarios are designed to test the
III. CASE STUDIES forecasting performance, as shown in TABLE IV.
TABLE IV
To verify the accuracy of the proposed forecasting approach, FORECASTING SCENARIOS
measured datasets from PV systems in Australia are used. Four Scenario Forecasting dataset
error indices are adopted to measure the forecast accuracy [5]: Spring From October 1, 1:00, 2013 to October 31, 24:00, 2013
i) Normalized mean absolute error (nMAE) Summer From January 1, 1:00, 2014 to January 31, 24:00, 2014
Autumn From April 1, 1:00, 2014 to April 30, 24:00, 2014
1 H 2 Pmp t Pmp t
ˆ* Winter From July 1, 1:00, 2013 to July 31, 24:00, 2013
nMAE
H 2 t 1 PC
100% , (17) Sunny 30 days with highest GHI from July 1, 2013 to May 1, 2014
Cloudy 30 days with highest cloud coverage from July 1, 2013 to May 1, 2014
ii) Normalized root mean square error (nRMSE) Humid 30 days with highest relative humidity from July 1, 2013 to May 1, 2014
Pˆ t Pmp t
*
2 The weather data are from the datasets in [37]. The hourly
H2
1 weather predictions include 11 independent variables: total
mp
nRMSE 100% , (18)
PC t 1 H2 column liquid water, total column ice water, surface pressure,
iii) Normalized largest absolute error (nLAE) relative humidity, total cloud coverage, wind speed,
temperature, GHI, surface thermal radiation, top net solar
nLAE
1
PC
max Pˆ mp
*
t Pmp t , t 100% , (19) radiation Ea and total precipitation. The statistical descriptions
of the historical datasets are shown in Appendix.
iv) Energy production error (EPE) As shown in (11), the key weather factors include 12 terms,
Pˆ t P t
H2
* which can be calculated based on the weather variables. The
mp mp
t 1 input features of the proposed methods, i.e., S1, A1 and W1, are
EPE H2
100% , (20) the 11 available weather variables from the datasets and 12 key
Pmp t
t 1
weather factors. The input features of S2, A2 and W2 are only
the 11 available weather variables from the datasets. The input
1949-3053 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TSG.2017.2766022, IEEE
Transactions on Smart Grid
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 7
features of NR are only the 12 key weather factors. Then the 3 nMAE 1.88
1.41 1.61 2.00 2.20 2.23 2.24
forecasting is carried out day by day. nRMSE 2.63
2.20 2.36 2.84 3.05 3.05 2.98
nLAE 12.4
10.9 12.2 13.7 10.3 10.6 11.7
5-fold cross validation is conducted to determine the EPE 4.83
4.63 1.40 2.54 6.08 8.64 6.72
hyperparameters in different forecasting methods [36], TABLE VII
including the value of 𝜀 in the deviation analysis, the value of K FORECAST ERRORS IN AUTUMN SCENARIO IN AUSTRALIA (%)
in KNN, the parameters in SVM and the number of the hidden PV Index S1 S2 A1 A2 W1 W2 NR
layer in ANN. 1 nMAE 4.33 5.05 4.45 5.08 5.31 5.57 5.57
nRMSE 6.52 7.37 6.62 7.53 7.39 7.78 7.82
2) Forecast Results of Four Seasons nLAE 31.4 37.1 28.8 41.3 32.4 34.4 35.0
In this subsection, the forecast results of four seasons are EPE 2.24 9.40 4.65 10.5 1.14 11.2 7.63
compared with different forecasting methods. The actual and 2 nMAE 1.50 1.78 1.63 1.89 1.90 1.94 2.19
forecasted solar power of PV 1 on typical days in four seasons nRMSE 2.26 2.51 2.34 2.61 2.63 2.64 2.93
nLAE 9.36 8.61 8.72 9.04 7.72 9.21 9.67
are compared in Fig. 5 using different methods. EPE 1.01 10.6 2.58 10.3 1.80 10.6 3.10
3 nMAE 1.76 2.01 1.96 2.21 2.19 2.27 2.11
nRMSE 2.83 2.93 2.96 3.20 3.13 3.13 3.01
nLAE 12.0 10.8 11.4 10.5 9.90 11.4. 11.7
EPE 3.39 3.65 1.14 6.09 1.32 8.63 1.17
TABLE VIII
FORECAST ERRORS IN WINTER SCENARIO IN AUSTRALIA (%)
PV Index S1 S2 A1 A2 W1 W2 NR
1 nMAE 5.63 6.06 5.51 6.51 6.31 6.46 6.53
nRMSE 8.66 9.03 8.53 9.59 9.21 9.31 8.94
nLAE 32.6 31.2 33.7 33.8 33.0 31.9 30.1
EPE 8.23 8.33 7.11 7.94 4.24 9.20 7.34
2 nMAE 1.77 1.81 1.84 1.84 2.05 2.12 2.17
nRMSE 2.50 2.71 2.51 2.67 2.72 2.92 2.85
nLAE 10.4 10.1 10.7 9.44 8.09 9.16 8.07
EPE 1.79 10.6 2.66 9.92 10.6 9.31 2.42
3 nMAE 2.04 2.35 2.19 2.57 2.56 2.73 2.62
nRMSE 3.11 3.31 3.14 3.54 3.54 3.75 3.51
nLAE 13.3 12.2 12.0 12.6 13.0 12.1 10.7
EPE 4.79 3.73 3.50 3.17 0.53 14.0 4.83
From the forecast results, a good match between the actual
and forecasted solar power is obtained using the proposed
methods. By comparing S1 with S2, A1 with A2 and W1 with
Fig. 5. Solar power of PV 1 on 18th of each month in four seasons using
different methods.
W2 in four seasons, one can observe that the proposed methods
outperform the traditional methods at nMAE and nRMSE due
The error indices of the forecast results are shown in TABLE to taking key weather factors in account. For nLAE, the
V, TABLE VI, TABLE VII and TABLE VIII. proposed methods with SVM, ANN or Weighted KNN
TABLE V generally achieves the minimum except the cases in winter. As
FORECAST ERRORS IN SPRING SCENARIO IN AUSTRALIA (%) shown in TABLE VIII, NR has the minimum nLAE in winter
PV Index S1 S2 A1 A2 W1 W2 NR while the proposed methods also have satisfactory performance.
1 nMAE 4.03 6.44 4.00 6.43 6.63 7.37 7.89
For EPE, the proposed methods have the minimum errors
nRMSE 6.26 8.99 6.60 9.36 8.80 9.65 10.3
nLAE 24.8 28.3 32.0 33.8 24.0 26.4 33.2 except for PV 1 in summer.
EPE 3.09 5.00 0.03 3.17 2.40 4.01 18.9 By comparing the forecast results in different seasons, one
2 nMAE 1.37 2.39 1.45 2.47 1.81 2.39 2.68 can observe that better forecast performance is achieved in
nRMSE 2.22 3.17 2.24 3.27 2.50 3.08 3.32 summer while the winter generally has worse forecast
nLAE 10.5 9.50 9.18 9.55 8.01 8.77 9.57
EPE 4.91 14.2 1.02 12.7 3.58 7.95 12.3 performance. For nMAE, the minimum errors of three PV
3 nMAE 1.72 2.47 1.92 2.51 2.67 2.92 2.74 arrays are 3.02% (PV 1), 1.73% (PV 2) and 1.41% (PV 3) in
nRMSE 2.77 3.44 2.91 3.61 3.41 3.73 3.65 summer while 5.51% (PV 1), 1.77% (PV 2) and 2.04% (PV 3)
nLAE 14.9 16.4 14.6 17.6 13.9 14.1 17.3 in winter. For nRMSE, the minimum errors of three PV arrays
EPE 0.67 3.33 0.94 2.90 0.24 1.90 10.5
are 5.02% (PV 1), 2.48% (PV 2) and 2.20% (PV 3) in summer
TABLE VI
FORECAST ERRORS IN SUMMER SCENARIO IN AUSTRALIA (%) while 8.53% (PV 1), 2.50% (PV 2) and 3.11% (PV 3) in winter.
PV Index S1 S2 A1 A2 W1 W2 NR This is because more sunny days are in summer than in winter.
1 nMAE 3.02 5.14 3.38 5.42 5.79 6.19 7.06 3) Forecast Results of Three Weather Types
nRMSE 5.02 7.25 5.64 7.92 7.81 8.08 9.19 In this subsection, the forecast results of three weather types
nLAE 26.1 25.6 26.8 30.8 24.1 32.2 24.7
EPE 2.28 5.49 2.09 5.05 8.06 9.50 0.79
are compared with different forecasting methods. The actual
2 nMAE 1.73 2.63 1.75 2.69 2.22 2.75 3.05 and forecasted solar power of PV 1 on typical days of three
nRMSE 2.48 3.78 3.08 3.79 3.21 3.79 3.97 weather are compared in Fig. 6 using different methods.
nLAE 15.7 15.9 15.3 15.0 13.0 13.3 13.9
EPE 5.42 17.0 4.16 16.0 10.9 17.3 16.5
1949-3053 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TSG.2017.2766022, IEEE
Transactions on Smart Grid
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 8
1949-3053 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TSG.2017.2766022, IEEE
Transactions on Smart Grid
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 9
Fig. 10. Forecast errors of key weather factors in 𝐗 𝐂 for PV 1 on humid days.
IV. CONCLUSION
In this paper, a novel solar power forecasting approach is
proposed by exploring the key weather factors from the
analytical modeling of PV systems. The analytical model of a
PV system can provide physical knowledge regarding solar
power, and the machine learning methods explore the nonlinear
and time-varying nature of a practical PV system. In the
analytical modeling module, key weather features are
theoretically derived from weather predictions. In contrast to
Fig. 8. Forecast errors of key weather factors in 𝐗 𝐂 for PV 1 on sunny days. existing methods, the input of the machine learning methods is
reformulated using both key weather features and available
weather predictions. To further reduce forecasting errors,
deviation analysis for the historical datasets is conducted to
adjust the forecasted solar power.
Case studies based on the measured datasets from PV
systems in Australia demonstrate that by taking advantage of
the key weather features derived from PV models, the
forecasting performance can be effectively improved. In the
future, two aspects deserve further investigations: i) The key
weather factors derived from different PV analytical models
and their contributions to solar power forecast; ii) The forecast
of the ramping events in solar power caused by sudden weather
changes. The proposed approach will possibly provide new
Fig. 9. Forecast errors of key weather factors in 𝐗 𝐂 for PV 1 on cloudy days.
insights for solar power forecasting technique.
1949-3053 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TSG.2017.2766022, IEEE
Transactions on Smart Grid
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 10
APPENDIX [6] Z. Li, L. Ye, Y. Zhao, et al, “Short-term wind power prediction based on
extreme learning machine with error correction,” Protection and Control
The statistical descriptions of the historical datasets are of Modern Power Systems, vol. 1, no. 1, pp. 1-8, 2016.
shown in TABLE XIII, TABLE XIV and TABLE XV. Note [7] F. Almonacid, P. J. Perez-Higueras, E. F. Fernandez, et al, “A
methodology based on dynamic artificial neural network for short-term
that the periods with zero solar power output are excluded. forecasting of the power output of a PV generator,” Energy conversion
TABLE XIII and Management, vol. 85, pp. 389-398, 2014.
STATISTICAL DESCRIPTIONS OF THE HISTORICAL DATASETS OF PV 1 [8] H. Tian, F. Mancilla-David, K. Ellis, et al, “A cell-to-module-to-array
Weather feature Mean Median Standard deviation detailed model for photovoltaic panels,” Solar Energy, vol. 86, pp. 2695-
Total column liquid water (kg/m2) 0.04 0 0.11 2706, 2012.
Total column liquid water (kg/m2) 0.02 0 0.05 [9] H. S. Sahu, S. K. Nayak, “Numerical approach to estimate the maximum
Surface pressure (103 Pa) 94.67 94.68 0.58 power point of a photovoltaic array,” IET Gener. Transm. Distrib., vol.
Relative humidity (%) 57.82 56.87 19.45 10, no. 11, pp. 2670-2680, 2016.
Total cloud cover 0.43 0.36 0.40 [10] A. Shah, H. Yokoyama, N. Kakimoto, “High-precision forecasting model
Wind speed (m/s) 3.42 2.98 2.09 of solar irradiance based on grid point value data analysis for an efficient
Temperature (K) 289.24 288.85 7.00 photovoltaic system,” IEEE Trans. Sustain. Energy, vol. 6, no. 2, pp. 474-
Global horizontal irradiance (J/m2) 405.29 364.90 305.35 481, 2015.
Surface thermal radiation (J/m2) 317.77 317.74 41.34 [11] A. J. Veldhuis, A. M. Nobre, I. M. Peters, et al, “An empirical model for
Top solar radiation (J/m2) 476.33 452.46 319.58 rack-mounted PV module temperatures for southeast Asian locations
Total precipitation (10-5 m) 7.88 0 35.52 evaluated for minute time scales,” IEEE Journal of Photovoltaics, vol. 5,
no. 3, pp. 774-782, 2015.
Solar power (kW) 0.33 0.26 0.28
[12] C. Yang, A. A. Thatte, L. Xie, “Multitime-scale data-driven spatio-
TABLE XIV temporal forecast of photovoltaic generation,” IEEE Trans. Sustain.
STATISTICAL DESCRIPTIONS OF THE HISTORICAL DATASETS OF PV 2 Energy, vol. 6, no. 1, pp. 104-112, 2015.
Weather feature Mean Median Standard deviation [13] C. Wan, J. Zhao, Y. Song, et al, “Photovoltaic and solar power forecasting
Total column liquid water (kg/m2) 0.04 0 0.12 for smart grid energy management,” CSEE J. Power and Energy Syst., vol.
Total column liquid water (kg/m2) 0.02 0 0.05 1, no. 4, pp. 38-46, 2015.
Surface pressure (103 Pa) 94.12 94.13 584.20 [14] Y. Zhang, M. Beaudin, R. Taheri, et al, “Day-ahead power output
Relative humidity (%) 56.23 54.81 19.38 forecasting for small-scale solar photovoltaic electricity generators,”
Total cloud cover 0.43 0.37 0.39 IEEE Trans. Smart Grid, vol. 6, no. 5, pp. 2253-2262, 2015.
Wind speed (m/s) 3.17 2.74 1.95 [15] H. Jang, K. Bae, H. Park, et al, “Solar power prediction based on satellite
Temperature (K) 289.00 288.60 6.84 images and support vector machine,” IEEE Trans. Sustain. Energy, vol.
Global horizontal irradiance (J/m2) 409.57 370.47 302.68 7, no.3, pp. 1255-1263, 2016.
Surface thermal radiation (J/m2) 315.35 315.23 40.94 [16] M. Sanjari, H. Gooi, “Probabilistic forecast of PV power generation based
Top solar radiation (J/m2) 483.14 461.86 317.42 on higher-order Markov chain,” IEEE Trans. Power Syst., 2016.
Total precipitation (10-5 m) 7.97 0 34.98 [17] H. Sheng, J. Xiao, Y. Cheng, et al, “Short-term solar power forecasting
based on weighted Gaussian process regression,” IEEE Trans. Industrial
Solar power (kW) 0.37 0.34 0.29
Electronics, 2017.
TABLE XV [18] J. Liu, W. Fang, X. Zhang, et al, “An improved photovoltaic power
STATISTICAL DESCRIPTIONS OF THE HISTORICAL DATASETS OF PV 3 forecasting model with the assistance of aerosol index data,” IEEE Trans.
Weather feature Mean Median Standard deviation Sustain. Energy, vol. 6, no. 2, pp. 434-442, 2015.
Total column liquid water (kg/m2) 0.04 0 0.12 [19] J. R. Andrade, R. J. Bessa, “Improving renewable energy forecasting with
Total column liquid water (kg/m2) 0.02 0 0.05 a grid of numerical weather predictions,” IEEE Trans. Sustain. Energy,
Surface pressure (103 Pa) 92.47 92.49 575.63 2017.
Relative humidity (%) 57.06 55.48 19.94 [20] A. Tascikaraoglu, B. Sanandaji, G. Chicco, et al, “Compressive spatio-
Total cloud cover 0.44 0.40 0.39 temporal forecasting of meteorological quantities and photovoltaic power,”
Wind speed (m/s) 3.13 2.74 1.85 IEEE Trans. Sustain. Energy, vol. 7, no. 3, pp. 1295-1305, 2016.
Temperature (K) 288.06 287.69 6.72 [21] J. M. Filipe, R. J. Bessa, J. Sumaili, et al, “A hybrid short-term solar power
Global horizontal irradiance (J/m2) 401.88 360.55 303.91 forecasting tool,” Intelligent System Application to Power Systems, 2015
Surface thermal radiation (J/m2) 307.36 306.31 40.58 18th International Conference on, pp. 1-6, 2015.
Top solar radiation (J/m2) 475.38 452.71 320.68 [22] B. Marion, “Comparison of predictive models for photovoltaic module
performance,” Photovoltaic Specialists Conference, 33rd IEEE, pp. 1-6,
Total precipitation (10-5 m) 8.60 0 36.57
2008.
Solar power (kW) 0.38 0.35 0.30
[23] I. Reda, A. Andreas, “Solar position algorithm for solar radiation
The detailed data can be found in [37]. applications,” Solar Energy, vol. 76, no. 5, pp. 577-589, 2004.
[24] P. G. Loutzenhiser, H. Manz, C. Felsmann, et al, “Empirical validation of
models to compute solar irradiance on inclined surfaces for building
REFERENCES energy simulation,” Solar Energy, vol. 81, no. 2, pp. 254-267, 2007.
[1] Y. Ding, C. Kang, J. Wang, et al, “Foreword for the special section on [25] F. J. Batlles, M. A. Rubio, J. Tovar, et al, “Empirical modeling of hourly
power system planning and operation towards a low-carbon economy,” direct irradiance by means of hourly global irradiance,” Energy, vol. 25,
IEEE Trans. Power Syst., vol. 30, no. 2, pp. 1015-1016, 2015. no. 7, pp. 675-688, 2000.
[2] J. Wang, H. Zhong, Z. Ma, et al, “Review and prospect of integrated [26] D. T. Reindl, W. A. Beckman, J. A. Duffle, “Diffuse fraction correlations,”
demand response in the multi-energy system,” Appl. Energy, vol. 202, pp. Solar Energy, vol. 45, pp. 1-7, 1990.
772-782, 2017. [27] D. Fairman, “Assessing the outdoor operating temperature of photovoltaic
[3] Mercom Capital Group (2016) Mercom forecasts 76 GW in global solar modules,” Progress in Photovoltaics: Research and Applications, vol. 16,
installations in 2016, a 48% increase over 2015 [Online]. Available: no. 4, pp. 307-315, 2008.
http://mercomcapital.com/mercom-forecasts-76-gw-in-global-solar- [28] K. Benidis, Y. Sun, P. Babu, et al, “Orthogonal sparse PCA and
installations-in-2016-a-48-yoy-increase-over-2015 covariance estimation via Procrustes reformulation,” IEEE Trans. Signal
[4] H. Yang, C. Huang, Y. Huang, et al, “A weather-based hybrid method for Process., vol. 64, no. 23, pp. 6211-6226, 2016.
1-day ahead hourly forecasting of PV power output,” IEEE Trans. Sustain. [29] I. B. Mohamad, D. Usman, “Standardization and its effects on K-means
Energy, vol. 5, no. 3, pp. 917-926, 2014. clustering algorithm,” Research Journal of Applied Sciences, Engineering
[5] E. B. Ssekulima, M. B. Anwar, A. A. Hinai, et al, “Wind speed and solar and Technology, vol. 6, no. 17, pp. 3299-3303, 2013.
irradiance forecasting techniques for enhanced renewable energy [30] J. Shi, W. J. Lee, Y. Liu, et al, “Forecasting power output of photovoltaic
integration with the grid: a review,” IET Renew. Power Gener., vol. 10, systems based on weather classification and support vector machines,”
no. 7, pp. 885-898, 2016. IEEE Trans. Ind. Appl., vol. 48, no. 3, pp. 1064-1069, 2012.
1949-3053 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TSG.2017.2766022, IEEE
Transactions on Smart Grid
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 11
1949-3053 (c) 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.