Professional Documents
Culture Documents
a r t i c l e i n f o a b s t r a c t
Article history: Simulation of radiation is one of the most important issues in solar utilization. Time series models are
Received 25 August 2014 useful tools in the estimation and forecasting of solar radiation series and their changes. In this paper,
Accepted 24 December 2014 the effectiveness of autoregressive moving average (ARMA) models with various generalized autoregres-
Available online 13 January 2015
sive conditional heteroskedasticity (GARCH) processes, namely ARMAGARCH models are evaluated for
their effectiveness in radiation series. Six different GARCH approaches, which contain three different
Keywords: ARMAGARCH models and corresponded GARCH in mean (ARMAGARCH-M) models, are applied in
Models
radiation data sets from two representative climate stations in China. Multiple evaluation metrics of
Solar radiation
ARIMA
modeling sufciency are used for evaluating the performances of models. The results show that the
Generalized Autoregressive Conditional ARMAGARCH(-M) models are effective in radiation series estimation. Both in tting and prediction of
Heteroscedasticity (GARCH) radiation series, the ARMAGARCH(-M) models show better modeling sufciency than traditional mod-
Prediction els, while ARMAEGARCH-M models are robustness in two sites and the ARMAsGARCH-M models
appear very competitive. Comparisons of statistical diagnostics and model performance clearly show that
the ARMAGARCH-M models make the mean radiation equations become more sufcient. It is recom-
mended the ARMAGARCH(-M) models to be the preferred method to use in the modeling of solar radi-
ation series.
2015 Elsevier Ltd. All rights reserved.
1. Introduction There are a number of solar energy models presented in the lit-
erature, which include mathematical linear [2,79], nonlinear
Solar energy is a part of the suns energy that falls at the earths functions [1012], Support Vector Machines (SVM) [13], articial
surface for a wide range of applications in engineering, meteorol- neural network [1418], fuzzy logic [19,20], and some hybrid
ogy, agricultural sciences, as well as in many elds of the natural approaches [2125]. The most popular model was the linear model
sciences [1]. Solar energy is secure, clean, and available on the by Angstrom-Prescott [7,17] and its adjusted approaches [26,27],
earth through the year and is considered to be very important to which establishes a linear relationship between global radiation
the world to solve the high fuel costs and the critical situation of and sunshine duration besides the knowledge of extraterrestrial
the atmosphere resulting from fossil fuel applications [2]. It is solar radiation and the theoretical maximum daily solar hours.
increasing to make better use of the solar energy in China to meet However, the maximum difculty to obtain an accurate solar
the requirement of a growing industry. However, in the utilization energy prediction model is the availability of a long term weather
of solar radiation, one critical difculty is modeling solar radiation data [28]. Time-series regression models were employed by
[3]. For example, the electricity generated by solar power systems another group of investigators [6,29], e.g. Zaharim et al. [30] use
is not as stable as those generated by other sources, which means the ARMA BoxJenkins method to model global solar radiation data
that it is challenging to integrate solar energy into traditional elec- in Malaysia. Although accurate prediction of the mean solar radia-
tricity systems [4]. Therefore, gathering accurate information of tion can be provided by these methods, the turbulence (volatility
solar radiation and its volatility has been a crucial issue [5,6]. or heteroscedasticity) of solar radiation is often lost. In other
words, although the linearity in the mean or the rst moment of
solar radiation can be captured by linear time series models, the
Corresponding author at: School of Hydropower and Information Engineering,
variance that may be responsible for nonlinearity is rarely consid-
Huazhong University of Science and Technology, Wuhan 430072, China. Tel.: +86 27
ered in solar radiation modeling. Autoregressive integrated moving
87543992 (O), +86 34 1960 0578 (cell).
E-mail address: huaiweisun@whu.edu.cn (H. Sun). average (ARIMA) models with generalized autoregressive
http://dx.doi.org/10.1016/j.enconman.2014.12.072
0196-8904/ 2015 Elsevier Ltd. All rights reserved.
386 H. Sun et al. / Energy Conversion and Management 92 (2015) 385395
conditional heteroskedastic (GARCH) [31,32] processes are the are both equal to 1, the GARCH model is also called the standard
widely used approaches to modeling the mean and volatility of GARCH model, denoted as SGARCH in this paper.
time series. GARCH models have been applied widely to a range
of time series, especially in nance and nancial decisions [33], 2.2. Typical variants of GARCH models
however, these models have not yet received much attention from
the energy community except for wind speed forecasting [3436]. There are some typical variants of the GARCH family of models,
As such, it is necessary to have an attempt to apply these models in which are established in an attempt to account for nonlinear
radiation modeling and give a comprehensively evaluation to effects [36]. A brief introduction to these GARCH models used in
them. This paper is written to show our effect in this aspect and this study is provided below.
concerns the development of GARCH models in radiation model- The gjrGARCH model, which was developed by Glosten et al.
ing. Specically, six different ARMAGARCH(-M) models are used [37], is used to model the asymmetry in the GARCH process. The
in this study to investigate their performances as applied to two mathematical form of gjrGARCH model with autoregressive terms
representative data sets collected from China. and moving averaging terms is dened as follows,
The rest of the paper is organized as follows. Section 2 reviews X
p X
q
the general GARCH methodology and their typical variants and yt l /i yti hj etj et
presents different metrics related to model performances. Section i1 j1
X
N series, the rst 240 monthly mean total daily solar radiation values
Si Oi 2 are used to develop models, and then the rest of data set (120 val-
i1
NSE 1 8 ues) in each site is used to test the models performance. During
X
N
2
Oi O the study, modeling and programming are written in R code and
i1 four packages namely forecast [41], rugarch [42], fBasics [43], and
hydroGOF [44] are used.
0 12 Also, Fig. 1 is drawn to further illustrate the variation features of
X N solar radiation over time. According to Table 1 and Fig. 1, it can be
B Oi OSi S C
B C seen that the solar radiation changes in the range of 3.139
B C
R B
2
Bv vC
i1
C 9 25.581 MJ/m2 in Urumqi and 4.89222.481 MJ/m2 in Beijing. The
Bu X
u N
uX C mean values of two solar radiation series are 1.381 MJ/m2 and
@u
N
t O O2 t S S2 A
i i
1.352 MJ/m2, with standard deviations being 6.871 MJ/m2 and
i1 i1
4.514 MJ/m2, respectively. Radiation data in two stations are nega-
The commonly used criteria of mean absolute error (MAE), tively skewed and leptokurtic. In addition, the gures show that
which records in real units the level of overall agreement between solar radiation series exhibit high volatility over time.
the observed and modeled datasets, is calculated by using Eq. (6).
In Eq. (8), the NashSutcliffe efciency (NSE), which determines
the relative magnitude of the residual variance (noise) compared 4. Result and discussion
to the measured data variance (information), is dened [40]. NSE
indicates how well the plot of observed versus simulated data ts 4.1. Estimation of GARCH models
the 1:1 line, as a value of 1 corresponds to a perfect match of mod-
eled to the observed data and the model is more accurate while the Before building ARMAGARCH(-M) models for the rst 20 years
value closer to 1. The commonly used criteria of RMSE and R2 were (240 values) of radiation data sets obtained from Urumqi and Bei-
calculated by Eqs. (7) and (9), respectively. Also, a criteria namely jing Stations, time-series ARMA models were developed as the rst
bR2, which is obtained as R2 multiplied by the slope of the regres- stage to identify the underlying model structures governing the
sion line between simulated and observed values, was used in this mean solar radiation. Firstly, the time series were checked for sta-
study as accounting for the discrepancy in the magnitude of two tionary to verify the identication of an appropriate BoxJenkins
signals. By using these criteria, we illustrate the application of models [45]. The free library package forecast [41] was used in this
the above methods for two monthly mean total daily solar radia- step, the results indicated that no differences were required for sta-
tion series from two different climate areas. These two radiation tionary series in two sites. Secondly, the plots of the Auto-Correla-
data sets are selected to investigate and compare the GARCH tion Function (ACF) and Partial Auto-Correlation Function (PACF)
model efciency and advantage for modeling the radiation series. were drawn to give help in identifying the ARMA model that best
describes the resulting stationary time-series. As shown in Fig. 2,
the ACFs of the series appear to have a seasonal pattern and the
3. Radiation data from two study sites values trend to 0 in 3 lags in Urumqi and Beijing, respectively. In
order to simply the t process of the best ARIMA model to radia-
The daily record of solar radiation series data at two represen- tion time series, the stepwise algorithm outlined in [41] was used
tative climate stations were selected in this study to evaluate the in this study. The orders of the autoregressive and moving average
approaches. These included Beijing (39.8N, 116.5E) in the North terms are determined according to the minimum Akaike Informa-
China Plain with a temperate monsoon climate, and Urumqi tion Criterion (AIC). As the results also indicate no seasonal or dif-
(43.5N, 87.4E) in the Northwest with an arid climate in the center ference terms needed in the model structures, the constructed
of the Eurasian continent. ARMA model in Urumqi has the autoregressive order of 3 and
The measured period was from January 1, 1983 to December 31, the moving average order of 3, thereby expressed as ARMA (3, 3)
2012, with measurements of total daily solar radiation, daily extra- while an ARMA (2, 3) was used in Beijing.
terrestrial solar radiation, and sunshine duration records included Due to its simplicity and robustness, the GARCH (1, 1) structure
in the dataset. The data were obtained from the National Meteoro- is adopted for the GARCH component. After the model structure is
logical Information Center and the China Meteorological Adminis- determined, the parameter values in different GARCH models of
tration. Preliminary quality control tests were conducted by the Urumqi and Beijing are reported in Tables 2 and 3, respectively.
suppliers. For these station, all the instruments utilized in this There are both the estimated parameter values and the evaluation
study were calibrated periodically and were made following the statistics of model sufciency. For estimated parameter values,
World Meteorological Organization guidelines. Specially, the mea- there are two parts of the results, the values of the model param-
sure of solar radiation is taken by pyranometer. The researchers eters and standard errors which are in parenthesis are provided. It
further checked the data and the basic statistics of the radiation is clearly shown in Tables that the parameters are signicant and
series data are summarized in Table 1. In total, there are 10958 the models are valid. Apart from the statistical signicance of the
observation daily records for solar radiation. It is carried out a fre- parameters, an important test is that the residuals of the estimated
quency conversion from the daily data into the monthly average model are white noise. Ljung-Box (1978), also called Q-test, was
values, and 360 values obtained for each site. In the radiation used as a convenient test for white noise. In Tables 2 and 3, the
Table 1
Geographical coordinates and general statistics of radiation data sets in two sites.
Fig. 1. Monthly solar radiation series in 19832012 at two sites (a, Urumqi; b, Beijing).
Fig. 2. Plots of ACF and PACF of radiation series in two sites (a, ACF in Urumqi; b, PACF in Urumqi; c, ACF in Beijing; d, PACF in Beijing).
results from the Q-tests indicate that the models built for Urumqi 4.2. Total radiation prediction
and Beijing become signicant. In the bottom of the table, the
results of Log-likelihood, AIC, and BIC are also provided. In the sta- The rest of radiation data (120 values) are used to test the pre-
tistical signicance testing, the signicance levels normally used diction accuracy of the tted models based on the initial 240 val-
are 1%, 5%, and 10%. By examining the tables, it can be seen that ues. The metrics of MAE, RMSE, NSE, R2, and bR2 are used to
the parameters in equations are nearly all less than 0.01 which evaluate the prediction accuracy. The hydroGOF package [44] was
point out that the tted models are highly signicant and the mod- used to calculate these metrics. By using the ARMA model for the
els are appropriate. testing dataset in Urumqi, the values of MAE, RMSE, NSE, R2, and
The comparisons of observed and estimated radiation values in bR2 are obtained to be 3.321, 3.680, 0.710, 0.892, and 0.794, respec-
different models are shown in Figs. 3 and 4. In Urumqi, the R2 val- tively. As shown in Table 5, these values are 1.785, 2.190, 0.758,
ues between observed and estimated values vary within a narrow 0.787, and 0.766 in Beijing and also suggesting that the tted
band of 0.9494 and 0.9611 depending on models. The tted R2 val- ARMA models have the capability of accurate prediction. Then,
ues are higher in the ARMAGARCH-M model than other models. we compared the obtain ARMAGARCH(-M) models with the
For the other site, namely Beijing station, the R2 values are between ARMA model, as well as ANN and AR-ANN model. The ANN and
0.8589 and 0.8759 in Beijing and the ARMAGARCH-M models AR-ANN model are included because they are usually useful in
have better performance than other models expect for ARMAgjr- short-term forecasting. As a comparison, we only use feed-forward
GARCH-M model though the difference is small. Expert for the neural networks with a single hidden layer for forecasting radia-
ARMAgjrGARCH-M model in Beijing, the p-values of parameter tion time series. It can be seen from Tables 4 and 5, it is shown that
k0 in the ARMAGARCH-M models are all less than 0.01 which the ARMAGARCH models have better performance than ARMA
indicates the strengthened explanatory power of mean models is models as the largest value of NSE equal to 0.956 obtained by
necessary. All these results of signicance testing basically state ARMAsGARCH-M model for Urumqi and 0.878 obtained by
that the ARMAGARCH-M models make the mean radiation equa- ARMAEGARCH-M model for Beijing. Also, the ANN and AR-ANN
tions become more sufcient. model have very different performances in radiation predictions.
H. Sun et al. / Energy Conversion and Management 92 (2015) 385395 389
Table 2
Parameter estimation and statistical diagnostics of different models used in Urumqi.
Notes:
*
Means signicant at the 10% level, two-sided test.
**
Means signicant at the 5% level, two-sided test.
***
Means signicant at the 1% level, two-sided test.
Table 3
Parameter estimation and statistical diagnostics of different models used in Beijing.
Notes:
*
Means signicant at the 10% level, two-sided test.
**
Means signicant at the 5% level, two-sided test.
***
Means signicant at the 1% level, two-sided test.
In Urumqi, the AR term can improve the NSE of radiation modeling EGARCH, ARMAgjrGARCH, and ARMA model in Urumqi and
from 0.107 to 0.799, and the values also changed from 0.164 to ARMAEGARCH-M, ARMAsGARCH-M, ARMAsGARCH, ARMA
0.473 in Beijing. These performances may be improved by using gjrGARCH, ARMAEGARCH, ARMAgjrGARCH-M, and ARMA model
coupled method or advanced algorithm. In the related literature, in Beijing. The maximum performance differences in mean abso-
Flores et al. [46] have used an evolutionary loop to improve the lute error are 2.296 MJ/m2 and 0.562 MJ/m2 in Urumqi and Beijing,
performance of ANN models in different climate series and got a respectively. Also, the ARMAGARCH-M models have the best per-
better prediction than ARMA models. Also, Chattopadhyay et al. formance both in Urumqi and Beijing. The results are expected and
[47] have used a coupled method of AR and ANN to get a better somewhat consistent with the ndings in literatures [33,34]. The
result than ARMA models. Due to the scope of this study, we only ARMAGARCH-M models capture both the conditional mean and
selected the ordinary methods of ANN as a comparison. As the ANN conditional heteroscedasticity, which contributes to the better per-
models have poorer performance than ARMAGARCH(-M) models formance of ARMAGARCH-M models than other models.
and cannot offer the volatility information, these models will no
longer be consider in the remaining work for comparison purpose. 4.3. Further discussion on predicting total radiation output
Except for two performance measures (R2 and bR2), the perfor-
mance ranking is constant among the models as shown in Tables 4 The volatility information can be captured by ARMAGARCH
and 5, namely in the descending order of ARMAsGARCH-M, models, which are different to the traditional point forecasting
ARMAEGARCH-M, ARMAgjrGARCH-M, ARMAsGARCH, ARMA models such as ARMA, neural network, SVM, and hybrid models.
390 H. Sun et al. / Energy Conversion and Management 92 (2015) 385395
Fig. 3. Comparisons of observed and estimated radiation values in Urumqi (a, ARMA; b, ARMAsGARCH; c, ARMAgjrGARCH; d, ARMAEGARCH; e, ARMAsGARCH-M;
f, ARMAgjrGARCH-M; g, ARMAEGARCH-M).
Figs. 5 and 6 show the predictions which implying the trend of estimation of radiation is accurate and reliable. Also, the ARMA
actual radiation can be well captured by the predictions and the EGARCH-M model has the best performance to give an interval
change characteristics of radiation series. As shown in Figs. 5(a) estimation of radiation simulation in Beijing. In literatures,
and 6(a), the gaps between the measured radiation points and pre- ARCH/GARCH models are believed to have better performances in
dicted values obtained by ARMA models are larger than other non-constant volatility of the time series data [48,49]. If we bal-
models. ance the predication accuracy and interval estimation of models,
The interval estimation of radiation value is conducted at 95% the ARMAEGARCH-M model appears to be the one which is pre-
condence interval. It is obvious that although the solar total radi- ferred in radiation modeling. This is because the model structure
ation dramatically changes over years, the estimated intervals of has the advantage of robustness, and it has the best performance
ARMAGARCH(-M) models keep embracing it well. The numerical in radiation modeling in Beijing and the second ranking in model-
calculation shows that there are 119 observation, namely 99.2% of ing radiation series in Urumqi. Also, the ARMAEGARCH-M model
the 120 observations in the testing dataset, which are scattered in has overall good interval estimation accuracy. Certainly, the
the respective estimated intervals of ARMAsGARCH-M model in ARMAsGARCH-M model is very competitive as it produces the
Urumqi (Table 6). This result empirically veries that the interval lowest MAE and RMSE among this group of 6 models in modeling
H. Sun et al. / Energy Conversion and Management 92 (2015) 385395 391
Fig. 4. Comparisons of observed and estimated radiation values in Urumqi (a, ARMA; b, ARMAsGARCH; c, ARMAgjrGARCH; d, ARMAEGARCH; e, ARMAsGARCH-M;
f, ARMAgjrGARCH-M; g, ARMAEGARCH-M).
Table 4
Predicting performance of different models used in Urumqi.
Prediction performance ARMA ARMA ARMA ARMA ARMA ARMA ARMA ANN AR-
index sGARCH gjrGARCH EGARCH sGARCH-M gjrGARCH-M EGARCH-M ANN
MAE 3.321 1.829 1.977 1.904 1.025 1.305 1.167 6.380 2.656
RMSE 3.680 2.459 2.614 2.541 1.44 1.851 1.526 7.190 3.075
NSE 0.710 0.871 0.854 0.862 0.956 0.927 0.950 0.107 0.799
R2 0.892 0.907 0.890 0.904 0.956 0.928 0.954 0.004 0.908
bR2 0.794 0.900 0.887 0.898 0.953 0.905 0.923 0.003 0.814
392 H. Sun et al. / Energy Conversion and Management 92 (2015) 385395
Table 5
Predicting performance of different models used in Beijing.
Prediction performance ARMA ARMA ARMA ARMA ARMA ARMA ARMA ANN AR-
index sGARCH gjrGARCH EGARCH sGARCH-M gjrGARCH-M EGARCH-M ANN
MAE 1.785 1.266 1.284 1.302 1.259 1.308 1.223 4.079 2.774
RMSE 2.190 1.665 1.690 1.715 1.646 1.723 1.556 4.801 3.250
NSE 0.758 0.860 0.856 0.852 0.863 0.850 0.878 0.164 0.473
R2 0.787 0.864 0.865 0.859 0.874 0.859 0.879 0.002 0.827
bR2 0.766 0.860 0.863 0.857 0.873 0.859 0.875 0.002 0.795
Fig. 5. Comparison of measured data and the 95% condence interval by models in Urumqi (a, ARMA; b, ARMAsGARCH; c, ARMAgjrGARCH; d, ARMAEGARCH; e, ARMA
sGARCH-M; f, ARMAgjrGARCH-M; g, ARMAEGARCH-M).
H. Sun et al. / Energy Conversion and Management 92 (2015) 385395 393
Fig. 6. Comparison of measured data and the 95% condence interval by models in Beijing (a, ARMA; b, ARMAsGARCH; c, ARMAgjrGARCH; d, ARMAEGARCH; e, ARMA
sGARCH-M; f, ARMAgjrGARCH-M; g, ARMAEGARCH-M).
Table 6
The numbers of the predicted points outside the 95% condence interval.
[35] Payne JE, Carroll B. Modeling wind speed and time-varying turbulence in [42] Ghalanos A. rugarch: Univariate GARCH models. R package 2014.
geographically dispersed wind energy markets in China. Energy Sour Part A [43] Diethelm W. fBasics: Rmetrics-Markets and Basic Statistics. R package 2009.
Recov Utiliz Environ Eff 2009;31(19):175969. [44] Zambrano-Bigiarini M. hydroGOF: Goodness-of-t functions for comparison of
[36] Liu H, Erdem E, Shi J. Comprehensive evaluation of ARMAGARCH(-M) simulated and observed hydrological time series. R package 2011.
approaches for modeling the mean and volatility of wind speed. Appl Energy [45] Box GEP, Jenkins GM, Reinsel GC. Time series analysis: forecasting and
2011;88(3):72432. control. Prentice Hall: John Wiley & Sons; 2013.
[37] Glosten LR, Jagannathan R, Runkle DE. On the relationship between the [46] Flores JJ, Mario G, Rodriguez H. Evolutive design of ARMA and ANN models for
expected value and the volatility of the nominal excess returns on stocks. J time series forecasting. Renew Energy 2012;44:22530.
Finance 1993;48:1779801. [47] Chattopadhyay G, Chattopadhyay S. Autoregressive forecast of monthly total
[38] Nelson DB. Conditional heteroskedasticity in asset returns: a new approach. ozone concentration: a neurocomputing approach. Comput Geosci
Econometrica 1991;59:34770. 2009;35(9):192532.
[39] Nelson DB, Cao CQ. Inequality constraints in the univariate GARCH model. J [48] Sparks JJ, Yurova YV. Comparative performance of ARIMA and ARCH/GARCH
Business Econ Stat 1992;10:22935. models on time series of daily equity prices for large companies. Unive Illinois
[40] Nash JE, Sutcliffe JV. River ow forecasting through conceptual models part I Chicago 2006:56373.
A discussion of principles. J Hydrol 1970;10(3):28290. [49] Zheng M, Miao J. Comparing the forecastability of alternative quantitative
[41] Hyndman RJ, Khandakar Y. Automatic time series forecasting: the forecast models: a trading simulation approach in nancial engineering. Syst Eng
package for R. J Stat Softw 2008;26:122. Procedia 2012;4:359.