Professional Documents
Culture Documents
Tao Hong
• What I’ve been doing recently
– Analytical Consultant, SAS Institute
– BKS Instructor, SAS Institute
– Founding Chair, IEEE Working Group on Energy Forecasting
• Interest
– Forecasting, optimization
– Energy economics, system planning, power engineering
• Education
– Ph.D., O.R., E.E., NC State University, Raleigh
– M.S., E.E., O.R., I.E., NC State University, Raleigh
– B.Eng., Automation, Tsinghua University, Beijing
Statistical Models for Electric Load Forecasting
© 2011 Tao Hong, PhD (hongtao01@gmail.com) Page 3 of 47
Introduction Tao Hong
Numerical and Graphical Methods Objectives
Regression Approach What’s electric load forecasting?
Box-Jenkins Approach Why do we need load forecasts?
Beyond This Workshop How to forecast electricity demand?
Objectives
• Understand an important application of analytics in electric
utility industry – electric load forecasting
• Learn how to
– discover salient features of electric load series using graphical methods
– apply regression analysis and time series analysis to short term load
forecasting
– develop general linear models and time series models to forecast one-
day ahead hourly loads
Notes, data, and reading materials can be downloaded from the lecture webpage:
http://courses.drhongtao.com/smelf
• Energy purchasing
1990s
Deregulation
• Rate case
• Modeling process
– Statistics: linear regression, Box-Jenkins, nonparametric regression
Descriptive Statistics
• Numerical methods
– Max, Min
– Mean, Median
• Graphical methods
– Line plot
– Box plot
– Scatter plot
Develop INTUITION
Numerical Methods
• When describing the load of a utility
– Annual Peak
– Average Load
– Load factor = Average Load / Annual Peak
• When describing the size of a utility
– Annual Peak
– Number of customers
– Number of feeders Understand BUSINESS
– Length of transmission lines
– Size of service territory
– Revenue, growth rate, etc.
Statistical Models for Electric Load Forecasting
© 2011 Tao Hong, PhD (hongtao01@gmail.com) Page 12 of 47
Introduction Descriptive statistics
Numerical and Graphical Methods Numerical methods
Regression Approach Line plot
Box-Jenkins Approach Box plot
Beyond This Workshop Scatter plot
Line Plot
• Load, 2005 – 2008 • Temperature, 2005 – 2008
Line Plot
• 8-12, Jun 2008 • 16-21, Nov 2008
Box Plot
• Trend
Box Plot
• Seasonality – Month
Box Plot
• Seasonality – weekday vs. weekend
Box Plot
• Seasonality – weekly load profile
Scatter Plot
• Load – temperature relationship
Scatter Plot
• Load – temperature relationship by month
Scatter Plot
• Load – temperature relationship by hour
Interaction Regression
• Interactions between/among
– Quantitative variables
• A new quantitative variable (not covered)
– Quantitative variables and qualitative variables
• Month and temperature
• Hour and temperature
– Qualitative variables
• Weekday and hour
Interaction Regression
• Temperature vs. calendar variables
– High during the day, low at night
– High in summer, low in winter
– No apparent interactions across 7 days of a week
• Interactions (or cross effects)
– “Temperature” × Hour
– “Temperature” × Month
“Temperature”: T, T2, T3
E(Load) = β0 + β1 * Trend + β2 * Day * Hour + β3 * Month
+ β4 * Month * T + β5 * Month * T2 + β6 * Month * T3
+ β7 * Hour * T + β8 * Hour * T2 + β9 * Hour * T3
Statistical Models for Electric Load Forecasting
© 2011 Tao Hong, PhD (hongtao01@gmail.com) Page 28 of 47
Introduction Simple linear regression
Numerical and Graphical Methods Multiple linear regression
Regression Approach Interaction regression
Box-Jenkins Approach Lagged regression
Beyond This Workshop Dynamic regression
Lagged Regression
• Recency effect
– The most recently presented items or experiences will most likely be
remembered best. If you hear a long list of words, it is more likely that
you will remember the words you heard last (at the end of the list)
than words that occurred in the middle.
• Lagged regression models
– To forecast an output series yt, given a present and past of an input
series xt, or lagged independent variables.
• Load = f(T, T(t-1), T(t-2), …)
– Load refers to Load(t), T refers to T(t)
– i.e., Load = f(trend, calendar variables, T, T(t-1))
Lagged Regression
E(Load) = β0 + β1 * Trend + β2 * Day * Hour + β3 * Month
+ β4 * Month * T + β5 * Month * T2 + β6 * Month * T3
+ β7 * Hour * T + β8 * Hour * T2 + β9 * Hour * T3
+ β10 * Month * T(t-1) + β11 * Month * T(t-1)2 + β12 * Month * T(t-1)3
+ β13 * Hour * T(t-1) + β14 * Hour * T(t-1)2 + β15 * Hour * T(t-1)3
Dynamic Regression
• Lagged dependent variable
– Example: Load = f(trend, calendar variables, T, T(t-1), Load(t-1))
• Notice that during the forecasting stage
– to forecast Load(t+20), we need Load(t+19), Load(t+18), Load(t+17)…
– at time t, we don’t have actual Load(t+19), Load(t+18), …
E(Load) = β0 + β1 * Trend + β2 * Day * Hour + β3 * Month
+ β4 * Month * T + β5 * Month * T2 + β6 * Month * T3
+ β7 * Hour * T + β8 * Hour * T2 + β9 * Hour * T3
+ β10 * Month * T(t-1) + β11 * Month * T(t-1)2 + β12 * Month * T(t-1)3
+ β13 * Hour * T(t-1) + β14 * Hour * T(t-1)2 + β15 * Hour * T(t-1)3
+ β16 * Load(t-1)
Statistical Models for Electric Load Forecasting
© 2011 Tao Hong, PhD (hongtao01@gmail.com) Page 31 of 47
Introduction ARIMA
Numerical and Graphical Methods Seasonal ARIMA
Regression Approach Temperature forecasting
Box-Jenkins Approach Intervention Analysis
Beyond This Workshop Warning
ARIMA
• REG + ARIMA
Step 1: Apply regression analysis to
forecast the original series. Then
^
Load = LoadREG + r calculate the residual series.
^
= Load + ^r
REG ARIMA + e Step 2: Use ARIMA to forecast
the residual series.
^
= Load + e
Step 3: The final forecast is the ARIMA based
residual forecast plus the regression based
load forecast.
Seasonal ARIMA
• Now you have another tool to model seasonal patterns
• Seasonal ARIMA can be used to forecast load series directly
– Univariate method
Temperature is not needed
Limited accuracy as forecasting horizon becomes longer
Temperature Forecasting
• Seasonal ARIMA can be used to forecast temperature series
directly
• Regression model with SARIMA based temperature forecast
Intervention Analysis
• Holidays
• Big outages due to severe weather
– Hurricanes
– Snow storm
• Outliers
Warning
• You are now well equipped to perform load forecasting
Forecast
Submitted
History
Forecast
Morning
of today
End of End of
today tomorrow
Warning
• The more techniques you learn
– The more freedom and options you have
– The more powerful you should be
• However
– You can be easily buried by hundreds of
“combinations” of techniques
• ALWAYS keep in mind the principle of KISS
– Keep
– It
– Simple
– Stupid
Statistical Models for Electric Load Forecasting
© 2011 Tao Hong, PhD (hongtao01@gmail.com) Page 37 of 47
Introduction
Numerical and Graphical Methods
Regression Approach
Box-Jenkins Approach
Beyond This Workshop
Exercise
• To forecast the loads of a given day, you can only use the loads up to two
days prior to the given day. For instance, to forecast the loads of 2007/8/3,
you can use the loads up to the last hour of 2007/8/1.
– When you are using dynamic regression models for forecast 2007/8/3, you can only plug
in predicted values of the loads after the last hour of 2007/8/1
• You will use the loads starting from the first hour of 2004/1/1 for the first
two problems.
• The load of the first hour of Day Light Saving Time (DST) of each year (in
spring) may be shown as zero, while the last hour of DST period (in fall) may
show the sum of the loads of two hours. You don’t need to make any
adjustment when doing the first two problems. If you make adjustment for
the 3rd problem, please indicate your strategy.
• Submission instructions are listed in the project assignment
Exercise
• Using the naive MLR benchmarking model disscussed in page 28 to do ex
post forecasting for 2007/8/3 and 2007/11/22, report the MAPE and MAE
for each day.
• Using the lagged regression model discussed in page 30 to do ex post
forecasting for 2007/8/3, and 2007/11/22, report the MAPE and MAE for
each day.
• Can you further improve the forecasting accuracy of 2007/11/22? Report
the methodology you used and the best MAPE and MAE you can get.
• Hints
– The first two problems mean to have you “calibrate” the tool. We expect the same
answers from each one of you
– The third problem is tricky. After solving the first two problems, you probably find the
forecasting error of 11/22 is much larger than 8/3. Thank about why, and how to improve
it, and try to improve it. Remember to use your intuition.
Statistical Models for Electric Load Forecasting
© 2011 Tao Hong, PhD (hongtao01@gmail.com) Page 39 of 47
Introduction Theory
Numerical and Graphical Methods Reading
Regression Approach Forecaster’s job
Box-Jenkins Approach Networking
Beyond This Workshop Project assignment
Theory
• Seasonal ARIMA
• Intervention analysis
• Time series regression
• Artificial Neural Networks
• Clustering
Reading
• Tao Hong, “Short Term Electric Load Forecasting”, PhD
dissertation, NCSU
• Ott and Longnecker, “An Introduction to Statistical Methods
and Data Analysis”
• Kutner, et. al., “Applied Linear Statistical Models”
• Brockleblank and Dickey, “SAS for Forecasting Time Series”
Forthcoming
• Hyndman and Athanasopoulos, “Forecasting: Principles and
Practice”
• Hong and Dickey, “Electric Load Forecasting”
Statistical Models for Electric Load Forecasting
© 2011 Tao Hong, PhD (hongtao01@gmail.com) Page 41 of 47
Introduction Theory
Numerical and Graphical Methods Reading
Regression Approach Forecaster’s job
Box-Jenkins Approach Networking
Beyond This Workshop Project assignment
Forecaster’s Job
Day ahead forecasting for the 24 hours with hour ending:
9/13/2011 1am to 9/14/2011 12am
• Review the past forecasting performance
• Alter forecasting methodology if necessary
• Develop temperature forecast
• Calculate load forecast for the next day
• Adjust forecast if necessary
• Submit load forecast
By 9/12/2011 morning, i.e. 8am.
Networking
• IEEE working group on energy forecasting
– Utility application oriented energy forecasting
– Focus on practical needs of utilities
– Data: outlier detection, data cleansing, selection of weather station(s)
– Inputs: temperature forecast, wind forecast, solar forecast, EV load
forecast, customer behavior, demand response activities, loss
evaluation
– Modeling: hierarchical (system-substation-feeder) forecasting, multi-
region forecasting
– Applications (outputs): price forecasting, demand response analysis,
planning of demand side management, risk management, weather
normalization, loss evaluation
Statistical Models for Electric Load Forecasting
© 2011 Tao Hong, PhD (hongtao01@gmail.com) Page 43 of 47
Introduction Theory
Numerical and Graphical Methods Reading
Regression Approach Forecaster’s job
Box-Jenkins Approach Networking
Beyond This Workshop Project assignment
Overview
• A medium-large utility would like to improve their one-day
ahead load forecasting practice. You are hired to develop
statistical models for this utility
• You will be working with your group members to conduct this
project
• At the end, each group will make a presentation about the
findings and results of the study
Objectives
• Develop and sharpen the following skills
– Data analysis
– Statistical modeling
– SAS programming
– Strategic planning
– Teamwork
– Project management
– Reporting
– Presentation
– Documentation
Thank You
• Questions / comments?
Notes, data, and reading materials can be downloaded from the lecture webpage:
http://courses.drhongtao.com/smelf