Professional Documents
Culture Documents
FMG XVIII A
FORE SCHOOL OF MANAGEMENT
Managerial Economics Final Project Report September 2009 2
Preface
T
he Indian power sector is such a magnanimous entity in itself that deciphering the
intricate nuances of the sector becomes synonymous to fishing in the dark. Never
the less, this project is an earnest attempt made to unravel the dilemma of the
Indian Power Sector.
India is such a diverse country both in terms of geography and demography that
narrowing down to a few parameters that are responsible for the change/ alteration of
the demand of power is next to impossible. Hence, a better approach in determining
the power demand across India is to take into account the various seasonal variations
as well as the influences of the different regions in India.
Acknowledgement
The group members would sincerely like to thank Dr. Ravikesh Srivastava for his unending
support and motivation towards the execution of the project. Without his guidance, the
report would not have taken the shape that it has taken at this moment.
We’d also like to thank all the fellow students of FMG XVIII section A for their constant
pep-talk and encouragement.
Thanking one and all for all the help vented to the execution of the project.
Gourab Kundu
Samarth Gulati
Cijil Diclause
Akshat Vaid
Harshdeep Garg
Table of Contents
1. Introduction
2. Project Objective
3. Project Methodology
5. Simple Regression
a. Per capita annual income in Rupees
b. Population
c. Price index with '93 as index of 100
d. Gross national Product
6. Multiple Regression Model
a. Degree of Correlation
b. Multiple Regression Model
c. Multiple Regression Model Obtained
d. Prediction according to multiple regression model of power in India
7. Moving Averages
a. 3 year moving average
b. 5 year moving average
8. Exponential Smoothing
a. Dumping Factor/ Weight: 0.3
b. Dumping Factor/ Weight: 0.5
9. Seasonal Trend Analysis
a. Region wise Analysis
b. Equation for Regression
c. Why Seasonal Variations?
d. Trend analysis in North India
e. Trend analysis in South India
f. Trend analysis in North India
g. Trend Analysis Regions/Seasons
h. Power Trading
11. Annexure
1. Introduction
Indian power sector has attained insurmountable importance because of the huge
boom in India’s economy. In fact, the visionary in Dr. Manmohan Singh has
acknowledged the fact that it is electricity that is a driving factor for India’s economy.
He was so forthright that he laid no stone unturned to have India sign the Indo-US
Nuclear Civil Agreement. Perhaps power has become the most important commodity
that needs to be taken into account for the estimation of India’s GDP.
But a lot needs to be done in the Indian Power Sector. As a matter of fact just 44% of the
rural households have access to electricity. This statement is both a danger and an
opportunity for India. A lot of uncaptured demand for power can be tapped into
hence, an industry as large as the Power sector awaits expansion of magnanimous
proportions.
Some facts and figures about the Indian Power Sector:
Due to India's economic upturn, the demand for energy in India has grown at an
average of 3.6% per annum in the past 30 years.
The theft of electricity, a common phenomenon in most parts of urban India
amounts to about 1.5% of India's total GDP.
India is world's 6th largest energy consumer, accounting for 3.4% of global energy
consumption.
Electricity losses in India during transmission and distribution are extremely high
and vary between 30 to 45%.
Electricity demand outstripping supply by 7-11%, is a trend in the past few years.
The various kinds of projects that are/will be undertaken in the coming months/years in
India:
Hydro Projects
Ultra Mega Power Projects
Nuclear Power
Rural Electrification
Renewable Sources of Energy
2. Project Objective
The project objectives have been classified as follows:
Forecasting the demand of power in the coming years 2009-10, 2010-11, 2011-12.
Understanding the intricacies of the Indian Power Sector by looking into the
various geographical regions.
Collating data that establishes the influence of seasons on the demand of power
in India.
3. Project Methodology
The following tools have been used to determine the demand of power in India:
Simple Regression
Multiple Regression Model
Moving Averages
Exponential Smoothing
Seasonal Trend Analysis (by Region)
Auto Regression Integrated Moving Averages (ARIMA)
Parameters considered:
Per Capita Income in India in Rupees
Population of India
Price Index of electricity with 1993 as Index = 100
(GNP) Gross National Product in Rupees
5. Simple Regression
In statistics, linear regression refers to any approach to modeling the relationship
between one or more variables denoted y and one or more variables denoted X, such
that the model depends linearly on the unknownparameters to be estimated from
the data. Such a model is called a "linear model." Most commonly, linear regression
refers to a model in which the conditional mean of y given the value of X is an affine
function of X. Less commonly, linear regression could refer to a model in which
the median, or some other quantile of the conditional distribution of y given X is
expressed as a linear function of X. Like all forms of regression analysis,linear
regression focuses on the conditional probability distribution of y given X, rather than on
the joint probability distribution of y and X, which is the domain of multivariate analysis.
Linear regression was the first type of regression analysis to be studied rigorously, and to
be used extensively in practical applications. The reason for this is that models that
depend linearly on their unknown parameters are easier to fit than models that are
related non-linearly to their parameters and the statistical properties of the resulting
estimators are easier to determine.
Linear regression has many practical uses. Most applications of linear regression fall into
one of the following two broad categories:
If we have a variable y and a number of variables X1, ..., Xp that may be related to y,
we can use linear regression analysis to quantify the strength of the relationship
between y and theXj, to assess which Xj may have no relationship with y at all, and to
identify which subsets of the Xj contain redundant information about y, so that once
one of them is known, the others are no longer informative.
Linear regression models are often fit using the least squares approach, but may also be
fit in other ways, such as by minimizing the "lack of fit" in some other norm, or by
minimizing a penalized version of the least squares loss function as in ridge regression.
Conversely, the least squares approach can be used to fit models that are not linear
models. Thus, while the terms "least squares" and linear model are closely linked, they
are not synonymous.
b. Population
Intercept (a) -35404582017
Year (b) 18211019.86
Y* = a+ b X*
Projected Values
for 2009 (X*) 1181356876
for 2010 1199567896
for 2011 1217778916
a. Degree of Correlation
Coefficients of correlation (r) Calculated by doing a regression using the power
demand (Y) and individual parameters(X). The value of R2 (coefficient of determination
is thus determined)
Square root of R2
Per Capita Income = 0.9544
Population of India = 0.9955
Price Index of power = 0.9719
Gross National Product = 0.9706
Where,
Y* = Estimated demand a = coefficient of Regression b1 = coefficient of X1
b2 = coefficient of X2 b3 = coefficient of X3 b4 = coefficient of X4
X1 = Per Capita Income X2 = Population X3 = Price Index X4 = GNP
7. Moving Averages
In statistics, a moving average, also called rolling average, rolling mean or running
average, is a type of finite impulse response filter used to analyze a set of data points by
creating a series of averages of different subsets of the full data set. A moving average
is not a single number, but it is a set of numbers, each of which is the average of the
corresponding subset of a larger set of data points. A moving average may also use
unequal weights for each data value in the subset to emphasize particular values in the
subset.
A moving average is commonly used with time series data to smooth out short-term
fluctuations and highlight longer-term trends or cycles. The threshold between short-
term and long-term depends on the application, and the parameters of the moving
average will be set accordingly. For example, it is often used in technical analysis of
financial data, like stock prices, returns or trading volumes. It is also used in economics to
examine gross domestic product, employment or other macroeconomic time series.
Mathematically, a moving average is a type of convolution and so it is also similar to
the low-pass filter used in signal processing. When used with non-time series data, a
moving average simply acts as a generic smoothing operation without any specific
connection to time, although typically some kind of ordering is implied.
A simple moving average (SMA) is the unweighted mean of the previous n data points.
For example, a 10-day simple moving average of closing price is the mean of the
previous 10 days' closing prices. If those prices are then the
formula is
When calculating successive values, a new value comes into the sum and an old
value drops out, meaning a full summation each time is unnecessary,
8. Exponential Smoothing
In statistics, exponential smoothing is a technique that can be applied to time
series data, either to produce smoothed data for presentation, or to make forecasts.
The time series data themselves are a sequence of observations. The observed
phenomenon may be an essentially random process, or it may be an orderly, but noisy,
process. Whereas in the simple moving average the past observations are weighted
equally, exponential smoothing assigns exponentially decreasing weights as the
observation get older.
Exponential smoothing is commonly applied to financial market and economic data,
but it can be used with any discrete set of repeated measurements. The raw data
sequence is often represented by {xt}, and the output of the exponential smoothing
algorithm is commonly written as {st } which may be regarded as our best estimate of
what the next value of x will be. When the sequence of observations begins at
time t = 0, the simplest form of exponential smoothing is given by the formulas
where α is the smoothing factor, and 0 < α < 1. In other words, the smoothed statistic st is
a simple weighted average of the latest observation xt and the previous smoothed
statistic st−1. Simple exponential smoothing is easily applied, and it produces a smoothed
statistic as soon as two observations are available.
Values of α close to one have less of a smoothing effect and give greater weight to
recent changes in the data, while values of α closer to zero have a greater smoothing
effect and are less responsive to recent changes. There is no formally correct procedure
for choosing α. Sometimes the statistician's judgment is used to choose an appropriate
factor.
(Here a month has been taken as a benchmark for a season) the months taken into
account are:
January
April
July
October
We did by ratio to trend method and is based on the assumptions that past trends and
seasonal patterns in data will persist.
h. Power Trading
Power trading inherently means a transaction where the price of power is
negotiable and options exist about whom to trade with and for what quantum.
In India, power trading is in an evolving stage and the volumes of exchange are
not huge.
The Electricity Act, 2003, mandated development of power markets by
appropriate commissions through enabling regulations
In India, while there is a huge section of consumers, who are power deprived,
there are a lot of Power Plants that are under utilized
The emerging trends will help in proper flow of power from surplus regions to
deficit regions and thus try to bring about a balance in the power sector
PTC India Ltd. (PTC), is the leading provider of power trading solutions in India.
timeseries: y
Method: Nonlinear Least Squares (Levenberg-Marquardt)
date: 09-07-09 time: 19:40
Included observations: 55
p = 2 - q = 5 - no constant - autoselection (AIC)
Mean
R-squared 0.999081 dependent var 133354.323273
Adjusted R- S.D.
squared 0.998966 dependent var 128580.853224
Akaike info
S.E. of regression 4134.981337 criterion 19.258835
Sum squared Schwarz
resid 820707391.547770 criterion 19.514314
Durbin-Watson
Log likelihood -522.617969 stat 1.845670
But even in this graph we had one portion where the model did not fit very well instead the
spike over their showed huge deviation from the actual value. This caused deviation not just in
the portion where it was present, but the deviation it caused deviation in the whole of the
model. This is shown below.
Residualplot
15000
10000
5000
0
0 10 20 30 40 50 60 Residual
-5000
-10000
-15000
-20000
To remove this error we followed outlier deletion as per which the value which had caused an
unacceptable deviation is removed from the data and the same process is again applied
upon. The results were encouraging as shown below.
Residualplot
20000
15000
10000
5000
0 Residual
0 10 20 30 40 50 60
-5000
-10000
-15000
-20000
We just removed one entry from the 57 that we had taken initially [1.75% sampling] and in the
process the MAPE reduced from 0.077 to 0.05 and at the same time the Median of Absol ute
residuals {MAR} reduced from 2224 to 1308.
On the basis of the new model we obtained the following forecasts for the next 10 years.
Table
Period IR Forecast
1 1.000000 504900.515272
2 1.656811 560753.674775
3 2.202814 621694.158683
4 2.536284 685205.377056
5 2.592833 750279.007608
6 2.411722 814653.053112
7 1.972840 877917.986331
8 1.367065 939074.264170
9 0.632118 998877.677111
10 -0.116531 1057891.283728
These values don’t fit very well if seen in light of the day as too steep an increase is predicted as
per the model. The reason for this lies in the fact that ARIMA as a tool takes only the past values
of a function into consideration. Any other external factor which may have a bearing on the
actual results are not considered in the forecasts.
11. Annexure
power Gross
demand in per capita price with '93 National
year India GWH income annual population as Index = 100 product
1981 90245 3456 683329097 31.4 61099
1982 95589 3598 698362337 35 68959
1983 102344 3740 713726308 37.9 79875
1984 144068 4024 729428287 39.2 86543
1985 123099 4308 745475709 43.9 99876
1986 135952 4592 761876175 48.2 117812
1987 145613 4876 778637451 52.4 196814
1988 160196 5160 795767475 55.5 328004
1989 175419 5444 813274359 59 459194
1990 190357 5728 831166395 63.1 590384
1991 207645 6012 846421039 70 721574
1992 220674 6440.25 864534449 78.2 852764
1993 238569 7698 883035486 100 983954
1994 259629 8955.75 901932445 113.6 1115144
1995 277029 10213.5 921233800 127.8 1246334
1996 280143 11471.25 940948203 133.5 1377524
1997 296749 12729 961084495 151.8 1508714
1998 309734 14682 981651703 157.2 1639904
1999 312841 16635 1002659050 168.9 1771094
2000 316600 18588 1024115953 200 1902284
2001 322459 20541 1028737436 224.8 2077658
2002 339598 22494 1048797816 238 2244725
2003 360937 24447 1069249373 248.8 2519921
2004 386134 26400 1090099736 253 2855331
2005 411887 28353 1111356681 263.4 3249554
2006 425748 30306 1133028136 271.7 3643777
2007 440774 32259 1155122185 273 4038000
2008 453800 34212 1169266890 275 4432223
Regression Statistics
Multiple R 0.997742312
R Square 0.995489721
Adjusted R Square 0.994705325
Standard Error 9025.442084
Observations 28
ANOVA
Significanc
df SS MS F eF
Regressio 4.13522E+ 1.0338E+1 1269.11
n 4 11 1 6 1.31E-26
187354791 81458604.
Residual 23 1 8
4.15395E+
Total 27 11
Regression Statistics
Multiple R 0.948678
R Square 0.899991
Adjusted R
Square 0.896144
Standard
Error 3240.844
Observatio
ns 28
ANOVA
Significanc
df SS MS F eF
2.46E+0 233.976
Regression 1 9 2.46E+09 3 1.63E-14
2.73E+0
Residual 26 8 10503071
2.73E+0
Total 27 9
Uppe
Standard Uppe Lower r
Coefficients Error t Stat P-value Lower 95% r 95% 95.0% 95.0%
-
- -
1988 261054 198885
Intercept -2299700 151226 -15.207 1.87E-14 -2610549 8 9 0
1315. 1003.9 1315.6
year 1159.778 75.82089 15.29628 1.63E-14 1003.926 2 6 3
RESIDUAL OUTPUT
Predicted per
capita income Standard
Observation annual Residuals Residuals
1 -2179.76 5635.76 1.772105
2 -1019.98 4617.982 1.452076
3 139.7958 3600.204 1.132046
4 1299.574 2724.426 0.856667
5 2459.352 1848.648 0.581288
6 3619.129 972.8706 0.305909
7 4778.907 97.09278 0.03053
8 5938.685 -778.685 -0.24485
9 7098.463 -1654.46 -0.52023
10 8258.241 -2530.24 -0.79561
11 9418.019 -3406.02 -1.07099
12 10577.8 -4137.55 -1.30101
13 11737.57 -4039.57 -1.2702
14 12897.35 -3941.6 -1.2394
15 14057.13 -3843.63 -1.20859
16 15216.91 -3745.66 -1.17778
17 16376.69 -3647.69 -1.14698
18 17536.46 -2854.46 -0.89756
19 18696.24 -2061.24 -0.64814
20 19856.02 -1268.02 -0.39872
21 21015.8 -474.797 -0.14929
22 22175.57 318.4251 0.100125
23 23335.35 1111.647 0.349546
24 24495.13 1904.869 0.598966
25 25654.91 2698.092 0.848386
26 26814.69 3491.314 1.097807
27 27974.46 4284.536 1.347227
28 29134.24 5077.758 1.596648
population
y
SUMMARY
OUTPUT
Regression Statistics
Multiple R 0.999281057
R Square 0.998562631
Adjusted R
Square 0.998507347
Standard
Error 5791797.75
Observatio
ns 28
ANOVA
Significan
df SS MS F ce F
Regressio 6.06E+1 18062.
n 1 7 6.06E+17 6 1.73E-38
8.72E+1
Residual 26 4 3.35E+13
6.07E+1
Total 27 7
RESIDUAL OUTPUT OF
PREDICTED
POPULATION IN INDIA
Predicted Standard
Observation population Residuals Residuals
1 671448320.3 11880777 2.090387
2 689659340.2 8702997 1.531266
3 707870360.1 5855948 1.030336
4 726081379.9 3346907 0.588878
5 744292399.8 1183310 0.2082
6 762503419.6 -627244 -0.11036
7 780714439.5 -2076988 -0.36544
8 798925459.3 -3157984 -0.55564
9 817136479.2 -3862120 -0.67953
10 835347499.1 -4181104 -0.73565
11 853558518.9 -7137480 -1.25582
12 871769538.8 -7235090 -1.27299
13 889980558.6 -6945072 -1.22196
14 908191578.5 -6259133 -1.10128
15 926402598.3 -5168798 -0.90943
16 944613618.2 -3665415 -0.64492
17 962824638.1 -1740143 -0.30617
18 981035657.9 616045.4 0.108391
19 999246677.8 3412372 0.600397
20 1017457698 6658256 1.1715
21 1035668717 -6931281 -1.21954
22 1053879737 -5081921 -0.89415
23 1072090757 -2841384 -0.49993
24 1090301777 -202041 -0.03555
25 1108512797 2843884 0.500373
26 1126723817 6304320 1.109226
27 1144934837 1018748 1.792433
28 1163145856 6121034 1.076977
Predicted Price
y
Regression Statistics
Multiple R 0.97567
R Square 0.951932
Adjusted R
Square 0.950083
Standard Error 19.88692
Observations 28
ANOVA
df SS MS F Significance F
514.90
Regression 1 203638.8 203638.8 3 1.16E-18
Residual 26 10282.73 395.4897
Total 27 213921.6
RESIDUAL OUTPUT
Standar
Predicted d
Observatio price with '93 Residual Residual
n as Index = 100 s s
1 -6.30123 37.70123 1.931893
2 4.256267 30.74373 1.575376
3 14.81377 23.08623 1.182989
4 25.37126 13.82874 0.708615
5 35.92876 7.971237 0.408464
6 46.48626 1.713738 0.087816
7 57.04376 -4.64376 -0.23796
8 67.60126 -12.1013 -0.62009
9 78.15876 -19.1588 -0.98174
10 88.71626 -25.6163 -1.31263
11 99.27375 -29.2738 -1.50005
12 109.8313 -31.6313 -1.62085
13 120.3888 -20.3888 -1.04476
14 130.9463 -17.3463 -0.88886
15 141.5037 -13.7037 -0.70221
16 152.0612 -18.5612 -0.95112
17 162.6187 -10.8187 -0.55438
18 173.1762 -15.9762 -0.81866
19 183.7337 -14.8337 -0.76011
20 194.2912 5.708758 0.292529
21 204.8487 19.95126 1.022346
22 215.4062 22.59376 1.157754
23 225.9637 22.83626 1.17018
24 236.5212 16.47876 0.844408
25 247.0787 16.32126 0.836337
26 257.6362 14.06377 0.720658
27 268.1937 4.806267 0.246284
28 278.7512 -3.75123 -0.19222
Regression Statistics
Multiple R 0.957543
R Square 0.916889
Adjusted R
Square 0.913692
Standard
Error 379752.6
Observations 28
ANOVA
Significan
df SS MS F ce F
286.833
Regression 1 4.14E+13 4.14E+13 9 1.46E-15
Residual 26 3.75E+12 1.44E+11
Total 27 4.51E+13
RESIDUAL OUTPUT
demand of electricity in
north india in OCTOBERin forecasted value using
MW regression value actual/forecasted
24049 25017.6 0.961283257
27608 26939.5 1.024814863
30290 28861.4 1.049498638
29795 30783.3 0.96789493
32565 32705.2 0.99571322
AVAERAGE OF
ACTUAL/FORECASTED 0.999840982
Intercept -3826470
X Variable 1 1921.9
demand of electricity in
north india in JANUARY in forecasted value using regression
MW value actual/forecasted
24997 25037 0.998402365
27095 27146.7 0.998095533
29173 29256.4 0.997149342
31848 31366.1 1.015363721
33169 33475.8 0.99083517
AVAERAGE OF
ACTUAL/FORECASTED 0.999969226
Intercept -4202801.8
X Variable 1 2109.7