34235.pdf

An Expert System Approach to Short-Term Load
Forecasting For Reliance Energy Limited, Mumbai
M. S. S. Rao, S. A. Soman, B. L. Menezes, Pradeep Chawande, P. Dipti, and T. Ghanshyam
Abstract— Economically efficient generation scheduling requires accurate forecasting of load. In this paper, we propose
a Short Term Load Forecasting program for Reliance Energy
Limited (REL) in Mumbai region. The method is based on a
similar day approach. The development of forecast engine involves
4-steps. The first step involves discussion with domain experts
(Utility Engineers) to extract and learn the rules regarding system
behaviour. In the next step, these rules are refined by statistical
analysis. A linear prediction model for each day of week is
developed. The third step involves an adaptive implementation of
the rules. The parameters of the linear model are learned from
previous data by solving an optimization problem. Quadratic
Programming is used with redundancy factor 2. The final step
involves fine-tuning of forecast by re-shaping the forecast as
the reference day using Fast Fourier Transform, filtering and
smoothening by 3-point moving average technique. Normalization
is done using dc component of reference day.
Since the parameters are learnt from past few weeks data, the
seasonal variations due to change in season like winter, summer
are better modeled. Detailed study of the results of the forecast
program, the overall Mean Absolute Percentage Error (MAPE)
of the forecasted data is 2.89 over an interval from Aug’04 to
May’05 which is reasonable.
Index Terms— Expert System Approach, Quadratic Programming, Short-Term Load Forecasting.
T
I. I NTRODUCTION
HE main aim of a power system is to produce electric
power as to satisfy consumer’s requirements at all times
and at reasonable cost. Short Term Load Forecasting (STLF)
plays an important role in electric power system operation and
planning. It is fundamental to provide economic generation,
system security, load management and planning. Forecast of
the total load demand is necessary for Unit Commitment
and Economic Dispatch of generating units. Short-term load
forecasting is also essential for system security and load
management. Western Region Electricity Board (WREB) has
implemented the Availability Based Tariff (ABT) [1]. Under
this tariff structure, the utilities are penalized for the unscheduled interchange i.e. for the over drawl and under drawl and
M. S. S. Rao is Ph D student at Indian Institute of Technology Bombay,
Mumbai (email:[email protected])
S A Soman is with Department of Electrical Engineering, Indian Institute
of Technology - Bombay, (e-mail: [email protected]), Phone No: 91-2225767435, Fax:91-22-25723707
B. L. Menezes is an Associate Professor in KReSIT, Indian Institute of
Technology - Bombay, (e-mail:[email protected])
Pradeep Chawande is a Senior Manager, Reliance Energy Limited,
Mumbai, (e-mail: [email protected])
P. Dipti (e-mail: [email protected]) and T. Ghanshyam (e-mail:
[email protected]), are Senior Engineers, Reliance Energy Limited, Mumbai
is frequency dependent. Accurate forecasting not only reduces
the generation cost in a power system, but also provides a good
principle of effective operation and increase the profits of the
electricity markets in the context of ABT.
This paper proposes a rule-based expert system for STLF
on Reliance Energy Limited Mumbai network.
This paper is organized as follows. In section II, the
parameters which affects the system load is discussed. Section
III deals with the data analysis. Section IV discuss the expert
system method for STLF and the rules for expert system.
Section V gives the results and Section VI concludes the paper.
II. T HE SYSTEM LOAD
The system load is the sum of all the individual demands
at all the nodes of the power system. In principle, one
could determine the system load pattern if each individual
consumption pattern were known. However, the demand or
usage pattern of an individual load of a customer is quite
random and highly unpredictable. Also, there is a very broad
diversity of individual usage patterns in a typical utility. These
factors make it impossible to predict the system demand
levels by extrapolating the estimated individual usage patterns.
Fortunately, however the totality of the individual loads results
in a distinct consumption pattern which can be statistically
predicted.
The system load behavior is influenced by a number of
factors [2]. These factors can be classified into four major
categories.
Economic factors
Time factors
Weather factors
Random disturbances.
To model the system load, one needs to understand the impact
of each class of factors on the electricity consumption patterns.
A. Economic Factors
The economic environment in which the utility operates has
a clear effect on the electric demand consumption patterns.
Factors, such as the service area demographics, levels of
industrial activity, changes in the farming sector, the nature
and level of penetration of the appliances in the population,
developments in the regulatory climate and more generally,
economic trends have significant impacts on the system load
growth/decline trend. In addition, utility-initiated programs,
such as changes in rate design and demand management
programs, also influence the load. Typically, these economic
0-7803-9525-5/06/$20.00 c 2006 IEEE
Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on December 2, 2008 at 01:56 from IEEE Xplore. Restrictions apply.
factors operate with considerably longer time constants than
one week. It is important to account for these factors in the
updation of forecasting modes from one year to the next or
possible from one season to another. The economic factors
are not, however, explicitly represented in the short term
load forecasting models because of the longer time scales
associated with them.
B. Time Factors
The main time factors that plays an important role in
influencing the load pattern are seasonal effects, weekly-daily
cycle, and public and religious holidays. The seasonal changes
determine whether a utility is summer or winter peaking.
Certain changes in the load pattern occur gradually in response
to seasonal variation such as the number of daylight hours
and the changes in temperature. On the other hand, there are
seasonal events which brings abrupt, but important structural
modifications in the electricity consumption pattern. These
are the shifts to and from Day-light Time, changes in the
rate structure(time-of-day or seasonal demand), start of school
year, and significant reductions of activities during vacation
periods.
The weekly- daily periodicity of the load is a consequence
of the work-rest pattern of the service area population. There
are well-defined load patterns for “typical” seasonal weeks.
The existence of statutory and religious holidays has the
general effect of significantly lowering the load values to
levels well below “normal”. Moreover, on days preceding
or following holidays, modifications in the electricity usage
pattern are observed due to the tendency of creating “long
weekends.”
C. Weather Factors
Meteorological conditions are responsible for significant
variation in the load patterns. This is true because most utilities
have large components of weather-sensitive load, such as
those due to space heating, air conditioning, and agriculture
irrigation.
In many systems, temperature is the most important weather
variable in terms of its effects on the load. For any given
day, the deviation of the temperature variable from a normal
value may cause significant changes as to require major
modifications in the unit commitment pattern. Moreover, past
temperatures also affects the load profile. For example, a string
of high-temperature days may result in such heat buildup
throughout the system as to create a new system peak. For a
system with nonuniform geography and climate, several temperature or several areas may need to be considered to account
for variations in the system load. Humidity is a factor that may
affect the system load in a manner similar to temperature,
particularly in hot and humid areas. Thunderstorms also have
a strong effect on the load due to the change in temperature
that they induce. Other factors that impact on load behavior
are wind speed, precipitation, and cloud cover/light intensity.
D. Random Disturbances
The power system is continuously subject to random disturbance reflecting the fact that the system load is a composite
of a large number of individual demands. In addition to a
large number of very small disturbances, there are large loadssteel mills, synchrotron, wind tunnels- whose operation can
cause large variations in electricity usage. Since the hours of
operations of these large devices are usually unknown to utility
dispatchers, they represent large unpredictable disturbances.
There are also certain events such as widespread strikes, shutdown of industrial facilities and special television programs
whose occurrence is known priori, but whose effect on the
load is uncertain.
III. DATA A NALYSIS
The data analysis is carried out on data provided by Reliance
Energy Limited (REL) in the Mumbai region. The REL is one
of the major distribution companies in Mumbai having peak
load of 1200MW and having its own generation of 500MW.
Forecast for the next day implies forecast from 00:00 hrs to
23:45 hrs of the next day. Forecast has to be given at 15:00 hrs
(previous day). Usually, accuracy of forecast improves as the
gap between time of forecast and forecast hours is reduced. In
the analysis phase, the load curves are drawn and the variation
of the load with temperature is studied.
A. Load Curves
The load data contains half hourly load data and minimum
and maximum temperature values of the day. The load curve
for the month of July-2004 is shown in Fig. 1. The observations from the load curves are as follows:
1) The shape of the load curve is almost the identical on
same week days, if the day is not a holiday.
2) The load shape is affected by holidays.
3) Average demand is almost same on the same week days.
4) The Fig. 1 clearly shows the weekly seasonality.
5) The Fig. 2 reveals that the shape of the load curve is
almost same on the same week day, but it scales up and
down.
6) Days are classified based on their load into the following
categories.
a) Normal
working day
Saturday
b) c) Holiday during the week
d) Holiday preceded or followed by weekend
e) Special days
f) Outliers
B. Variation of the load with Temperature
Temperature is the most important weather variable in terms
of its effects on the load. For any given day, the deviation of
the temperature variable from a normal value may cause significant load changes. Fig. 3 and 4 show the relation between
load and temperature. The graph shows the positive correlation
Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on December 2, 2008 at 01:56 from IEEE Xplore. Restrictions apply.
Fig. 1.
Load curve for the month of July 2004.
Fig. 4.
A plot between the average load and average temperature.
TABLE I
Sun
Mon
Tue
Wed
Thu
Fri
Sat
B EST CORRELATED PREVIOUS DAY
best
best
best
Pre Sun
2 weeks back
3 weeks back
Sunday
Sunday
Thu
Fri
Mon
Thu
Mon
Tue
Tue
Thu
Mon
Tue
Wed
Thu
Thu
Wed
Tue
Thu
Wed
Tue
C. Correlation Analysis
Fig. 2.
Load curves of all Wednesdays in the months May to August 2004.
between load and temperature i.e. demand increases with
temperature. Fig. 3 is the plot between average temperature
and maximum demand. Fig. 4 is the plot between average load
and average temperature. Average temperature here is simply
the mean of maximum and minimum temperature of the day.
In the plots dot represents the actual values and the solid line
is the best fit curve.
The above load curves show good correlation on week days.
The correlation coefficient is used to study the correlation of
load data between the two days. The correlation coefficient
between two days is calculated using the equation 1. Here
the correlation coefficient is studied between particular day
and the last seven days, the corresponding day two weeks and
three weeks back. The same information is studied over an
year data for the other week days also. The best correlated
days for a particular day were given in the Table I.
(
"!
"!
#$&%
!
)
*,+
%'
!
&%
%-,+.
0/ +
(1)
)
From this correlation analysis we can find the best correlated
days and we can select them as reference days.
D. Auto Correlation Analysis
Fig. 3.
A plot between the average load and maximum temperature.
Autocorrelation Function(ACF) is the term used to describe
the association or mutual dependence between values of the
same time series at different time periods. This can be com!
!
puted using the general equation,
2
$! 8 9 2
:
6 8 132 54 7
(2)
+
8 4
The Fig. 5 show the autocorrelation function at different
time
lags. In this figure we can see that ACF of 48 time lag
Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on December 2, 2008 at 01:56 from IEEE Xplore. Restrictions apply.
is high. This means that
there is
a seasonality and a good
correlation between 1 and 48 point which are 24 hours
apart. In other words today’s load has good correlation with
yesterday’s load at the same time.
Here is a constant, is the forecasted mean
temperature of target day and is the reference
day’s mean temperature.
average temperature is given by
"!$#&% 9 + "!('*) The
. The parameter ‘ ’ is optimized
for every month. Typical value of ‘ ’ for Sunday is 30. The
forecast data in the formula form is given by
547698
+-, 103:
0:00 to 6:00
2 ;=
<>? From
/.
From 06:15 to 23:45
:; is the reference day load.
Where
(4)
B. Monday model
Fig. 5.
Autocorrelation of load data at different time lags.
IV. A N E XPERT S YSTEM BASED M ETHOD FOR STLF
An expert system [3], [4] is one which can reason, explain
and have its knowledge. The expert systems represent the
expert knowledge as data or rules within the computer. These
rules and data can be called upon when needed to solve
problems. This rule based algorithm consists of the following:
Logical and syntactical relation between weather and
daily load shapes.
Set of relations between changes in the load and change
in the natural and forced factors.
Rule base consists of all rules taking the IF-THEN form
and mathematical expressions.
After a detailed study of data, we concluded that the load
data has correlation with the previous day load data on normal
week days (Tuesday to Friday) provided that the day is not
a holiday and there is not much variation in temperature on
that day. The load on Sundays is having high correlation with
previous Sundays. We use these conclusions to develop the
rule based algorithm.
A. Sunday model
The load curves and the correlation analysis shows that
Sunday is having good correlation with previous Sundays.
Thus the previous Sunday has been selected as reference day
(RD) if it is a normal day, else previous to previous, else
three weeks back Sunday is selected as reference day if it is a
normal day. Forecast for Sunday load from 0:00hrs to 6:00hrs
is the actual load of previous day, i.e the Saturday’s load. After
6:00hrs the forecast load is the reference day’s load times the
Temperature Correction Factor (TCF).
(3)
The best correlated days for Monday are previous Thursday,
Friday and Monday. The same order is used to select the
reference day for Monday if the target day and reference day
both are normal days. Forecast of the load from 0:00hrs to
03:00hrs is the actual load of previous day, i.e the Sunday’s
load. From 3:15hrs to 8:00hrs reference day is previous
). The
Tuesday multiplied by the load correction factor ( @
reason for choosing Tuesday is that the early Monday morning
load does not pick up in the same manner as early Sunday
morning load. After 08:00hrs the forecast load is the reference
day’s load times the Temperature Correction Factor (TCF).
47698
0:00 to 3:00
+-, ACB EF GI
D HKJJ L 747698 < @ From
From 03:15 to 08:00
:?;M<>?
From 08:15 to 23:45
(5)
=ratio of the average load on Sunday and Tuesday
Where @
represents the normalbetween 03:00 hrs to 08:00hrs. @
ization of one week back data with this week data.
C. Tuesday model
Since the load curves and the correlation analysis shows that
the Tuesday has good correlation with the previous Monday,
Tuesday, Thursday and Friday, a linear model can formed
using these days as in equation 6.
+-, 8
.
\[^]_
>< J 7476N8O Q< PSR L 476N8K T >< U J 4V698
+
W <>U J +NX(Y Z < 8
(6)
.
we
have some additional constraints like
W also
I`a"b
. If `a"b Sc , then the forecast load will
However
4
be the convex combination of good correlated days. However
in a generic frame work, the parameter `a"b can also be
optimized.This leads to a Linear Programming (LP) problem.
dZ
represents a temperature dependent offset parameter.
In the above problem the coefficients to Z can be
found using Quadratic Program (QP). Our aim is to learn
the best values of coefficients to Z using previous data.
The learning is adaptive and the coefficients are upgraded
each week based upon the last 10 weeks data. The MATLAB
inbuilt function “QuadProg” is used to solve for to Z and
substituted in the equation 6 for forecast on Tuesday. Most
recent 10 week data is used in the learning process. We would
like to claim that every half an hour interval leads to a different
Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on December 2, 2008 at 01:56 from IEEE Xplore. Restrictions apply.
LP problem and hence, to Z are not average values on a
day.
Similarly, the models for Wednesday, Thursday, Friday and
Saturday have been developed and they summarized below.
D. Wednesday model
+-, 8 >< J 4V698 Q< PSR L 4V698 T < , 47698
+
W <>U J +NX(Y Z < 8 (7)
TABLE II
M EAN , M IN AND M AX MAPE S OVER THE PERIOD AUG ’04 TO M AY ’05.
Mean MAPE
Sun
Mon
Tue
Wed
Thu
Fri
Sat
All Data
2.9369
3.3787
2.8529
2.6366
2.6376
2.8558
2.9807
2.8944
Minimum
MAPE
0.88499
0.87954
0.95412
1.0543
0.65905
0.8773
0.6767
0.65905
Maximum
MAPE
9.5171
8.1652
7.4056
8.4326
6.9554
7.1751
10.244
10.244
E. Thursday model
+-, .
< , 47698 +
< J 476N8 T < V. R ESULTS
.
(8)
F. Friday model
+-, (6
<?
U J 476N8V
+
< , 476N87 T < $6
All the rules that are mentioned in section IV have been
coded in the MATLAB. The results are compared using Mean
Absolute Percent Error (MAPE), defined
as follows:
!
P
(9)
c
a R
J
)
,
+ R 1 V , R ,
V J R , 2
<
c
]]
(11)
G. Saturday model
+-, <U J 74 698V
+
<
1
476N87 T < A. MAPE Analysis
(10)
All the coefficients were found using QP and substituted
in equations 7 to 10 to get the forecast for Wednesday,
Thursday, Friday and Saturday respectively.
Remark 1. The above models are used upto 15:00 hrs when
the operator has to submit the load forecast for the next day.
Post 15:00 hr forecast can not use data after 15:00hrs of the
previous day as it is yet to happen. Hence, similar models
but with reduced information content are used for forecasting
after 15:00hrs.
The MAPE analysis shows that the mean, minimum and
maximum MAPEs on overall data are 2.8979, 0.659 and
10.24 respectively. Large MAPEs usually represents out-liers.
Handling the outliers requiring much more data and analysis,
which is beyond scope of this paper. Table II shows the values
of Mean MAPE, minimum MAPE, maximum MAPE for each
week day and overall data.
The pie chart in Fig. 6 shows that 81% of the days have
MAPE less than 4%, 8% of the times the MAPE is in between
4 and 5, and 11% of the times MAPE is greater than 5 which
is quite acceptable.
H. Holiday Forecasting
The data analysis reveals that the load on public holidays
(on week days) resembles that of Sunday. This is because that
the industries and offices are closed. Therefore, reference day
to forecast for holidays is taken as Sunday. The forecast data
for holiday is reference day times TCF.
I. Smoothening and Reshaping of the forecasted load curve
Since the forecast methods have different models for different time slots in a day, there may occur discrete jumps in the
forecast load, which is not desirable. To eliminate the changes
in the forecasted load, we have performed smoothing of raw
forecast. Smoothing is done in two steps.
1) Fast Fourier Transform (FFT) Smoothing.
2) 3-Point moving average smoothing.
By doing FFT of the forecasted load, we will get the frequency
domain data. By eliminating the higher harmonics which cause
the discrete jumps in the forecasted load, we can smooth
the variations in the forecast. Subsequently, a simple 3-point
moving average is used.
Fig. 6.
range.
Pie chart showing the percentage days having MAPE in the given
B. Study the Variation of coefficient
`a"b
As a part of reducing the error, the MAPEs are studied by
varying the `Vb . The Fig. 7 shows the MAPE as a function of
`a"b . The best fitted `Vb values are given in the Table III.
Reason for the variation of best `a"b :
1) The best fitted `a"b on Monday is 0.975 because Monday’s actual load is less than Thursday which is used for
Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on December 2, 2008 at 01:56 from IEEE Xplore. Restrictions apply.
TABLE III
B EST FITTED Sun
1.00
Mon
0.975
Tue
1.01
Wed
1.00
1300
Thu
0.995
Fri
1.00
Sat
0.975
Load curves of the actual and forecasted on 15−Sep−05
1200
Load in MVA
1100
sensitivity of MAPE with beq
6.5
Sun
6
Mon
5.5
Tue
MAPE
Forecasted
Actual
900
800
Wed
5
1000
Thu
700
Fri
4.5
Sat
4
600
0
3.5
5
15
10
Time of the day
20
25
3
2.5
0.95
Fig. 7.
Fig. 8.
0.96
0.97
0.98
0.99
1
beq
1.01
1.02
1.03
1.04
MAPE Vs .
VII. ACKNOWLEDGMENT
the forecast of the Monday’s load. This implies that we
should not use different linear combination of models.
2) The best fitted `a"b on Tuesday is 1.01 because the
Tuesday’s load is greater than Monday’s load which is
used to forecast Tuesday’s load.
3) The best fitted `a"b for Saturday is 0.975 because Thursday’s load is greater than Saturday’s load.
C. Best values of coefficient of “ ”
was find out by
The best values of coefficient ‘ ’ in
trial and error. The best values of coefficient ‘ ’ for each day is
given in the Table IV. These values of ‘ ’ are usually updated
every month, so as to follows the gross changes in the system.
Fig. 8 shows the actual load curve and forecasted load curve
for 15-Sept-05. It is seen that forecast follows the actual load
pattern closely.
VI. C ONCLUSIONS
This paper proposed an expert system approach to STLF
applied to REL Mumbai, region. The basic idea is to use
similar day approach but the coefficients for linear model are
learnt by solving a quadratic programming problem over a
most recent few weeks data. This makes learning adaptive.
Results obtained over one year during Aug’04 to May’05 are
very encouraging. We feel that the proposed methodology is
generic enough to be applied to forecasting problem of other
distribution companies (DISCOM) / utilities.
TABLE IV
B EST FITTED Sun
30
Actual and forecasted load curve on 15-Sept-05.
1.05
Mon
22
Tue
36
Wed
34
The authors would like to thank Reliance Energy Limited
for their invaluable co-operation and suggestions.
VIII. R EFERENCES
[1] (2004) Kalki communicataion technologies. [Online]. Available:
http://www.kalkitech.com/downindex/IntroductionToABT.pdf
[2] G. Gross and F. D. Galiana, “Short-term load term forecasting,” Proceeding of IEEE, pp. 1558 – 1570, Dec. 1987.
[3] S. Rahman and R. Bhatnagar, “An expert system based algorithm for
short term load forecast,” IEEE Trans. Power Syst., pp. 392 – 398, 1988.
[4] K. L. Ho, “Short term load forecasting of Taiwan Power System using
a knowledge-based expert system,” IEEE Trans. Power Syst., pp. 1214 –
1221, 1990.
IX. B IOGRAPHIES
M. S. S. Rao received the M.Tech degree in Energy Systems Engineering,
from Indian Institute of Technology, Bombay, India, in 2005. Currently he is
pursuing Ph.D at IITB, Mumbai.
S. A. Soman is an Associate Professor in Department of Electrical Engineering, IITB, Mumbai, India. His research areas include Power System
Computation, Power System Protection, and Object Oriented Programming.
He has authored a book titled “Computational Methods for Large Power
System Analysis: An Object Oriented Approach”, published by Kluwer
Academic in 2001.
B. L. Menezes is an Associate Professor in KReSIT at the Indian Institute
of Technology, Bombay, India. His research areas include Forecasting, Data
Mining, and Network Security.
Pradeep Chawande received the M. Tech from Indian Institute of Bombay,
Mumbai, India in 1998, working as a Senior Manager in Reliance Energy
Limited. His research interests include Power System Protection, Numerical
Relays, Distribution Automation, and SCADA.
P. Dipti working as a Senior Engineer in Reliance Energy Limited, Mumbai.
Thu
40
Fri
22
Sat
5
T. Ghansyam working as a Senior Engineer in Reliance Energy Limited,
Mumbai.
Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on December 2, 2008 at 01:56 from IEEE Xplore. Restrictions apply.