ShortTermLoadForecastingusingMultipleregression
ShortTermLoadForecastingusingMultipleregression
net/publication/224309206
CITATIONS READS
170 6,522
3 authors, including:
All content following this page was uploaded by Cuneyt Suheyl Ozveren on 01 July 2015.
Abstract
In this paper we present an investigation for the short term (up 24 hours) load forecasting of the
demand for the South Sulewesi’s (Sulewesi Island – Indonesia) Power System, using a Multiple
Linear Regression (MLR) method. After a brief analytical discussion of the technique, the usage
of polynomial terms and the steps to compose the MLR model will be explained. Report on
TM
implementation of MLR algorithm using commercially available tool such as Microsoft EXCEL
will also be discussed. As a case study, historical data consisting of hourly load demand and
temperatures of South Sulawesi electrical system will be used, to forecast the short term load.
The results will be presented and analysed potential for improvement using alternative methods is
also discussed.
In the multiple linear regression method, the load y1 1 x11 x12 L x1k
is found in terms of explanatory (independent) y 1 x x 22 L x 2 k
variable such as weather and other variables Y = 2 And X = 21
y = β 0 + β 1 x1 + β 2 x 2 + ... + β k x k + ε (1)
Where y is the load, xi is the affecting factors, After parameters are calculated, this model
can be used for prediction. Assuming that all
βi is regression parameters with respect to xi , the independent variables have been correctly
identified and therefor the standard error will
and ε is an error term. be small. The standard error is obtained by the
equation below:
Load (MW)
Temp (C)
25
SSE = ∑ ( yi (t ) − yˆ i (t ) )
2 200,0
(5) 150,0
20
15
100,0 10
50,0 5
0,0 0
y i (t ) : observed, yˆ i (t ) : estimated 1 3 5 7 9 11 13 15 17 19 21 23
Time
n
∑ ( y (t ) − yˆ (t ))
i i
2
R2 = 1− t =1
n
(6) Figure 1 Daily load and temperature curve
∑ ( y (t ) − y (t ))
t =1
i i
2
For this forecasting study, data during the
rainy season and dry season were used
separately. In the MLR application, the hourly
y i (t ) : the average value of yi (t ) load is modelled as: (i) Intercept component
which is assumed constant for different time
Experience about the load to be modelled helps intervals of the day, (ii) Time of observation
an initial identification of the candidate of which is represent the load characteristic of
explanatory (independent) variables. To determine the day. (iii) Temperature sensitive component
the significance of regression parameters, the F which is function of difference of temperature
statistical test is performed and to determine the at time t and the average temperature in time
significance of each of these coefficients; is intervals. The relationship between the
performed by calculating t ratios. A goodness of fit temperature sensitive component, time of
measurement is represented by the R2 statistic observation and the load fluctuation are not
which ranges from 0 to 1 and indicates the linear but in form of polynomial term.
proportion of the total variation in the dependent
variable Y around its average that is counted for The model that is used divides the load curve
by the independent variable in the estimated into 3 intervals, 1 to 6 am, 7am to 17pm and
2 18 to 24 pm.
regression function. The closer the R statistic to
the value 1, the better the estimated regression
function fits the data. For the interval 1 to 6 am, 1 ≤ t ≤ 6 , the
hourly load can be modelled as follows
yˆ i (t ) = b0 + b1 (Ti (t ) − Ti (t − 1))
3 IMPLEMENTATIONS
+ b2 (Ti (t − 1) − Ti (t − 2)) (7)
The MLR technique as has been discussed + b3t + b4 t 2
previously is implemented to predict the hourly
load of South Sulawesi Electricity system.
For interval 7am to 17 pm, 7 ≤ t ≤ 17 , the
South Sulawesi is one of the provinces of Republic load curve is modelled by the form:
of Indonesia. Indonesia is located on the equator
with tropical weather, and have only dry and rainy
season throughout the year. The energy sales
composition of the consumer base in 2005 was
49% residential, 16% commercial, 26% Industrial
and 9% public, with the typical daily load (in MW)
and temperature curve as follows:
Where:
Table 2. Weekday MLR rainy season model
yˆ i (t ) = predicted load at hour t in the interval i of parameter estimates for different time intervals
of the day
the day
Tavb = Tava lagged 3 hours, (0C) For this review, the model is derived both for
the dry season and the rainy season, with
parameters shown in table 1 and 2
t = Time of observation. respectively. These parameters regression
have been calculated using weekday hourly
data for every time interval. The division of the
L o a d (M W )
200,0
n
yt − yˆ t Figure 3. Rainy season load curve
∑
t =1 yt
MAPE = 100 ⋅ (10) Table 3. Actual, forecast load and mean
n average percentage error (MAPE) for dry
and rainy season
-The MAPE in the dry season, is 3,52 %
Dry Season Rainy Season
-The MAPE in the rainy season is 4,34 % Time Actual Forecast Error Actual Forecast Error
1 190,1 186,7 1,78 194,5 193,7 0,41
Figure 2 and 3 show the actual and predicted 2 176,5 166,4 5,75 181,5 177,6 2,15
curve in dry season and rainy season respectively. 3 168,3 171,9 2,10 176,4 166,7 5,49
Table 3 describes the actual load, forecast and 4 163,3 165,7 1,43 174,9 161,1 7,94
5 182,2 164,2 9,92 182,4 170,9 6,32
MAPE for dry and rainy season,
6 179,1 178,2 0,50 192,7 188,3 2,28
7 167,9 169,3 0,85 171,3 165,3 3,48
Real 5Oct04 Forecast 05Oct04 8 185,4 174,6 5,85 187,7 181,8 3,17
350 9 203,4 186,3 8,42 192,8 205,4 6,51
10 214,1 200,6 6,30 207,9 218,5 5,09
300
11 213,2 207,0 2,92 215,5 212,2 1,52
250 12 212,0 218,6 3,10 212,2 210,2 0,96
13 213,1 220,8 3,63 212,6 213,0 0,19
Load (MW)
This model is very sensitive to the fluctuation of A pre analysis of the load is necessary for
temperature. It needs a very accurate temperature successful MLR forecast as the result of the
forecast, as a small change of temperature will analysis showed, because in South Sulawesi
cause a significant change in load prediction. This system, the load is not only affected by the
forecasting model uses the next day temperature fluctuation of temperature but also by other
forecast as an input which will introduce further weather factors such as humidity and cloud
errors, As there were no temperature forecast data cover. It is the daylight change that changes
available, temperature forecast was not included the lighting levels in residential and
in this regression analysis. Further studies should commercial spaces and demand of consumers
include this in real life applications. according to the brightness of the day.
The load characteristic is not only affected by the In this paper we have reported our preliminary
temperature, but also affected by the other investigations the use of MLR for STLF of the
weather factors such as humidity, cloud cover and South Sulawesi Electrical System, and in our
brightness of the day. Knowledge based on pre- studies to date we have only modelled
analysis of the load, identified that in South weekdays in wet and dry seasons.
Sulawesi system, the dominant portion of the load Investigations are underway to model
is residential (lighting) and this load is dependent weekend and holidays.
mostly on the levels and duration of the daylight.
It can be seen from load curve that the load rises
sharply at 18 pm. Therefore the choice of time
observation (t) as an independent variable in 7 ACKNOWLEDGEMENTS
equation (7), (8) and (9) of the model, increased
the accuracy of the forecasting. The author would like to thank PT. PLN
(persero), the Indonesian state electricity
company for financing the study in University
of Abertay Dundee
6 CONCLUSIONS
AUTHOR ADDRESS
Email: 0604719@abertay.ac.uk