Hourly solar irradiance forecasting based on machine learning

Hourly solar irradiance forecasting based on machine learning
models (IEEE ICMLA 2016)
Young Statisticians Meeting 2017
F-N. Melzi, T. Touati, A. Samé and L. Oukhellou
[email protected]
French institute of sciences and technology for transport, development and networks
(IFSTTAR)
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
1 / 18
Summary
Summary
1
Introduction
2
Dataset Description
3
Forecast models
Time series models
Machine learning models
4
Results and Discussion
5
Conclusion and Current Work
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
2 / 18
Introduction
Global context
Augmentation of population in cities, Increasing of energy consumption
Scale up of infrastructures in terms of transport, telecommunications and public
equipment
In France 2015 : Nuclear 76.4 %, Hydro 10.8 % Other Renewable Energy 6.6 %
Challenge : increase the part of renewable energy production
Emergence of smart cities (Technologies)
Better monitoring of the renewable energy (photovoltaic, eolian,...)
Improve the quality of life, Respect of environment
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
3 / 18
Introduction
Context and Motivation
Large amount of data collected from smart devices, smart meters and sensors
(Meteorological Sensors)
Development of technologies in terms of data storage
Need of Data analytics in the energy domain
Smart City Energy analytics (SCE) project launch, both academic and industrial
partners (GE, ENGIE, Alstom Transport, IFSTTAR and others)
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
4 / 18
Introduction
Goals
Build decision tools based on Advanced Data Analytics in the Energy domain
Case study : Photovoltaic energy (One of the most used renewable energy
worldwide), dependence on the solar irradiance and weather condition
Focus
Forecast the hourly solar irradiance using only irradiance data
Machine learning techniques and Times series models.
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
5 / 18
Dataset Description
Data set
Hourly solar irradiance of the Paris suburb of Alfortville provided by Reuniwatt
company
The irradiance is collected for a duration of 144 months (from 1st January 2004 to
31st December 2015)
Each curve consists of a daily solar irradiance
Figure – Solar irradiance curves during the year 2015
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
6 / 18
Dataset Description
Figure – Evolution of the number of hours of sunshine over 12 years
The progression of the number of hours of sunshine is the same for one leap year to
another and even for the non-leap years
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
7 / 18
Dataset Description
(a)
(b)
Figure – (a) Distribution of the number of hours of sunshine during a leap year (2012) ; (b) non
leap year (2015)
The year is divided into 9 groups of days
The number of hours of sunshine varies between 9 and 17 hours
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
8 / 18
Forecast models
Figure – Hourly solar irradiance forcasting models
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
9 / 18
Forecast models
Time series models
Time series models
Naive model
The predicted value is equal to the last observed
ch = irh−1
ir
Autoregressive Moving Average (ARMA)
Stochastic process describing each time series Xt∈N by the folowing : .
φ(L)Xt = Θ(L)t
such that φ and Θ are polynomials of degree p and q respectively, L is the
lag-operator, and t∈N is white noise independent from Xt∈N
The solar irradiance time series is not stationary (ADF-test)
Calculation of the clear sky index k (Lauret et al 2015) a :
k = X /Xclear
The clear sky index time series is fitted by an ARMA
The solar irradiance is given by :
h
ch = kbh × Xclear
ir
a. http ://www.soda-is.com/eng/index.html
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
10 / 18
Forecast models
Machine learning models
Machine learning models
Figure – Methodology
Learning on the days having the same number of hours as the predicted day
Taking into account the previous hours of the forecasted day
Notation
{xih , yih }ni=1 → training data set
(x∗h , y∗h ) → test data
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
11 / 18
Forecast models
Machine learning models
Support Vector Machine Regression (SVR)
Maximizing the margin between the hyperplane and the data
Minimizing the empirical risk
ch =
ir
n
X
αi K (x∗h , xih ) + b
i=1
Neural Network (NN)
Non linear parameterized mapping from an input xi to an output yi
The model is designed by h − 1 inputs, m hidden neurons and one single output
(hour to forecast)
h−1
m
X
X
ch =
ir
wj f (
wji x∗h + b1 ) + b2 ,
j=1
i=1
Similarity
Distance between the actual observation and the past observations in the history
n
X
ch =
ir
wih yih
i=1
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
12 / 18
Results and Discussion
Evaluation of the models
Leave-one-out
Two years (2014 and 2015) are considered to train and evaluate the models
The training is performed on the n − 1 past observations
The test is carried out on the nth observation
The operation is repeated n times from 1st January 2015 to 31st December 2015
Normalized Root Mean Square Error (NRMSE)
→ Evaluation of the accuracy of each model
v
u
H
u 1 X
ch − irh )2
RMSE = t
(ir
H −1
h=2
NRMSE =
Taieb Touati (IFSTTAR,UPMC)
RMSE
¯
ir
RDJS 2017
April 03-07, 2017
13 / 18
Results and Discussion
Error indicator
NRMSE
Naive
0.545
ARMA
0.208
Models
SIM
0.252
SVR
0.203
NN
0.16
Table – Normalized Root Mean Square Error for the hourly forecasting models
Better performance of ARMA, SIM, SVR and NN comparing to Naive
NRMSE of ARMA is comparable to that of Machine Learning methods.
NN is the best forecasting model
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
14 / 18
Measured
Naive
ARMA
SIM
SVM
NN
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
400
600
−
−
−
−
−
−
●
●
●
●
●
●
●
●
●
●
●
●
●
200
Irradiance (W/m2)
800
Results and Discussion
●
●
●
●
●
●
0
●
●
●
●
●
●
●
●
●
1
2
3
4
5
6
●
●
●
●
●
●
●
●
7
8
9
●
●
●
●
10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
Time (hour)
400
−
−
−
−
−
−
Measured
Naive
ARMA
SIM
SVM
NN
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
200
Irradiance (W/m2)
600
(a)
0
●
●
●
●
●
●
●
●
●
1
2
3
4
5
●
6
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
7
8
9
●
●
●
●
10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
Time (hour)
(b)
Figure – (a) Measured and forecasted values for a sunny day ; (b) cloudy day
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
15 / 18
Results and Discussion
(a)
(b)
(c)
(d)
Figure – (a) Evolution of the NRMSE according to the year 2015 for ARMA ; (b) SIM ; (c) SVR ;
(d) NN
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
16 / 18
Conclusion and Current Work
Conclusion
An hourly solar irradiance forecasting is carried out
Three machine learning techniques are used (SIM, SVR and NN)
Comparison with time series methods (ARMA), Non stationarity of raw data.
Satisfactory results during the sunny days for machine learning methods and lesser
accuracy when dealing with cloudy days
Further Work
Test Hybrid models with meteorological variables
Perform daily forecasting
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
17 / 18
Conclusion and Current Work
Thank you for your attention !
Taieb Touati (IFSTTAR,UPMC)
RDJS 2017
April 03-07, 2017
18 / 18