SlideShare a Scribd company logo
Smooth Pinball based Quantile Network:
Neural Probabilistic Forecasting
By Kostas Hatalis
hatalis@gmail.com
Dept. of Electrical & Computer Engineering
Lehigh University, Bethlehem, PA
2018
Kostas Hatalis Quantile Neural Network 2018 1 / 21
Introduction
Inspired by SVQR we want to expand the investigation of nonlinear
nonparametric probabilistic forecasting using deep learning.
1 We propose and investigate a new objective function for neural
networks.
2 We introduce a way to prevent the quantile crossover problem.
3 We showcase how a multiple quantile based neural network can be
used for probabilistic forecasting of wind.
4 We design experiments using publicly available data from the Global
Energy Forecasting Competition 2014.
5 We show our method improves the skill, reliability, and sharpness over
various benchmarks.
Kostas Hatalis Quantile Neural Network 2018 2 / 21
Smooth Pinball Function
Pinball Loss Function
ρτ (u) =
τu if u ≥ 0
(τ − 1)u if u < 0
Quantile regression optimization problem:
min
W ,b
1
N
N
t=1
ρτ (yt − ˆq
(τ)
t ) (1)
The smooth approximation of the pinball function:
Sτ,α(u) = τu + α log 1 + exp −
u
α
Smooth quantile regression:
min
W ,b
1
N
N
t=1
Sτ,α(yt − ˆq
(τ)
t )
Kostas Hatalis Quantile Neural Network 2018 3 / 21
Smooth Pinball Function
Kostas Hatalis Quantile Neural Network 2018 4 / 21
Smooth Pinball Neural Network (SPNN)
The objective function for SPNN is then given by
E =
λ1
2NM
W [1] 2
F +
λ2
2NM
W [2] 2
F +
1
NM
N
t=1
M
m=1
...
τm(yt − ˆq
(τm)
t ) + α log 1 + exp −
yt − ˆq
(τm)
t
α
.
Kostas Hatalis Quantile Neural Network 2018 5 / 21
Smooth Pinball Neural Network (SPNN)
Gradient descent with backpropagation can be used to train SPNN.
Example gradients for a 2 layer SPNN:
∂Et
∂W [2]
=
λ2
M
W [2]
+
∂Et
∂ ˆQt
·
∂ ˆQt
∂Z
[2]
t
·
∂Z
[2]
t
∂W [2]
=
λ2
M
W [2]
+
1
M

 1
1 + exp yt − ˆQt
α
− T

 Ht
With respect to the weights of the first layer W[1] as follows
∂Et
∂W [1]
=
λ1
M
W
[1]
+


∂Et
∂ ˆQt
·
∂ ˆQt
∂Z
[2]
t
·
∂Z
[2]
t
∂Ht

 ·
∂Ht
∂Z
[1]
t
·
∂Z
[1]
t
∂W [1]
=
λ1
M
W
[1]
+
1
M




1
1 + exp
yt − ˆQt
α
− T



 W
[2]
1 − H
2
t Xt
.
Kostas Hatalis Quantile Neural Network 2018 6 / 21
Smooth Pinball Neural Network (SPNN)
Kostas Hatalis Quantile Neural Network 2018 7 / 21
Kostas Hatalis Quantile Neural Network 2018 8 / 21
Noncrossing Quantiles - Smooth Penalty Constraint
The condition 0 < τ1 < ... < τM are defined as the orders of M
conditional quantiles to be estimated.
To ensure these quantiles do not cross each other the following
constraint is needed q
(τ1)
t ≤ ... ≤ q
(τM )
t , ∀t.
We define the non-crossing quantile penalty term p as follows
penalty = c
N
t=1
M
m=1
max 0, − ˆq
(τm−1)
t − ˆq
(τm)
t
2
where ˆq
(τ0)
t = 0, is the least amount that the two quantile should differ
by, and c is the penalty parameter with a high value.
Kostas Hatalis Quantile Neural Network 2018 9 / 21
Empirical Evaluation
Datasets
Publicly available Global Energy Forecasting Competition 2014
(GEFCom2014)
Evaluation Metrics
QS, IS, ACE, and Sharpness
Benchmark Methods
Uniform, Climatology, Persistence, SVQR, QR
Case Studies Both studies we test SPNN having one and two hidden
layers denoted as SPNN1 and SPNN2. Only use raw data, no feature
engineering!
1. Estimate prediction intervals with nominal coverage from 10% to
90% in increments of 10%. Two wind farms. Also looked at QVSS
score.
2. We estimate 99 quantiles across all 10 wind farms.
Kostas Hatalis Quantile Neural Network 2018 10 / 21
Quantile Verification Skill Score (QVSS)
Quantile verification skill score (QVSS):
QVSS = 1 −
QSfor
QSref
QSfor is QS for the forecast method of interest (SPNN in our case)
QSref is the QS value for the reference forecast of a benchmark
method, which we use linear quantile regression.
Measures relative performance between methods.
Kostas Hatalis Quantile Neural Network 2018 11 / 21
Example Forecast
Kostas Hatalis Quantile Neural Network 2018 12 / 21
Sensitivity Analysis
Partial Derivatives (PaD) Method:
dt,i,M =
∂ ˆQτm
t
∂xt,i
=
∂ ˆQτm
t
∂Z
[2]
t
·
∂Z
[2]
t
∂Ht
·
∂Ht
∂Z[1],t
·
∂Z
[1]
t
∂xt,i
Sum of Square Derivatives (SSD):
SSDi,M =
N
t=1
(dt,i,M)
Kostas Hatalis Quantile Neural Network 2018 13 / 21
Sensitivity Analysis
Kostas Hatalis Quantile Neural Network 2018 14 / 21
Sensitivity Analysis, partial derivative of U100 for 90%
Kostas Hatalis Quantile Neural Network 2018 15 / 21
Case Study 1: Zone 1
Kostas Hatalis Quantile Neural Network 2018 16 / 21
Case Study 1: Zone 2
Kostas Hatalis Quantile Neural Network 2018 17 / 21
Case Study 2: ACE and Sharpness
Kostas Hatalis Quantile Neural Network 2018 18 / 21
Case Study 2: QS and IS
Kostas Hatalis Quantile Neural Network 2018 19 / 21
Case Study 2: GEFCom2014
Kostas Hatalis Quantile Neural Network 2018 20 / 21
Work Done
Probabilistic Forecasting by Deep Learning
[1] Kostas Hatalis, Alberto J Lamadrid, Katya Scheinberg, and Shalinee
Kishore. Multi quantile estimation based neural network for probabilistic
forecasting of wind power. IEEE Transactions on Neural Networks and
Learning Systems (In-Submission).
Kostas Hatalis Quantile Neural Network 2018 21 / 21

More Related Content

PPTX
Probability distributions & expected values
PPTX
Binomial probability distribution
DOC
Ordinary least squares linear regression
PDF
Moving Average
PPTX
Lesson 5 arima
PDF
PDF
Maximum Likelihood Estimation
PDF
Lecture: Monte Carlo Methods
Probability distributions & expected values
Binomial probability distribution
Ordinary least squares linear regression
Moving Average
Lesson 5 arima
Maximum Likelihood Estimation
Lecture: Monte Carlo Methods

What's hot (20)

PPTX
Probability Distribution
PPTX
Conditional-Probability-Powerpoint.pptx
PPTX
Quadratic Programming : KKT conditions with inequality constraints
PPTX
Maxima and minima
PPT
stochastic processes-2.ppt
PPTX
Hypergeometric probability distribution
PDF
Module 4_Machine Learning_Evaluating Hyp
PPT
The wave equation
PPTX
COVARIANCE IN PROBABILITY
PPTX
Random Forest In R | Random Forest Algorithm | Random Forest Tutorial |Machin...
ODP
Introduction to Principle Component Analysis
PDF
02 Machine Learning - Introduction probability
PDF
Nonlinear component analysis as a kernel eigenvalue problem
PPTX
Monte carlo simulation
PPTX
taylors theorem
DOC
Covariance and correlation
PPTX
Random forest algorithm
PPTX
Numerical method-Picards,Taylor and Curve Fitting.
PPTX
Statistics for data science
Probability Distribution
Conditional-Probability-Powerpoint.pptx
Quadratic Programming : KKT conditions with inequality constraints
Maxima and minima
stochastic processes-2.ppt
Hypergeometric probability distribution
Module 4_Machine Learning_Evaluating Hyp
The wave equation
COVARIANCE IN PROBABILITY
Random Forest In R | Random Forest Algorithm | Random Forest Tutorial |Machin...
Introduction to Principle Component Analysis
02 Machine Learning - Introduction probability
Nonlinear component analysis as a kernel eigenvalue problem
Monte carlo simulation
taylors theorem
Covariance and correlation
Random forest algorithm
Numerical method-Picards,Taylor and Curve Fitting.
Statistics for data science
Ad

Similar to Smooth Pinball based Quantile Neural Network (20)

PPTX
Restricted Boltzman Machine (RBM) presentation of fundamental theory
PDF
Constrained Support Vector Quantile Regression for Conditional Quantile Estim...
PDF
Time Delayed Recurrent Neural Network for Multi-Step Prediction
PDF
Clustering coefficients for correlation networks
PDF
Maneuvering target track prediction model
PDF
Survadapt-Webinar_2014_SLIDES
PPTX
Advanced_cluster_analysis.pptx
PDF
BNL_Research_Poster
PDF
Unbiased Bayes for Big Data
PDF
CLIM: Transition Workshop - A Notional Framework for a Theory of Data Systems...
PDF
A common fixed point theorem for two random operators using random mann itera...
PDF
CLIM: Transition Workshop - Sea Ice, Unstable Subspace, Model Error: Three To...
PDF
AINL 2016: Strijov
PDF
Streaming multiscale anomaly detection
PDF
MUMS Opening Workshop - Model Uncertainty in Data Fusion for Remote Sensing -...
PDF
[IJCT-V3I2P19] Authors: Jieyin Mai, Xiaojun Li
PDF
presentation_2.pdf
PDF
MVPA with SpaceNet: sparse structured priors
PDF
DATA STRUCTURES & ALGORITHMS MINIMUM SPANNING TREE
PDF
MUMS Undergraduate Workshop - Introduction to Bayesian Inference & Uncertaint...
Restricted Boltzman Machine (RBM) presentation of fundamental theory
Constrained Support Vector Quantile Regression for Conditional Quantile Estim...
Time Delayed Recurrent Neural Network for Multi-Step Prediction
Clustering coefficients for correlation networks
Maneuvering target track prediction model
Survadapt-Webinar_2014_SLIDES
Advanced_cluster_analysis.pptx
BNL_Research_Poster
Unbiased Bayes for Big Data
CLIM: Transition Workshop - A Notional Framework for a Theory of Data Systems...
A common fixed point theorem for two random operators using random mann itera...
CLIM: Transition Workshop - Sea Ice, Unstable Subspace, Model Error: Three To...
AINL 2016: Strijov
Streaming multiscale anomaly detection
MUMS Opening Workshop - Model Uncertainty in Data Fusion for Remote Sensing -...
[IJCT-V3I2P19] Authors: Jieyin Mai, Xiaojun Li
presentation_2.pdf
MVPA with SpaceNet: sparse structured priors
DATA STRUCTURES & ALGORITHMS MINIMUM SPANNING TREE
MUMS Undergraduate Workshop - Introduction to Bayesian Inference & Uncertaint...
Ad

Recently uploaded (20)

PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Foundation of Data Science unit number two notes
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Computer network topology notes for revision
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PDF
Lecture1 pattern recognition............
PPT
Quality review (1)_presentation of this 21
PDF
annual-report-2024-2025 original latest.
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Supervised vs unsupervised machine learning algorithms
Miokarditis (Inflamasi pada Otot Jantung)
Galatica Smart Energy Infrastructure Startup Pitch Deck
Clinical guidelines as a resource for EBP(1).pdf
Business Acumen Training GuidePresentation.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Foundation of Data Science unit number two notes
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Computer network topology notes for revision
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Introduction-to-Cloud-ComputingFinal.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Lecture1 pattern recognition............
Quality review (1)_presentation of this 21
annual-report-2024-2025 original latest.
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
1_Introduction to advance data techniques.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Supervised vs unsupervised machine learning algorithms

Smooth Pinball based Quantile Neural Network

  • 1. Smooth Pinball based Quantile Network: Neural Probabilistic Forecasting By Kostas Hatalis hatalis@gmail.com Dept. of Electrical & Computer Engineering Lehigh University, Bethlehem, PA 2018 Kostas Hatalis Quantile Neural Network 2018 1 / 21
  • 2. Introduction Inspired by SVQR we want to expand the investigation of nonlinear nonparametric probabilistic forecasting using deep learning. 1 We propose and investigate a new objective function for neural networks. 2 We introduce a way to prevent the quantile crossover problem. 3 We showcase how a multiple quantile based neural network can be used for probabilistic forecasting of wind. 4 We design experiments using publicly available data from the Global Energy Forecasting Competition 2014. 5 We show our method improves the skill, reliability, and sharpness over various benchmarks. Kostas Hatalis Quantile Neural Network 2018 2 / 21
  • 3. Smooth Pinball Function Pinball Loss Function ρτ (u) = τu if u ≥ 0 (τ − 1)u if u < 0 Quantile regression optimization problem: min W ,b 1 N N t=1 ρτ (yt − ˆq (τ) t ) (1) The smooth approximation of the pinball function: Sτ,α(u) = τu + α log 1 + exp − u α Smooth quantile regression: min W ,b 1 N N t=1 Sτ,α(yt − ˆq (τ) t ) Kostas Hatalis Quantile Neural Network 2018 3 / 21
  • 4. Smooth Pinball Function Kostas Hatalis Quantile Neural Network 2018 4 / 21
  • 5. Smooth Pinball Neural Network (SPNN) The objective function for SPNN is then given by E = λ1 2NM W [1] 2 F + λ2 2NM W [2] 2 F + 1 NM N t=1 M m=1 ... τm(yt − ˆq (τm) t ) + α log 1 + exp − yt − ˆq (τm) t α . Kostas Hatalis Quantile Neural Network 2018 5 / 21
  • 6. Smooth Pinball Neural Network (SPNN) Gradient descent with backpropagation can be used to train SPNN. Example gradients for a 2 layer SPNN: ∂Et ∂W [2] = λ2 M W [2] + ∂Et ∂ ˆQt · ∂ ˆQt ∂Z [2] t · ∂Z [2] t ∂W [2] = λ2 M W [2] + 1 M   1 1 + exp yt − ˆQt α − T   Ht With respect to the weights of the first layer W[1] as follows ∂Et ∂W [1] = λ1 M W [1] +   ∂Et ∂ ˆQt · ∂ ˆQt ∂Z [2] t · ∂Z [2] t ∂Ht   · ∂Ht ∂Z [1] t · ∂Z [1] t ∂W [1] = λ1 M W [1] + 1 M     1 1 + exp yt − ˆQt α − T     W [2] 1 − H 2 t Xt . Kostas Hatalis Quantile Neural Network 2018 6 / 21
  • 7. Smooth Pinball Neural Network (SPNN) Kostas Hatalis Quantile Neural Network 2018 7 / 21
  • 8. Kostas Hatalis Quantile Neural Network 2018 8 / 21
  • 9. Noncrossing Quantiles - Smooth Penalty Constraint The condition 0 < τ1 < ... < τM are defined as the orders of M conditional quantiles to be estimated. To ensure these quantiles do not cross each other the following constraint is needed q (τ1) t ≤ ... ≤ q (τM ) t , ∀t. We define the non-crossing quantile penalty term p as follows penalty = c N t=1 M m=1 max 0, − ˆq (τm−1) t − ˆq (τm) t 2 where ˆq (τ0) t = 0, is the least amount that the two quantile should differ by, and c is the penalty parameter with a high value. Kostas Hatalis Quantile Neural Network 2018 9 / 21
  • 10. Empirical Evaluation Datasets Publicly available Global Energy Forecasting Competition 2014 (GEFCom2014) Evaluation Metrics QS, IS, ACE, and Sharpness Benchmark Methods Uniform, Climatology, Persistence, SVQR, QR Case Studies Both studies we test SPNN having one and two hidden layers denoted as SPNN1 and SPNN2. Only use raw data, no feature engineering! 1. Estimate prediction intervals with nominal coverage from 10% to 90% in increments of 10%. Two wind farms. Also looked at QVSS score. 2. We estimate 99 quantiles across all 10 wind farms. Kostas Hatalis Quantile Neural Network 2018 10 / 21
  • 11. Quantile Verification Skill Score (QVSS) Quantile verification skill score (QVSS): QVSS = 1 − QSfor QSref QSfor is QS for the forecast method of interest (SPNN in our case) QSref is the QS value for the reference forecast of a benchmark method, which we use linear quantile regression. Measures relative performance between methods. Kostas Hatalis Quantile Neural Network 2018 11 / 21
  • 12. Example Forecast Kostas Hatalis Quantile Neural Network 2018 12 / 21
  • 13. Sensitivity Analysis Partial Derivatives (PaD) Method: dt,i,M = ∂ ˆQτm t ∂xt,i = ∂ ˆQτm t ∂Z [2] t · ∂Z [2] t ∂Ht · ∂Ht ∂Z[1],t · ∂Z [1] t ∂xt,i Sum of Square Derivatives (SSD): SSDi,M = N t=1 (dt,i,M) Kostas Hatalis Quantile Neural Network 2018 13 / 21
  • 14. Sensitivity Analysis Kostas Hatalis Quantile Neural Network 2018 14 / 21
  • 15. Sensitivity Analysis, partial derivative of U100 for 90% Kostas Hatalis Quantile Neural Network 2018 15 / 21
  • 16. Case Study 1: Zone 1 Kostas Hatalis Quantile Neural Network 2018 16 / 21
  • 17. Case Study 1: Zone 2 Kostas Hatalis Quantile Neural Network 2018 17 / 21
  • 18. Case Study 2: ACE and Sharpness Kostas Hatalis Quantile Neural Network 2018 18 / 21
  • 19. Case Study 2: QS and IS Kostas Hatalis Quantile Neural Network 2018 19 / 21
  • 20. Case Study 2: GEFCom2014 Kostas Hatalis Quantile Neural Network 2018 20 / 21
  • 21. Work Done Probabilistic Forecasting by Deep Learning [1] Kostas Hatalis, Alberto J Lamadrid, Katya Scheinberg, and Shalinee Kishore. Multi quantile estimation based neural network for probabilistic forecasting of wind power. IEEE Transactions on Neural Networks and Learning Systems (In-Submission). Kostas Hatalis Quantile Neural Network 2018 21 / 21