SlideShare a Scribd company logo
Ilab METIS
Optimization of Energy Policies
Olivier Teytaud + Inria-Tao + Artelys
TAO project-team
INRIA Saclay Île-de-France
O. Teytaud, Research Fellow,
olivier.teytaud@inria.fr
http://www.lri.fr/~teytaud/
Outline
Who we are
What we solve
Methodologies
Ilab METIS
www.lri.fr/~teytaud/metis.html
Ilab METIS
www.lri.fr/~teytaud/metis.html
● Metis = Tao + Artelys
● TAO tao.lri.fr, Machine Learning & Optimization
● Joint INRIA / CNRS / Univ. Paris-Sud team
● 12 researchers, 17 PhDs, 3 post-docs, 3 engineers
● Artelys www.artelys.com SME
- France / US / Canada
- 50 persons
==> collaboration through common platform
● Activities
● Optimization (uncertainties, sequential)
● Application to power systems
Fundings
● Inria team Tao
● Lri (Univ. Paris-Sud, Umr Cnrs 8623)
● FP7 european project (city/factory scale)
● Ademe Bia(transcontinental stuff)
● Ilab (with Artelys)
● Indema (associate team with Taiwan)
● Maybe others, I get lost in fundings
Outline
Who we are
What we solve
Methodologies
Industrial application
● Building power systems is expensive
power plants, HVDC links, networks...
● Non trivial planning questions
● Compromise: should we move solar power to the
south and build networks ?
● Is a HVDC connection “x ↔ y” a good idea ?
● What we do:
● Simulate the operational level of a given power
system (this involves optimization of operational
decisions)
● Optimize the investments
● Planning/control
● Pluriannual planning: evaluate marginal costs of hydroelectricity
● Taking into account stochasticity and uncertainties
==> IOMCA (ANR)
● High scale investment studies (e.g. Europe+North Africa)
● Long term (2030 - 2050)
● Huge (non-stochastic) uncertainties
● Investments: interconnections, storage, smart grids, power plants...
==> POST (ADEME)
● Moderate scale (Cities, Factories)
● Master plan optimization
● Stochastic uncertainties
==> Citines project (FP7)
Specialization on Power Systems
Example: interconnection studies
(demand levelling, stabilized supply)
The POST project – supergrids
simulation and optimization
European subregions:
- Case 1 : electric corridor France / Spain / Marocco
- Case 2 : south-west
(France/Spain/Italiy/Tunisia/Marocco)
- Case 3 : maghreb – Central West Europe
==> towards a European supergrid
Related
ideas in Asia
Mature technology:HVDC links
(high-voltage direct current)
Investment decisions through simulations
● Issues
– Demand varying in time, limited previsibility
– Transportation introduces constraints
– Renewable ==> variability ++
● Methods
– Markovian assumptions ==> wrong
– Simplified models ==> Model error >> optimization error
● Our approach
● Machine Learning on top of Mathematical Programming
Outline
Who we are
What we solve
Methodologies
A few milestones
● Linear programming is fast
● Bellman decomposition: we can split
short term reward + long term reward
● Folklore result: direct policy search
==> we use all of them
Hybridization reinforcement learning /
mathematical programming
● Math programming
– Nearly exact solutions for a simplified problem
– High-dimensional constrained action space
– But small state space & not anytime
● Reinforcement learning
– Unstable
– Small model bias
– Small / simple action space
– But high dimensional state space & anytime
Errors
● Statistical error: due to finite samples (e.g.
weather data = archive), possibly with bias (climate
change)
● Statistical model error: due to the error in the
model of random processes
● Model error: due to system modelling
● Anticipativity error: due to assuming perfect
forecasts
● Monoactor: due to neglecting interactions
between actor (social welfare)
● Optim. error: due to imperfect optimization
Plenty of tools
● Dynamic programming based ==> bad
modelization of long term dependencies
● Direct policy search: difficult to handle
constraints ==> bad modelization of systems
● Model predictive control: bad modelization of
randomness
==> we use combined tools
I love Direct Policy Search
● What is DPS ?
● Implement a simulator
● Implement a policy / controller
● Replace constants in the policy by free parameters
● Optimize these parameters on simulations
● Why I love it
● Pragmatic, benefits from human expertise
● The best in terms of model error
● But ok it is sometimes slow
● Not always that convenient for constraints
We propose specialized DPS
● A special structure for plenty of constraints
● After all, you can use DPS on top of everything,
just by defining a “good” controller
● DP-based tools have a great representation
● Let us use DP-representations in DPS
Dynamic programming tools
Decision at time T = argmax of
reward over the T next time steps
+ V'(state) x StateAt(t0+T)
with V computed backwards
Direct Value Search
Decision at time T = argmax of
reward over the T next time steps
+ f(, state) x StateAt(t0+T)
with  optimized through Direct Policy Search
and f a general function approximator (e.g.
neural)
Using
forecasts
as in MPC
As in DPstyle
Summary
● Model error: often more important than optim
error (whereas most works on optim error)
● We propose methodologies
● Compliant with constraints
● More expensive than MPC
● But not more expensive than DP-tools
● Smallest model error
● User-friendly (human expertise)
What we propose
● Is ok for correctly specified problems
● Uncertainties which can be modelized by
probabilities
● Less model error, more optim. error
● Optim. error reduced by big clusters
● Takes into account the challenges in new power
systems
● Stochastic effects (increased by renewables)
● High scale actions (demand-side management)
● High scale models (transcontinental grids)
What we propose
● Open source ?
● Algorithms are public
● Tools are not
● Data/models are not
● Want to join ?
● Room for mathematics
● Room for geeks
● Room for people who like applications
Our tools
● Tested on real problems
● Include investment levels
– There are operational decisions
– There are investment decisions
● Parallel
● Expensive
Further work
● Nothing on multiple actors (national
independence ? intern. risk ?)
● Non stochastic uncertainties: how do we
modelize non-probabilistic uncertainties on
scientific breakthroughs ? (Wald criterion,
Savage, Nash, Regret...)
Bibliography
● Dynamic Programming and Suboptimal Control: A Survey from
ADP to MPC. Bertsekas, 2005. (MPC = deterministic forecasts)
● “Newave vs Odin”: why MPC survives in spite of theoretical
shortcomings
● Dallagi et Simovic (EDF R&D) : "Optimisation des actifs
hydrauliques d'EDF : besoins métiers, méthodes actuelles et
perspectives", PGMO (importance of precise simulations)
● Ernst: The Global Grid, 2013
● Renewable energy forecasts ought to be probabilistic! Pinson,
2013 (wipfor talk)
● Training a neural network with a financial criterion rather than a
prediction criterion. Bengio, 1997
● Direct Model Predictive Control, Decock et al, 2014 (combining
DPS and MPC)

More Related Content

ODP
Ilab Metis: we optimize power systems and we are not afraid of direct policy ...
ODP
Planning for power systems
PDF
Automated Machine Learning
PPTX
What if computers invigilate examinations - Cypher 2018
PDF
A Kaggle Talk
PDF
Basic Problems and Solving Algorithms
PPTX
An Introduction to Reinforcement Learning - The Doors to AGI
PDF
Online Machine Learning: introduction and examples
Ilab Metis: we optimize power systems and we are not afraid of direct policy ...
Planning for power systems
Automated Machine Learning
What if computers invigilate examinations - Cypher 2018
A Kaggle Talk
Basic Problems and Solving Algorithms
An Introduction to Reinforcement Learning - The Doors to AGI
Online Machine Learning: introduction and examples

What's hot (20)

PDF
Refutations on "Debunking the Myths of Influence Maximization: An In-Depth Be...
PPTX
Day 2 (Lecture 5): A Practitioner's Perspective on Building Machine Product i...
PPT
Numerical Algorithms
PDF
Data science as career
PDF
Data Workflows for Machine Learning - Seattle DAML
PPTX
Kaggle Days Milan - March 2019
PDF
Europython - Machine Learning for dummies with Python
PDF
Parametric & Non-Parametric Machine Learning (Supervised ML)
PPTX
Asymptotic analysis of parallel programs
PDF
林守德/Practical Issues in Machine Learning
PPTX
Parametric and nonparametric
PDF
NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo
PDF
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
PPTX
Improving the accuracy and reliability of data analysis code
PDF
A General Overview of Machine Learning
PPTX
Problem Formulation in Artificial Inteligence Projects
PDF
Optimization Problems Solved by Different Platforms Say Optimum Tool Box (Mat...
PPTX
MATLAB Project Ideas Engineering Research Assistance
PDF
Pybcn machine learning for dummies with python
PPSX
Ds03 algorithms jyoti lakhani
Refutations on "Debunking the Myths of Influence Maximization: An In-Depth Be...
Day 2 (Lecture 5): A Practitioner's Perspective on Building Machine Product i...
Numerical Algorithms
Data science as career
Data Workflows for Machine Learning - Seattle DAML
Kaggle Days Milan - March 2019
Europython - Machine Learning for dummies with Python
Parametric & Non-Parametric Machine Learning (Supervised ML)
Asymptotic analysis of parallel programs
林守德/Practical Issues in Machine Learning
Parametric and nonparametric
NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Improving the accuracy and reliability of data analysis code
A General Overview of Machine Learning
Problem Formulation in Artificial Inteligence Projects
Optimization Problems Solved by Different Platforms Say Optimum Tool Box (Mat...
MATLAB Project Ideas Engineering Research Assistance
Pybcn machine learning for dummies with python
Ds03 algorithms jyoti lakhani
Ad

Viewers also liked (16)

ODP
Uncertainties in large scale power systems
ODP
Meta Monte-Carlo Tree Search
ODP
ODP
Energy Management Forum, Tainan 2012
ODP
Machine learning 2016: deep networks and Monte Carlo Tree Search
ODP
Introduction to the TAO Uct Sig, a team working on computational intelligence...
ODP
Theories of continuous optimization
ODP
Noisy optimization --- (theory oriented) Survey
ODP
Tools for artificial intelligence
ODP
Statistics 101
ODP
Stochastic modelling and quasi-random numbers
ODP
Multimodal or Expensive Optimization
ODP
Combining UCT and Constraint Satisfaction Problems for Minesweeper
ODP
Inteligencia Artificial y Go
ODP
Complexity of planning and games with partial information
ODP
Computers and Killall-Go
Uncertainties in large scale power systems
Meta Monte-Carlo Tree Search
Energy Management Forum, Tainan 2012
Machine learning 2016: deep networks and Monte Carlo Tree Search
Introduction to the TAO Uct Sig, a team working on computational intelligence...
Theories of continuous optimization
Noisy optimization --- (theory oriented) Survey
Tools for artificial intelligence
Statistics 101
Stochastic modelling and quasi-random numbers
Multimodal or Expensive Optimization
Combining UCT and Constraint Satisfaction Problems for Minesweeper
Inteligencia Artificial y Go
Complexity of planning and games with partial information
Computers and Killall-Go
Ad

Similar to Dynamic Optimization without Markov Assumptions: application to power systems (20)

ODP
Tools for Discrete Time Control; Application to Power Systems
ODP
Optimization of power systems - old and new tools
ODP
Power systemsilablri
ODP
Artificial intelligence for power systems
PDF
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
PPT
Tetiana Bogodorova "Data Science for Electric Power Systems"
PDF
Smart optimization techniques for virtual power plants
PDF
Master's_Thesis_XuejiaoHAN
PDF
Overview of the FlexPlan project. Focus on EU regulatory analysis and TSO-DSO...
ODP
Bias correction, and other uncertainty management techniques
PDF
Wanted!: Open M&S Standards and Technologies for the Smart Grid - Introducing...
PDF
Meetup Luglio - Operations Research.pdf
PDF
Learning to Run a Power Network - a design challenge - TAILOR Conference - Pr...
PDF
Modelling & Control of Drinkable Water Networks
PPTX
KTH SmarTS Lab - An Introduction to our Research Group and Activities
PPT
Concepts of predictive control
PDF
Universal approximators for Direct Policy Search in multi-purpose water reser...
PDF
GREDOR Presentation
PDF
Curses, tradeoffs, and scalable management: advancing evolutionary direct pol...
PDF
Using the black-box approach with machine learning methods in ...
Tools for Discrete Time Control; Application to Power Systems
Optimization of power systems - old and new tools
Power systemsilablri
Artificial intelligence for power systems
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Tetiana Bogodorova "Data Science for Electric Power Systems"
Smart optimization techniques for virtual power plants
Master's_Thesis_XuejiaoHAN
Overview of the FlexPlan project. Focus on EU regulatory analysis and TSO-DSO...
Bias correction, and other uncertainty management techniques
Wanted!: Open M&S Standards and Technologies for the Smart Grid - Introducing...
Meetup Luglio - Operations Research.pdf
Learning to Run a Power Network - a design challenge - TAILOR Conference - Pr...
Modelling & Control of Drinkable Water Networks
KTH SmarTS Lab - An Introduction to our Research Group and Activities
Concepts of predictive control
Universal approximators for Direct Policy Search in multi-purpose water reser...
GREDOR Presentation
Curses, tradeoffs, and scalable management: advancing evolutionary direct pol...
Using the black-box approach with machine learning methods in ...

Recently uploaded (20)

PDF
Getting Started with Data Integration: FME Form 101
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
1. Introduction to Computer Programming.pptx
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
Approach and Philosophy of On baking technology
PPTX
TLE Review Electricity (Electricity).pptx
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
A Presentation on Touch Screen Technology
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
Zenith AI: Advanced Artificial Intelligence
PPTX
Programs and apps: productivity, graphics, security and other tools
Getting Started with Data Integration: FME Form 101
Assigned Numbers - 2025 - Bluetooth® Document
Building Integrated photovoltaic BIPV_UPV.pdf
1. Introduction to Computer Programming.pptx
OMC Textile Division Presentation 2021.pptx
DP Operators-handbook-extract for the Mautical Institute
Approach and Philosophy of On baking technology
TLE Review Electricity (Electricity).pptx
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Heart disease approach using modified random forest and particle swarm optimi...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
A Presentation on Artificial Intelligence
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Enhancing emotion recognition model for a student engagement use case through...
A novel scalable deep ensemble learning framework for big data classification...
Digital-Transformation-Roadmap-for-Companies.pptx
A Presentation on Touch Screen Technology
Web App vs Mobile App What Should You Build First.pdf
Zenith AI: Advanced Artificial Intelligence
Programs and apps: productivity, graphics, security and other tools

Dynamic Optimization without Markov Assumptions: application to power systems

  • 1. Ilab METIS Optimization of Energy Policies Olivier Teytaud + Inria-Tao + Artelys TAO project-team INRIA Saclay Île-de-France O. Teytaud, Research Fellow, olivier.teytaud@inria.fr http://www.lri.fr/~teytaud/
  • 2. Outline Who we are What we solve Methodologies
  • 3. Ilab METIS www.lri.fr/~teytaud/metis.html Ilab METIS www.lri.fr/~teytaud/metis.html ● Metis = Tao + Artelys ● TAO tao.lri.fr, Machine Learning & Optimization ● Joint INRIA / CNRS / Univ. Paris-Sud team ● 12 researchers, 17 PhDs, 3 post-docs, 3 engineers ● Artelys www.artelys.com SME - France / US / Canada - 50 persons ==> collaboration through common platform ● Activities ● Optimization (uncertainties, sequential) ● Application to power systems
  • 4. Fundings ● Inria team Tao ● Lri (Univ. Paris-Sud, Umr Cnrs 8623) ● FP7 european project (city/factory scale) ● Ademe Bia(transcontinental stuff) ● Ilab (with Artelys) ● Indema (associate team with Taiwan) ● Maybe others, I get lost in fundings
  • 5. Outline Who we are What we solve Methodologies
  • 6. Industrial application ● Building power systems is expensive power plants, HVDC links, networks... ● Non trivial planning questions ● Compromise: should we move solar power to the south and build networks ? ● Is a HVDC connection “x ↔ y” a good idea ? ● What we do: ● Simulate the operational level of a given power system (this involves optimization of operational decisions) ● Optimize the investments
  • 7. ● Planning/control ● Pluriannual planning: evaluate marginal costs of hydroelectricity ● Taking into account stochasticity and uncertainties ==> IOMCA (ANR) ● High scale investment studies (e.g. Europe+North Africa) ● Long term (2030 - 2050) ● Huge (non-stochastic) uncertainties ● Investments: interconnections, storage, smart grids, power plants... ==> POST (ADEME) ● Moderate scale (Cities, Factories) ● Master plan optimization ● Stochastic uncertainties ==> Citines project (FP7) Specialization on Power Systems
  • 8. Example: interconnection studies (demand levelling, stabilized supply)
  • 9. The POST project – supergrids simulation and optimization European subregions: - Case 1 : electric corridor France / Spain / Marocco - Case 2 : south-west (France/Spain/Italiy/Tunisia/Marocco) - Case 3 : maghreb – Central West Europe ==> towards a European supergrid Related ideas in Asia Mature technology:HVDC links (high-voltage direct current)
  • 10. Investment decisions through simulations ● Issues – Demand varying in time, limited previsibility – Transportation introduces constraints – Renewable ==> variability ++ ● Methods – Markovian assumptions ==> wrong – Simplified models ==> Model error >> optimization error ● Our approach ● Machine Learning on top of Mathematical Programming
  • 11. Outline Who we are What we solve Methodologies
  • 12. A few milestones ● Linear programming is fast ● Bellman decomposition: we can split short term reward + long term reward ● Folklore result: direct policy search ==> we use all of them
  • 13. Hybridization reinforcement learning / mathematical programming ● Math programming – Nearly exact solutions for a simplified problem – High-dimensional constrained action space – But small state space & not anytime ● Reinforcement learning – Unstable – Small model bias – Small / simple action space – But high dimensional state space & anytime
  • 14. Errors ● Statistical error: due to finite samples (e.g. weather data = archive), possibly with bias (climate change) ● Statistical model error: due to the error in the model of random processes ● Model error: due to system modelling ● Anticipativity error: due to assuming perfect forecasts ● Monoactor: due to neglecting interactions between actor (social welfare) ● Optim. error: due to imperfect optimization
  • 15. Plenty of tools ● Dynamic programming based ==> bad modelization of long term dependencies ● Direct policy search: difficult to handle constraints ==> bad modelization of systems ● Model predictive control: bad modelization of randomness ==> we use combined tools
  • 16. I love Direct Policy Search ● What is DPS ? ● Implement a simulator ● Implement a policy / controller ● Replace constants in the policy by free parameters ● Optimize these parameters on simulations ● Why I love it ● Pragmatic, benefits from human expertise ● The best in terms of model error ● But ok it is sometimes slow ● Not always that convenient for constraints
  • 17. We propose specialized DPS ● A special structure for plenty of constraints ● After all, you can use DPS on top of everything, just by defining a “good” controller ● DP-based tools have a great representation ● Let us use DP-representations in DPS
  • 18. Dynamic programming tools Decision at time T = argmax of reward over the T next time steps + V'(state) x StateAt(t0+T) with V computed backwards
  • 19. Direct Value Search Decision at time T = argmax of reward over the T next time steps + f(, state) x StateAt(t0+T) with  optimized through Direct Policy Search and f a general function approximator (e.g. neural) Using forecasts as in MPC As in DPstyle
  • 20. Summary ● Model error: often more important than optim error (whereas most works on optim error) ● We propose methodologies ● Compliant with constraints ● More expensive than MPC ● But not more expensive than DP-tools ● Smallest model error ● User-friendly (human expertise)
  • 21. What we propose ● Is ok for correctly specified problems ● Uncertainties which can be modelized by probabilities ● Less model error, more optim. error ● Optim. error reduced by big clusters ● Takes into account the challenges in new power systems ● Stochastic effects (increased by renewables) ● High scale actions (demand-side management) ● High scale models (transcontinental grids)
  • 22. What we propose ● Open source ? ● Algorithms are public ● Tools are not ● Data/models are not ● Want to join ? ● Room for mathematics ● Room for geeks ● Room for people who like applications
  • 23. Our tools ● Tested on real problems ● Include investment levels – There are operational decisions – There are investment decisions ● Parallel ● Expensive
  • 24. Further work ● Nothing on multiple actors (national independence ? intern. risk ?) ● Non stochastic uncertainties: how do we modelize non-probabilistic uncertainties on scientific breakthroughs ? (Wald criterion, Savage, Nash, Regret...)
  • 25. Bibliography ● Dynamic Programming and Suboptimal Control: A Survey from ADP to MPC. Bertsekas, 2005. (MPC = deterministic forecasts) ● “Newave vs Odin”: why MPC survives in spite of theoretical shortcomings ● Dallagi et Simovic (EDF R&D) : "Optimisation des actifs hydrauliques d'EDF : besoins métiers, méthodes actuelles et perspectives", PGMO (importance of precise simulations) ● Ernst: The Global Grid, 2013 ● Renewable energy forecasts ought to be probabilistic! Pinson, 2013 (wipfor talk) ● Training a neural network with a financial criterion rather than a prediction criterion. Bengio, 1997 ● Direct Model Predictive Control, Decock et al, 2014 (combining DPS and MPC)