SlideShare a Scribd company logo
Hi
Deep Reinforcement Learning
Azzeddine CHENINE
AI Research Engineer @instadeepai
An introduction workshop
Deep Reinforcement Learning
Azzeddine CHENINE
AI Research Engineer @instadeepai
What? How? What’s hot about it 🔥?
4
5
6
7
8
9
But What if…..
…Your task is manifested by a series of decisions to
reach or keep an optimal performance
10
Reinforcement Learning
• Building agents that are able to learn an optimal policy to preform a task within a
Markovian environment
…What ?
11
Reinforcement Learning
• Building agents that are able to learn an optimal policy to preform a task within a
Markovian environment
• In a Markovian environment the next state depends only on the current state and the
agent that will be preformed by the agent
…What ?
12
Reinforcement Learning
• Building agents that are able to learn an optimal policy to preform a task within a
Markovian environment
• In a Markovian environment the next state depends only on the current state and the
agent that will be preformed by the agent
…What ?
• This task can be episodic or continues
13
Reinforcement Learning
…How ?
Environment
Agent
14
Reinforcement Learning
…How ?
Environment
Agent
State
15
Reinforcement Learning
…How ?
Environment
Agent
Action
State
16
Reinforcement Learning
…How ?
Environment
Agent
Reward New State Action
17
Reinforcement Learning
…How ?
Environment
Agent
Reward
New State
Action
• Reach an optimal policy
𝝿
•
𝝿
can be deterministic or stochastic
• A deterministic version of
𝝿
can be derived from the
action value function Q(S,a)
• You are free to choose your policy type
18
What’s hot 🔥about DeepRL
• Reinforcement Learning existed since the early 80s
19
What’s hot 🔥about DeepRL
• Reinforcement Learning existed since the early 80s
• Reinforcement Learning before the hype of Deep Learning used to rely on Dynamic
programing Algorithms
20
What’s hot 🔥about DeepRL
• Reinforcement Learning existed since the early 80s
• Reinforcement Learning before the hype of Deep Learning used to rely on Dynamic
programing Algorithms
• Monte-carlo, Sarsa (not salsa 💃), Q-learning, expected Sarsa…etc
21
What’s hot 🔥about DeepRL
• Reinforcement Learning existed since the early 80s
• Reinforcement Learning before the hype of Deep Learning used to rely on Dynamic
programing Algorithms
• Monte-carlo, Sarsa (not salsa 💃), Q-learning, expected Sarsa…etc
• Data structures to hold reference for the actions values of each state
22
What’s hot 🔥about DeepRL
Bio
Stocks Games
Robots
• Modern environments present complex action and state spaces
23
What’s hot 🔥about DeepRL
Bio
Stocks Games
Robots
• Deep Neural Networks are able to extract features from different state types
24
• Modern environments present complex action and state spaces
What’s hot 🔥about DeepRL
Bio
Stocks Games
Robots
• Deep Neural Networks are able to approximate functions that map an observation to
a desired output space
25
• Deep Neural Networks are able to extract features from different state types
• Modern environments present complex action and state spaces
DeepRL workshop
• Inspecting a dynamic programing version of Q-learning
• Inspecting limitation and Deep Neural network use case
• Implementing Deep Q-learning with Tensor
fl
ow Keras API and Pytorch
• Getting introduced to OpenAI GYM for reinforcement learning environments
• Visualizing the training and inference of a DQN agents
26
Other hot topics
• Multi-agent reinforcement learning
• Imitation learning and behaviour cloning
• The problem of generation in Deep RL
• Policy based methods: PPO, A2C, A3C…
• DeepRL frameworks: RLLib, TF Agents…
27
Resources
• Berkeley DeepRL Bootcamp on Youtube
• Reinforcement Learning, an introduction
• Udacity DeepRL Nanodegree if possible
• RL course by David silver on Youtube
• Open AI gym documentation
28

More Related Content

PDF
Firebase ml kit
PPTX
Io18...what's new in Android
PPTX
Android pei devfest Algiers 2018
PPTX
Ready, Set Code!: Learning How to Code Through Games
PPTX
5 AI technologies to the Driverless World Internet of Things Summit April 13...
PDF
PDF
An introduction to deep reinforcement learning
PDF
Deep Reinforcement Learning
Firebase ml kit
Io18...what's new in Android
Android pei devfest Algiers 2018
Ready, Set Code!: Learning How to Code Through Games
5 AI technologies to the Driverless World Internet of Things Summit April 13...
An introduction to deep reinforcement learning
Deep Reinforcement Learning

Similar to Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day (20)

DOCX
Reinforcement Learning Literature review - apr2019/feb2021 (with zip file)
PDF
GDRR Opening Workshop - Deep Reinforcement Learning for Asset Based Modeling ...
PDF
Deep Q-Learning
PPTX
Deep Reinforcement Leaning In Machine Learning
PDF
Reinforcement Learning (DLAI D7L2 2017 UPC Deep Learning for Artificial Intel...
PDF
MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL) by Lex Fridman
PDF
Shanghai deep learning meetup 4
PDF
Deep Reinforcement Learning An Introduction
PDF
Deep Reinforcement Learning: MDP & DQN - Xavier Giro-i-Nieto - UPC Barcelona ...
PPTX
R22 Machine learning jntuh UNIT- 5.pptx
PDF
Deep Reinforcement Learning Innovation Insights from Patents
PPTX
Intro to Deep Reinforcement Learning
PDF
Introduction2drl
PDF
An introduction to reinforcement learning
PDF
Reinforcement learning in a nutshell
PPTX
Reinforcement Learning, Application and Q-Learning
PDF
anintroductiontoreinforcementlearning-180912151720.pdf
PPTX
An introduction to reinforcement learning
PDF
Deep Reinforcement Learning and Its Applications
PDF
Reinforcement Learning for Financial Markets
Reinforcement Learning Literature review - apr2019/feb2021 (with zip file)
GDRR Opening Workshop - Deep Reinforcement Learning for Asset Based Modeling ...
Deep Q-Learning
Deep Reinforcement Leaning In Machine Learning
Reinforcement Learning (DLAI D7L2 2017 UPC Deep Learning for Artificial Intel...
MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL) by Lex Fridman
Shanghai deep learning meetup 4
Deep Reinforcement Learning An Introduction
Deep Reinforcement Learning: MDP & DQN - Xavier Giro-i-Nieto - UPC Barcelona ...
R22 Machine learning jntuh UNIT- 5.pptx
Deep Reinforcement Learning Innovation Insights from Patents
Intro to Deep Reinforcement Learning
Introduction2drl
An introduction to reinforcement learning
Reinforcement learning in a nutshell
Reinforcement Learning, Application and Q-Learning
anintroductiontoreinforcementlearning-180912151720.pdf
An introduction to reinforcement learning
Deep Reinforcement Learning and Its Applications
Reinforcement Learning for Financial Markets
Ad

Recently uploaded (20)

PDF
Digital Logic Computer Design lecture notes
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PDF
PPT on Performance Review to get promotions
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
Construction Project Organization Group 2.pptx
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
Lecture Notes Electrical Wiring System Components
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
web development for engineering and engineering
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
Sustainable Sites - Green Building Construction
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPT
Project quality management in manufacturing
Digital Logic Computer Design lecture notes
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPT on Performance Review to get promotions
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
Construction Project Organization Group 2.pptx
Foundation to blockchain - A guide to Blockchain Tech
Lecture Notes Electrical Wiring System Components
R24 SURVEYING LAB MANUAL for civil enggi
Automation-in-Manufacturing-Chapter-Introduction.pdf
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
web development for engineering and engineering
Model Code of Practice - Construction Work - 21102022 .pdf
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Sustainable Sites - Green Building Construction
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
bas. eng. economics group 4 presentation 1.pptx
Project quality management in manufacturing
Ad

Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day

  • 1. Hi
  • 2. Deep Reinforcement Learning Azzeddine CHENINE AI Research Engineer @instadeepai An introduction workshop
  • 3. Deep Reinforcement Learning Azzeddine CHENINE AI Research Engineer @instadeepai What? How? What’s hot about it 🔥?
  • 4. 4
  • 5. 5
  • 6. 6
  • 7. 7
  • 8. 8
  • 9. 9
  • 10. But What if….. …Your task is manifested by a series of decisions to reach or keep an optimal performance 10
  • 11. Reinforcement Learning • Building agents that are able to learn an optimal policy to preform a task within a Markovian environment …What ? 11
  • 12. Reinforcement Learning • Building agents that are able to learn an optimal policy to preform a task within a Markovian environment • In a Markovian environment the next state depends only on the current state and the agent that will be preformed by the agent …What ? 12
  • 13. Reinforcement Learning • Building agents that are able to learn an optimal policy to preform a task within a Markovian environment • In a Markovian environment the next state depends only on the current state and the agent that will be preformed by the agent …What ? • This task can be episodic or continues 13
  • 18. Reinforcement Learning …How ? Environment Agent Reward New State Action • Reach an optimal policy 𝝿 • 𝝿 can be deterministic or stochastic • A deterministic version of 𝝿 can be derived from the action value function Q(S,a) • You are free to choose your policy type 18
  • 19. What’s hot 🔥about DeepRL • Reinforcement Learning existed since the early 80s 19
  • 20. What’s hot 🔥about DeepRL • Reinforcement Learning existed since the early 80s • Reinforcement Learning before the hype of Deep Learning used to rely on Dynamic programing Algorithms 20
  • 21. What’s hot 🔥about DeepRL • Reinforcement Learning existed since the early 80s • Reinforcement Learning before the hype of Deep Learning used to rely on Dynamic programing Algorithms • Monte-carlo, Sarsa (not salsa 💃), Q-learning, expected Sarsa…etc 21
  • 22. What’s hot 🔥about DeepRL • Reinforcement Learning existed since the early 80s • Reinforcement Learning before the hype of Deep Learning used to rely on Dynamic programing Algorithms • Monte-carlo, Sarsa (not salsa 💃), Q-learning, expected Sarsa…etc • Data structures to hold reference for the actions values of each state 22
  • 23. What’s hot 🔥about DeepRL Bio Stocks Games Robots • Modern environments present complex action and state spaces 23
  • 24. What’s hot 🔥about DeepRL Bio Stocks Games Robots • Deep Neural Networks are able to extract features from different state types 24 • Modern environments present complex action and state spaces
  • 25. What’s hot 🔥about DeepRL Bio Stocks Games Robots • Deep Neural Networks are able to approximate functions that map an observation to a desired output space 25 • Deep Neural Networks are able to extract features from different state types • Modern environments present complex action and state spaces
  • 26. DeepRL workshop • Inspecting a dynamic programing version of Q-learning • Inspecting limitation and Deep Neural network use case • Implementing Deep Q-learning with Tensor fl ow Keras API and Pytorch • Getting introduced to OpenAI GYM for reinforcement learning environments • Visualizing the training and inference of a DQN agents 26
  • 27. Other hot topics • Multi-agent reinforcement learning • Imitation learning and behaviour cloning • The problem of generation in Deep RL • Policy based methods: PPO, A2C, A3C… • DeepRL frameworks: RLLib, TF Agents… 27
  • 28. Resources • Berkeley DeepRL Bootcamp on Youtube • Reinforcement Learning, an introduction • Udacity DeepRL Nanodegree if possible • RL course by David silver on Youtube • Open AI gym documentation 28