Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day

Deep Reinforcement Learning
Azzeddine CHENINE
AI Research Engineer @instadeepai
An introduction workshop

Deep Reinforcement Learning
Azzeddine CHENINE
AI Research Engineer @instadeepai
What? How? What’s hot about it 🔥?

But What if…..
…Your task is manifested by a series of decisions to
reach or keep an optimal performance
10

Reinforcement Learning
• Building agents that are able to learn an optimal policy to preform a task within a
Markovian environment
…What ?
11

• In a Markovian environment the next state depends only on the current state and the
agent that will be preformed by the agent
…What ?
12

• In a Markovian environment the next state depends only on the current state and the
agent that will be preformed by the agent
…What ?
• This task can be episodic or continues
13

…How ?
Environment
Agent
14

…How ?
Environment
Agent
State
15

…How ?
Environment
Agent
Action
State
16

…How ?
Environment
Agent
Reward New State Action
17

…How ?
Environment
Agent
Reward
New State
Action
• Reach an optimal policy
𝝿
•
𝝿
can be deterministic or stochastic
• A deterministic version of
𝝿
can be derived from the
action value function Q(S,a)
• You are free to choose your policy type
18

What’s hot 🔥about DeepRL
• Reinforcement Learning existed since the early 80s
19

• Reinforcement Learning before the hype of Deep Learning used to rely on Dynamic
programing Algorithms
20

• Monte-carlo, Sarsa (not salsa 💃), Q-learning, expected Sarsa…etc
21

• Monte-carlo, Sarsa (not salsa 💃), Q-learning, expected Sarsa…etc
• Data structures to hold reference for the actions values of each state
22

Bio
Stocks Games
Robots
• Modern environments present complex action and state spaces
23

Bio
Stocks Games
Robots
• Deep Neural Networks are able to extract features from different state types
24

Bio
Stocks Games
Robots
• Deep Neural Networks are able to approximate functions that map an observation to
a desired output space
25
• Deep Neural Networks are able to extract features from different state types

DeepRL workshop
• Inspecting a dynamic programing version of Q-learning
• Inspecting limitation and Deep Neural network use case
• Implementing Deep Q-learning with Tensor
fl
ow Keras API and Pytorch
• Getting introduced to OpenAI GYM for reinforcement learning environments
• Visualizing the training and inference of a DQN agents
26

Other hot topics
• Multi-agent reinforcement learning
• Imitation learning and behaviour cloning
• The problem of generation in Deep RL
• Policy based methods: PPO, A2C, A3C…
• DeepRL frameworks: RLLib, TF Agents…
27

Resources
• Berkeley DeepRL Bootcamp on Youtube
• Reinforcement Learning, an introduction
• Udacity DeepRL Nanodegree if possible
• RL course by David silver on Youtube
• Open AI gym documentation
28

Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day

More Related Content

Similar to Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day (20)

Recently uploaded (20)

Introduction to Deep Reinforcement Learning workshop at School of Ai: AI Day