SlideShare a Scribd company logo
DeepMind Technologies
Playing Atari with
Deep
Reinforcement
Learning
Contents
 Abstract
 Introduction
 Background
 Deep Reinforcement Learning
 Experiments
 Conclusion
Abstract
 First deep learning model using reinforcement
learning
• Successfully learn control policies directly
• From high-dimensional sensory input (pixels)
 CNN model trained on a variant of Q-learning
• Input: raw pixel
• Output: a value function estimating future reward
 Applied seven Atari 2600 games (no adjustment)
• Outperforms ALL previous approaches on six games
• Surpasses a human expert on three games
Introduction
 Learning directly from high-dimensional sensory input
is Long-standing challenges of RL
 Most successful RL relies on hand-crafted features
 Recent advances in deep learning extract high-level
features from raw sensory data
• Breakthroughs in computer vision/Speech recognition
 Neural architectures on supervised/unsupervised
learning
• Convolutional networks
• Multilayer perceptrons
• Restricted Boltzmann machines
• Recurrent Neural nets
Problems to solve : Motivation
Introduction
 Most DL requires hand labeled training data
• RL must learn from a scalar reward signal
• Reward signal is often sparse, noisy, and delayed
• Delay between actions and resulting rewards can be
thousand time steps
CNN with a variant Q-learning
• Most DL assumes data samples are independent
• RL encounters sequences of highly correlated states
Experience replay
Remaining challenges
Introduction
Background
Agent and Environment
Background
State
Background
Major Components of an RL Agent
Background
Policy
Background
On-policy learning vs Off-policy learning
Background
On-policy learning vs Off-policy learning
http://www.modulabs.co.kr/RL_library/2621
Background
Value Function
Background
Optimal Value Functions
Background
Q-Networks
Background
Q-Learning
Background
Model
Deep Reinforcement Learning
Deep Q-Networks : Experience Replay
Deep Reinforcement Learning
Deep Q-Networks : Experience Replay
Deep Reinforcement Learning
Model Architecture
Experiments
Experiments
Experiments
Conclusion
http://leehyekang.com
THANK YOU

More Related Content

PPT
Reinforcement Learning Q-Learning
PDF
[1312.5602] Playing Atari with Deep Reinforcement Learning
PDF
Neuromorphic computing for neural networks
PPTX
Reinforcement learning
PPTX
Reinforcement Learning : A Beginners Tutorial
PDF
Deep Q-Learning
PPTX
An introduction to reinforcement learning
PPTX
Presentation on Raspberry pi
Reinforcement Learning Q-Learning
[1312.5602] Playing Atari with Deep Reinforcement Learning
Neuromorphic computing for neural networks
Reinforcement learning
Reinforcement Learning : A Beginners Tutorial
Deep Q-Learning
An introduction to reinforcement learning
Presentation on Raspberry pi

What's hot (20)

PDF
Gradient descent method
PDF
Temporal difference learning
PPTX
Binarized CNN on FPGA
PPTX
Image classification using cnn
PPT
Swarm robotics ppt
PDF
Faster R-CNN - PR012
PDF
Reinforcement Learning 5. Monte Carlo Methods
PPTX
Object classification using CNN & VGG16 Model (Keras and Tensorflow)
PPTX
Spiking neural network: an introduction I
PPTX
Optimization and particle swarm optimization (O & PSO)
PPTX
Faster R-CNN
PPTX
Deep Reinforcement Learning
PDF
Faster R-CNN: Towards real-time object detection with region proposal network...
PPTX
Introduction to Embedded Linux
PDF
Introduction of Deep Reinforcement Learning
PPTX
Smartphone processors
PDF
A brief overview of Reinforcement Learning applied to games
PDF
Human Action Recognition
PDF
Convolutional Neural Networks (CNN)
PPTX
Humanoid Robotics
Gradient descent method
Temporal difference learning
Binarized CNN on FPGA
Image classification using cnn
Swarm robotics ppt
Faster R-CNN - PR012
Reinforcement Learning 5. Monte Carlo Methods
Object classification using CNN & VGG16 Model (Keras and Tensorflow)
Spiking neural network: an introduction I
Optimization and particle swarm optimization (O & PSO)
Faster R-CNN
Deep Reinforcement Learning
Faster R-CNN: Towards real-time object detection with region proposal network...
Introduction to Embedded Linux
Introduction of Deep Reinforcement Learning
Smartphone processors
A brief overview of Reinforcement Learning applied to games
Human Action Recognition
Convolutional Neural Networks (CNN)
Humanoid Robotics
Ad

Similar to Playing Atari with Deep Reinforcement Learning (20)

PPTX
Introduction to deep learning
PPTX
AI powered emotion recognition: From Inception to Production - Global AI Conf...
PPTX
AI powered emotion recognition: From Inception to Production - Global AI Conf...
PDF
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
PPTX
Deep learning: the future of recommendations
PDF
Startup.Ml: Using neon for NLP and Localization Applications
PDF
DEF CON 24 - Clarence Chio - machine duping 101
PPT
DEEP LEARNING PPT aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
PDF
Machine Duping 101: Pwning Deep Learning Systems
PPTX
Development of Deep Learning Architecture
PPTX
Computer Design Concepts for Machine Learning
PPTX
Deep Learning Sample Class (Jon Lederman)
PPTX
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
PPTX
Deep Learning Made Easy with Deep Features
PPTX
Neural Network ppt (vs.swathi).pptx neural network neyral network
PDF
An Introduction to Deep Learning
PDF
DSRLab seminar Introduction to deep learning
PPTX
Introduction to deep learning
PDF
Separating Hype from Reality in Deep Learning with Sameer Farooqui
PDF
Introduction to deep learning @ Startup.ML by Andres Rodriguez
Introduction to deep learning
AI powered emotion recognition: From Inception to Production - Global AI Conf...
AI powered emotion recognition: From Inception to Production - Global AI Conf...
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
Deep learning: the future of recommendations
Startup.Ml: Using neon for NLP and Localization Applications
DEF CON 24 - Clarence Chio - machine duping 101
DEEP LEARNING PPT aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
Machine Duping 101: Pwning Deep Learning Systems
Development of Deep Learning Architecture
Computer Design Concepts for Machine Learning
Deep Learning Sample Class (Jon Lederman)
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning Made Easy with Deep Features
Neural Network ppt (vs.swathi).pptx neural network neyral network
An Introduction to Deep Learning
DSRLab seminar Introduction to deep learning
Introduction to deep learning
Separating Hype from Reality in Deep Learning with Sameer Farooqui
Introduction to deep learning @ Startup.ML by Andres Rodriguez
Ad

Recently uploaded (20)

PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PDF
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
PDF
EXPLORING LEARNING ENGAGEMENT FACTORS INFLUENCING BEHAVIORAL, COGNITIVE, AND ...
PDF
SMART SIGNAL TIMING FOR URBAN INTERSECTIONS USING REAL-TIME VEHICLE DETECTI...
PPTX
CURRICULAM DESIGN engineering FOR CSE 2025.pptx
PPT
Total quality management ppt for engineering students
PPTX
Information Storage and Retrieval Techniques Unit III
PDF
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
PDF
Soil Improvement Techniques Note - Rabbi
PDF
Design Guidelines and solutions for Plastics parts
PPTX
Software Engineering and software moduleing
PDF
Visual Aids for Exploratory Data Analysis.pdf
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
PPTX
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
PPTX
AUTOMOTIVE ENGINE MANAGEMENT (MECHATRONICS).pptx
PPT
INTRODUCTION -Data Warehousing and Mining-M.Tech- VTU.ppt
PDF
737-MAX_SRG.pdf student reference guides
PDF
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
PDF
distributed database system" (DDBS) is often used to refer to both the distri...
Automation-in-Manufacturing-Chapter-Introduction.pdf
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
EXPLORING LEARNING ENGAGEMENT FACTORS INFLUENCING BEHAVIORAL, COGNITIVE, AND ...
SMART SIGNAL TIMING FOR URBAN INTERSECTIONS USING REAL-TIME VEHICLE DETECTI...
CURRICULAM DESIGN engineering FOR CSE 2025.pptx
Total quality management ppt for engineering students
Information Storage and Retrieval Techniques Unit III
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
Soil Improvement Techniques Note - Rabbi
Design Guidelines and solutions for Plastics parts
Software Engineering and software moduleing
Visual Aids for Exploratory Data Analysis.pdf
R24 SURVEYING LAB MANUAL for civil enggi
Fundamentals of safety and accident prevention -final (1).pptx
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
AUTOMOTIVE ENGINE MANAGEMENT (MECHATRONICS).pptx
INTRODUCTION -Data Warehousing and Mining-M.Tech- VTU.ppt
737-MAX_SRG.pdf student reference guides
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
distributed database system" (DDBS) is often used to refer to both the distri...

Playing Atari with Deep Reinforcement Learning