SlideShare a Scribd company logo
14
Most read
16
Most read
17
Most read
Deep Deterministic Policy Gradient
DDPG
History
ML methods
ML methods
Supervised vs Unsupervised
Supervised process
Supervised uses
Unsupervised
Unsupervised
Neural network types
Gradient Descent
Reinforcement learning
Grid worlds
Value function vs Policy
Actor critic
Actor critic method
DDPG
- Continuous state and action space
- Replay buffer
- Soft updates
- Exploration noise
Pitfalls
- Designing reward function is very hard
- Tends to get stuck into local optima
- Unstable
- Needs lots of training samples
Driving in simulator

More Related Content

PDF
MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL) by Lex Fridman
PPTX
Deep Reinforcement Learning
PDF
Generative AI
PPTX
Deep Reinforcement Learning
PDF
ddpg seminar
PDF
Hunting Lateral Movement in Windows Infrastructure
PDF
Continuous control with deep reinforcement learning (DDPG)
PDF
Actor critic algorithm
MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL) by Lex Fridman
Deep Reinforcement Learning
Generative AI
Deep Reinforcement Learning
ddpg seminar
Hunting Lateral Movement in Windows Infrastructure
Continuous control with deep reinforcement learning (DDPG)
Actor critic algorithm

What's hot (20)

PDF
Deep Reinforcement Learning: Q-Learning
PDF
Deep Q-Learning
PDF
An introduction to deep reinforcement learning
PDF
Deep reinforcement learning
PPTX
Reinforcement Learning : A Beginners Tutorial
PDF
Deep Reinforcement Learning
PDF
오토인코더의 모든 것
PPTX
An introduction to reinforcement learning
PDF
Introduction to SAC(Soft Actor-Critic)
PDF
Reinforcement Learning - DQN
PDF
Optimizers
PDF
Introduction of Deep Reinforcement Learning
PPTX
Chapter 3 image enhancement (spatial domain)
PPTX
Optimization/Gradient Descent
PDF
LeNet to ResNet
PPT
Z Buffer Optimizations
PPTX
Image-to-Image Translation pix2pix
PDF
Reinforcement learning, Q-Learning
PPTX
Reinforcement Learning
Deep Reinforcement Learning: Q-Learning
Deep Q-Learning
An introduction to deep reinforcement learning
Deep reinforcement learning
Reinforcement Learning : A Beginners Tutorial
Deep Reinforcement Learning
오토인코더의 모든 것
An introduction to reinforcement learning
Introduction to SAC(Soft Actor-Critic)
Reinforcement Learning - DQN
Optimizers
Introduction of Deep Reinforcement Learning
Chapter 3 image enhancement (spatial domain)
Optimization/Gradient Descent
LeNet to ResNet
Z Buffer Optimizations
Image-to-Image Translation pix2pix
Reinforcement learning, Q-Learning
Reinforcement Learning
Ad

Recently uploaded (20)

PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
medical staffing services at VALiNTRY
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PPTX
L1 - Introduction to python Backend.pptx
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
System and Network Administration Chapter 2
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
Understanding Forklifts - TECH EHS Solution
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PPTX
Essential Infomation Tech presentation.pptx
PDF
AI in Product Development-omnex systems
PPTX
Operating system designcfffgfgggggggvggggggggg
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
CHAPTER 2 - PM Management and IT Context
medical staffing services at VALiNTRY
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
L1 - Introduction to python Backend.pptx
Internet Downloader Manager (IDM) Crack 6.42 Build 41
wealthsignaloriginal-com-DS-text-... (1).pdf
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
System and Network Administration Chapter 2
Adobe Illustrator 28.6 Crack My Vision of Vector Design
Understanding Forklifts - TECH EHS Solution
2025 Textile ERP Trends: SAP, Odoo & Oracle
Essential Infomation Tech presentation.pptx
AI in Product Development-omnex systems
Operating system designcfffgfgggggggvggggggggg
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
Ad

Deep deterministic policy gradient