SlideShare a Scribd company logo
Reinforcement Learning
ujava.org Workshop
2015-06-27
www.idosi.com
CEO 강신동
Shindong KANG
(주)지능도시
www.idosi.comujava.org
www.idosi.comspaceapi.org
www.idosi.comReinforcement Learning for Brick Game
www.idosi.comReinforcement Learning for Brick Game
www.idosi.comTo Flip Pancake
www.idosi.comCrawling Robot on Carpet
www.idosi.comPavlov's Dog
www.idosi.comPavlov
www.idosi.comReinforcement (강화)
www.idosi.comMarkov Chain
www.idosi.comMarkov Process
www.idosi.comMarkov Decision Process (MDP))
www.idosi.comNon-Deterministic Search
www.idosi.comGrid World
www.idosi.comGoal
www.idosi.comAction
www.idosi.comMDP
www.idosi.comMarkov Property
www.idosi.comPolicy
www.idosi.comOptimal Policy
www.idosi.comRacing's Probability
www.idosi.comRacing's Reward
www.idosi.comSearch Tree
www.idosi.comQ-state
www.idosi.comDiscounting
www.idosi.comDiscounting
www.idosi.comPolicy with Discouting
www.idosi.comDiscouting Factor
www.idosi.comDiscouting Factor
www.idosi.comDiscouting Factor
www.idosi.comReinforcement
www.idosi.comSum of Rewards
www.idosi.comOptimal Quantities
www.idosi.comValues of States
www.idosi.comMDP
www.idosi.comMDP
www.idosi.comMDP
www.idosi.comMDP
www.idosi.comMDP
www.idosi.comMDP
www.idosi.comMDP
www.idosi.comMDP
www.idosi.comReinforcement Learning
www.idosi.comMDP of all infos
www.idosi.comRL of no infos
www.idosi.comMDP vs. RL
www.idosi.comModel-Based Learning (RL)
www.idosi.comObserved Episodes
www.idosi.comLearned Model
www.idosi.comDirect Evaluation
www.idosi.comProblems with Direct Evaluation
www.idosi.comTemporal Difference Learning
www.idosi.comTemporal Difference Learning
www.idosi.comTemporal Difference Learning
www.idosi.comExpoential Moving Average
www.idosi.comQ-Value Iteration
www.idosi.comQ-Learning
www.idosi.comQ-Learning Demo
Thank you !
(주)지능도시
Intelligent City Ltd.
강신동
Shindong KANG
www.idosi.com
ceo@idosi.com

More Related Content

PDF
PDF
PDF
aug11
PDF
aug13
PDF
PDF
PDF
aug12
PDF
July02

Viewers also liked (19)

PDF
PDF
July17
PPTX
Learning coordination strategies using reinforcement learning myriam z abrams...
PPTX
collective bargaining
PPTX
Reinforcement Learning
PPTX
General Equilibrium
PDF
Advanced Microeconomics - Lecture Slides
PDF
Lect 8-auctions
PDF
Bargaining and Ethnicity
PDF
Equilibrium in Nash’s mind (with references)
PPTX
02 significance of rational decision making
PDF
Introduction to Reinforcement Learning
PPT
Reinforcement learning 7313
PPTX
5. decision making
PDF
John Nash Ppt
PDF
Introduction to Auction Theory
PPT
Chapter 1: Social Welfare, Past and Present
PDF
MAS Course - Lect10 - coordination
July17
Learning coordination strategies using reinforcement learning myriam z abrams...
collective bargaining
Reinforcement Learning
General Equilibrium
Advanced Microeconomics - Lecture Slides
Lect 8-auctions
Bargaining and Ethnicity
Equilibrium in Nash’s mind (with references)
02 significance of rational decision making
Introduction to Reinforcement Learning
Reinforcement learning 7313
5. decision making
John Nash Ppt
Introduction to Auction Theory
Chapter 1: Social Welfare, Past and Present
MAS Course - Lect10 - coordination
Ad

More from 신동 강 (18)

PDF
Graph Convolutional Neural Networks
ODP
Recurrent Neural Network tutorial (2nd)
ODP
ujava.org workshop : Reinforcement Learning with Thompson Sampling
ODP
ujava.org Reinforcement Learning (2nd)
ODP
Quantum Computer for Deep Learning
ODP
ujava.org Drone Scenario & Drone Airport Systems
ODP
ujava.org Drone Physics
ODP
Recursive Neural Network : ujava.org 12th deep learning workshop
ODP
NN Models with DL4J for Deep Learning
PDF
RBM with DL4J for Deep Learning
ODP
Deep Learning for Java (DL4J)
PDF
Ujava.org tensor-analysis
PDF
Tensor Physics for Deep Learning
PDF
ujava.org Deep Learning with Convolutional Neural Network
PDF
Recurrent Neural Network, Fractal for Deep Learning
ODP
ujava.org workshop : Deep Learning [2015-03-08]
PPT
IoT & Machine Learning
PPT
IoT In-Depth Conference 강연 자료 (주)지능도시 강신동 양계장 비닐하우스 포함
Graph Convolutional Neural Networks
Recurrent Neural Network tutorial (2nd)
ujava.org workshop : Reinforcement Learning with Thompson Sampling
ujava.org Reinforcement Learning (2nd)
Quantum Computer for Deep Learning
ujava.org Drone Scenario & Drone Airport Systems
ujava.org Drone Physics
Recursive Neural Network : ujava.org 12th deep learning workshop
NN Models with DL4J for Deep Learning
RBM with DL4J for Deep Learning
Deep Learning for Java (DL4J)
Ujava.org tensor-analysis
Tensor Physics for Deep Learning
ujava.org Deep Learning with Convolutional Neural Network
Recurrent Neural Network, Fractal for Deep Learning
ujava.org workshop : Deep Learning [2015-03-08]
IoT & Machine Learning
IoT In-Depth Conference 강연 자료 (주)지능도시 강신동 양계장 비닐하우스 포함
Ad

Recently uploaded (20)

PPT
Reliability_Chapter_ presentation 1221.5784
PDF
annual-report-2024-2025 original latest.
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
SAP 2 completion done . PRESENTATION.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Computer network topology notes for revision
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
IB Computer Science - Internal Assessment.pptx
PDF
Lecture1 pattern recognition............
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
Reliability_Chapter_ presentation 1221.5784
annual-report-2024-2025 original latest.
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
[EN] Industrial Machine Downtime Prediction
SAP 2 completion done . PRESENTATION.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
climate analysis of Dhaka ,Banglades.pptx
Computer network topology notes for revision
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
IB Computer Science - Internal Assessment.pptx
Lecture1 pattern recognition............
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Introduction to Knowledge Engineering Part 1
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Miokarditis (Inflamasi pada Otot Jantung)
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Introduction-to-Cloud-ComputingFinal.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu

Ujava.org reinforcement-learning