Reinfocement learning

© 2016. SNU CSE Biointelligence Lab., http://bi.snu.ac.kr
Introduction of Reinforcement Learning
1
곽동현
서울대학교 바이오지능 연구실

Background
• 기존의 강화학습(Reinforcement Learning)에서 Q function을
DNN 혹은 CNN으로 근사하여 문제를 해결하는 시도가 최근
Google DeepMind를 필두로 활발히 연구가 되고 있다.
• 최근 연구에서는 Atari 2600, 바둑을 인간보다 더 잘 플레이하
는 수준의 경이적인 성과를 보이고 있으며, 나아가 3D 게임이
나 로봇 컨트롤 문제에도 적용되고 있다.
2

What is AI? ML?
3https://www.linkedin.com/pulse/deep-dive-venture-landscape-ai-ajit-nazre-rahul-garg-nazre

Various Field with ML
4https://www.linkedin.com/pulse/how-exceed-your-goals-2016-dr-travis-bradberry-1

Various Algorithm in ML
5

Function Approximation
6http://arxiv.org/pdf/1411.4555.pdf https://people.mpi-inf.mpg.de/~kkim/supres/supres.htm

What is Deep Learning?
7

Machine Learning
• Supervised Learning :
y = f(x)
• Unsupervised Learning :
x ~ p(x) , x = f(x)
• Reinforcement Learning :
??
8

Agent-Environment Interaction
• Objective : Maximize the expected sum of future rewards
• Algorithms
1) Planning : Dynamic Programming Based
2) Reinforcement Learning : Machine Learning Based
9

Example of Supervised
Learning
10

Polynomial Curve Fitting
11
Microsoft Excel 2007의 추세선

Example of
Unupervised Learning
12

Clustering
13
http://www.frankichamaki.com/data-driven-market-segmentation-more-effective-marketing-to-
segments-using-ai/

Example of
Reinforcement Learning
14

Videos
• A crawling robot: a Q-learning example
https://www.youtube.com/watch?v=2iNrJx6IDEo
• Deep Reinforcement Learning for Robotic
Manipulation
https://youtu.be/ZhsEKTo7V04?t=1m27s
15

THANK YOU
16

Reinfocement learning

More Related Content

Viewers also liked (16)

Similar to Reinfocement learning (7)

Recently uploaded (20)

Reinfocement learning

Editor's Notes