SlideShare a Scribd company logo
3
Most read
4
Most read
5
Most read
Basics of
Reinforcement Learning
Spotle.ai Study Material
Spotle.ai/Learn
Spotle.ai Study Material
Spotle.ai/Learn
Let’s play chess!
I just don’t make any possible move
without thinking what my opponent’s
move can be to counter my move.
I try to consider all possible moves that
are safe. And then choose the one that I
feel is the best move among all.
Machines can learn this way. And this
learning is called reinforcement machine
learning.
Spotle.ai Study Material
Spotle.ai/Learn
What is reinforcement learning?
First, a particular situation in which the learning will be applicable.
You start at a point, you go through several steps to reach a level.
In the process you earn a reward point for every correct step and you lose a reward point
for every wrong step.
Finally, you choose the path with the highest reward point in that particular situation.
Agent Environment
State
Reward
Action
Spotle.ai Study Material
Spotle.ai/Learn
Terminologies
Agent: The learner and the decision maker.
Environment: Where the agent learns and decides what actions to perform.
Action: A set of actions which the agent can perform.
State: The state of the agent in the environment.
Reward: For each action selected by the agent the environment provides a reward.
Usually a scalar value.
Agent Environment
State
Reward
Action
In supervised learning the training data has the output, that is, the answer in it. Here
the model is trained with the correct answer. But in case of reinforcement learning,
there is no answer given. The reinforcement agent decides the action to perform based
on the maximum reward it receives. There is no training data in reinforcement
learning. The machine learns from its experience.
Supervised learning? No
Spotle.ai Study Material
Spotle.ai/Learn
Training
data
Not available
Spotle.ai Study Material
Spotle.ai/Learn
Reinforcing your learning
Which one to choose?
Give reward to all
possible ones step by step
Choose the one with the
maximum reward.Topic A Topic B Topic C
Spotle.ai Study Material
Spotle.ai/Learn
Pavlov Experiment
TRIAL 1
In the first trial Pavlov
gives meat to his dog and
the dog starts salivating.
Spotle.ai Study Material
Spotle.ai/Learn
Pavlov Experiment
TRIAL 2
In the second trial Pavlov
does not give meat to his
dog but rings a bell.
Without seeing the meat
the dog does not start
salivating.
Spotle.ai Study Material
Spotle.ai/Learn
Pavlov Experiment
TRIAL 3
In trial 3 Pavlov rings the
bell and gives meat to his
dog and seeing meat the
dog starts salivating.
Spotle.ai Study Material
Spotle.ai/Learn
Pavlov Experiment
TRIAL 4
In trial 4 Pavlov rings the
bell and at this his dog
starts salivating, hoping
that meat will follow the
ringing of the bell. This is
learning by reinforcement.
The dog was rewarded
with meat after the
ringing of the bell.
Summarizing
❖ The input is an initial stage from which the machine starts learning.
❖ There are more than one possible output in a particular problem.
❖ Each output state is given a reward or punishment.
❖ The output with maximum reward is selected to be performed.
❖ The reinforcement learning process is continuous.
Spotle.ai Study Material
Spotle.ai/Learn
#HappyLearning
#BeCareerReady
That’s all for today.

More Related Content

PDF
Bjj roy harris - escape from the si
PPTX
Reinforcement learning
PPTX
Reinforcement learning slides
PPTX
Introduction to reinforcement learning
PDF
Real-world Reinforcement Learning
PPTX
CS3013 -MACHINE LEARNING.pptx
PDF
Real-world Reinforcement Learning
PDF
What is Reinforcement Learning.pdf
Bjj roy harris - escape from the si
Reinforcement learning
Reinforcement learning slides
Introduction to reinforcement learning
Real-world Reinforcement Learning
CS3013 -MACHINE LEARNING.pptx
Real-world Reinforcement Learning
What is Reinforcement Learning.pdf

Similar to Basics of Reinforcement Learning (20)

PPTX
Reinforcement learning.pptx
PDF
Reinforcement learning
PPTX
What is Reinforcement Learning in Machine Learning
PPTX
Reinforcement learning
PDF
reinforcement-learning-141009013546-conversion-gate02.pdf
PPTX
reinforcement-learning-141009013546-conversion-gate02.pptx
PPTX
semi supervised Learning and Reinforcement learning (1).pptx
PDF
Reinforcement Learning for Financial Markets
PDF
A Review on Introduction to Reinforcement Learning
PDF
An introduction to reinforcement learning
PDF
Intro rl
PPTX
applications of reinforcement learning 1
PDF
Machine Learning , deep learning module imp
PDF
"Reinforcement Learning: Pioneering the Next Evolution in Artificial Intellig...
PPTX
Survey of Modern Reinforcement Learning
PDF
DRL 1 Course Introduction Reinforcement.ppt
PDF
Lecture 1 - introduction.pdf
PDF
Reinforcement Learning 1. Introduction
PDF
20180520 MLPHS
PDF
Reinforcement Learning
Reinforcement learning.pptx
Reinforcement learning
What is Reinforcement Learning in Machine Learning
Reinforcement learning
reinforcement-learning-141009013546-conversion-gate02.pdf
reinforcement-learning-141009013546-conversion-gate02.pptx
semi supervised Learning and Reinforcement learning (1).pptx
Reinforcement Learning for Financial Markets
A Review on Introduction to Reinforcement Learning
An introduction to reinforcement learning
Intro rl
applications of reinforcement learning 1
Machine Learning , deep learning module imp
"Reinforcement Learning: Pioneering the Next Evolution in Artificial Intellig...
Survey of Modern Reinforcement Learning
DRL 1 Course Introduction Reinforcement.ppt
Lecture 1 - introduction.pdf
Reinforcement Learning 1. Introduction
20180520 MLPHS
Reinforcement Learning
Ad

More from Spotle.ai (20)

PDF
Spotle AI-thon - AI For Good Business Plan Showcase - Team IIM Indore - AI Ro...
PDF
Spotle AI-thon - AI For Good Business Plan Showcase - Cummins College
PDF
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Team Elit...
PDF
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India- Ankur chat...
PDF
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Team La c...
PDF
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Team Temp...
PDF
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Team Zer...
PDF
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Shivam Gi...
PDF
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Cyber Pun...
PDF
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Tech Owls...
PDF
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Team Jar...
PDF
Artificial intelligence in fintech
PDF
Semi-supervised Machine Learning
PDF
Tableau And Data Visualization - Get Started
PDF
Artificial Intelligence in FinTech
PDF
Supervised and Unsupervised Machine Learning
PDF
Growing-up With AI
PDF
AI And Cyber-security Threats
PDF
Robotic Process Automation With Blue Prism
PDF
Get started with Microsoft Azure
Spotle AI-thon - AI For Good Business Plan Showcase - Team IIM Indore - AI Ro...
Spotle AI-thon - AI For Good Business Plan Showcase - Cummins College
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Team Elit...
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India- Ankur chat...
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Team La c...
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Team Temp...
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Team Zer...
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Shivam Gi...
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Cyber Pun...
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Tech Owls...
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Team Jar...
Artificial intelligence in fintech
Semi-supervised Machine Learning
Tableau And Data Visualization - Get Started
Artificial Intelligence in FinTech
Supervised and Unsupervised Machine Learning
Growing-up With AI
AI And Cyber-security Threats
Robotic Process Automation With Blue Prism
Get started with Microsoft Azure
Ad

Recently uploaded (20)

PDF
Electronic commerce courselecture one. Pdf
PDF
Approach and Philosophy of On baking technology
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPT
Teaching material agriculture food technology
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
KodekX | Application Modernization Development
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Modernizing your data center with Dell and AMD
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Encapsulation theory and applications.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
Big Data Technologies - Introduction.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
Electronic commerce courselecture one. Pdf
Approach and Philosophy of On baking technology
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Teaching material agriculture food technology
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
20250228 LYD VKU AI Blended-Learning.pptx
KodekX | Application Modernization Development
Understanding_Digital_Forensics_Presentation.pptx
Machine learning based COVID-19 study performance prediction
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Modernizing your data center with Dell and AMD
Encapsulation_ Review paper, used for researhc scholars
The Rise and Fall of 3GPP – Time for a Sabbatical?
Encapsulation theory and applications.pdf
NewMind AI Weekly Chronicles - August'25 Week I
NewMind AI Monthly Chronicles - July 2025
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Big Data Technologies - Introduction.pptx
Empathic Computing: Creating Shared Understanding
Diabetes mellitus diagnosis method based random forest with bat algorithm

Basics of Reinforcement Learning

  • 1. Basics of Reinforcement Learning Spotle.ai Study Material Spotle.ai/Learn
  • 2. Spotle.ai Study Material Spotle.ai/Learn Let’s play chess! I just don’t make any possible move without thinking what my opponent’s move can be to counter my move. I try to consider all possible moves that are safe. And then choose the one that I feel is the best move among all. Machines can learn this way. And this learning is called reinforcement machine learning.
  • 3. Spotle.ai Study Material Spotle.ai/Learn What is reinforcement learning? First, a particular situation in which the learning will be applicable. You start at a point, you go through several steps to reach a level. In the process you earn a reward point for every correct step and you lose a reward point for every wrong step. Finally, you choose the path with the highest reward point in that particular situation. Agent Environment State Reward Action
  • 4. Spotle.ai Study Material Spotle.ai/Learn Terminologies Agent: The learner and the decision maker. Environment: Where the agent learns and decides what actions to perform. Action: A set of actions which the agent can perform. State: The state of the agent in the environment. Reward: For each action selected by the agent the environment provides a reward. Usually a scalar value. Agent Environment State Reward Action
  • 5. In supervised learning the training data has the output, that is, the answer in it. Here the model is trained with the correct answer. But in case of reinforcement learning, there is no answer given. The reinforcement agent decides the action to perform based on the maximum reward it receives. There is no training data in reinforcement learning. The machine learns from its experience. Supervised learning? No Spotle.ai Study Material Spotle.ai/Learn Training data Not available
  • 6. Spotle.ai Study Material Spotle.ai/Learn Reinforcing your learning Which one to choose? Give reward to all possible ones step by step Choose the one with the maximum reward.Topic A Topic B Topic C
  • 7. Spotle.ai Study Material Spotle.ai/Learn Pavlov Experiment TRIAL 1 In the first trial Pavlov gives meat to his dog and the dog starts salivating.
  • 8. Spotle.ai Study Material Spotle.ai/Learn Pavlov Experiment TRIAL 2 In the second trial Pavlov does not give meat to his dog but rings a bell. Without seeing the meat the dog does not start salivating.
  • 9. Spotle.ai Study Material Spotle.ai/Learn Pavlov Experiment TRIAL 3 In trial 3 Pavlov rings the bell and gives meat to his dog and seeing meat the dog starts salivating.
  • 10. Spotle.ai Study Material Spotle.ai/Learn Pavlov Experiment TRIAL 4 In trial 4 Pavlov rings the bell and at this his dog starts salivating, hoping that meat will follow the ringing of the bell. This is learning by reinforcement. The dog was rewarded with meat after the ringing of the bell.
  • 11. Summarizing ❖ The input is an initial stage from which the machine starts learning. ❖ There are more than one possible output in a particular problem. ❖ Each output state is given a reward or punishment. ❖ The output with maximum reward is selected to be performed. ❖ The reinforcement learning process is continuous.