2
Most read
3
Most read
15
Most read
Gym & Universe
Ashish Kumar
LinkedIn: https://www.linkedin.com/in/ashkmr1
Twitter : @ashish_fagna
e-mail : ashish.fagna@gmail.com
OpenAI - Introduction
● OpenAI is a non-profit artificial
intelligence (AI) research company.
● It aims to promote and develop
friendly AI in a way to benefit
humanity as a whole.
● It aims to "freely collaborate" by
making its patents and research open
to the public.
2
Latest in News
● OpenAI Five vs Dota 2
● Event was streamed online
● OpenAI Five is a set of five neural networks
3
Source: https://medium.com/deep-math-machine-learning-ai/different-types-of-machine-learning-and-their-types-34760b9128a2
4
ReInforcement Learning
● One of the most important type of Machine Learning,
● An agent learns how to behave in a environment by performing actions and
seeing the results.
5
ReInforcement Learning
There are two basic concepts in reinforcement learning:
1. Environment (namely, the outside world) and
2. Agent (namely, the algorithm you are writing).
The agent sends actions to the environment, and the environment replies with
observations and rewards (that is, a score).
6
Example : ReInforcement Learning
Imagine you’re a child in a living room.
Action1 : You see a fireplace, and you approach it. It’s warm
(Positive Reward +1).
Action 2: But when you try to touch the fire. It burns your hand
(Negative reward -1).
Learning 1 : fire is positive when you are a sufficient distance
away, because it produces warmth.
Learning 2 : But getting too close to it and you will be burned.
Image Source : https://medium.freecodecamp.org/an-introduction-to-reinforcement-learning-4339519de419
That’s how humans learn, through interaction.
Reinforcement Learning is just a computational approach of learning from action.
7
Reinforcement Learning
Actions influence the state, which determines reward.
Image Source : https://medium.freecodecamp.org/an-introduction-to-reinforcement-learning-4339519de419
8
Reinforcement Learning Process (Super Mario)
Let’s imagine an agent learning to play Super Mario.
The Reinforcement Learning (RL) process can be modeled as a loop that works
like this:
● Our Agent receives state S0 from the Environment (In our case we receive
the first frame of our game (state) from Super Mario (environment))
● Based on that state S0, agent takes an action A0 (our agent will move right)
● Environment transitions to a new state S1 (new frame)
● Environment gives some reward R1 to the agent (not dead: +1)
This RL loop outputs a sequence of state, action and reward.
9
The Reinforcement Learning process
● The goal of the agent is to maximize the expected cumulative reward.
● By running more and more loops, the agent will learn to play better and
better.
10
OpenAI Gym
● Gym is a toolkit for Researching
(developing and comparing)
reinforcement learning algorithms.
● It supports teaching agents everything
from walking to playing games like Pong
or Pinball.
● Gym Envs:
https://gym.openai.com/envs/#mujoco
Ant-v2
Make a 3D four-legged robot walk. 11
OpenAI Universe
● Platform for measuring and training an AGI across games, websites and other
applications.
● Makes it possible for any existing program to become an OpenAI Gym
environment, without needing special access to the program's internals,
source code, or APIs.
● It does this by packaging the program into a Docker container, and presenting
the AI with the same interface a human uses: sending keyboard and mouse
events, and receiving screen pixels.
● Contains over 1,000 environments in which an AI agent can take actions and
gather observations. 12
Command: Start Docker Container via Conda
● Conda is an open source package management system and environment
management system that runs on Windows, macOS and Linux.
● Conda quickly installs, runs and updates packages and their dependencies.
● Conda easily creates, saves, loads and switches between environments on
your local computer.
Command:
conda create --name universe-starter-agent python=3.5
source activate universe-starter-agent
13
OpenAI Universe Demo
14
Gym vs Universe
● OpenAI Universe is like a much bigger OpenAI Gym.
● OpenAI Gym’s got some basic tasks, like pole balancing, and pendulum
uprighting, and some more difficult ones like basic Atari games like Space
Invaders.
● like an enclosed world, or a “gym” to exercise and develop RL algorithms.
● OpenAI Universe has a much wider variety of tasks, and is more involved in
giving RL networks/algorithms the ability to interact with the real world:
playing games, using an actual (virtual) keyboard and mouse to interact with
buttons and sliders on webpages, etc.
● Universe is based on Gym
15
Use Cases
Environments for doing various tasks, like
● Sending an email,
● Doing some mouse clicking, keyboard events,
● More and more environments are being added
16
Thank You
Ashish Kumar
LinkedIn: https://www.linkedin.com/in/ashkmr1
Twitter : @ashish_fagna
E-mail: ashish.fagna@gmail.com
17

More Related Content

PPTX
Open ai openpower
PPTX
How does ChatGPT work: an Information Retrieval perspective
PPTX
openai.pptx
PPTX
Behind the Scenes of ChatGPT.pptx
PPTX
Breaking down the AI magic of ChatGPT: A technologist's lens to its powerful ...
PPTX
Open AI Chat GPT-4-3.pptx
PDF
Developing Apps with GPT-4 and ChatGPT_ Build Intelligent Chatbots, Content G...
PPTX
openai-chatgpt sunumu
Open ai openpower
How does ChatGPT work: an Information Retrieval perspective
openai.pptx
Behind the Scenes of ChatGPT.pptx
Breaking down the AI magic of ChatGPT: A technologist's lens to its powerful ...
Open AI Chat GPT-4-3.pptx
Developing Apps with GPT-4 and ChatGPT_ Build Intelligent Chatbots, Content G...
openai-chatgpt sunumu

What's hot (20)

PDF
Reinforcement Learning using OpenAI Gym
PPTX
Generative models
PPTX
Future of AI - 2023 07 25.pptx
PDF
An Introduction to Generative AI
PPTX
Reinforcement learning
PDF
What is chat gpt
PDF
Machine learning Summer Training report
PPTX
An introduction to reinforcement learning
PPTX
Web scraping
PDF
GAN in medical imaging
PDF
Introduction to Deep Learning (NVIDIA)
PPTX
AI Hardware Landscape 2021
PPTX
Artificial Intelligence in Gaming
PDF
Artificial Intelligence Notes Unit 2
PDF
A brief overview of Reinforcement Learning applied to games
PPTX
Presentation-Open AI Chat GPT-4 3.pptx
PDF
Transfer Learning
PDF
AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete Deck
PPTX
Tic Tac Toe using Mini Max Algorithm
DOCX
Industrial Training report on java
Reinforcement Learning using OpenAI Gym
Generative models
Future of AI - 2023 07 25.pptx
An Introduction to Generative AI
Reinforcement learning
What is chat gpt
Machine learning Summer Training report
An introduction to reinforcement learning
Web scraping
GAN in medical imaging
Introduction to Deep Learning (NVIDIA)
AI Hardware Landscape 2021
Artificial Intelligence in Gaming
Artificial Intelligence Notes Unit 2
A brief overview of Reinforcement Learning applied to games
Presentation-Open AI Chat GPT-4 3.pptx
Transfer Learning
AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete Deck
Tic Tac Toe using Mini Max Algorithm
Industrial Training report on java
Ad

Similar to OpenAI Gym & Universe (20)

PDF
Autonomous Systems for Optimization and Control
PDF
Autonomous Control AI Training from Data
PPTX
#OSSPARIS19 - Overcoming open source challenges in reinforcement learning - W...
PDF
AbadIA: the abbey of the crime AI - GDG Cloud London 2018
PPTX
Reinforcement Learning - Mindbridge.ai
PPT
Recent trends in Artificial intelligence and Machine learning
PDF
Reinforcement learning in a nutshell
PDF
Deep Reinforcement Learning and Its Applications
PPTX
Introduction to Generative AI refers to a subset of artificial intelligence
PDF
Introduction to reinforcement learning
PPTX
An Introduction to Reinforcement Learning (December 2018)
PDF
Aaa ped-24- Reinforcement Learning
PPTX
Designing an AI that gains experience for absolute beginners
PDF
RL presentation
PPTX
Machine Learning and its types with application
PPTX
Introduction-to-Artificial Intelligence.pptx.pptx
PDF
A General Overview of Machine Learning
PDF
Deep Reinforcement Learning An Introduction
PDF
Data Science, Machine Learning and Neural Networks
PDF
Ilya Sutskever at AI Frontiers : Progress towards the OpenAI mission
Autonomous Systems for Optimization and Control
Autonomous Control AI Training from Data
#OSSPARIS19 - Overcoming open source challenges in reinforcement learning - W...
AbadIA: the abbey of the crime AI - GDG Cloud London 2018
Reinforcement Learning - Mindbridge.ai
Recent trends in Artificial intelligence and Machine learning
Reinforcement learning in a nutshell
Deep Reinforcement Learning and Its Applications
Introduction to Generative AI refers to a subset of artificial intelligence
Introduction to reinforcement learning
An Introduction to Reinforcement Learning (December 2018)
Aaa ped-24- Reinforcement Learning
Designing an AI that gains experience for absolute beginners
RL presentation
Machine Learning and its types with application
Introduction-to-Artificial Intelligence.pptx.pptx
A General Overview of Machine Learning
Deep Reinforcement Learning An Introduction
Data Science, Machine Learning and Neural Networks
Ilya Sutskever at AI Frontiers : Progress towards the OpenAI mission
Ad

More from Entrepreneur / Startup (13)

PDF
R-FCN : object detection via region-based fully convolutional networks
PPTX
You only look once (YOLO) : unified real time object detection
PPTX
Machine Learning Algorithms in Enterprise Applications
PPTX
Build a Neural Network for ITSM with TensorFlow
PPTX
Understanding Autoencoder (Deep Learning Book, Chapter 14)
PPTX
Build an AI based virtual agent
PPTX
Building Bots Using IBM Watson
PDF
Building chat bots using ai platforms (wit.ai or api.ai) in nodejs
PPTX
Building mobile apps using meteorJS
PPTX
Building iOS app using meteor
PPTX
Understanding angular meteor
PPTX
Introducing ElasticSearch - Ashish
PPTX
Meteor Introduction - Ashish
R-FCN : object detection via region-based fully convolutional networks
You only look once (YOLO) : unified real time object detection
Machine Learning Algorithms in Enterprise Applications
Build a Neural Network for ITSM with TensorFlow
Understanding Autoencoder (Deep Learning Book, Chapter 14)
Build an AI based virtual agent
Building Bots Using IBM Watson
Building chat bots using ai platforms (wit.ai or api.ai) in nodejs
Building mobile apps using meteorJS
Building iOS app using meteor
Understanding angular meteor
Introducing ElasticSearch - Ashish
Meteor Introduction - Ashish

Recently uploaded (20)

PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Hybrid model detection and classification of lung cancer
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Unlock new opportunities with location data.pdf
PPT
Geologic Time for studying geology for geologist
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PPTX
Modernising the Digital Integration Hub
PDF
Five Habits of High-Impact Board Members
PPTX
Web Crawler for Trend Tracking Gen Z Insights.pptx
PPTX
Tartificialntelligence_presentation.pptx
PPT
What is a Computer? Input Devices /output devices
PPTX
observCloud-Native Containerability and monitoring.pptx
PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
The various Industrial Revolutions .pptx
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
Getting started with AI Agents and Multi-Agent Systems
Univ-Connecticut-ChatGPT-Presentaion.pdf
Hybrid model detection and classification of lung cancer
Hindi spoken digit analysis for native and non-native speakers
Unlock new opportunities with location data.pdf
Geologic Time for studying geology for geologist
sustainability-14-14877-v2.pddhzftheheeeee
O2C Customer Invoices to Receipt V15A.pptx
Modernising the Digital Integration Hub
Five Habits of High-Impact Board Members
Web Crawler for Trend Tracking Gen Z Insights.pptx
Tartificialntelligence_presentation.pptx
What is a Computer? Input Devices /output devices
observCloud-Native Containerability and monitoring.pptx
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
Final SEM Unit 1 for mit wpu at pune .pptx
The various Industrial Revolutions .pptx
A novel scalable deep ensemble learning framework for big data classification...
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Module 1.ppt Iot fundamentals and Architecture
Getting started with AI Agents and Multi-Agent Systems

OpenAI Gym & Universe

  • 1. Gym & Universe Ashish Kumar LinkedIn: https://www.linkedin.com/in/ashkmr1 Twitter : @ashish_fagna e-mail : ashish.fagna@gmail.com
  • 2. OpenAI - Introduction ● OpenAI is a non-profit artificial intelligence (AI) research company. ● It aims to promote and develop friendly AI in a way to benefit humanity as a whole. ● It aims to "freely collaborate" by making its patents and research open to the public. 2
  • 3. Latest in News ● OpenAI Five vs Dota 2 ● Event was streamed online ● OpenAI Five is a set of five neural networks 3
  • 5. ReInforcement Learning ● One of the most important type of Machine Learning, ● An agent learns how to behave in a environment by performing actions and seeing the results. 5
  • 6. ReInforcement Learning There are two basic concepts in reinforcement learning: 1. Environment (namely, the outside world) and 2. Agent (namely, the algorithm you are writing). The agent sends actions to the environment, and the environment replies with observations and rewards (that is, a score). 6
  • 7. Example : ReInforcement Learning Imagine you’re a child in a living room. Action1 : You see a fireplace, and you approach it. It’s warm (Positive Reward +1). Action 2: But when you try to touch the fire. It burns your hand (Negative reward -1). Learning 1 : fire is positive when you are a sufficient distance away, because it produces warmth. Learning 2 : But getting too close to it and you will be burned. Image Source : https://medium.freecodecamp.org/an-introduction-to-reinforcement-learning-4339519de419 That’s how humans learn, through interaction. Reinforcement Learning is just a computational approach of learning from action. 7
  • 8. Reinforcement Learning Actions influence the state, which determines reward. Image Source : https://medium.freecodecamp.org/an-introduction-to-reinforcement-learning-4339519de419 8
  • 9. Reinforcement Learning Process (Super Mario) Let’s imagine an agent learning to play Super Mario. The Reinforcement Learning (RL) process can be modeled as a loop that works like this: ● Our Agent receives state S0 from the Environment (In our case we receive the first frame of our game (state) from Super Mario (environment)) ● Based on that state S0, agent takes an action A0 (our agent will move right) ● Environment transitions to a new state S1 (new frame) ● Environment gives some reward R1 to the agent (not dead: +1) This RL loop outputs a sequence of state, action and reward. 9
  • 10. The Reinforcement Learning process ● The goal of the agent is to maximize the expected cumulative reward. ● By running more and more loops, the agent will learn to play better and better. 10
  • 11. OpenAI Gym ● Gym is a toolkit for Researching (developing and comparing) reinforcement learning algorithms. ● It supports teaching agents everything from walking to playing games like Pong or Pinball. ● Gym Envs: https://gym.openai.com/envs/#mujoco Ant-v2 Make a 3D four-legged robot walk. 11
  • 12. OpenAI Universe ● Platform for measuring and training an AGI across games, websites and other applications. ● Makes it possible for any existing program to become an OpenAI Gym environment, without needing special access to the program's internals, source code, or APIs. ● It does this by packaging the program into a Docker container, and presenting the AI with the same interface a human uses: sending keyboard and mouse events, and receiving screen pixels. ● Contains over 1,000 environments in which an AI agent can take actions and gather observations. 12
  • 13. Command: Start Docker Container via Conda ● Conda is an open source package management system and environment management system that runs on Windows, macOS and Linux. ● Conda quickly installs, runs and updates packages and their dependencies. ● Conda easily creates, saves, loads and switches between environments on your local computer. Command: conda create --name universe-starter-agent python=3.5 source activate universe-starter-agent 13
  • 15. Gym vs Universe ● OpenAI Universe is like a much bigger OpenAI Gym. ● OpenAI Gym’s got some basic tasks, like pole balancing, and pendulum uprighting, and some more difficult ones like basic Atari games like Space Invaders. ● like an enclosed world, or a “gym” to exercise and develop RL algorithms. ● OpenAI Universe has a much wider variety of tasks, and is more involved in giving RL networks/algorithms the ability to interact with the real world: playing games, using an actual (virtual) keyboard and mouse to interact with buttons and sliders on webpages, etc. ● Universe is based on Gym 15
  • 16. Use Cases Environments for doing various tasks, like ● Sending an email, ● Doing some mouse clicking, keyboard events, ● More and more environments are being added 16
  • 17. Thank You Ashish Kumar LinkedIn: https://www.linkedin.com/in/ashkmr1 Twitter : @ashish_fagna E-mail: ashish.fagna@gmail.com 17