SlideShare a Scribd company logo
Hosting Reinforcement Learning Challenges on
EvalAI
Deepesh Pathak
Deshraj Yadav
Kartik Verma
Rishabh Bharadwaj
Rishabh Jain
Project Goal
Build feature that will help users to host Reinforcement
Learning Challenges easily on EvalAI and evaluate the agents
at Scale.
The Flow
Submission
1. Participant submits agent image
using EvalAI-CLI, it will Poll API
for various details
2. The Agent image will be pushed to a
registry
3. The EvalAI CLI will send submission
details to API
Manage Submissions
1. The Submission details received at
API will be pushed to Challenge
Specific Queue.
2. RL Submission worker will be polling
this queue and will get details of
new submissions.
RL Worker
Using the Submission details, RL Worker will deploy the logical
entity of environment and agent containers on Kubernetes.
Deployment Logs from rl-worker.py
Polling the API for new
submissions
Processing Submission and
deploying
Deployment
Evaluation
1. Multi Container Policy
2. gRPC for interaction between Agent
and Environment.
Demo
1. A pole is attached by an un-actuated joint
to a cart, which moves along a frictionless
track.
2. The system is controlled by applying a
force of +1 or -1 to the cart. The pendulum
starts upright, and the goal is to prevent
it from falling over.
3. A reward of +1 is provided for every
timestep that the pole remains upright.
Plan for Phase 3
1. Hosting challenges that have environments and agents
coupled tightly. e.g. Habitat Challenge
2. Resource Optimization using GPU sharing
What I learnt?
Next steps after GSoC
Keep Contributing and help
other contributors
Improve EvalAI
Thank You!

More Related Content

PPTX
PDF
Postman 101 for Students
PDF
Integrations: Using Postman in Your Workflows
PDF
POST/CON 2019 Workshop: Testing, Automated Testing, and Reporting APIs with P...
PDF
Real-time Automation Result in Slack Channel
PPTX
RESTful API Testing using Postman, Newman, and Jenkins
PDF
Getting Ahead of Engineering
PPTX
Do you really need to reload?
Postman 101 for Students
Integrations: Using Postman in Your Workflows
POST/CON 2019 Workshop: Testing, Automated Testing, and Reporting APIs with P...
Real-time Automation Result in Slack Channel
RESTful API Testing using Postman, Newman, and Jenkins
Getting Ahead of Engineering
Do you really need to reload?

What's hot (20)

PDF
Using Postman to Test OAuth/OIDC
PDF
The Most Common Errors That Aren’t Caught
PDF
Postman: An Introduction for API Ops Professionals
PDF
Automation, Integration, and Orchestration for Better Engineering Operations
PDF
Postman Webinar: “Continuous Testing with Postman”
PDF
How Gear4Music Went from 0-1000+ API Tests
PDF
Using Postman to Automate API On-Boarding
PPTX
Postman 101 for developers
PDF
TAUS Quality Dashboard: Use Cases and Integrations -Nigel Watson (SDL), Danie...
PDF
Upgrade Your Collections
PDF
POST/CON Keynote 2019
PDF
Enterprise E-commerce Webinar #2 - Deploy & Monitor APIs
PDF
AllTheTalks 2020: "The Past, Present, and Future of Cloud Native API Gateways"
PDF
Postman Galaxy Tour: San Francisco - Workshop Presentation
PDF
Emulators as an Emerging Best Practice for API providers
PPTX
Renan Dias: Using Alexa to deploy applications to Kubernetes
PPTX
Postman Enterprise Webinar
PDF
Postman Galaxy Tour - Keynote Presentation
PPTX
It meet up 1 0 лебедева, лоханов
PPTX
API Test Automation Tips and Tricks
Using Postman to Test OAuth/OIDC
The Most Common Errors That Aren’t Caught
Postman: An Introduction for API Ops Professionals
Automation, Integration, and Orchestration for Better Engineering Operations
Postman Webinar: “Continuous Testing with Postman”
How Gear4Music Went from 0-1000+ API Tests
Using Postman to Automate API On-Boarding
Postman 101 for developers
TAUS Quality Dashboard: Use Cases and Integrations -Nigel Watson (SDL), Danie...
Upgrade Your Collections
POST/CON Keynote 2019
Enterprise E-commerce Webinar #2 - Deploy & Monitor APIs
AllTheTalks 2020: "The Past, Present, and Future of Cloud Native API Gateways"
Postman Galaxy Tour: San Francisco - Workshop Presentation
Emulators as an Emerging Best Practice for API providers
Renan Dias: Using Alexa to deploy applications to Kubernetes
Postman Enterprise Webinar
Postman Galaxy Tour - Keynote Presentation
It meet up 1 0 лебедева, лоханов
API Test Automation Tips and Tricks
Ad

Similar to Code Upload Challenge on EvalAI (20)

PPTX
Quantum API 1.1 proposal
PDF
Puppet Integration Adapter - Datasheet
PDF
Rehosting apps between k8s clusters and automating deployment using crane c...
PPTX
Rehosting apps between k8s clusters and automating deployment using crane
PPT
Hp Quick Test Professional
PDF
Hrishikesh_iitg_internship_report
PPTX
OpenStack Nova - Developer Introduction
PPTX
Shipping Code like a keptn: Continuous Delivery & Automated Operations on k8s
PPTX
Observability and Orchestration of your GitOps Deployments with Keptn
PDF
MuleSoft Certified Platform Architect Exam Dumps 2023.pdf
PPTX
Vulnerability Discovery in the Cloud
PDF
005281271.pdf
PPTX
Anypoint connector Roadmap
PPTX
Don't Deploy Into the Dark: DORA Metrics for your K8s GitOps Deployments
PDF
Hands-On Lab: Application Performance Management Command Center: Automate Mon...
DOC
kowsalyamanickam_resume_OIM
PPTX
Inside Logic Apps
PPTX
Under the Trenchcoat: Neutron Agent Extensions
PDF
Karim Fanadka
PPT
Remics experiences(berlin) brian
Quantum API 1.1 proposal
Puppet Integration Adapter - Datasheet
Rehosting apps between k8s clusters and automating deployment using crane c...
Rehosting apps between k8s clusters and automating deployment using crane
Hp Quick Test Professional
Hrishikesh_iitg_internship_report
OpenStack Nova - Developer Introduction
Shipping Code like a keptn: Continuous Delivery & Automated Operations on k8s
Observability and Orchestration of your GitOps Deployments with Keptn
MuleSoft Certified Platform Architect Exam Dumps 2023.pdf
Vulnerability Discovery in the Cloud
005281271.pdf
Anypoint connector Roadmap
Don't Deploy Into the Dark: DORA Metrics for your K8s GitOps Deployments
Hands-On Lab: Application Performance Management Command Center: Automate Mon...
kowsalyamanickam_resume_OIM
Inside Logic Apps
Under the Trenchcoat: Neutron Agent Extensions
Karim Fanadka
Remics experiences(berlin) brian
Ad

Recently uploaded (20)

PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PDF
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
Construction Project Organization Group 2.pptx
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
Artificial Intelligence
PDF
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PDF
Human-AI Collaboration: Balancing Agentic AI and Autonomy in Hybrid Systems
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
Geodesy 1.pptx...............................................
PDF
composite construction of structures.pdf
PPTX
web development for engineering and engineering
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
UNIT 4 Total Quality Management .pptx
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Construction Project Organization Group 2.pptx
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Artificial Intelligence
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
Foundation to blockchain - A guide to Blockchain Tech
Human-AI Collaboration: Balancing Agentic AI and Autonomy in Hybrid Systems
Operating System & Kernel Study Guide-1 - converted.pdf
Fundamentals of safety and accident prevention -final (1).pptx
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Geodesy 1.pptx...............................................
composite construction of structures.pdf
web development for engineering and engineering

Code Upload Challenge on EvalAI

  • 1. Hosting Reinforcement Learning Challenges on EvalAI Deepesh Pathak Deshraj Yadav Kartik Verma Rishabh Bharadwaj Rishabh Jain
  • 2. Project Goal Build feature that will help users to host Reinforcement Learning Challenges easily on EvalAI and evaluate the agents at Scale.
  • 4. Submission 1. Participant submits agent image using EvalAI-CLI, it will Poll API for various details 2. The Agent image will be pushed to a registry 3. The EvalAI CLI will send submission details to API
  • 5. Manage Submissions 1. The Submission details received at API will be pushed to Challenge Specific Queue. 2. RL Submission worker will be polling this queue and will get details of new submissions.
  • 6. RL Worker Using the Submission details, RL Worker will deploy the logical entity of environment and agent containers on Kubernetes. Deployment Logs from rl-worker.py Polling the API for new submissions Processing Submission and deploying
  • 8. Evaluation 1. Multi Container Policy 2. gRPC for interaction between Agent and Environment.
  • 9. Demo 1. A pole is attached by an un-actuated joint to a cart, which moves along a frictionless track. 2. The system is controlled by applying a force of +1 or -1 to the cart. The pendulum starts upright, and the goal is to prevent it from falling over. 3. A reward of +1 is provided for every timestep that the pole remains upright.
  • 10. Plan for Phase 3 1. Hosting challenges that have environments and agents coupled tightly. e.g. Habitat Challenge 2. Resource Optimization using GPU sharing
  • 12. Next steps after GSoC Keep Contributing and help other contributors Improve EvalAI

Editor's Notes

  • #3: What is the problem? Other Class of problems not solved by evalai Host RL Types of
  • #4: What is Env, and what is agent and how they are entities? Challenge creator and participant.
  • #5: Explain
  • #9: The Project can be used for Multi Container Policy as well as Single Container Policy. The advantage of Multi Container Policy is that we decouple things that are not related making the approach more generalized. Environment and Agent: Code on https://github.com/vkartik97/rl-grpc Images on https://hub.docker.com/r/vkartik97/agent and https://hub.docker.com/r/vkartik97/env
  • #11: 1 - example: Habitat
  • #12: GPU Sharing