SlideShare a Scribd company logo
MASTERTHESISPROPOSALComparison of Reinforcement Learning Frameworks
MASTERTHESISPROPOSAL
Reinforcement Learning (RL) is a class of machine learning algorithms in which an agent interacts by trial-and-error in an
environment.
RL in conjunction with Deep Learning has obtained outstanding results in Atari video games, the Go board-game and a
more complex environment like StarCraft II.
Recently many open source RL frameworks has been released by software companies in order to easily train and test new
RL algorithms.
The goal of the thesis is to benchmark the most promising RL frameworks, to study the new algorithms proposed and to
evaluate their performance on research environments.
Planned Activities
1.Acquire strong theoretical basis on Deep Reinforcement
Learning;
2.Install and compare the different RL frameworks;
3.Adapt and apply the best framework to a real application.


Required Skills:
• Experience with Linux or Unix based OS;
• Proficiency in at least one programming language (Python,
Lua, Matlab, C++, Java);
• Basic knowledge of machine learning;
• Good knowledge of linear algebra.
Competencies to be acquired:
• Experience with the application of Machine Learning to
complex systems.
• Expertise on the most recent Deep Reinforcement Learning
algorithms;
• Proficient use of the most advanced development
frameworks with a software engineering approach.

add-for.com
Who we’re looking for
Students that are about to get their Master Degree in:
computer science, computer engineering, mechatronic
engineering, mathematical engineering, physics of
complex systems, mathematics, physics or stochastics
and data science.
Duration of this Projects: 5-6 months
How to contact us
Directly by email to: sonia.cannavo@add-for.com
By LinkedIn: linkedin.com/in/cannavò-sonia-66a95467
Check these Links before moving on[1] Reinforcement Learning:An Introduction
http://incompleteideas.net/book/the-book.html
[2] Deep-Q-Network
https://www.nature.com/articles/nature14236
[3] AlphaGo
https://deepmind.com/research/alphago/
[4] AlphaStar https://bit.ly/2B5YrKh
[5] RL frameworks:
https://github.com/deepmind/trfl
https://github.com/facebookresearch/Horizon
https://github.com/google/dopamine
https://github.com/NervanaSystems/coach
https://github.com/openai/spinningup
[6] OpenAI gym
https://gym.openai.com/
[7] Mujoco
http://mujoco.org/
[8] CARLA Simulator
https://github.com/carla-simulator/carla

More Related Content

PDF
Deep reinforcement learning&Robotics
PDF
Unearthing The Power Of IBM – Rational Functional Tester 7.0 - RFT
DOCX
PDF
PowerShell Defcon for Cybersecurity Topics
PDF
视觉效果制作行业的工业语言——Python
PDF
What to Look for When Hiring a Rust Software Developer in 2025?
DOCX
Php developer
PPTX
Introduction to matlab for medical doctors and biologists (call slides)
Deep reinforcement learning&Robotics
Unearthing The Power Of IBM – Rational Functional Tester 7.0 - RFT
PowerShell Defcon for Cybersecurity Topics
视觉效果制作行业的工业语言——Python
What to Look for When Hiring a Rust Software Developer in 2025?
Php developer
Introduction to matlab for medical doctors and biologists (call slides)

Similar to Master's Thesis - comparison of reinforcement learning frameworks (20)

DOC
Satyam_Singh_cv
PDF
Machine learning scientist
DOCX
Dusty Parrott Resume
DOCX
Software Engineer Resume
DOCX
prathibha resume
PPTX
DSA unpluggedEventByGDGOncampusAtMedcaps.pptx
PDF
Lead developer position
PPT
Programming Paradigms
DOCX
RAGHUNATH_GORLA_RESUME
DOC
Chiranjeevi_QA Engg.
DOC
Mannu_Kumar_CV
PDF
WTFAST Crack Latest Version FREE Downlaod 2025
PDF
uTorrent Pro Crack Latest Version free 2025
PDF
Adobe Master Collection CC Crack 2025 FREE
PDF
AOMEI Partition Assistant Crack 2025 FREE
PDF
K7 Total Security 16.0.1260 Crack + License Key Free
PPTX
Evolving Scala, Scalar conference, Warsaw, March 2025
DOCX
Job Title: Senor/Research Fellow Job Function: This candidate will ...
PDF
Software developer in test sdet
Satyam_Singh_cv
Machine learning scientist
Dusty Parrott Resume
Software Engineer Resume
prathibha resume
DSA unpluggedEventByGDGOncampusAtMedcaps.pptx
Lead developer position
Programming Paradigms
RAGHUNATH_GORLA_RESUME
Chiranjeevi_QA Engg.
Mannu_Kumar_CV
WTFAST Crack Latest Version FREE Downlaod 2025
uTorrent Pro Crack Latest Version free 2025
Adobe Master Collection CC Crack 2025 FREE
AOMEI Partition Assistant Crack 2025 FREE
K7 Total Security 16.0.1260 Crack + License Key Free
Evolving Scala, Scalar conference, Warsaw, March 2025
Job Title: Senor/Research Fellow Job Function: This candidate will ...
Software developer in test sdet
Ad

More from Enrico Busto (18)

PDF
IBM Prague ai - real life experiences in engaging customers and do business...
PDF
20181210 Super Resolution
PDF
Master's Thesis - inverse reinforcement learning for autonomous driving
PDF
Master's Thesis - deep genomics: harnessing the power of deep neural networks...
PDF
Master's degree thesis testing algorithms for image & video understanding
PDF
20180509 energy - v001
PDF
Why join the navy - Addfor prsentation
PDF
Meetup IBM Rome October 24th 2018
PDF
Ai business innovator v001
PDF
Imaging automotive 2015 addfor v002
PDF
PaSSED - IBM Power AI - Addfor
PDF
NVIDIA DGX-1 Community-Based Benchmark
PDF
ARTIFICIAL INTELLIGENCE AT WORK
PDF
Performance Traction Control (PTC)
PDF
SideSlip Angle Estimator (SSE)
PPTX
Wiki stage 20151128 - v001
PDF
Automotive Virtual Sensors - Motorsport Applications
PDF
Imaging automotive 2015 addfor v002
IBM Prague ai - real life experiences in engaging customers and do business...
20181210 Super Resolution
Master's Thesis - inverse reinforcement learning for autonomous driving
Master's Thesis - deep genomics: harnessing the power of deep neural networks...
Master's degree thesis testing algorithms for image & video understanding
20180509 energy - v001
Why join the navy - Addfor prsentation
Meetup IBM Rome October 24th 2018
Ai business innovator v001
Imaging automotive 2015 addfor v002
PaSSED - IBM Power AI - Addfor
NVIDIA DGX-1 Community-Based Benchmark
ARTIFICIAL INTELLIGENCE AT WORK
Performance Traction Control (PTC)
SideSlip Angle Estimator (SSE)
Wiki stage 20151128 - v001
Automotive Virtual Sensors - Motorsport Applications
Imaging automotive 2015 addfor v002
Ad

Recently uploaded (20)

PPTX
Big Data Technologies - Introduction.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Cloud computing and distributed systems.
PDF
Machine learning based COVID-19 study performance prediction
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
sap open course for s4hana steps from ECC to s4
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
MIND Revenue Release Quarter 2 2025 Press Release
Big Data Technologies - Introduction.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Encapsulation_ Review paper, used for researhc scholars
Chapter 3 Spatial Domain Image Processing.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Cloud computing and distributed systems.
Machine learning based COVID-19 study performance prediction
“AI and Expert System Decision Support & Business Intelligence Systems”
Per capita expenditure prediction using model stacking based on satellite ima...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Understanding_Digital_Forensics_Presentation.pptx
sap open course for s4hana steps from ECC to s4
The Rise and Fall of 3GPP – Time for a Sabbatical?
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
The AUB Centre for AI in Media Proposal.docx
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Reach Out and Touch Someone: Haptics and Empathic Computing
MIND Revenue Release Quarter 2 2025 Press Release

Master's Thesis - comparison of reinforcement learning frameworks

  • 1. MASTERTHESISPROPOSALComparison of Reinforcement Learning Frameworks MASTERTHESISPROPOSAL Reinforcement Learning (RL) is a class of machine learning algorithms in which an agent interacts by trial-and-error in an environment. RL in conjunction with Deep Learning has obtained outstanding results in Atari video games, the Go board-game and a more complex environment like StarCraft II. Recently many open source RL frameworks has been released by software companies in order to easily train and test new RL algorithms. The goal of the thesis is to benchmark the most promising RL frameworks, to study the new algorithms proposed and to evaluate their performance on research environments. Planned Activities 1.Acquire strong theoretical basis on Deep Reinforcement Learning; 2.Install and compare the different RL frameworks; 3.Adapt and apply the best framework to a real application. 
 Required Skills: • Experience with Linux or Unix based OS; • Proficiency in at least one programming language (Python, Lua, Matlab, C++, Java); • Basic knowledge of machine learning; • Good knowledge of linear algebra. Competencies to be acquired: • Experience with the application of Machine Learning to complex systems. • Expertise on the most recent Deep Reinforcement Learning algorithms; • Proficient use of the most advanced development frameworks with a software engineering approach.
 add-for.com Who we’re looking for Students that are about to get their Master Degree in: computer science, computer engineering, mechatronic engineering, mathematical engineering, physics of complex systems, mathematics, physics or stochastics and data science. Duration of this Projects: 5-6 months How to contact us Directly by email to: sonia.cannavo@add-for.com By LinkedIn: linkedin.com/in/cannavò-sonia-66a95467 Check these Links before moving on[1] Reinforcement Learning:An Introduction http://incompleteideas.net/book/the-book.html [2] Deep-Q-Network https://www.nature.com/articles/nature14236 [3] AlphaGo https://deepmind.com/research/alphago/ [4] AlphaStar https://bit.ly/2B5YrKh [5] RL frameworks: https://github.com/deepmind/trfl https://github.com/facebookresearch/Horizon https://github.com/google/dopamine https://github.com/NervanaSystems/coach https://github.com/openai/spinningup [6] OpenAI gym https://gym.openai.com/ [7] Mujoco http://mujoco.org/ [8] CARLA Simulator https://github.com/carla-simulator/carla