SlideShare a Scribd company logo
Distributed deep rl on spark   strata singapore
SKYMIND INTELLIGENCE LAYER (SKIL)
REFERENCE ARCHITECTURE
Overview
● Why am I up here?
● Reinforcement Learning
● Use cases
● Demo!
● Deep Reinforcement Learning
● Rl4j
● Dl4j
● Spark/RL - why?
Why am I up
here?
Wrote this -->
Book Giveaway!
Reinforcement
Learning
● Learn a “policy” with repeated trial
and error
● An agent explores a search space
● Learns from rewards and penalties
each time it takes a step
● Think of win/lose scenarios
● Rewards/punishment set by an
“environment”
Credit:
http://ai.berkeley.edu/reinforcement.ht
ml
Use cases (not
games!)
● Risk analysis (loans)
● Network Intrusion
● Learning patterns from
simulations (MCMC)
Demo!
Cartpole (Hello
world of RL)
Deep
Reinforcement
Learning
● Teach a neural net from environment
● Policy determines gradient descent steps
● Most work has been based on raw frames
from games (pixel input)
● Various techniques (A3C,Policy Gradients,Deep
Q,..)
● Core idea: Neural net has a softmax
(probability distribution) mapped to actions to
take in an environment
RL4j
● Deep Reinforcement Learning
library for Java
● Openai Gym Intregration
● Deep Reinforcement Learning
with DL4j
● Implementations of A3C,DeepQ,
Policy Gradients
● Openai Gym Java Bindings
Dl4j
Dl4j
● Import keras models
● Focus on running in production
● Integrate with existing big data ecosystem
● Transparent usage of cpus and gpus
● End to end ecosystem for building data
products (not just algorithms!)
Spark/RL Why?
● Spark is distributed compute
● A lot of simulations and
environments to run
● Distributed workers running
experiments in parallel
● Data Parallelism with neural nets
Summary
● Spark for orchestrating simulations
● Spark for distributed training
● Integrated storage with HDFS
● Orchestrate GPU based spark jobs
● Easy to hook in to production (java/scala)
● Great streaming ecosystem for incremental
updates
Distributed deep rl on spark   strata singapore

More Related Content

PDF
Wrangleconf Big Data Malaysia 2016
PPTX
Introduction of CTF and CGC
PPTX
Pen Testing, Red Teaming, and More
PDF
Capture The Flag
PDF
KringleCon 3 Providing Value in Offensive Security
PDF
So you want to be a red teamer
PPTX
Pen Testing Development
PDF
Big Data Analytics Tokyo
Wrangleconf Big Data Malaysia 2016
Introduction of CTF and CGC
Pen Testing, Red Teaming, and More
Capture The Flag
KringleCon 3 Providing Value in Offensive Security
So you want to be a red teamer
Pen Testing Development
Big Data Analytics Tokyo

Viewers also liked (20)

PDF
Deep Learning with GPUs in Production - AI By the Bay
PDF
Deep learning in production with the best
PDF
SKIL - Dl4j in the wild meetup
PPTX
Dl4j in the wild
PPTX
Anomaly detection in deep learning (Updated) English
PPTX
Productionizing dl from the ground up
PPTX
Hadoop summit 2016
PPTX
Recurrent nets and sensors
PDF
Deep learning with Hortonworks and Apache Spark - Hortonworks technical workshop
PPTX
Future of ai on the jvm
PPTX
Deep Learning using Spark and DL4J for fun and profit
PDF
Anomaly Detection in Deep Learning (Updated)
PDF
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
PDF
Strata Beijing - Deep Learning in Production on Spark
PDF
H2O World - Top 10 Deep Learning Tips & Tricks - Arno Candel
PPTX
August 2016 HUG: Open Source Big Data Ingest with StreamSets Data Collector
PPTX
Suneel Marthi - Deep Learning with Apache Flink and DL4J
PPTX
Apache Hadoop 3.0 What's new in YARN and MapReduce
PDF
August 2016 HUG: Better together: Fast Data with Apache Spark™ and Apache Ign...
PPTX
Dynamic and Static Modeling
Deep Learning with GPUs in Production - AI By the Bay
Deep learning in production with the best
SKIL - Dl4j in the wild meetup
Dl4j in the wild
Anomaly detection in deep learning (Updated) English
Productionizing dl from the ground up
Hadoop summit 2016
Recurrent nets and sensors
Deep learning with Hortonworks and Apache Spark - Hortonworks technical workshop
Future of ai on the jvm
Deep Learning using Spark and DL4J for fun and profit
Anomaly Detection in Deep Learning (Updated)
The Enterprise and Connected Data, Trends in the Apache Hadoop Ecosystem by A...
Strata Beijing - Deep Learning in Production on Spark
H2O World - Top 10 Deep Learning Tips & Tricks - Arno Candel
August 2016 HUG: Open Source Big Data Ingest with StreamSets Data Collector
Suneel Marthi - Deep Learning with Apache Flink and DL4J
Apache Hadoop 3.0 What's new in YARN and MapReduce
August 2016 HUG: Better together: Fast Data with Apache Spark™ and Apache Ign...
Dynamic and Static Modeling
Ad

Similar to Distributed deep rl on spark strata singapore (20)

PDF
Sequential Decision Making in Recommendations
PPT
Ai architectureand designpatternsgdc2009
PDF
Memory-based Reinforcement Learning
PPTX
Exploring DeepSeek A Hands-On Dive & How to Adapt the AI Surge.pptx
PPTX
Exploring DeepSeek A Hands-On Dive & How to Adapt the AI Surge.pptx
PDF
Is Production RL at a tipping point?
PDF
The Risks of YOLOing-2.pdf
PDF
How DeepMind Mastered The Game Of Go
PPTX
Session_2_Introduction_to_Deep_Learning.pptx
PDF
Self-supervised Learning Lecture Note
PDF
Session_2_Introduction_to_Deep_Learning.pdf
PPTX
Building a deep learning ai.pptx
PDF
Rewrite the whole damn thing
PPTX
Neural networks with python
PDF
Avoiding GraphQL insecurities with OWASP SKF - OWASP HU meetup
PDF
Performance optimization techniques for Java code
PDF
"Deep Reinforcement Learning for Optimal Order Placement in a Limit Order Boo...
PDF
Salt Identification Challenge
PDF
OISF - Continuous Skills Improvement for Everyone
PDF
A brief overview of Reinforcement Learning applied to games
Sequential Decision Making in Recommendations
Ai architectureand designpatternsgdc2009
Memory-based Reinforcement Learning
Exploring DeepSeek A Hands-On Dive & How to Adapt the AI Surge.pptx
Exploring DeepSeek A Hands-On Dive & How to Adapt the AI Surge.pptx
Is Production RL at a tipping point?
The Risks of YOLOing-2.pdf
How DeepMind Mastered The Game Of Go
Session_2_Introduction_to_Deep_Learning.pptx
Self-supervised Learning Lecture Note
Session_2_Introduction_to_Deep_Learning.pdf
Building a deep learning ai.pptx
Rewrite the whole damn thing
Neural networks with python
Avoiding GraphQL insecurities with OWASP SKF - OWASP HU meetup
Performance optimization techniques for Java code
"Deep Reinforcement Learning for Optimal Order Placement in a Limit Order Boo...
Salt Identification Challenge
OISF - Continuous Skills Improvement for Everyone
A brief overview of Reinforcement Learning applied to games
Ad

More from Adam Gibson (17)

PDF
End to end MLworkflows
PDF
World Artificial Intelligence Conference Shanghai 2018
PPTX
Deploying signature verification with deep learning
ODP
Self driving computers active learning workflows with human interpretable ve...
PDF
Anomaly Detection and Automatic Labeling with Deep Learning
PDF
Strata Beijing 2017: Jumpy, a python interface for nd4j
PPTX
Boolan machine learning summit
PDF
Advanced deeplearning4j features
PPTX
Skymind - Udacity China presentation
PDF
Anomaly detection in deep learning
PPTX
Brief introduction to Distributed Deep Learning
PPTX
Advanced spark deep learning
PPTX
Skymind Open Power Summit ISV Round Table
PPTX
Nd4 j slides.pptx
PPTX
Deep learning on Hadoop/Spark -NextML
PDF
Skymind & Deeplearning4j: Deep Learning for the Enterprise
PPTX
Sf data mining_meetup
End to end MLworkflows
World Artificial Intelligence Conference Shanghai 2018
Deploying signature verification with deep learning
Self driving computers active learning workflows with human interpretable ve...
Anomaly Detection and Automatic Labeling with Deep Learning
Strata Beijing 2017: Jumpy, a python interface for nd4j
Boolan machine learning summit
Advanced deeplearning4j features
Skymind - Udacity China presentation
Anomaly detection in deep learning
Brief introduction to Distributed Deep Learning
Advanced spark deep learning
Skymind Open Power Summit ISV Round Table
Nd4 j slides.pptx
Deep learning on Hadoop/Spark -NextML
Skymind & Deeplearning4j: Deep Learning for the Enterprise
Sf data mining_meetup

Recently uploaded (20)

PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Foundation of Data Science unit number two notes
PDF
Launch Your Data Science Career in Kochi – 2025
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PDF
Introduction to Business Data Analytics.
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
Global journeys: estimating international migration
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PPT
Quality review (1)_presentation of this 21
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PPTX
Moving the Public Sector (Government) to a Digital Adoption
Business Ppt On Nestle.pptx huunnnhhgfvu
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
STUDY DESIGN details- Lt Col Maksud (21).pptx
Foundation of Data Science unit number two notes
Launch Your Data Science Career in Kochi – 2025
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Introduction-to-Cloud-ComputingFinal.pptx
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Introduction to Business Data Analytics.
oil_refinery_comprehensive_20250804084928 (1).pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
IBA_Chapter_11_Slides_Final_Accessible.pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Global journeys: estimating international migration
Introduction to Knowledge Engineering Part 1
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
Quality review (1)_presentation of this 21
Major-Components-ofNKJNNKNKNKNKronment.pptx
Moving the Public Sector (Government) to a Digital Adoption

Distributed deep rl on spark strata singapore

  • 2. SKYMIND INTELLIGENCE LAYER (SKIL) REFERENCE ARCHITECTURE
  • 3. Overview ● Why am I up here? ● Reinforcement Learning ● Use cases ● Demo! ● Deep Reinforcement Learning ● Rl4j ● Dl4j ● Spark/RL - why?
  • 4. Why am I up here? Wrote this --> Book Giveaway!
  • 5. Reinforcement Learning ● Learn a “policy” with repeated trial and error ● An agent explores a search space ● Learns from rewards and penalties each time it takes a step ● Think of win/lose scenarios ● Rewards/punishment set by an “environment” Credit: http://ai.berkeley.edu/reinforcement.ht ml
  • 6. Use cases (not games!) ● Risk analysis (loans) ● Network Intrusion ● Learning patterns from simulations (MCMC)
  • 8. Deep Reinforcement Learning ● Teach a neural net from environment ● Policy determines gradient descent steps ● Most work has been based on raw frames from games (pixel input) ● Various techniques (A3C,Policy Gradients,Deep Q,..) ● Core idea: Neural net has a softmax (probability distribution) mapped to actions to take in an environment
  • 9. RL4j ● Deep Reinforcement Learning library for Java ● Openai Gym Intregration ● Deep Reinforcement Learning with DL4j ● Implementations of A3C,DeepQ, Policy Gradients ● Openai Gym Java Bindings
  • 10. Dl4j
  • 11. Dl4j ● Import keras models ● Focus on running in production ● Integrate with existing big data ecosystem ● Transparent usage of cpus and gpus ● End to end ecosystem for building data products (not just algorithms!)
  • 12. Spark/RL Why? ● Spark is distributed compute ● A lot of simulations and environments to run ● Distributed workers running experiments in parallel ● Data Parallelism with neural nets
  • 13. Summary ● Spark for orchestrating simulations ● Spark for distributed training ● Integrated storage with HDFS ● Orchestrate GPU based spark jobs ● Easy to hook in to production (java/scala) ● Great streaming ecosystem for incremental updates