SlideShare a Scribd company logo
Deep Learning and Recurrent Neural
Networks in the Enterprise
StampedeCon
St. Louis 2016
Josh Patterson, Skymind
Presenter: Josh Patterson
Past
Research in Swarm Algorithms: Real-time optimization techniques in
mesh sensor networks
TVA / NERC: Smartgrid, Sensor Collection, and Big Data
Cloudera: Principal SA, Working with Fortune 500
Patterson Consulting: Working with Fortune 500 on Big Data, ML
Today
Skymind, Director Field Engineering
josh@skymind.io / @jpatanooga
DL4J Co-creator,
Co-Author on Upcoming Oreilly Book
“Deep Learning: A Practitioner’s Approach”
Topics
• What is Deep Learning?
• DL4J
• Recurrent Neural Network Applications
WHAT IS DEEP LEARNING?
Defining Deep Learning
• Higher neuron counts than in previous
generation neural networks
• Different and evolved ways to connect layers
inside neural networks
• More computing power to train
• Automated Feature Learning
Automated Feature Learning
• Deep Learning can be thought of as workflows
for automated feature construction
– From “feature construction” to “feature learning”
• As Yann LeCun says:
– “machines that learn to represent the world”
Deep Learning and Recurrent Neural Networks in the Enterprise
Deep Learning and Recurrent Neural Networks in the Enterprise
These are the features learned at each neuron in a Restricted Boltzmann Machine
(RBMS)
These features are passed to higher levels of RBMs to learn more complicated things.
Part of the
“7” digit
Unreasonable Effectiveness:
Benchmark Records
1. Text-to-speech synthesis (Fan et al., Microsoft, Interspeech 2014)
2. Language identification (Gonzalez-Dominguez et al., Google, Interspeech 2014)
3. Large vocabulary speech recognition (Sak et al., Google, Interspeech 2014)
4. Prosody contour prediction (Fernandez et al., IBM, Interspeech 2014)
5. Medium vocabulary speech recognition (Geiger et al., Interspeech 2014)
6. English to French translation (Sutskever et al., Google, NIPS 2014)
7. Audio onset detection (Marchi et al., ICASSP 2014)
8. Social signal classification (Brueckner & Schulter, ICASSP 2014)
9. Arabic handwriting recognition (Bluche et al., DAS 2014)
10. TIMIT phoneme recognition (Graves et al., ICASSP 2013)
11. Optical character recognition (Breuel et al., ICDAR 2013)
12. Image caption generation (Vinyals et al., Google, 2014)
13. Video to textual description (Donahue et al., 2014)
14. Syntactic parsing for Natural Language Processing (Vinyals et al., Google, 2014)
15. Photo-real talking heads (Soong and Wang, Microsoft, 2014).
Four Major Architectures
• Deep Belief Networks
• Convolutional Neural Networks
• Recurrent Neural Networks
• Recursive Neural Networks
Quick Usage Guide
• If I have Timeseries or Audio Input
– I should use a Recurrent Neural Network
– Examples: Fraud Detection, Anomaly Detection
• If I have Image input
– I should use a Convolutional Neural Network
• If I have Video input
– I should use a hybrid Convolutional + Recurrent
Architecture!
Convolutional Generated Art
The More Things Change…
• Deep Learning is still trying to answer the
same fundamental questions such as:
– “is this image a face?”
• The difference is Deep Learning makes hard
questions easier to answer with better
architectures and more computing power
– We do this by matching the correct architecture
w the right problem
DL4J
Building Deep Neural Networks with
DL4J
• “The Hadoop of Deep Learning”
– Java, Scala, and Python APIs
– ASF 2.0 Licensed
• Java implementation
– Parallelization (Yarn + Spark)
– GPU support
• Also Supports multi-GPU per host
• Runtime Neutral
– Local
– Hadoop / YARN + Spark
• https://github.com/deeplearning4j/deeplearning4j
DL4J Workflow Toolchain
ETL
(DataVec)
Vectorization
(DataVec)
Modeling
(DL4J)
Evaluation
(Arbiter)
Execution Platforms: Spark, Single Machine
ND4J - Linear Algebra Runtime: CPU, GPU
ND4J: The Need for Speed
• Javacpp (cython for java)
– Auto generate JNI bindings for C++ by parsing classes
– Allows for easy maintenance and deployment of c++
binaries in java
• CPU backends
– Openmp (multithreading within native operations)
– Openblas or MKL (BLAS operations)
– SIMD-extensions
• GPU backends
– DL4J supports Cuda 7.5 at the moment, and will support
8.0 support as soon as it comes out.
– Leverages cudnn as well
Prepping Data is Time Consuming
http://www.forbes.com/sites/gilpress/2016/03/23/data-preparation-most-time-consuming-least-enjoyable-data-science-task-survey-says/#633ea7f67f75
Preparing Data for Modeling is Hard
DataVec
• DataVec is a tool for machine learning ETL
(Extract, Transform, Load) operations.
– Spark-Enabled and focused on Supporting DL4J
• Also performs vectorization
– Image, CSV, Sequences (timeseries), more
• Open Source, ASF 2.0 Licensed
– https://github.com/deeplearning4j/DataVec
RECURRENT NEURAL NETWORK
APPLICATIONS
Using DL4J for
Source: IDC White Paper - sponsored by EMC.
As the Economy Contracts, the Digital Universe Expands. May 2009.
.
Transactional Data Explosion
• 2,500 exabytes of new information in 2012 with Internet as primary driver
• Digital universe grew by 62% last year to 800K petabytes and will grow to 1.2
“zettabytes” this year
Relational
Transactional
(Logs, Sensors)
(You)
NERC Sensor Data Collection
openPDC PMU Data Collection circa 2009
• 120 Sensors
• 30 samples/second
• 4.3B Samples/day
• Housed in Hadoop
Sensor Timeseries Classification with RNNs
• Recurrent Neural Networks have the ability to
model change of input over time
• Older techniques (mostly) do not retain time
domain
– Hidden Markov Models do…
• but are more limited
• Key Takeaway:
– For working with Timeseries data, RNNs will be
more accurate
RNN Architectures
Standard
supervised
learning
Image
captioning
Sentiment
analysis
Video captioning,
Natural language
translation
Part of speech
tagging
Generative mode
for text
Anomaly Detection
• Model the normal patterns in the data
• Autoencoders give us the ability to look at
data that it hasn’t seen before
– Find anomalous patterns in sequences
– Can also use RNNs for pattern classification
• Interesting Industry Applications
– Telecom
– Financial Services
Audio Applications
• Text-to-Speech
• Recognize specific songs / audio
• Enables natural language interfaces
“Google is living a few years in the
future and sending the rest of us
messages”
-- Doug Cutting in 2013
• However
– Most organizations are not built like Google
• (and Jeff Dean does not work at your company…)
• Anyone building Next-Gen infrastructure has
to consider these things
Certified on Two Hadoop Distributions
• Running Spark on Hadoop via YARN gives us
– Sharing cluster resources between heterogeneous
workloads concurrently
– Access to the yarn scheduler capabilities
– Better control of executors in Spark
– Kerberos support for security
• Certified on CDH 5.4
• Certified on HDP 2.4
– [ Coming later this month ]
Questions?
Thank you for your time and attention
“Deep Learning: A Practitioner’s Approach”
(Oreilly, October 2016)
Running DL4J Workflows on Spark
• DataVec is built to scale out via Spark RDDs
– RDD<LabeledPoint>
– RDD<DataSet>
• DL4J Uses same MultiLayerConfiguration as
single host version
– Uses SparkDl4jMultiLayer to drive the training on spark
– Performs Parameter Averaging
spark-submit --class
io.skymind.spark.dl4j.datavec.BasicDataVecExample --master yarn --
num-executors 1 --properties-file ./spark_extra.props
./Skymind_spark-1.0-SNAPSHOT.jar

More Related Content

PPTX
Deep learning with DL4J - Hadoop Summit 2015
PPTX
How to Build Deep Learning Models
PPTX
Deep Learning: DL4J and DataVec
PPTX
Georgia Tech cse6242 - Intro to Deep Learning and DL4J
PPTX
Deep Learning Intro - Georgia Tech - CSE6242 - March 2015
PPTX
Building Deep Learning Workflows with DL4J
PDF
DL4J at Workday Meetup
PPTX
Enterprise Deep Learning with DL4J
Deep learning with DL4J - Hadoop Summit 2015
How to Build Deep Learning Models
Deep Learning: DL4J and DataVec
Georgia Tech cse6242 - Intro to Deep Learning and DL4J
Deep Learning Intro - Georgia Tech - CSE6242 - March 2015
Building Deep Learning Workflows with DL4J
DL4J at Workday Meetup
Enterprise Deep Learning with DL4J

What's hot (20)

PPTX
Smart Data Conference: DL4J and DataVec
PPTX
Deep learning on Hadoop/Spark -NextML
PDF
DeepLearning4J and Spark: Successes and Challenges - François Garillot
PPTX
Vectorization - Georgia Tech - CSE6242 - March 2015
ODP
Deep Learning for Java (DL4J)
PPTX
Deep Learning on Qubole Data Platform
PDF
Snorkel: Dark Data and Machine Learning with Christopher Ré
PPTX
Machine Learning Exposed!
PPTX
Amazon Deep Learning
PPTX
Hadoop Summit 2014 - San Jose - Introduction to Deep Learning on Hadoop
PDF
H2O with Erin LeDell at Portland R User Group
PDF
Distributed Inference on Large Datasets Using Apache MXNet and Apache Spark ...
PDF
Searching Chinese Patents Presentation at Enterprise Data World
PPTX
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
PDF
Building a Lightweight Discovery Interface for China's Patents@NYC Solr/Lucen...
PDF
Deploying Enterprise Deep Learning Masterclass Preview - Enterprise Deep Lea...
PDF
High Performance Machine Learning in R with H2O
PDF
Building Intelligent Applications, Experimental ML with Uber’s Data Science W...
PDF
Introduction to GPUs for Machine Learning
PDF
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Smart Data Conference: DL4J and DataVec
Deep learning on Hadoop/Spark -NextML
DeepLearning4J and Spark: Successes and Challenges - François Garillot
Vectorization - Georgia Tech - CSE6242 - March 2015
Deep Learning for Java (DL4J)
Deep Learning on Qubole Data Platform
Snorkel: Dark Data and Machine Learning with Christopher Ré
Machine Learning Exposed!
Amazon Deep Learning
Hadoop Summit 2014 - San Jose - Introduction to Deep Learning on Hadoop
H2O with Erin LeDell at Portland R User Group
Distributed Inference on Large Datasets Using Apache MXNet and Apache Spark ...
Searching Chinese Patents Presentation at Enterprise Data World
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Building a Lightweight Discovery Interface for China's Patents@NYC Solr/Lucen...
Deploying Enterprise Deep Learning Masterclass Preview - Enterprise Deep Lea...
High Performance Machine Learning in R with H2O
Building Intelligent Applications, Experimental ML with Uber’s Data Science W...
Introduction to GPUs for Machine Learning
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Ad

Similar to Deep Learning and Recurrent Neural Networks in the Enterprise (20)

PDF
Introduction to parallel iterative deep learning on hadoop’s next​ generation...
PDF
Deeplearning on Hadoop @OSCON 2014
PPTX
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
PPTX
Deep Learning on Hadoop
PPTX
Hadoop Summit 2014 Distributed Deep Learning
PPT
Introduction_to_DEEP_LEARNING.ppt machine learning that uses data, loads ...
PDF
Cognitive IoT using DeepLearning on data parallel frameworks like Spark & Flink
PPT
Introduction_to_DEEP_LEARNING ppt 101ppt
PPT
Introduction_to_DEEP_LEARNING.ppt
PPT
Introduction_to_DEEP_LEARNING.sfsdafsadfsadfsdafsdppt
PDF
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
PPTX
Suneel Marthi - Deep Learning with Apache Flink and DL4J
PPTX
Deep Learning with Apache Spark: an Introduction
PPTX
Understanding deep learning
PPTX
Applied Deep Learning with Spark and Deeplearning4j
PDF
Hands on image recognition with scala spark and deep learning4j
PPTX
Deep Learning and Watson Studio
PDF
DeepLearning4J and Spark: Successes and Challenges - François Garillot
PDF
DeepLearning4J and Spark: Successes and Challenges - François Garillot
PDF
Big Data Malaysia - A Primer on Deep Learning
Introduction to parallel iterative deep learning on hadoop’s next​ generation...
Deeplearning on Hadoop @OSCON 2014
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
Deep Learning on Hadoop
Hadoop Summit 2014 Distributed Deep Learning
Introduction_to_DEEP_LEARNING.ppt machine learning that uses data, loads ...
Cognitive IoT using DeepLearning on data parallel frameworks like Spark & Flink
Introduction_to_DEEP_LEARNING ppt 101ppt
Introduction_to_DEEP_LEARNING.ppt
Introduction_to_DEEP_LEARNING.sfsdafsadfsadfsdafsdppt
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Suneel Marthi - Deep Learning with Apache Flink and DL4J
Deep Learning with Apache Spark: an Introduction
Understanding deep learning
Applied Deep Learning with Spark and Deeplearning4j
Hands on image recognition with scala spark and deep learning4j
Deep Learning and Watson Studio
DeepLearning4J and Spark: Successes and Challenges - François Garillot
DeepLearning4J and Spark: Successes and Challenges - François Garillot
Big Data Malaysia - A Primer on Deep Learning
Ad

More from Josh Patterson (13)

PPTX
Patterson Consulting: What is Artificial Intelligence?
PPTX
What is Artificial Intelligence
PPTX
Modeling Electronic Health Records with Recurrent Neural Networks
PPTX
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
PPTX
Intro to Vectorization Concepts - GaTech cse6242
PPTX
MLConf 2013: Metronome and Parallel Iterative Algorithms on YARN
PPTX
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
PPTX
Knitting boar atl_hug_jan2013_v2
PPTX
Knitting boar - Toronto and Boston HUGs - Nov 2012
PPTX
LA HUG Dec 2011 - Recommendation Talk
PPTX
Oct 2011 CHADNUG Presentation on Hadoop
PPTX
Machine Learning and Hadoop
PPTX
Classification with Naive Bayes
Patterson Consulting: What is Artificial Intelligence?
What is Artificial Intelligence
Modeling Electronic Health Records with Recurrent Neural Networks
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
Intro to Vectorization Concepts - GaTech cse6242
MLConf 2013: Metronome and Parallel Iterative Algorithms on YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Knitting boar atl_hug_jan2013_v2
Knitting boar - Toronto and Boston HUGs - Nov 2012
LA HUG Dec 2011 - Recommendation Talk
Oct 2011 CHADNUG Presentation on Hadoop
Machine Learning and Hadoop
Classification with Naive Bayes

Recently uploaded (20)

PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
1_Introduction to advance data techniques.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
Global journeys: estimating international migration
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPT
Quality review (1)_presentation of this 21
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
Introduction to Business Data Analytics.
PDF
Foundation of Data Science unit number two notes
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
Supervised vs unsupervised machine learning algorithms
Major-Components-ofNKJNNKNKNKNKronment.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
1_Introduction to advance data techniques.pptx
IB Computer Science - Internal Assessment.pptx
Business Acumen Training GuidePresentation.pptx
Global journeys: estimating international migration
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Quality review (1)_presentation of this 21
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Clinical guidelines as a resource for EBP(1).pdf
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Introduction to Business Data Analytics.
Foundation of Data Science unit number two notes
IBA_Chapter_11_Slides_Final_Accessible.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Introduction-to-Cloud-ComputingFinal.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu

Deep Learning and Recurrent Neural Networks in the Enterprise

  • 1. Deep Learning and Recurrent Neural Networks in the Enterprise StampedeCon St. Louis 2016 Josh Patterson, Skymind
  • 2. Presenter: Josh Patterson Past Research in Swarm Algorithms: Real-time optimization techniques in mesh sensor networks TVA / NERC: Smartgrid, Sensor Collection, and Big Data Cloudera: Principal SA, Working with Fortune 500 Patterson Consulting: Working with Fortune 500 on Big Data, ML Today Skymind, Director Field Engineering josh@skymind.io / @jpatanooga DL4J Co-creator, Co-Author on Upcoming Oreilly Book “Deep Learning: A Practitioner’s Approach”
  • 3. Topics • What is Deep Learning? • DL4J • Recurrent Neural Network Applications
  • 4. WHAT IS DEEP LEARNING?
  • 5. Defining Deep Learning • Higher neuron counts than in previous generation neural networks • Different and evolved ways to connect layers inside neural networks • More computing power to train • Automated Feature Learning
  • 6. Automated Feature Learning • Deep Learning can be thought of as workflows for automated feature construction – From “feature construction” to “feature learning” • As Yann LeCun says: – “machines that learn to represent the world”
  • 9. These are the features learned at each neuron in a Restricted Boltzmann Machine (RBMS) These features are passed to higher levels of RBMs to learn more complicated things. Part of the “7” digit
  • 10. Unreasonable Effectiveness: Benchmark Records 1. Text-to-speech synthesis (Fan et al., Microsoft, Interspeech 2014) 2. Language identification (Gonzalez-Dominguez et al., Google, Interspeech 2014) 3. Large vocabulary speech recognition (Sak et al., Google, Interspeech 2014) 4. Prosody contour prediction (Fernandez et al., IBM, Interspeech 2014) 5. Medium vocabulary speech recognition (Geiger et al., Interspeech 2014) 6. English to French translation (Sutskever et al., Google, NIPS 2014) 7. Audio onset detection (Marchi et al., ICASSP 2014) 8. Social signal classification (Brueckner & Schulter, ICASSP 2014) 9. Arabic handwriting recognition (Bluche et al., DAS 2014) 10. TIMIT phoneme recognition (Graves et al., ICASSP 2013) 11. Optical character recognition (Breuel et al., ICDAR 2013) 12. Image caption generation (Vinyals et al., Google, 2014) 13. Video to textual description (Donahue et al., 2014) 14. Syntactic parsing for Natural Language Processing (Vinyals et al., Google, 2014) 15. Photo-real talking heads (Soong and Wang, Microsoft, 2014).
  • 11. Four Major Architectures • Deep Belief Networks • Convolutional Neural Networks • Recurrent Neural Networks • Recursive Neural Networks
  • 12. Quick Usage Guide • If I have Timeseries or Audio Input – I should use a Recurrent Neural Network – Examples: Fraud Detection, Anomaly Detection • If I have Image input – I should use a Convolutional Neural Network • If I have Video input – I should use a hybrid Convolutional + Recurrent Architecture!
  • 14. The More Things Change… • Deep Learning is still trying to answer the same fundamental questions such as: – “is this image a face?” • The difference is Deep Learning makes hard questions easier to answer with better architectures and more computing power – We do this by matching the correct architecture w the right problem
  • 15. DL4J Building Deep Neural Networks with
  • 16. DL4J • “The Hadoop of Deep Learning” – Java, Scala, and Python APIs – ASF 2.0 Licensed • Java implementation – Parallelization (Yarn + Spark) – GPU support • Also Supports multi-GPU per host • Runtime Neutral – Local – Hadoop / YARN + Spark • https://github.com/deeplearning4j/deeplearning4j
  • 17. DL4J Workflow Toolchain ETL (DataVec) Vectorization (DataVec) Modeling (DL4J) Evaluation (Arbiter) Execution Platforms: Spark, Single Machine ND4J - Linear Algebra Runtime: CPU, GPU
  • 18. ND4J: The Need for Speed • Javacpp (cython for java) – Auto generate JNI bindings for C++ by parsing classes – Allows for easy maintenance and deployment of c++ binaries in java • CPU backends – Openmp (multithreading within native operations) – Openblas or MKL (BLAS operations) – SIMD-extensions • GPU backends – DL4J supports Cuda 7.5 at the moment, and will support 8.0 support as soon as it comes out. – Leverages cudnn as well
  • 19. Prepping Data is Time Consuming http://www.forbes.com/sites/gilpress/2016/03/23/data-preparation-most-time-consuming-least-enjoyable-data-science-task-survey-says/#633ea7f67f75
  • 20. Preparing Data for Modeling is Hard
  • 21. DataVec • DataVec is a tool for machine learning ETL (Extract, Transform, Load) operations. – Spark-Enabled and focused on Supporting DL4J • Also performs vectorization – Image, CSV, Sequences (timeseries), more • Open Source, ASF 2.0 Licensed – https://github.com/deeplearning4j/DataVec
  • 23. Source: IDC White Paper - sponsored by EMC. As the Economy Contracts, the Digital Universe Expands. May 2009. . Transactional Data Explosion • 2,500 exabytes of new information in 2012 with Internet as primary driver • Digital universe grew by 62% last year to 800K petabytes and will grow to 1.2 “zettabytes” this year Relational Transactional (Logs, Sensors) (You)
  • 24. NERC Sensor Data Collection openPDC PMU Data Collection circa 2009 • 120 Sensors • 30 samples/second • 4.3B Samples/day • Housed in Hadoop
  • 25. Sensor Timeseries Classification with RNNs • Recurrent Neural Networks have the ability to model change of input over time • Older techniques (mostly) do not retain time domain – Hidden Markov Models do… • but are more limited • Key Takeaway: – For working with Timeseries data, RNNs will be more accurate
  • 26. RNN Architectures Standard supervised learning Image captioning Sentiment analysis Video captioning, Natural language translation Part of speech tagging Generative mode for text
  • 27. Anomaly Detection • Model the normal patterns in the data • Autoencoders give us the ability to look at data that it hasn’t seen before – Find anomalous patterns in sequences – Can also use RNNs for pattern classification • Interesting Industry Applications – Telecom – Financial Services
  • 28. Audio Applications • Text-to-Speech • Recognize specific songs / audio • Enables natural language interfaces
  • 29. “Google is living a few years in the future and sending the rest of us messages” -- Doug Cutting in 2013 • However – Most organizations are not built like Google • (and Jeff Dean does not work at your company…) • Anyone building Next-Gen infrastructure has to consider these things
  • 30. Certified on Two Hadoop Distributions • Running Spark on Hadoop via YARN gives us – Sharing cluster resources between heterogeneous workloads concurrently – Access to the yarn scheduler capabilities – Better control of executors in Spark – Kerberos support for security • Certified on CDH 5.4 • Certified on HDP 2.4 – [ Coming later this month ]
  • 31. Questions? Thank you for your time and attention “Deep Learning: A Practitioner’s Approach” (Oreilly, October 2016)
  • 32. Running DL4J Workflows on Spark • DataVec is built to scale out via Spark RDDs – RDD<LabeledPoint> – RDD<DataSet> • DL4J Uses same MultiLayerConfiguration as single host version – Uses SparkDl4jMultiLayer to drive the training on spark – Performs Parameter Averaging spark-submit --class io.skymind.spark.dl4j.datavec.BasicDataVecExample --master yarn -- num-executors 1 --properties-file ./spark_extra.props ./Skymind_spark-1.0-SNAPSHOT.jar