SlideShare a Scribd company logo
PRODUCTIONIZING
DEEP LEARNING FROM
THE GROUND UP
Adam Gibson
O P E N
D A T A
S C I E N C E
C O N F E R E N C E_
BOSTON 2015
@opendatasci
Open DataSciCon May 2015
Productionizing
Deep Learning
From the Ground Up
Overview
● What is Deep Learning?
● Why is it hard?
● Problems to think about
● Conclusions
What is Deep Learning?
Pattern
recognition on
unlabeled &
unstructured
data.
What is Deep Learning?
● Deep Neural Networks >= 3 Layers
● For media/unstructured data
● Automatic Feature Engineering
● Benefits From Complex Architectures
● Computationally Intensive
● Accelerates With Special Hardware
Get why it’s hard yet?
Deep Networks >= 3 Layers
● Backpropagation and Old School ANNs = 3
Deep Networks
● Neural Networks themselves as hidden
Layers
● Different Types of Layers can be
Interchanged/stacked
● Multiple Layer Types, each with own
Hyperparameters and Loss Functions
What Are Common Layer Types?
Feedforward
1. MLPs
2. AutoEncoders
3. RBMs
Recurrent
1. MultiModal
2. LSTMs
3. Stateful
Convolutional
Lenet: Mixes convolutional & subsampling layers
Recursive/Tree
Uses a parser to form a tree structure
Other kinds
● Memory Networks
● Deep Reinforcement Learning
● Adversarial Architectures
● New recursive ConvNet variant to come in
2016?
● Over 9,000 Layers? (22 is already pretty
common)
Automatic Feature Engineering
Automatic Feature Engineering (TSNE)
Visualizations are crucial:
Use TSNE to render different kinds of data:
http://lvdmaaten.github.io/tsne/
deeplearning4j.org
presentation@
Google, Nov. 17 2014
“TWO PIZZAS SITTING ON A STOVETOP”
Benefits from Complex Architectures
Google’s result combined:
● LSTMs (learning captions)
● Word Embeddings
● Convolutional features from images (aligned
to be same size as embeddings)
Computationally Intensive
● One iteration of ImageNet (1k label dataset
and over 1MM examples) takes 7 hours on
GPUs
● Project Adam
● Google Brain
Special Hardware required
Unlike most solutions, multiple GPUs are used
today
(Not common in Java-based stacks!)
Software Engineering Concerns
● Pipelines to deal with messy data,
not canned problems...
(Real life is not Kaggle, people.)
● Scale/Maintenance (Clusters of GPUs aren’t
done well today.)
● Different kinds of parallelism (model and
data)
Model vs Data Parallelism
● Model is sharding model across servers
(HPC style)
● Data is mini batch
Vectorizing unstructured data
● Data is stored in different databases
● Different kinds of files (raw)
● Deep Learning works well on mixed signal
Parallelism
● Model (HPC)
● Data (Mini batch param averaging)
Production Stacks today
● Hadoop/Spark not enough
● GPUs not friendly to average programmer
● Cluster management of GPUs as a resource
not typically done
● Many frameworks don’t work well in a
distributed env (getting better, though)
Problems With Neural Nets
● Loss functions
● Scaling data
● Mixing different neural nets
● Hyperparameter tuning
Loss Functions
● Classification
● Regression
● Reconstruction
Scaling Data
● Zero mean and unit variance
● Zero to 1
● Other forms of preprocessing relative to
distribution of data
● Processing can also be columnwise
(categorical?)
Mixing and Matching Neural Networks
● Video: ConvNet + Recurrent
● Convolutional RBMs?
● Convolutional -> Subsampling -> Fully
Connected
● DBNs: Different hidden and visible units for
each layer
Hyperparameter tuning
● Underfit
● Overfit
● Overdescribe (your hidden layers)
● Layerwise interactions
● What activation function? (Competing?
Relu? Good ol’ Sigmoid?)
Hyperparameter Tuning (2)
● Grid search for neural nets (Don’t do it!)
● Bayesian (Getting better. There are at least
priors here.)
● Gradient-based approaches (Your hyper-
parameters are a neural net, so there are
neural nets optimizing your neural nets...)
Questions?
Twitter: @agibsonccc
Github: agibsonccc
LinkedIn: /in/agibsonccc
Email: adam@skymind.io (combo breaker!)
Web: deeplearning4j.org

More Related Content

PPTX
Productionizing dl from the ground up
PPT
How To Do A Project
PDF
Николай Хлебинский — Предсказательная аналитика в E-commerce: есть ли жизнь п...
PPTX
Fail Conference
PPT
Understanding the Chief Data Officer
PDF
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
PDF
Materialism for UX Design
PPTX
Feature Engineering
Productionizing dl from the ground up
How To Do A Project
Николай Хлебинский — Предсказательная аналитика в E-commerce: есть ли жизнь п...
Fail Conference
Understanding the Chief Data Officer
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Materialism for UX Design
Feature Engineering

Viewers also liked (6)

PPTX
[Uxtokyojam]2014 final public
PDF
Can We Automate Predictive Analytics
PDF
Rethinking hx uxtokyo_jam
PDF
Experience Design Out of Screen:これからのエクスペリエンスデザイナーの生きる道@UX TOKYO Jam
PDF
UX TOKYO Jam 2014 Closing Keynote
PPTX
UX0,1,100
[Uxtokyojam]2014 final public
Can We Automate Predictive Analytics
Rethinking hx uxtokyo_jam
Experience Design Out of Screen:これからのエクスペリエンスデザイナーの生きる道@UX TOKYO Jam
UX TOKYO Jam 2014 Closing Keynote
UX0,1,100
Ad

Similar to Productionizing Deep Learning From the Ground Up (20)

PDF
Deep Learning Demystified
PPTX
Introduction to deep learning
PDF
Neural Networks, Spark MLlib, Deep Learning
PDF
Phx dl meetup
PPTX
Introduction to deep learning workshop
PPTX
Visualization of Deep Learning
DOCX
Title_ Deep Learning Explained_ What You Should Be Aware of in Data Science a...
PDF
Python for Image Understanding: Deep Learning with Convolutional Neural Nets
PDF
Top 10 deep learning algorithms you should know in
PPTX
Deep learning in automation industry
PPTX
No BS Guide to Deep Learning in the Enterprise
PPTX
Deep Learning on Qubole Data Platform
PDF
Deep learning: Cutting through the Myths and Hype
PDF
Big Data Malaysia - A Primer on Deep Learning
PPTX
Deep Learning Tutorial
PPTX
Deep learning tutorial 9/2019
PDF
Deep Learning And Business Models (VNITC 2015-09-13)
PDF
Machine Learning with JavaScript
PPT
Notes from 2016 bay area deep learning school
PPTX
10 Things I Wish I Dad Known Before Scaling Deep Learning Solutions
Deep Learning Demystified
Introduction to deep learning
Neural Networks, Spark MLlib, Deep Learning
Phx dl meetup
Introduction to deep learning workshop
Visualization of Deep Learning
Title_ Deep Learning Explained_ What You Should Be Aware of in Data Science a...
Python for Image Understanding: Deep Learning with Convolutional Neural Nets
Top 10 deep learning algorithms you should know in
Deep learning in automation industry
No BS Guide to Deep Learning in the Enterprise
Deep Learning on Qubole Data Platform
Deep learning: Cutting through the Myths and Hype
Big Data Malaysia - A Primer on Deep Learning
Deep Learning Tutorial
Deep learning tutorial 9/2019
Deep Learning And Business Models (VNITC 2015-09-13)
Machine Learning with JavaScript
Notes from 2016 bay area deep learning school
10 Things I Wish I Dad Known Before Scaling Deep Learning Solutions
Ad

More from odsc (20)

PPTX
Machine-In-The-Loop for Knowledge Discovery
PPT
API Driven Development
PPTX
Mobile technology Usage by Humanitarian Programs: A Metadata Analysis
PPT
Big Data Infrastructure: Introduction to Hadoop with MapReduce, Pig, and Hive
PPTX
Think Breadth, Not Depth
PPT
Data Science at Dow Jones: Monetizing Data, News and Information
PDF
Spark, Python and Parquet
PPTX
Building a Predictive Analytics Solution with Azure ML
PPT
Beyond Names
PPT
How Woman are Conquering the S&P 500
PPTX
Domain Expertise and Unstructured Data
PPTX
Kaggle The Home of Data Science
PPT
Open Source Tools & Data Science Competitions
PPT
Machine Learning with scikit-learn
PPT
Bridging the Gap Between Data and Insight using Open-Source Tools
PDF
Top 10 Signs of the Textpocalypse
PPTX
The Art of Data Science
PPTX
Frontiers of Open Data Science Research
PDF
Agile Data
PPT
Using your powers for good: Data science in the social sector
Machine-In-The-Loop for Knowledge Discovery
API Driven Development
Mobile technology Usage by Humanitarian Programs: A Metadata Analysis
Big Data Infrastructure: Introduction to Hadoop with MapReduce, Pig, and Hive
Think Breadth, Not Depth
Data Science at Dow Jones: Monetizing Data, News and Information
Spark, Python and Parquet
Building a Predictive Analytics Solution with Azure ML
Beyond Names
How Woman are Conquering the S&P 500
Domain Expertise and Unstructured Data
Kaggle The Home of Data Science
Open Source Tools & Data Science Competitions
Machine Learning with scikit-learn
Bridging the Gap Between Data and Insight using Open-Source Tools
Top 10 Signs of the Textpocalypse
The Art of Data Science
Frontiers of Open Data Science Research
Agile Data
Using your powers for good: Data science in the social sector

Recently uploaded (20)

PPTX
A Presentation on Artificial Intelligence
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
cloud_computing_Infrastucture_as_cloud_p
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPT
Teaching material agriculture food technology
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
Approach and Philosophy of On baking technology
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
NewMind AI Weekly Chronicles - August'25-Week II
A Presentation on Artificial Intelligence
gpt5_lecture_notes_comprehensive_20250812015547.pdf
cloud_computing_Infrastucture_as_cloud_p
SOPHOS-XG Firewall Administrator PPT.pptx
Group 1 Presentation -Planning and Decision Making .pptx
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Programs and apps: productivity, graphics, security and other tools
A comparative study of natural language inference in Swahili using monolingua...
Digital-Transformation-Roadmap-for-Companies.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Teaching material agriculture food technology
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Unlocking AI with Model Context Protocol (MCP)
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Approach and Philosophy of On baking technology
Mobile App Security Testing_ A Comprehensive Guide.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Encapsulation_ Review paper, used for researhc scholars
NewMind AI Weekly Chronicles - August'25-Week II

Productionizing Deep Learning From the Ground Up