SlideShare a Scribd company logo
Deep Learning in
the Wild
H2o Meetup
Mountain View, 3/11/2015
Arno Candel, H2O.ai
Who am I?
PhD in Computational Physics, 2005

from ETH Zurich Switzerland
6 years at SLAC - Accelerator Physics Modeling
2 years at Skytree - Machine Learning
15 months at H2O.ai - Machine Learning
15 years in Supercomputing & Modeling
• Named “2014 Big Data All-Star” by Fortune Magazine
• http://www.kdnuggets.com/tag/arno-candel
@ArnoCandel
H2O Deep Learning, @ArnoCandel
Outline
Introduction (5 mins)
Methods & Implementation (5 mins)
Results and Live Demos (20 mins)
MNIST handwritten digits
Higgs boson classification
Ebay text classification
h2o-dev Outlook: Flow, Python
3
H2O Deep Learning, @ArnoCandel
Teamwork at H2O.ai
Java, Apache v2 Open-Source
#1 Java Machine Learning in Github
Join the community!
4
H2O Deep Learning, @ArnoCandel
H2O: Open-Source (Apache v2)
Predictive Analytics Platform
5
H2O Deep Learning, @ArnoCandel 6
H2O Architecture - Designed for speed,
scale, accuracy & ease of use
Key technical points:
• distributed JVMs + REST API
• no Java GC issues 

(data in byte[], Double)
• loss-less number compression
• Hadoop integration (v1,YARN)
• R package (CRAN)
Pre-built fully featured algos:

K-Means, NB, PCA, CoxPH,

GLM, RF, GBM, DeepLearning
H2O Deep Learning, @ArnoCandel
Wikipedia:

Deep learning is a set of algorithms in
machine learning that attempt to model
high-level abstractions in data by using
architectures composed of multiple 

non-linear transformations.
What is Deep Learning?
Input:

Image
Output:

User ID
7
Example: Facebook DeepFace
H2O Deep Learning, @ArnoCandel
What is NOT Deep
Linear models are not deep
(by definition)
Neural nets with 1 hidden layer are not deep
(only 1 layer - no feature hierarchy)
SVMs and Kernel methods are not deep
(2 layers: kernel + linear)
Classification trees are not deep
(operate on original input space, no new features generated)
8
H2O Deep Learning, @ArnoCandel
1970s multi-layer feed-forward Neural Network
(stochastic gradient descent with back-propagation)
+ distributed processing for big data
(fine-grain in-memory MapReduce on distributed data)
+ multi-threaded speedup
(async fork/join worker threads operate at FORTRAN speeds)
+ smart algorithms for fast & accurate results
(automatic standardization, one-hot encoding of categoricals, missing value imputation, weight &
bias initialization, adaptive learning rate, momentum, dropout/l1/L2 regularization, grid search, 

N-fold cross-validation, checkpointing, load balancing, auto-tuning, model averaging, etc.)
= powerful tool for (un)supervised
machine learning on real-world data
H2O Deep Learning
9
all 320 cores maxed out
H2O Deep Learning, @ArnoCandel
Adaptive learning rate - ADADELTA (Google)

Automatically set learning rate for each neuron
based on its training history
Grid Search and Checkpointing

Run a grid search to scan many hyper-
parameters, then continue training the most
promising model(s)
Regularization

L1: penalizes non-zero weights

L2: penalizes large weights

Dropout: randomly ignore certain inputs
Hogwild!: intentional race conditions
Distributed mode: weight averaging
10
“Secret” Sauce to Higher Accuracy
H2O Deep Learning, @ArnoCandel
MNIST: digits classification
Standing world record:

Without distortions or
convolutions, the best-ever
published error rate on test
set: 0.83% (Microsoft)
11
Train: 60,000 rows 784 integer columns 10 classes
Test: 10,000 rows 784 integer columns 10 classes
MNIST = Digitized handwritten
digits database (Yann LeCun)
Data: 28x28=784 pixels with
(gray-scale) values in 0…255
Yann LeCun: “Yet another advice: don't get
fooled by people who claim to have a solution
to Artificial General Intelligence. Ask them what
error rate they get on MNIST or ImageNet.”
H2O Deep Learning, @ArnoCandel 12
H2O Deep Learning beats MNIST
Standard 60k/10k data
No distortions
No convolutions
No unsupervised training
No ensemble
10 hours on 10 16-core nodes
World-record!
0.83% test set error
http://learn.h2o.ai/content/hands-on_training/deep_learning.html
H2O Deep Learning, @ArnoCandel
POJO Model Export for
Production Scoring
13
Plain old Java code is
auto-generated to take
your H2O Deep Learning
models into production!
H2O Deep Learning, @ArnoCandel
Parallel Scalability
(for 64 epochs on MNIST, with “0.83%” parameters)
14
Speedup
0.00
10.00
20.00
30.00
40.00
1 2 4 8 16 32 63
H2O Nodes
(4 cores per node, 1 epoch per node per MapReduce)
2.7 mins
Training Time
0
25
50
75
100
1 2 4 8 16 32 63
H2O Nodes
in minutes
H2O Deep Learning, @ArnoCandel
MNIST: Unsupervised Anomaly Detection
with Deep Learning (Autoencoder)
15
The good The bad The ugly
Download the script and run it yourself!
H2O Deep Learning, @ArnoCandel 16
Application: Higgs Boson Classification
Higgs

vs

Background
Large Hadron Collider: Largest experiment of mankind!
$13+ billion, 16.8 miles long, 120 MegaWatts, -456F, 1PB/day, etc.
Higgs boson discovery (July ’12) led to 2013 Nobel prize!
http://arxiv.org/pdf/1402.4735v2.pdf
Images courtesy CERN / LHC
HIGGS UCI Dataset:
21 low-level features AND
7 high-level derived features (physics formulae)
Train: 10M rows, Valid: 500k, Test: 500k rows
H2O Deep Learning, @ArnoCandel 17
Algorithm
Paper’s*
l-l AUC
low-level
H2O AUC
all features

H2O AUC
Parameters (not heavily tuned), 

H2O running on 10 nodes
Generalized Linear Model - 0.596 0.684 default, binomial
Random Forest - 0.764 0.840 50 trees, max depth 50
Gradient Boosted Trees 0.73 0.753 0.839 50 trees, max depth 15
Neural Net 1 layer 0.733 0.760 0.830 1x300 Rectifier, 100 epochs
Deep Learning 3 hidden layers 0.836 0.850 - 3x1000 Rectifier, L2=1e-5, 40 epochs
Deep Learning 4 hidden layers 0.868 0.869 - 4x500 Rectifier, L1=L2=1e-5, 300 epochs
Deep Learning 5 hidden layers 0.880 0.871 - 5x500 Rectifier, L1=L2=1e-5
Deep Learning on low-level features alone beats everything else!
Prelim. H2O results compare well with paper’s results* (TMVA & Theano)
Higgs Particle Detection with H2O
*Nature paper: http://arxiv.org/pdf/1402.4735v2.pdf
HIGGS UCI Dataset:
21 low-level features AND
7 high-level derived features
Train: 10M rows, Test: 500k rows
H2O Deep Learning, @ArnoCandel
Goal: Predict the item from
seller’s text description
18
Train: 578,361 rows 8,647 cols 467 classes
Test: 64,263 rows 8,647 cols 143 classes
“Vintage 18KT gold Rolex 2 Tone
in great condition”
Data: Bag of words vector 0,0,1,0,0,0,0,0,1,0,0,0,1,…,0
vintagegold condition
Text Classification
H2O Deep Learning, @ArnoCandel
Out-Of-The-Box: 11.6% test set error after 10 epochs!
Predicts the correct class (out of 143) 88.4% of the time!
19
Note 2: No tuning was done

(results are for illustration only)
Train: 578,361 rows 8,647 cols 467 classes
Test: 64,263 rows 8,647 cols 143 classes
Note 1: H2O columnar-compressed in-memory
store only needs 60 MB to store 5 billion
values (dense CSV needs 18 GB)
Text Classification
H2O Deep Learning, @ArnoCandel 20
H2O GitBooks
https://leanpub.com/u/h2oai
H2O Deep Learning, @ArnoCandel
Re-Live H2O World!
21
http://h2o.ai/h2o-world/
http://learn.h2o.ai
Watch the Videos
Day 2
• Speakers from Academia & Industry
• Trevor Hastie (ML)
• John Chambers (S, R)
• Josh Bloch (Java API)
• Many use cases from customers
• 3 Top Kaggle Contestants (Top 10)
• 3 Panel discussions
Day 1
• Hands-On Training
• Supervised
• Unsupervised
• Advanced Topics
• Markting Usecase
• Product Demos
• Hacker-Fest with 

Cliff Click (CTO, Hotspot)
H2O Deep Learning, @ArnoCandel
H2O Kaggle Starter R Scripts
22
Final ranking:

#26 out of 1604
H2O Deep Learning, @ArnoCandel
Currently Ongoing Challenge
23
H2O Deep Learning, @ArnoCandel
New h2o-dev: Flow-based GUI
24
H2O Deep Learning, @ArnoCandel
h2o-dev: iPython Notebooks
25
H2O Deep Learning, @ArnoCandel
Sparkling Water: Spark+H2O
26
H2O Deep Learning, @ArnoCandel
Key Take-Aways
H2O is an open source predictive analytics platform
for data scientists and business analysts who need
scalable and fast machine learning.
H2O Deep Learning is ready to take your advanced
analytics to the next level - Try it on your data!
Join our Community and Meetups!
https://github.com/h2oai
h2ostream community forum
www.h2o.ai
@h2oai
27
Thank you!

More Related Content

PDF
Deep Learning through Examples
PDF
MLconf - Distributed Deep Learning for Classification and Regression Problems...
PDF
H2O Open Source Deep Learning, Arno Candel 03-20-14
PDF
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
PDF
Alex Tellez, Deep Learning Applications
PDF
H2O Deep Learning at Next.ML
PDF
H2ODeepLearningThroughExamples021215
PDF
San Francisco Hadoop User Group Meetup Deep Learning
Deep Learning through Examples
MLconf - Distributed Deep Learning for Classification and Regression Problems...
H2O Open Source Deep Learning, Arno Candel 03-20-14
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
Alex Tellez, Deep Learning Applications
H2O Deep Learning at Next.ML
H2ODeepLearningThroughExamples021215
San Francisco Hadoop User Group Meetup Deep Learning

What's hot (20)

PDF
Webinar: Deep Learning with H2O
PDF
Distance oracle - Truy vấn nhanh khoảng cách giữa hai điểm bất kỳ trên đồ thị
PDF
H2O Distributed Deep Learning by Arno Candel 071614
PDF
Deep Learning Cases: Text and Image Processing
PDF
Deep Learning and Reinforcement Learning
PPTX
Introduction to Deep Learning
PDF
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
PDF
How to win data science competitions with Deep Learning
PPTX
Deep Learning with Python (PyData Seattle 2015)
PDF
Deep Learning And Business Models (VNITC 2015-09-13)
PDF
Intro to Machine Learning for GPUs
PPTX
Machine Learning for Smarter Apps - Jacksonville Meetup
PPTX
Promises of Deep Learning
PDF
Machine Learning and Deep Learning with R
PDF
Using Deep Learning to do Real-Time Scoring in Practical Applications
PDF
運用CNTK 實作深度學習物件辨識 Deep Learning based Object Detection with Microsoft Cogniti...
PPTX
What Deep Learning Means for Artificial Intelligence
PPTX
Details of Lazy Deep Learning for Images Recognition in ZZ Photo app
PPTX
Object detection with Tensorflow Api
PDF
Synthetic dialogue generation with Deep Learning
 
Webinar: Deep Learning with H2O
Distance oracle - Truy vấn nhanh khoảng cách giữa hai điểm bất kỳ trên đồ thị
H2O Distributed Deep Learning by Arno Candel 071614
Deep Learning Cases: Text and Image Processing
Deep Learning and Reinforcement Learning
Introduction to Deep Learning
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
How to win data science competitions with Deep Learning
Deep Learning with Python (PyData Seattle 2015)
Deep Learning And Business Models (VNITC 2015-09-13)
Intro to Machine Learning for GPUs
Machine Learning for Smarter Apps - Jacksonville Meetup
Promises of Deep Learning
Machine Learning and Deep Learning with R
Using Deep Learning to do Real-Time Scoring in Practical Applications
運用CNTK 實作深度學習物件辨識 Deep Learning based Object Detection with Microsoft Cogniti...
What Deep Learning Means for Artificial Intelligence
Details of Lazy Deep Learning for Images Recognition in ZZ Photo app
Object detection with Tensorflow Api
Synthetic dialogue generation with Deep Learning
 
Ad

Similar to Deep Learning in the Wild with Arno Candel (20)

PDF
ArnoCandelScalabledatascienceanddeeplearningwithh2o_gotochg
PDF
Arno candel scalabledatascienceanddeeplearningwithh2o_reworkboston2015
PDF
H2O.ai's Distributed Deep Learning by Arno Candel 04/03/14
PDF
Arno candel scalabledatascienceanddeeplearningwithh2o_odsc_boston2015
PDF
Scalable Data Science and Deep Learning with H2O
PDF
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
PDF
Manycores for the Masses
PDF
Machine Learning With H2O vs SparkML
PPTX
Scalable Deep Learning in ExtremeEarth-phiweek19
PDF
qconsf 2013: Top 10 Performance Gotchas for scaling in-memory Algorithms - Sr...
PDF
Top 10 Performance Gotchas for scaling in-memory Algorithms.
PPTX
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
PDF
Resource-Efficient Deep Learning Model Selection on Apache Spark
PDF
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
PPT
Many Task Applications for Grids and Supercomputers
PDF
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
PPTX
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah Bard
PDF
Software tools for high-throughput materials data generation and data mining
PDF
Cognitive Engine: Boosting Scientific Discovery
PPTX
And Then There Are Algorithms
ArnoCandelScalabledatascienceanddeeplearningwithh2o_gotochg
Arno candel scalabledatascienceanddeeplearningwithh2o_reworkboston2015
H2O.ai's Distributed Deep Learning by Arno Candel 04/03/14
Arno candel scalabledatascienceanddeeplearningwithh2o_odsc_boston2015
Scalable Data Science and Deep Learning with H2O
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
Manycores for the Masses
Machine Learning With H2O vs SparkML
Scalable Deep Learning in ExtremeEarth-phiweek19
qconsf 2013: Top 10 Performance Gotchas for scaling in-memory Algorithms - Sr...
Top 10 Performance Gotchas for scaling in-memory Algorithms.
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
Resource-Efficient Deep Learning Model Selection on Apache Spark
Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio
Many Task Applications for Grids and Supercomputers
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah Bard
Software tools for high-throughput materials data generation and data mining
Cognitive Engine: Boosting Scientific Discovery
And Then There Are Algorithms
Ad

More from Sri Ambati (20)

PDF
H2O Label Genie Starter Track - Support Presentation
PDF
H2O.ai Agents : From Theory to Practice - Support Presentation
PDF
H2O Generative AI Starter Track - Support Presentation Slides.pdf
PDF
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
PDF
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
PDF
Intro to Enterprise h2oGPTe Presentation Slides
PDF
Enterprise h2o GPTe Learning Path Slide Deck
PDF
H2O Wave Course Starter - Presentation Slides
PDF
Large Language Models (LLMs) - Level 3 Slides
PDF
Data Science and Machine Learning Platforms (2024) Slides
PDF
Data Prep for H2O Driverless AI - Slides
PDF
H2O Cloud AI Developer Services - Slides (2024)
PDF
LLM Learning Path Level 2 - Presentation Slides
PDF
LLM Learning Path Level 1 - Presentation Slides
PDF
Hydrogen Torch - Starter Course - Presentation Slides
PDF
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
PDF
H2O Driverless AI Starter Course - Slides and Assignments
PPTX
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
PDF
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
PPTX
Generative AI Masterclass - Model Risk Management.pptx
H2O Label Genie Starter Track - Support Presentation
H2O.ai Agents : From Theory to Practice - Support Presentation
H2O Generative AI Starter Track - Support Presentation Slides.pdf
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
Intro to Enterprise h2oGPTe Presentation Slides
Enterprise h2o GPTe Learning Path Slide Deck
H2O Wave Course Starter - Presentation Slides
Large Language Models (LLMs) - Level 3 Slides
Data Science and Machine Learning Platforms (2024) Slides
Data Prep for H2O Driverless AI - Slides
H2O Cloud AI Developer Services - Slides (2024)
LLM Learning Path Level 2 - Presentation Slides
LLM Learning Path Level 1 - Presentation Slides
Hydrogen Torch - Starter Course - Presentation Slides
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
H2O Driverless AI Starter Course - Slides and Assignments
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
Generative AI Masterclass - Model Risk Management.pptx

Recently uploaded (20)

PDF
Design an Analysis of Algorithms I-SECS-1021-03
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PPTX
Introduction to Artificial Intelligence
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PPTX
ISO 45001 Occupational Health and Safety Management System
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PPTX
CHAPTER 12 - CYBER SECURITY AND FUTURE SKILLS (1) (1).pptx
PPT
Introduction Database Management System for Course Database
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PDF
System and Network Administration Chapter 2
PDF
top salesforce developer skills in 2025.pdf
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PPTX
Transform Your Business with a Software ERP System
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Design an Analysis of Algorithms I-SECS-1021-03
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
VVF-Customer-Presentation2025-Ver1.9.pptx
Introduction to Artificial Intelligence
Softaken Excel to vCard Converter Software.pdf
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Upgrade and Innovation Strategies for SAP ERP Customers
Odoo Companies in India – Driving Business Transformation.pdf
ISO 45001 Occupational Health and Safety Management System
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Adobe Illustrator 28.6 Crack My Vision of Vector Design
CHAPTER 12 - CYBER SECURITY AND FUTURE SKILLS (1) (1).pptx
Introduction Database Management System for Course Database
Operating system designcfffgfgggggggvggggggggg
How to Choose the Right IT Partner for Your Business in Malaysia
System and Network Administration Chapter 2
top salesforce developer skills in 2025.pdf
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
Transform Your Business with a Software ERP System
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf

Deep Learning in the Wild with Arno Candel

  • 1. Deep Learning in the Wild H2o Meetup Mountain View, 3/11/2015 Arno Candel, H2O.ai
  • 2. Who am I? PhD in Computational Physics, 2005
 from ETH Zurich Switzerland 6 years at SLAC - Accelerator Physics Modeling 2 years at Skytree - Machine Learning 15 months at H2O.ai - Machine Learning 15 years in Supercomputing & Modeling • Named “2014 Big Data All-Star” by Fortune Magazine • http://www.kdnuggets.com/tag/arno-candel @ArnoCandel
  • 3. H2O Deep Learning, @ArnoCandel Outline Introduction (5 mins) Methods & Implementation (5 mins) Results and Live Demos (20 mins) MNIST handwritten digits Higgs boson classification Ebay text classification h2o-dev Outlook: Flow, Python 3
  • 4. H2O Deep Learning, @ArnoCandel Teamwork at H2O.ai Java, Apache v2 Open-Source #1 Java Machine Learning in Github Join the community! 4
  • 5. H2O Deep Learning, @ArnoCandel H2O: Open-Source (Apache v2) Predictive Analytics Platform 5
  • 6. H2O Deep Learning, @ArnoCandel 6 H2O Architecture - Designed for speed, scale, accuracy & ease of use Key technical points: • distributed JVMs + REST API • no Java GC issues 
 (data in byte[], Double) • loss-less number compression • Hadoop integration (v1,YARN) • R package (CRAN) Pre-built fully featured algos:
 K-Means, NB, PCA, CoxPH,
 GLM, RF, GBM, DeepLearning
  • 7. H2O Deep Learning, @ArnoCandel Wikipedia:
 Deep learning is a set of algorithms in machine learning that attempt to model high-level abstractions in data by using architectures composed of multiple 
 non-linear transformations. What is Deep Learning? Input:
 Image Output:
 User ID 7 Example: Facebook DeepFace
  • 8. H2O Deep Learning, @ArnoCandel What is NOT Deep Linear models are not deep (by definition) Neural nets with 1 hidden layer are not deep (only 1 layer - no feature hierarchy) SVMs and Kernel methods are not deep (2 layers: kernel + linear) Classification trees are not deep (operate on original input space, no new features generated) 8
  • 9. H2O Deep Learning, @ArnoCandel 1970s multi-layer feed-forward Neural Network (stochastic gradient descent with back-propagation) + distributed processing for big data (fine-grain in-memory MapReduce on distributed data) + multi-threaded speedup (async fork/join worker threads operate at FORTRAN speeds) + smart algorithms for fast & accurate results (automatic standardization, one-hot encoding of categoricals, missing value imputation, weight & bias initialization, adaptive learning rate, momentum, dropout/l1/L2 regularization, grid search, 
 N-fold cross-validation, checkpointing, load balancing, auto-tuning, model averaging, etc.) = powerful tool for (un)supervised machine learning on real-world data H2O Deep Learning 9 all 320 cores maxed out
  • 10. H2O Deep Learning, @ArnoCandel Adaptive learning rate - ADADELTA (Google)
 Automatically set learning rate for each neuron based on its training history Grid Search and Checkpointing
 Run a grid search to scan many hyper- parameters, then continue training the most promising model(s) Regularization
 L1: penalizes non-zero weights
 L2: penalizes large weights
 Dropout: randomly ignore certain inputs Hogwild!: intentional race conditions Distributed mode: weight averaging 10 “Secret” Sauce to Higher Accuracy
  • 11. H2O Deep Learning, @ArnoCandel MNIST: digits classification Standing world record:
 Without distortions or convolutions, the best-ever published error rate on test set: 0.83% (Microsoft) 11 Train: 60,000 rows 784 integer columns 10 classes Test: 10,000 rows 784 integer columns 10 classes MNIST = Digitized handwritten digits database (Yann LeCun) Data: 28x28=784 pixels with (gray-scale) values in 0…255 Yann LeCun: “Yet another advice: don't get fooled by people who claim to have a solution to Artificial General Intelligence. Ask them what error rate they get on MNIST or ImageNet.”
  • 12. H2O Deep Learning, @ArnoCandel 12 H2O Deep Learning beats MNIST Standard 60k/10k data No distortions No convolutions No unsupervised training No ensemble 10 hours on 10 16-core nodes World-record! 0.83% test set error http://learn.h2o.ai/content/hands-on_training/deep_learning.html
  • 13. H2O Deep Learning, @ArnoCandel POJO Model Export for Production Scoring 13 Plain old Java code is auto-generated to take your H2O Deep Learning models into production!
  • 14. H2O Deep Learning, @ArnoCandel Parallel Scalability (for 64 epochs on MNIST, with “0.83%” parameters) 14 Speedup 0.00 10.00 20.00 30.00 40.00 1 2 4 8 16 32 63 H2O Nodes (4 cores per node, 1 epoch per node per MapReduce) 2.7 mins Training Time 0 25 50 75 100 1 2 4 8 16 32 63 H2O Nodes in minutes
  • 15. H2O Deep Learning, @ArnoCandel MNIST: Unsupervised Anomaly Detection with Deep Learning (Autoencoder) 15 The good The bad The ugly Download the script and run it yourself!
  • 16. H2O Deep Learning, @ArnoCandel 16 Application: Higgs Boson Classification Higgs
 vs
 Background Large Hadron Collider: Largest experiment of mankind! $13+ billion, 16.8 miles long, 120 MegaWatts, -456F, 1PB/day, etc. Higgs boson discovery (July ’12) led to 2013 Nobel prize! http://arxiv.org/pdf/1402.4735v2.pdf Images courtesy CERN / LHC HIGGS UCI Dataset: 21 low-level features AND 7 high-level derived features (physics formulae) Train: 10M rows, Valid: 500k, Test: 500k rows
  • 17. H2O Deep Learning, @ArnoCandel 17 Algorithm Paper’s* l-l AUC low-level H2O AUC all features
 H2O AUC Parameters (not heavily tuned), 
 H2O running on 10 nodes Generalized Linear Model - 0.596 0.684 default, binomial Random Forest - 0.764 0.840 50 trees, max depth 50 Gradient Boosted Trees 0.73 0.753 0.839 50 trees, max depth 15 Neural Net 1 layer 0.733 0.760 0.830 1x300 Rectifier, 100 epochs Deep Learning 3 hidden layers 0.836 0.850 - 3x1000 Rectifier, L2=1e-5, 40 epochs Deep Learning 4 hidden layers 0.868 0.869 - 4x500 Rectifier, L1=L2=1e-5, 300 epochs Deep Learning 5 hidden layers 0.880 0.871 - 5x500 Rectifier, L1=L2=1e-5 Deep Learning on low-level features alone beats everything else! Prelim. H2O results compare well with paper’s results* (TMVA & Theano) Higgs Particle Detection with H2O *Nature paper: http://arxiv.org/pdf/1402.4735v2.pdf HIGGS UCI Dataset: 21 low-level features AND 7 high-level derived features Train: 10M rows, Test: 500k rows
  • 18. H2O Deep Learning, @ArnoCandel Goal: Predict the item from seller’s text description 18 Train: 578,361 rows 8,647 cols 467 classes Test: 64,263 rows 8,647 cols 143 classes “Vintage 18KT gold Rolex 2 Tone in great condition” Data: Bag of words vector 0,0,1,0,0,0,0,0,1,0,0,0,1,…,0 vintagegold condition Text Classification
  • 19. H2O Deep Learning, @ArnoCandel Out-Of-The-Box: 11.6% test set error after 10 epochs! Predicts the correct class (out of 143) 88.4% of the time! 19 Note 2: No tuning was done
 (results are for illustration only) Train: 578,361 rows 8,647 cols 467 classes Test: 64,263 rows 8,647 cols 143 classes Note 1: H2O columnar-compressed in-memory store only needs 60 MB to store 5 billion values (dense CSV needs 18 GB) Text Classification
  • 20. H2O Deep Learning, @ArnoCandel 20 H2O GitBooks https://leanpub.com/u/h2oai
  • 21. H2O Deep Learning, @ArnoCandel Re-Live H2O World! 21 http://h2o.ai/h2o-world/ http://learn.h2o.ai Watch the Videos Day 2 • Speakers from Academia & Industry • Trevor Hastie (ML) • John Chambers (S, R) • Josh Bloch (Java API) • Many use cases from customers • 3 Top Kaggle Contestants (Top 10) • 3 Panel discussions Day 1 • Hands-On Training • Supervised • Unsupervised • Advanced Topics • Markting Usecase • Product Demos • Hacker-Fest with 
 Cliff Click (CTO, Hotspot)
  • 22. H2O Deep Learning, @ArnoCandel H2O Kaggle Starter R Scripts 22 Final ranking:
 #26 out of 1604
  • 23. H2O Deep Learning, @ArnoCandel Currently Ongoing Challenge 23
  • 24. H2O Deep Learning, @ArnoCandel New h2o-dev: Flow-based GUI 24
  • 25. H2O Deep Learning, @ArnoCandel h2o-dev: iPython Notebooks 25
  • 26. H2O Deep Learning, @ArnoCandel Sparkling Water: Spark+H2O 26
  • 27. H2O Deep Learning, @ArnoCandel Key Take-Aways H2O is an open source predictive analytics platform for data scientists and business analysts who need scalable and fast machine learning. H2O Deep Learning is ready to take your advanced analytics to the next level - Try it on your data! Join our Community and Meetups! https://github.com/h2oai h2ostream community forum www.h2o.ai @h2oai 27 Thank you!