SlideShare a Scribd company logo
Driverless AI
Webinar
Arno Candel, PhD
Chief Technology Officer
H2O.ai, Inc.
@arnocandel
86 and growing!
Driverless AI - Intro + Interactive Hands-on Lab
Shortage of Data Scientists
Mistake Correction
Automation needed to
avoid human error
Driverless AI - Intro + Interactive Hands-on Lab
The “Secret Sauce” of Driverless AI: Feature Engineering
https://www.youtube.com/watch?v=VMTKcT1iHww
H2O.ai Webinar on Feature Engineering
Hours for Driverless AI — Weeks for grandmasters
single run, fully automated: 6h on 3 GPUs
Driverless AI: 18th place in private LB (out of 2926)
Driverless AI: top 1% in BNP Paribas Kaggle competition
Copyright 2018 H2O.ai Inc. All Rights Reserved.
Driverless AI: top 5% in Amazon Kaggle competition
Driverless AI: 80th place in private LB

(out of 1687 - top 5%)
With a little bit of stacking: 20th place (top 1.5%)
Driverless AI produces feature engineering pipeline (“more columns”) for downstream use
https://www.youtube.com/watch?v=qtUNyJlAID0&t=11s
https://github.com/kaz-Anova/Competitive_Dai
Copyright 2018 H2O.ai Inc. All Rights Reserved.
Automatic Visualization
Scalable outlier detection
(no sampling)
Contains novel statistical algorithms to only show
“relevant” aspects of the data

(coming soon: automated data cleaning) Copyright 2018 H2O.ai Inc. All Rights Reserved.
Machine Learning Interpretation
Gain confidence in models before deploying them!
Copyright 2018 H2O.ai Inc. All Rights Reserved.
MOJO: Pure Java Production Deployment
• feature engineering and model scoring logic
• auto-generated human-readable representation
• minimal platform-independent storage format
• scoring backend can be in any language (C/Java/C#/Go/etc.)
Copyright 2018 H2O.ai Inc. All Rights Reserved.
Copyright 2018 H2O.ai Inc. All Rights Reserved.
Feature Now Q1 2018 Q2 2018 Q3 2018
AutoDL Feature Engineering Recipe
Supervised Structured Data, CSV, Text
Overfitting and Leakage Prevention
Machine Learning Interpretation
Automatic Visualization
GUI
Python client API
Python scoring API HTTP
Thrift Scoring API
Multi-GPU (shared data)
Scoring MOJO (100% Java or C)
Data connectors: HDFS, SQL
User Management: LDAP, Kerberos
TensorFlow Deep Learning NLP Recipes
Time Series Recipes
Multi-GPU (sharded data) - optimized for DGX Volta
UDR (User-Defined Recipes), Verticals
Multi-Node Multi-GPU - optimized for DGX Volta
Sparkling Water Backend for Driverless AI
Driverless AI Roadmap
Copyright 2018 H2O.ai Inc. All Rights Reserved.
http://h2o.ai
Webinars
Copyright 2018 H2O.ai Inc. All Rights Reserved.
Hands-on Lab
Copyright 2018 H2O.ai Inc. All Rights Reserved.

More Related Content

PDF
H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA
PPTX
Drive Away Fraudsters With Driverless AI - Venkatesh Ramanathan, Senior Data ...
PPTX
Sundar Ranganathan, NetApp + Vinod Iyengar, H2O.ai - Driverless AI integratio...
PDF
Scalable and Automatic Machine Learning with H2O
PDF
Scalable Automatic Machine Learning with H2O
PDF
Jakub Hava, H2O.ai - Productionizing Apache Spark Models using H2O - H2O Worl...
PDF
Productionizing H2O Models with Apache Spark
PDF
Introducción al Machine Learning Automático
H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA
Drive Away Fraudsters With Driverless AI - Venkatesh Ramanathan, Senior Data ...
Sundar Ranganathan, NetApp + Vinod Iyengar, H2O.ai - Driverless AI integratio...
Scalable and Automatic Machine Learning with H2O
Scalable Automatic Machine Learning with H2O
Jakub Hava, H2O.ai - Productionizing Apache Spark Models using H2O - H2O Worl...
Productionizing H2O Models with Apache Spark
Introducción al Machine Learning Automático

What's hot (20)

PDF
Get Behind the Wheel with H2O Driverless AI Hands-On Training
PPTX
Nanda Vijaydev, BlueData - Deploying H2O in Large Scale Distributed Environme...
PDF
H2O.ai's Driverless AI
PDF
H2O AutoML roadmap - Ray Peck
PDF
H2O at Berlin R Meetup
PDF
Introduction to Machine Learning with H2O and Python
PPTX
Machine Learning with H2O
PPTX
Ruben Diaz, Vision Banco + Rafael Coss, H2O ai + Luis Armenta, IBM - AI journ...
PDF
Erin LeDell, H2O.ai - Scalable Automatic Machine Learning - H2O World San Fra...
PDF
Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and Shiny
PPTX
Using H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital One
PPTX
Get Started with Driverless AI Recipes - Hands-on Training
PPTX
Real-Time AI: Designing for Low Latency and High Throughput - Dr. Sergei Izra...
PPTX
Near realtime AI deployment with huge data and super low latency - Levi Brack...
PDF
Introducción al Aprendizaje Automatico con H2O-3 (1)
PPTX
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
PPTX
Building Real Time Targeting Capabilities - Ryan Zotti, Subbu Thiruppathy - C...
PDF
Build Your Own Recommendation Engine
PDF
Automatic and Interpretable Machine Learning in R with H2O and LIME (Milan Ed...
PDF
Intro to H2O Machine Learning in Python - Galvanize Seattle
Get Behind the Wheel with H2O Driverless AI Hands-On Training
Nanda Vijaydev, BlueData - Deploying H2O in Large Scale Distributed Environme...
H2O.ai's Driverless AI
H2O AutoML roadmap - Ray Peck
H2O at Berlin R Meetup
Introduction to Machine Learning with H2O and Python
Machine Learning with H2O
Ruben Diaz, Vision Banco + Rafael Coss, H2O ai + Luis Armenta, IBM - AI journ...
Erin LeDell, H2O.ai - Scalable Automatic Machine Learning - H2O World San Fra...
Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and Shiny
Using H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital One
Get Started with Driverless AI Recipes - Hands-on Training
Real-Time AI: Designing for Low Latency and High Throughput - Dr. Sergei Izra...
Near realtime AI deployment with huge data and super low latency - Levi Brack...
Introducción al Aprendizaje Automatico con H2O-3 (1)
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
Building Real Time Targeting Capabilities - Ryan Zotti, Subbu Thiruppathy - C...
Build Your Own Recommendation Engine
Automatic and Interpretable Machine Learning in R with H2O and LIME (Milan Ed...
Intro to H2O Machine Learning in Python - Galvanize Seattle
Ad

Similar to Driverless AI - Intro + Interactive Hands-on Lab (20)

PDF
A Look Under the Hood of H2O Driverless AI, Arno Candel - H2O World San Franc...
PDF
A Look Under the Hood of H2O Driverless AI
PDF
Machine Learning on Google Cloud with H2O
PDF
Bring Your Own Recipes Hands-On Session
PDF
ML Model Deployment and Scoring on the Edge with Automatic ML & DF
PDF
Accelerate ML Deployment with H2O Driverless AI on AWS
PDF
Big Data LDN 2017: H2O.ai Driverless AI: Fast, Accurate, Interpretable AI
PPTX
Auto ai for skillsfuture
PDF
Driverless AI - Introduction and Live Demo
PDF
Belgrade R - Intro to H2O and Deep Water
PDF
H2O at BelgradeR Meetup
PDF
ArnoCandelScalabledatascienceanddeeplearningwithh2o_gotochg
PPTX
Project "Deep Water"
PPTX
AI and AutoML: Debunking Myths
PDF
Custom Machine Learning Recipes
PDF
Arno candel scalabledatascienceanddeeplearningwithh2o_reworkboston2015
PDF
Latest Developments in H2O
PDF
H2O Deep Water - Making Deep Learning Accessible to Everyone
PDF
H2O at Poznan R Meetup
PDF
Scalable Data Science and Deep Learning with H2O
A Look Under the Hood of H2O Driverless AI, Arno Candel - H2O World San Franc...
A Look Under the Hood of H2O Driverless AI
Machine Learning on Google Cloud with H2O
Bring Your Own Recipes Hands-On Session
ML Model Deployment and Scoring on the Edge with Automatic ML & DF
Accelerate ML Deployment with H2O Driverless AI on AWS
Big Data LDN 2017: H2O.ai Driverless AI: Fast, Accurate, Interpretable AI
Auto ai for skillsfuture
Driverless AI - Introduction and Live Demo
Belgrade R - Intro to H2O and Deep Water
H2O at BelgradeR Meetup
ArnoCandelScalabledatascienceanddeeplearningwithh2o_gotochg
Project "Deep Water"
AI and AutoML: Debunking Myths
Custom Machine Learning Recipes
Arno candel scalabledatascienceanddeeplearningwithh2o_reworkboston2015
Latest Developments in H2O
H2O Deep Water - Making Deep Learning Accessible to Everyone
H2O at Poznan R Meetup
Scalable Data Science and Deep Learning with H2O
Ad

More from Sri Ambati (20)

PDF
H2O Label Genie Starter Track - Support Presentation
PDF
H2O.ai Agents : From Theory to Practice - Support Presentation
PDF
H2O Generative AI Starter Track - Support Presentation Slides.pdf
PDF
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
PDF
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
PDF
Intro to Enterprise h2oGPTe Presentation Slides
PDF
Enterprise h2o GPTe Learning Path Slide Deck
PDF
H2O Wave Course Starter - Presentation Slides
PDF
Large Language Models (LLMs) - Level 3 Slides
PDF
Data Science and Machine Learning Platforms (2024) Slides
PDF
Data Prep for H2O Driverless AI - Slides
PDF
H2O Cloud AI Developer Services - Slides (2024)
PDF
LLM Learning Path Level 2 - Presentation Slides
PDF
LLM Learning Path Level 1 - Presentation Slides
PDF
Hydrogen Torch - Starter Course - Presentation Slides
PDF
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
PDF
H2O Driverless AI Starter Course - Slides and Assignments
PPTX
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
PDF
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
PPTX
Generative AI Masterclass - Model Risk Management.pptx
H2O Label Genie Starter Track - Support Presentation
H2O.ai Agents : From Theory to Practice - Support Presentation
H2O Generative AI Starter Track - Support Presentation Slides.pdf
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
Intro to Enterprise h2oGPTe Presentation Slides
Enterprise h2o GPTe Learning Path Slide Deck
H2O Wave Course Starter - Presentation Slides
Large Language Models (LLMs) - Level 3 Slides
Data Science and Machine Learning Platforms (2024) Slides
Data Prep for H2O Driverless AI - Slides
H2O Cloud AI Developer Services - Slides (2024)
LLM Learning Path Level 2 - Presentation Slides
LLM Learning Path Level 1 - Presentation Slides
Hydrogen Torch - Starter Course - Presentation Slides
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
H2O Driverless AI Starter Course - Slides and Assignments
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
Generative AI Masterclass - Model Risk Management.pptx

Recently uploaded (20)

PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Spectroscopy.pptx food analysis technology
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
cuic standard and advanced reporting.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
KodekX | Application Modernization Development
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Encapsulation theory and applications.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
Review of recent advances in non-invasive hemoglobin estimation
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
MYSQL Presentation for SQL database connectivity
Spectroscopy.pptx food analysis technology
Building Integrated photovoltaic BIPV_UPV.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
cuic standard and advanced reporting.pdf
Network Security Unit 5.pdf for BCA BBA.
“AI and Expert System Decision Support & Business Intelligence Systems”
Per capita expenditure prediction using model stacking based on satellite ima...
NewMind AI Weekly Chronicles - August'25 Week I
KodekX | Application Modernization Development
MIND Revenue Release Quarter 2 2025 Press Release
Machine learning based COVID-19 study performance prediction
Understanding_Digital_Forensics_Presentation.pptx
Encapsulation theory and applications.pdf
Encapsulation_ Review paper, used for researhc scholars
The AUB Centre for AI in Media Proposal.docx
Diabetes mellitus diagnosis method based random forest with bat algorithm

Driverless AI - Intro + Interactive Hands-on Lab

  • 1. Driverless AI Webinar Arno Candel, PhD Chief Technology Officer H2O.ai, Inc. @arnocandel
  • 4. Shortage of Data Scientists
  • 7. The “Secret Sauce” of Driverless AI: Feature Engineering https://www.youtube.com/watch?v=VMTKcT1iHww H2O.ai Webinar on Feature Engineering
  • 8. Hours for Driverless AI — Weeks for grandmasters single run, fully automated: 6h on 3 GPUs Driverless AI: 18th place in private LB (out of 2926) Driverless AI: top 1% in BNP Paribas Kaggle competition Copyright 2018 H2O.ai Inc. All Rights Reserved.
  • 9. Driverless AI: top 5% in Amazon Kaggle competition Driverless AI: 80th place in private LB
 (out of 1687 - top 5%) With a little bit of stacking: 20th place (top 1.5%) Driverless AI produces feature engineering pipeline (“more columns”) for downstream use https://www.youtube.com/watch?v=qtUNyJlAID0&t=11s https://github.com/kaz-Anova/Competitive_Dai Copyright 2018 H2O.ai Inc. All Rights Reserved.
  • 10. Automatic Visualization Scalable outlier detection (no sampling) Contains novel statistical algorithms to only show “relevant” aspects of the data
 (coming soon: automated data cleaning) Copyright 2018 H2O.ai Inc. All Rights Reserved.
  • 11. Machine Learning Interpretation Gain confidence in models before deploying them! Copyright 2018 H2O.ai Inc. All Rights Reserved.
  • 12. MOJO: Pure Java Production Deployment • feature engineering and model scoring logic • auto-generated human-readable representation • minimal platform-independent storage format • scoring backend can be in any language (C/Java/C#/Go/etc.) Copyright 2018 H2O.ai Inc. All Rights Reserved.
  • 13. Copyright 2018 H2O.ai Inc. All Rights Reserved.
  • 14. Feature Now Q1 2018 Q2 2018 Q3 2018 AutoDL Feature Engineering Recipe Supervised Structured Data, CSV, Text Overfitting and Leakage Prevention Machine Learning Interpretation Automatic Visualization GUI Python client API Python scoring API HTTP Thrift Scoring API Multi-GPU (shared data) Scoring MOJO (100% Java or C) Data connectors: HDFS, SQL User Management: LDAP, Kerberos TensorFlow Deep Learning NLP Recipes Time Series Recipes Multi-GPU (sharded data) - optimized for DGX Volta UDR (User-Defined Recipes), Verticals Multi-Node Multi-GPU - optimized for DGX Volta Sparkling Water Backend for Driverless AI Driverless AI Roadmap Copyright 2018 H2O.ai Inc. All Rights Reserved.
  • 16. Hands-on Lab Copyright 2018 H2O.ai Inc. All Rights Reserved.