SlideShare a Scribd company logo
Industrial Machine
Learning
Applied Artificial Intelligence in the New Industrial Revolution, 13 April 2017 (SF)
Josh Bloom
@profjsb
COPYRIGHT 2012-2017, WISE.IO INC.
• Brief Background/Introduction: Me & Wise.io
• Industrial Machine Learning (IML) Opportunities
• ML as a Systems Engineering Challenge
• IML Applications at GE
Agenda
Teaching
‣ Python Bootcamps
200+ undergrad/grad
‣ Python for Data
Science
graduate course
Industry
‣ML Applications
Company
Code / Repos
Q4’16
CTO, Co-founder Professor, UC Berkeley
Research
Gordon & Betty
Moore Foundation
Data-Driven Investigator
‣ Automated 

Data-driven
Discovery &
Inference in the
Time Domain
‣300+ refereed
articles
COPYRIGHT 2012-2017, WISE.IO INC.
“Intelligent applications in Production”
Customer Support Product
○Intelligent Routing/Triage
○Response Recommendation
○Auto-Response
○Knowledge-base Deflection
○Federated Search
○Spam Filtering
○Sentiment Prediction
○IoT/proactive support
Enhancing Decisions in Human-centric Workflows
• Currently serving dozens of customers in production
• Our customers: mid-sized, 5k-5M interactions/month,
charged on a per ticket basis
COPYRIGHT 2012-2017, WISE.IO INC.
Wise.io @ GE
Build & deploy SaaS-based production-grade scalable
intelligent IIoT applications for end business users
Leveraging the data, horizontal edge-to-cloud platform (Predix), &
industry relationships already at GE
IIoT: Beyond “Smart” Thermostats, Fitbits, and Self-driving cars…
COPYRIGHT 2012-2017, WISE.IO INC.
Consumer Internet Industrial Internet
Data Management Day’s worth of Twitter: 500 GB
Single flight:
1 TB
Connectivity
Biggest cell phone complaint:
dropped calls
Mission critical, rough & remote
Device
Support
Average wearables lifetime: 6
months
Lifetime of a Turbine:
20+ years
Security Time to Hack most devices: minutes 24/7 Mission Critical
Privacy
Privacy is no longer a “social norm” -
Zuck
HIPAA, ITAR, …
IIoT: The Internet of Really Important Things
Industrial Machine Learning as a Systems Challenge
What are we optimizing for?
Component What
Algorithm/Model
Learning rate, convexity, error
bounds, scaling, …
+ Software/Hardware
Accuracy, Memory usage, Disk
usage, CPU needs, time to learn,
time to predict
+ Project Staff
time to implement, people/
resource costs, reliability,
maintainability, experimentability
+ Consumers
direct value, useability,
explainability, actionability,
security, privacy
+ Society indirect value, ethics
- multi-axis optimizations in a given
component
- highly coupled optimization
considerations between components
- myopic view can be costly further up
the stack
All ML in production is a
Systems Challenge
Copyright 2012-2017, wise.io inc.
10
One ML Algorithmic Trade-Off
High
Low
Low High
Interpretability
Accuracy
Linear/Logistic
Regression
Naive Bayes
Decision Trees
SVMs
Bagging
Boosting
Decision Forests
Neural Nets
Deep Learning
Nearest
Neighbors
Gaussian/
Dirichlet
Processes
Splines
* on real-world data sets
Lasso
Warning
Unscientific &
opinionated!
11
>$50k Prize
<$50k Prize
Netflix
winning
metric
best
benchmark
many teams get within
~few % of optimum
so which is easier to
put into production?
Leaderboard data from Kaggle & Netflix
Optimization Metric
12
“We evaluated some of the new methods
offline but the additional accuracy gains
that we measured did not seem to justify the
engineering effort needed to bring them into
a production environment.”
Xavier Amatriain and Justin Basilico (April 2012)
On the Prize
http://research.google.com/pubs/pub43146.html
• Complex models erode abstraction
boundaries
• Data dependencies cost more than
code dependencies: weak contracts
• System-level Spaghetti
• Changing External World
“It may be surprising to the
academic community to know
that only a fraction of the code
… is actually doing ‘machine
learning’. A mature system
might end up being (at most)
5% machine learning code
and (at least) 95% glue code.”
see also, Bottou (Facebook) ICML
Prediction API
in-houseas a service
experimental/sandbox
production/scale ready
watsonAPI
Prediction API
in-houseas a service
experimental/sandbox
production/scale ready
watsonAPI
time & cost to implement
cost to maintain
COPYRIGHT 2012-2017, WISE.IO INC.
Wise Architecture: Leveraging Cloud-based Services
Services Oriented, Leveraging PaaS Managed Services
Microscaling: Dockerized
templated workflows for CPU/
GPU build/predict end-points
Macro scaling: compute clusters
load-balance
RESTful contracts between
services
Build on the AWS stack;
Instantiated with terraform
End-user Transactional
Systems
Embedded
UI


Wise App SDK Use Case Specific Middleware
Auth
Monitoring/
Alerting
Admin
Dashboard
Reporting

Wise Factory
Wise Template (Learn/Prediction/Feedback)
Transaction
DB
Model Storage / Management
FrontendMiddlewareMLbackend
Example
Industrial Machine Learning
Application
Inline Pipeline
Inspection
Technology ▶ Action
+
seam detected
Crack
Terabytes of
Inspection
data
Aggregate
historic data
to enable
learning from
experience
Advanced machine
learning generates
more accurate
insights
Surfaced to analysts
to improve
performance, drive
consistency, &
repeatability
Our Goal: drive Zero-Pipeline-Failure
Industrial Machine Learning (at GE)
Industrial Machine Learning (at GE)
The Power of a
1% Gain in Efficiency
$27B
$30B
$63B
$66B
$90B
Rail
Aviation
Healthcare
Power
Oil & Gas
Source: “Industrial Internet Pushing Boundaries of Minds & Machines” GE, 2012
Industrial Machine
Learning
Applied Artificial Intelligence in the New Industrial Revolution, 13 April 2017 (SF)
Josh Bloom
@profjsb
Thanks!
(and yes, we’re hiring…)

More Related Content

PDF
Industrial Machine Learning (SIGKDD17)
PDF
Industrial Machine Learning
PDF
Counter Intuitive Machine Learning for the Industrial Internet of Things
PPTX
Industrial training ppt
PDF
IC-SDV 2019: The Economics of Artificial Intelligence and Machine Learning fo...
PPT
Data Mining and Knowledge Discovery in Business Databases
PPTX
PDF
Architecting AI Applications
Industrial Machine Learning (SIGKDD17)
Industrial Machine Learning
Counter Intuitive Machine Learning for the Industrial Internet of Things
Industrial training ppt
IC-SDV 2019: The Economics of Artificial Intelligence and Machine Learning fo...
Data Mining and Knowledge Discovery in Business Databases
Architecting AI Applications

What's hot (20)

PDF
Practical Challenges ML Workflows
PDF
AI for Science
PPTX
#OSSPARIS19 - Overcoming open source challenges in reinforcement learning - W...
PDF
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
PDF
Machine Learning and Internet of Things
PDF
Data Science versus Artificial Intelligence: a useful distinction
PDF
AI: The next frontier by Amparo Alonso at Big Data Spain 2017
PPTX
IoT 2014 Value Creation Workshop: SDIL
PDF
DN18 | From Counting to Connecting: A Networked and Data-Driven Approach to M...
PDF
Data Science for Business Managers - Trends and Evolutions
PDF
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
PPTX
Data science applications and usecases
PDF
Getting Ready For 3rd Generation Platform
PDF
An empirical investigation on big data analytics (BDA) and innovation perform...
PDF
MLSEV Virtual. ML: Business Perspective
PPTX
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...
PDF
Vertex perspectives artificial intelligence
PPTX
Ai open powermeetupmarch25th
 
PPTX
Data Science & Artificial Intelligence for ALL
PDF
DN18 | Applied Machine Learning in Cybersecurity: Detect malicious DGA Domain...
Practical Challenges ML Workflows
AI for Science
#OSSPARIS19 - Overcoming open source challenges in reinforcement learning - W...
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Machine Learning and Internet of Things
Data Science versus Artificial Intelligence: a useful distinction
AI: The next frontier by Amparo Alonso at Big Data Spain 2017
IoT 2014 Value Creation Workshop: SDIL
DN18 | From Counting to Connecting: A Networked and Data-Driven Approach to M...
Data Science for Business Managers - Trends and Evolutions
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
Data science applications and usecases
Getting Ready For 3rd Generation Platform
An empirical investigation on big data analytics (BDA) and innovation perform...
MLSEV Virtual. ML: Business Perspective
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...
Vertex perspectives artificial intelligence
Ai open powermeetupmarch25th
 
Data Science & Artificial Intelligence for ALL
DN18 | Applied Machine Learning in Cybersecurity: Detect malicious DGA Domain...
Ad

Similar to Industrial Machine Learning (at GE) (20)

PPTX
2019 BioIt World - Post cloud legacy edition
PDF
Building Data Science Ecosystems for Smart Cities and Smart Commerce
PDF
Towards the Industrialization of AI
PDF
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
PDF
Where the Warehouse Ends: A New Age of Information Access
PDF
The Evolving Role of the Data Engineer - Whitepaper | Qubole
PDF
Using Algorithmia to leverage AI and Machine Learning APIs
PDF
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
PDF
Future of Data Strategy (ASEAN)
PPT
Practical Artificial Intelligence: Deep Learning Beyond Cats and Cars
PPTX
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
PDF
Neo4j on Microsoft Azure
PDF
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
PPTX
Is IIOT Right for You?
PDF
The coding portion of Data Science
PDF
Towards A Reference Architecture for BIG DATA.pdf
PDF
"Deep Learning Beyond Cats and Cars: Developing a Real-life DNN-based Embedde...
PDF
What is the future of data strategy?
PDF
Understanding the New World of Cognitive Computing
PDF
Architecting for Data Science
2019 BioIt World - Post cloud legacy edition
Building Data Science Ecosystems for Smart Cities and Smart Commerce
Towards the Industrialization of AI
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
Where the Warehouse Ends: A New Age of Information Access
The Evolving Role of the Data Engineer - Whitepaper | Qubole
Using Algorithmia to leverage AI and Machine Learning APIs
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
Future of Data Strategy (ASEAN)
Practical Artificial Intelligence: Deep Learning Beyond Cats and Cars
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Neo4j on Microsoft Azure
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
Is IIOT Right for You?
The coding portion of Data Science
Towards A Reference Architecture for BIG DATA.pdf
"Deep Learning Beyond Cats and Cars: Developing a Real-life DNN-based Embedde...
What is the future of data strategy?
Understanding the New World of Cognitive Computing
Architecting for Data Science
Ad

More from Joshua Bloom (8)

PDF
Autoencoding RNN for inference on unevenly sampled time-series data
PDF
Data Science Education: Needs & Opportunities in Astronomy
PDF
PyData 2015 Keynote: "A Systems View of Machine Learning"
PDF
Computational Training for Domain Scientists & Data Literacy
PDF
Large-Scale Inference in Time Domain Astrophysics
PDF
Data Science at Berkeley
PDF
Computational Training and Data Literacy for Domain Scientists
KEY
Joshua Bloom: Machine Learning and Classification in the Synoptic Survey Era
Autoencoding RNN for inference on unevenly sampled time-series data
Data Science Education: Needs & Opportunities in Astronomy
PyData 2015 Keynote: "A Systems View of Machine Learning"
Computational Training for Domain Scientists & Data Literacy
Large-Scale Inference in Time Domain Astrophysics
Data Science at Berkeley
Computational Training and Data Literacy for Domain Scientists
Joshua Bloom: Machine Learning and Classification in the Synoptic Survey Era

Recently uploaded (20)

PDF
Foundation of Data Science unit number two notes
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
IB Computer Science - Internal Assessment.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPT
Quality review (1)_presentation of this 21
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Launch Your Data Science Career in Kochi – 2025
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Mega Projects Data Mega Projects Data
Foundation of Data Science unit number two notes
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Database Infoormation System (DBIS).pptx
Business Acumen Training GuidePresentation.pptx
Reliability_Chapter_ presentation 1221.5784
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Introduction to Knowledge Engineering Part 1
IB Computer Science - Internal Assessment.pptx
.pdf is not working space design for the following data for the following dat...
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Quality review (1)_presentation of this 21
Data_Analytics_and_PowerBI_Presentation.pptx
Launch Your Data Science Career in Kochi – 2025
STUDY DESIGN details- Lt Col Maksud (21).pptx
Mega Projects Data Mega Projects Data

Industrial Machine Learning (at GE)

  • 1. Industrial Machine Learning Applied Artificial Intelligence in the New Industrial Revolution, 13 April 2017 (SF) Josh Bloom @profjsb
  • 2. COPYRIGHT 2012-2017, WISE.IO INC. • Brief Background/Introduction: Me & Wise.io • Industrial Machine Learning (IML) Opportunities • ML as a Systems Engineering Challenge • IML Applications at GE Agenda
  • 3. Teaching ‣ Python Bootcamps 200+ undergrad/grad ‣ Python for Data Science graduate course Industry ‣ML Applications Company Code / Repos Q4’16 CTO, Co-founder Professor, UC Berkeley Research Gordon & Betty Moore Foundation Data-Driven Investigator ‣ Automated 
 Data-driven Discovery & Inference in the Time Domain ‣300+ refereed articles
  • 4. COPYRIGHT 2012-2017, WISE.IO INC. “Intelligent applications in Production” Customer Support Product ○Intelligent Routing/Triage ○Response Recommendation ○Auto-Response ○Knowledge-base Deflection ○Federated Search ○Spam Filtering ○Sentiment Prediction ○IoT/proactive support Enhancing Decisions in Human-centric Workflows • Currently serving dozens of customers in production • Our customers: mid-sized, 5k-5M interactions/month, charged on a per ticket basis
  • 5. COPYRIGHT 2012-2017, WISE.IO INC. Wise.io @ GE Build & deploy SaaS-based production-grade scalable intelligent IIoT applications for end business users Leveraging the data, horizontal edge-to-cloud platform (Predix), & industry relationships already at GE
  • 6. IIoT: Beyond “Smart” Thermostats, Fitbits, and Self-driving cars…
  • 7. COPYRIGHT 2012-2017, WISE.IO INC. Consumer Internet Industrial Internet Data Management Day’s worth of Twitter: 500 GB Single flight: 1 TB Connectivity Biggest cell phone complaint: dropped calls Mission critical, rough & remote Device Support Average wearables lifetime: 6 months Lifetime of a Turbine: 20+ years Security Time to Hack most devices: minutes 24/7 Mission Critical Privacy Privacy is no longer a “social norm” - Zuck HIPAA, ITAR, … IIoT: The Internet of Really Important Things
  • 8. Industrial Machine Learning as a Systems Challenge
  • 9. What are we optimizing for? Component What Algorithm/Model Learning rate, convexity, error bounds, scaling, … + Software/Hardware Accuracy, Memory usage, Disk usage, CPU needs, time to learn, time to predict + Project Staff time to implement, people/ resource costs, reliability, maintainability, experimentability + Consumers direct value, useability, explainability, actionability, security, privacy + Society indirect value, ethics - multi-axis optimizations in a given component - highly coupled optimization considerations between components - myopic view can be costly further up the stack All ML in production is a Systems Challenge
  • 10. Copyright 2012-2017, wise.io inc. 10 One ML Algorithmic Trade-Off High Low Low High Interpretability Accuracy Linear/Logistic Regression Naive Bayes Decision Trees SVMs Bagging Boosting Decision Forests Neural Nets Deep Learning Nearest Neighbors Gaussian/ Dirichlet Processes Splines * on real-world data sets Lasso Warning Unscientific & opinionated!
  • 11. 11 >$50k Prize <$50k Prize Netflix winning metric best benchmark many teams get within ~few % of optimum so which is easier to put into production? Leaderboard data from Kaggle & Netflix Optimization Metric
  • 12. 12 “We evaluated some of the new methods offline but the additional accuracy gains that we measured did not seem to justify the engineering effort needed to bring them into a production environment.” Xavier Amatriain and Justin Basilico (April 2012) On the Prize
  • 13. http://research.google.com/pubs/pub43146.html • Complex models erode abstraction boundaries • Data dependencies cost more than code dependencies: weak contracts • System-level Spaghetti • Changing External World “It may be surprising to the academic community to know that only a fraction of the code … is actually doing ‘machine learning’. A mature system might end up being (at most) 5% machine learning code and (at least) 95% glue code.” see also, Bottou (Facebook) ICML
  • 14. Prediction API in-houseas a service experimental/sandbox production/scale ready watsonAPI
  • 15. Prediction API in-houseas a service experimental/sandbox production/scale ready watsonAPI time & cost to implement cost to maintain
  • 16. COPYRIGHT 2012-2017, WISE.IO INC. Wise Architecture: Leveraging Cloud-based Services Services Oriented, Leveraging PaaS Managed Services Microscaling: Dockerized templated workflows for CPU/ GPU build/predict end-points Macro scaling: compute clusters load-balance RESTful contracts between services Build on the AWS stack; Instantiated with terraform End-user Transactional Systems Embedded UI 
 Wise App SDK Use Case Specific Middleware Auth Monitoring/ Alerting Admin Dashboard Reporting 
Wise Factory Wise Template (Learn/Prediction/Feedback) Transaction DB Model Storage / Management FrontendMiddlewareMLbackend
  • 19. + seam detected Crack Terabytes of Inspection data Aggregate historic data to enable learning from experience Advanced machine learning generates more accurate insights Surfaced to analysts to improve performance, drive consistency, & repeatability Our Goal: drive Zero-Pipeline-Failure
  • 22. The Power of a 1% Gain in Efficiency $27B $30B $63B $66B $90B Rail Aviation Healthcare Power Oil & Gas Source: “Industrial Internet Pushing Boundaries of Minds & Machines” GE, 2012
  • 23. Industrial Machine Learning Applied Artificial Intelligence in the New Industrial Revolution, 13 April 2017 (SF) Josh Bloom @profjsb Thanks! (and yes, we’re hiring…)