SlideShare a Scribd company logo
Continuous Deployment
for Deep Learning
—
Nick Pentreath
Principal Engineer
@MLnick
About
IBM Developer / © 2019 IBM Corporation
– @MLnick on Twitter & Github
– Principal Engineer, IBM CODAIT (Center for
Open-Source Data & AI Technologies)
– Machine Learning & AI
– Apache Spark committer & PMC
– Author of Machine Learning with Spark
– Various conferences & meetups
2
CODAIT
Improving the Enterprise AI Lifecycle in Open Source
Center for Open Source
Data & AI Technologies
IBM Developer / © 2019 IBM Corporation 3
CODAIT aims to make AI solutions dramatically
easier to create, deploy, and manage in the
enterprise.
We contribute to and advocate for the open-source
technologies that are foundational to IBM’s AI
offerings.
30+ open-source developers!
Agenda
IBM Developer / © 2019 IBM Corporation
– Overview of Continuous Integration &
Deployment
– The Machine Learning Workflow
– How is CI/CD for ML Different & Challenges
– Model Asset Exchange
– Conclusion
4
Continuous Integration
& Deployment
IBM Developer / © 2019 IBM Corporation 5
Testing
IBM Developer / © 2019 IBM Corporation 6
App
Data Store
Public Front-end Services Back-end Services
Service
Remote
Service
Remote
Service
Testing
IBM Developer / © 2019 IBM Corporation 7
App
Data Store
Public Front-end Services Back-end Services
Service
Remote
Service
Remote
Service
Unit testing
Testing
IBM Developer / © 2019 IBM Corporation 8
App
Data Store
Public Front-end Services Back-end Services
Service
Remote
Service
Remote
Service
Integration testing
Testing
IBM Developer / © 2019 IBM Corporation 9
App
Data Store
Public Front-end Services Back-end Services
Service
Remote
Service
Remote
Service
Post-deploy / Acceptance testing
Development Process
IBM Developer / © 2019 IBM Corporation 10
Build Test Merge Deploy
Dependencies
Starting Point
IBM Developer / © 2019 IBM Corporation 11
Build Test Merge
Dependencies
Manual
Automated
Deploy
Continuous Integration
IBM Developer / © 2019 IBM Corporation 12
Build Test Merge
Dependencies
Manual
Automated
Deploy
Continuous Deployment
IBM Developer / © 2019 IBM Corporation 13
Build Test Merge
Dependencies
Manual
Automated
Deploy
Source of changes
IBM Developer / © 2019 IBM Corporation 14
Build Test Merge Deploy
Dependencies
Changes come from:
• Our code
• Internal dependencies
• 3rd party dependencies
The Machine Learning
Workflow
IBM Developer / © 2019 IBM Corporation 15
Perception
IBM Developer / © 2019 IBM Corporation 16
In reality the
workflow spans teams …
IBM Developer / © 2019 IBM Corporation 17
… and tools …
IBM Developer / © 2019 IBM Corporation 18
… and is a small (but critical!)
piece of the puzzle
*Source: Hidden Technical Debt in Machine Learning Systems
IBM Developer / © 2019 IBM Corporation 19
What is a “model”?
IBM Developer / © 2019 IBM Corporation 20
Deep learning pipeline
IBM Developer / © 2019 IBM Corporation 21
beagle: 0.82
basset: 0.09
bluetick: 0.07
...
Input image Image pre-processing Prediction
Decode image
Resize
Normalization
Convert types / format
Inference Post-processing
[0.2, 0.3, … ]
(label, prob)
Sort
Label map
PIL, OpenCV, tf.image,
…
Custom
Python
* Logos trademarks of their respective projects
Pipelines, not Models
– Deploying (and testing) just the model
part of the workflow is not enough
– Entire pipeline must be taken into
account
• Data transforms
• Feature extraction & pre-processing
• DL / ML model
• Prediction transformation
– Even ETL is part of the pipeline!
– Pipelines in frameworks
• scikit-learn
• Spark ML pipelines
• TensorFlow Transform
• pipeliner (R)
IBM Developer / © 2019 IBM Corporation 22
Continuous Integration
for Machine Learning
IBM Developer / © 2019 IBM Corporation 23
Source of changes
IBM Developer / © 2019 IBM Corporation 24
Build Test Merge Deploy
Dependencies
Changes come from:
• Our code
• Internal dependencies
• 3rd party dependencies
• Data
• Model
• Time
Data Model
Data
IBM Developer / © 2019 IBM Corporation
– Data types
• Images, video, audio, raw text,
unstructured
– Size
– Schemas
* Logos trademarks of their respective projects
25
Models
IBM Developer / © 2019 IBM Corporation
– Size of models
– Resource requirements
– Hardware
• CPU, GPU, TPU, Mobile, Edge
– Need to manage and bridge many
different languages, frameworks
– Formats
– State of the art is changing very
rapidly
* Logos trademarks of their respective projects
26
Monitoring & Feedback
over Time
IBM Developer / © 2019 IBM Corporation 27
Monitoring
Performance Business
Monitoring
Software
Traditional software monitoring
Latency, throughput, resource usage, etc
Model performance metrics
Traditional ML evaluation measures
(accuracy, prediction error, AUC etc)
Business metrics
Impact of predictions on business
outcomes
• Additional revenue - e.g. uplift from recommender
• Cost savings – e.g. value of fraud prevented
• Metrics implicitly influencing these – e.g. user
engagement
IBM Developer / © 2019 IBM Corporation
Feedback
Data
Transform
TrainDeploy
Feedback
Adapt
An intelligent system must automatically
learn from & adapt to the world around it
Continual learning
Retraining, online learning,
reinforcement learning
Feedback loops
Explicit: models create or directly
influence their own training data
Implicit: predictions influence behavior
in longer-term or indirect ways
Humans in the loop
IBM Developer / © 2019 IBM Corporation
AI Fairness,
Transparency &
Security
30IBM Developer / © 2019 IBM Corporation
AI Fairness,
Transparency &
Security
31IBM Developer / © 2019 IBM Corporation
https://github.com/IBM/AIX360/
https://github.com/IBM/AIF360
https://github.com/IBM/adversarial-robustness-toolbox
32
Model
Asset
eXchange
(MAX)
ibm.biz/model-exchange
IBM Developer / © 2019 IBM Corporation
What is
MAX?
33
- One place for state-of-art open
source deep learning models
- Wide variety of domains
- Tested code and IP
- Free and open source
- Both trainable and trained versions
IBM Developer / © 2019 IBM Corporation
Deployable asset on MAX
IBM Developer / © 2019 IBM Corporation 34
Data ExpertiseComputeModel
REST API specificationPre Trained ModelI/O processing
Deployable Asset on Model Asset
Exchange
MetadataInferenceSwagger
Specification
Deployable asset on MAX
IBM Developer / © 2019 IBM Corporation 35
MetadataInferenceSwagger
Specification
Deployable asset on MAX
IBM Developer / © 2019 IBM Corporation 36
Hosted resources
IBM Developer / © 2019 IBM Corporation 37
Model
Service
Web App
Kubernetes
Trainable asset on MAX
IBM Developer / © 2019 IBM Corporation 38
Data ExpertiseComputeModel
REST API specificationPre Trained ModelI/O processing
Deployable Asset on Model Asset
Exchange
Trainable asset on MAX
IBM Developer / © 2019 IBM Corporation 39
Data ExpertiseComputeModel
REST API specificationPre Trained ModelI/O processing
Deployable Asset on Model Asset
Exchange
Standardized scripts & wrappers
MAX and CI/CD
IBM Developer / © 2019 IBM Corporation 40
– TravisCI for GitHub repositories
– Jenkins for more complex workflows
• Deploying long-running instances to
Kubernetes
• Periodic jobs
• Trainable model testing (using Watson
Machine Learning service)
Conclusion
– ML Pipelines, not just models
– Need to take into account
• Data
• Models
• Variation over time
• Fairness, explainability,
transparency
– Containers / Kube?
• KubeFlow, MLFlow
• Tekton
IBM Developer / © 2019 IBM Corporation 41
Thank you
IBM Developer / © 2019 IBM Corporation
Sign up for IBM Cloud and try Watson Studio: https://ibm.biz/BdznGk
codait.org
twitter.com/MLnick
github.com/MLnick
developer.ibm.com
42
Model Asset Exchange:
https://ibm.biz/model-exchange
IBM Developer / © 2019 IBM Corporation 43

More Related Content

PDF
ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
PDF
Hamburg Data Science Meetup - MLOps with a Feature Store
PDF
Challenges of Operationalising Data Science in Production
PPTX
Feature store: Solving anti-patterns in ML-systems
PDF
“Houston, we have a model...” Introduction to MLOps
PDF
Multi runtime serving pipelines for machine learning
PPTX
Magdalena Stenius: MLOPS Will Change Machine Learning
PDF
Scaling up Deep Learning by Scaling Down
ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
Hamburg Data Science Meetup - MLOps with a Feature Store
Challenges of Operationalising Data Science in Production
Feature store: Solving anti-patterns in ML-systems
“Houston, we have a model...” Introduction to MLOps
Multi runtime serving pipelines for machine learning
Magdalena Stenius: MLOPS Will Change Machine Learning
Scaling up Deep Learning by Scaling Down

What's hot (20)

PDF
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
PDF
Serverless machine learning operations
PDF
Metaflow: The ML Infrastructure at Netflix
PPTX
Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...
PPTX
Productionizing Spark ML Pipelines with the Portable Format for Analytics
PPTX
Machine Learning In Production
PDF
Monitoring AI with AI
PPTX
Grokking Techtalk #40: AWS’s philosophy on designing MLOps platform
PDF
Model Parallelism in Spark ML Cross-Validation with Nick Pentreath and Bryan ...
PDF
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
PDF
Seamless MLOps with Seldon and MLflow
PDF
Machine learning model to production
PPTX
Data Science Salon: A Journey of Deploying a Data Science Engine to Production
PDF
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
PDF
ML Infra for Netflix Recommendations - AI NEXTCon talk
PDF
High Performance Transfer Learning for Classifying Intent of Sales Engagement...
PPTX
Sundar Ranganathan, NetApp + Vinod Iyengar, H2O.ai - Driverless AI integratio...
PPTX
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
PDF
How to combine Db2 on Z, IBM Db2 Analytics Accelerator and IBM Machine Learni...
PDF
TensorFlow 16: Building a Data Science Platform
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
Serverless machine learning operations
Metaflow: The ML Infrastructure at Netflix
Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...
Productionizing Spark ML Pipelines with the Portable Format for Analytics
Machine Learning In Production
Monitoring AI with AI
Grokking Techtalk #40: AWS’s philosophy on designing MLOps platform
Model Parallelism in Spark ML Cross-Validation with Nick Pentreath and Bryan ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Seamless MLOps with Seldon and MLflow
Machine learning model to production
Data Science Salon: A Journey of Deploying a Data Science Engine to Production
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
ML Infra for Netflix Recommendations - AI NEXTCon talk
High Performance Transfer Learning for Classifying Intent of Sales Engagement...
Sundar Ranganathan, NetApp + Vinod Iyengar, H2O.ai - Driverless AI integratio...
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
How to combine Db2 on Z, IBM Db2 Analytics Accelerator and IBM Machine Learni...
TensorFlow 16: Building a Data Science Platform
Ad

Similar to Continuous Deployment for Deep Learning (20)

PDF
IBM Cloud Private and IBM Power Systems: Overview and Real-World Scenarios
PDF
Deploying End-to-End Deep Learning Pipelines with ONNX
PDF
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
PPTX
Scaling up deep learning by scaling down
PDF
Ai pipelines powered by jupyter notebooks
PDF
IBM Connect AD206 IBM Domino XPages – Embrace, Extend, Integrate
PPTX
Inteligencia artificial, open source e IBM Call for Code
PDF
Dev ops for z
PDF
IBM Z for the Digital Enterprise - DevOps for Z
PDF
Open Source AI - News and examples
PPTX
End-to-End Deep Learning Deployment with ONNX
PDF
150330 verse - uffe sorensen posted
PDF
MQLight for WebSphere Integration user group June 2014
PDF
IBM MQ Light Service for Bluemix
PDF
IBM Enterprise Social Solutions on Bluemix (XPages and Connections)
PDF
Optimize your CI/CD with GitLab and AWS
PDF
App Modernization
PDF
Ibm business partner connect 2015 long fong yee v1 (read-only)
PDF
Db2 developer ecosystem
PDF
How to Balance System Speed and Risk for Multi-Platform Innovation
IBM Cloud Private and IBM Power Systems: Overview and Real-World Scenarios
Deploying End-to-End Deep Learning Pipelines with ONNX
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
Scaling up deep learning by scaling down
Ai pipelines powered by jupyter notebooks
IBM Connect AD206 IBM Domino XPages – Embrace, Extend, Integrate
Inteligencia artificial, open source e IBM Call for Code
Dev ops for z
IBM Z for the Digital Enterprise - DevOps for Z
Open Source AI - News and examples
End-to-End Deep Learning Deployment with ONNX
150330 verse - uffe sorensen posted
MQLight for WebSphere Integration user group June 2014
IBM MQ Light Service for Bluemix
IBM Enterprise Social Solutions on Bluemix (XPages and Connections)
Optimize your CI/CD with GitLab and AWS
App Modernization
Ibm business partner connect 2015 long fong yee v1 (read-only)
Db2 developer ecosystem
How to Balance System Speed and Risk for Multi-Platform Innovation
Ad

More from Databricks (20)

PPTX
DW Migration Webinar-March 2022.pptx
PPTX
Data Lakehouse Symposium | Day 1 | Part 1
PPT
Data Lakehouse Symposium | Day 1 | Part 2
PPTX
Data Lakehouse Symposium | Day 2
PPTX
Data Lakehouse Symposium | Day 4
PDF
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
PDF
Democratizing Data Quality Through a Centralized Platform
PDF
Learn to Use Databricks for Data Science
PDF
Why APM Is Not the Same As ML Monitoring
PDF
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
PDF
Stage Level Scheduling Improving Big Data and AI Integration
PDF
Simplify Data Conversion from Spark to TensorFlow and PyTorch
PDF
Scaling your Data Pipelines with Apache Spark on Kubernetes
PDF
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
PDF
Sawtooth Windows for Feature Aggregations
PDF
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
PDF
Re-imagine Data Monitoring with whylogs and Spark
PDF
Raven: End-to-end Optimization of ML Prediction Queries
PDF
Processing Large Datasets for ADAS Applications using Apache Spark
PDF
Massive Data Processing in Adobe Using Delta Lake
DW Migration Webinar-March 2022.pptx
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 4
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Democratizing Data Quality Through a Centralized Platform
Learn to Use Databricks for Data Science
Why APM Is Not the Same As ML Monitoring
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Stage Level Scheduling Improving Big Data and AI Integration
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Sawtooth Windows for Feature Aggregations
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Re-imagine Data Monitoring with whylogs and Spark
Raven: End-to-end Optimization of ML Prediction Queries
Processing Large Datasets for ADAS Applications using Apache Spark
Massive Data Processing in Adobe Using Delta Lake

Recently uploaded (20)

PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Computer network topology notes for revision
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
Global journeys: estimating international migration
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
Fluorescence-microscope_Botany_detailed content
Supervised vs unsupervised machine learning algorithms
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
1_Introduction to advance data techniques.pptx
Introduction-to-Cloud-ComputingFinal.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Computer network topology notes for revision
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Global journeys: estimating international migration
Reliability_Chapter_ presentation 1221.5784
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Major-Components-ofNKJNNKNKNKNKronment.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Data_Analytics_and_PowerBI_Presentation.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Fluorescence-microscope_Botany_detailed content

Continuous Deployment for Deep Learning

  • 1. Continuous Deployment for Deep Learning — Nick Pentreath Principal Engineer @MLnick
  • 2. About IBM Developer / © 2019 IBM Corporation – @MLnick on Twitter & Github – Principal Engineer, IBM CODAIT (Center for Open-Source Data & AI Technologies) – Machine Learning & AI – Apache Spark committer & PMC – Author of Machine Learning with Spark – Various conferences & meetups 2
  • 3. CODAIT Improving the Enterprise AI Lifecycle in Open Source Center for Open Source Data & AI Technologies IBM Developer / © 2019 IBM Corporation 3 CODAIT aims to make AI solutions dramatically easier to create, deploy, and manage in the enterprise. We contribute to and advocate for the open-source technologies that are foundational to IBM’s AI offerings. 30+ open-source developers!
  • 4. Agenda IBM Developer / © 2019 IBM Corporation – Overview of Continuous Integration & Deployment – The Machine Learning Workflow – How is CI/CD for ML Different & Challenges – Model Asset Exchange – Conclusion 4
  • 5. Continuous Integration & Deployment IBM Developer / © 2019 IBM Corporation 5
  • 6. Testing IBM Developer / © 2019 IBM Corporation 6 App Data Store Public Front-end Services Back-end Services Service Remote Service Remote Service
  • 7. Testing IBM Developer / © 2019 IBM Corporation 7 App Data Store Public Front-end Services Back-end Services Service Remote Service Remote Service Unit testing
  • 8. Testing IBM Developer / © 2019 IBM Corporation 8 App Data Store Public Front-end Services Back-end Services Service Remote Service Remote Service Integration testing
  • 9. Testing IBM Developer / © 2019 IBM Corporation 9 App Data Store Public Front-end Services Back-end Services Service Remote Service Remote Service Post-deploy / Acceptance testing
  • 10. Development Process IBM Developer / © 2019 IBM Corporation 10 Build Test Merge Deploy Dependencies
  • 11. Starting Point IBM Developer / © 2019 IBM Corporation 11 Build Test Merge Dependencies Manual Automated Deploy
  • 12. Continuous Integration IBM Developer / © 2019 IBM Corporation 12 Build Test Merge Dependencies Manual Automated Deploy
  • 13. Continuous Deployment IBM Developer / © 2019 IBM Corporation 13 Build Test Merge Dependencies Manual Automated Deploy
  • 14. Source of changes IBM Developer / © 2019 IBM Corporation 14 Build Test Merge Deploy Dependencies Changes come from: • Our code • Internal dependencies • 3rd party dependencies
  • 15. The Machine Learning Workflow IBM Developer / © 2019 IBM Corporation 15
  • 16. Perception IBM Developer / © 2019 IBM Corporation 16
  • 17. In reality the workflow spans teams … IBM Developer / © 2019 IBM Corporation 17
  • 18. … and tools … IBM Developer / © 2019 IBM Corporation 18
  • 19. … and is a small (but critical!) piece of the puzzle *Source: Hidden Technical Debt in Machine Learning Systems IBM Developer / © 2019 IBM Corporation 19
  • 20. What is a “model”? IBM Developer / © 2019 IBM Corporation 20
  • 21. Deep learning pipeline IBM Developer / © 2019 IBM Corporation 21 beagle: 0.82 basset: 0.09 bluetick: 0.07 ... Input image Image pre-processing Prediction Decode image Resize Normalization Convert types / format Inference Post-processing [0.2, 0.3, … ] (label, prob) Sort Label map PIL, OpenCV, tf.image, … Custom Python * Logos trademarks of their respective projects
  • 22. Pipelines, not Models – Deploying (and testing) just the model part of the workflow is not enough – Entire pipeline must be taken into account • Data transforms • Feature extraction & pre-processing • DL / ML model • Prediction transformation – Even ETL is part of the pipeline! – Pipelines in frameworks • scikit-learn • Spark ML pipelines • TensorFlow Transform • pipeliner (R) IBM Developer / © 2019 IBM Corporation 22
  • 23. Continuous Integration for Machine Learning IBM Developer / © 2019 IBM Corporation 23
  • 24. Source of changes IBM Developer / © 2019 IBM Corporation 24 Build Test Merge Deploy Dependencies Changes come from: • Our code • Internal dependencies • 3rd party dependencies • Data • Model • Time Data Model
  • 25. Data IBM Developer / © 2019 IBM Corporation – Data types • Images, video, audio, raw text, unstructured – Size – Schemas * Logos trademarks of their respective projects 25
  • 26. Models IBM Developer / © 2019 IBM Corporation – Size of models – Resource requirements – Hardware • CPU, GPU, TPU, Mobile, Edge – Need to manage and bridge many different languages, frameworks – Formats – State of the art is changing very rapidly * Logos trademarks of their respective projects 26
  • 27. Monitoring & Feedback over Time IBM Developer / © 2019 IBM Corporation 27
  • 28. Monitoring Performance Business Monitoring Software Traditional software monitoring Latency, throughput, resource usage, etc Model performance metrics Traditional ML evaluation measures (accuracy, prediction error, AUC etc) Business metrics Impact of predictions on business outcomes • Additional revenue - e.g. uplift from recommender • Cost savings – e.g. value of fraud prevented • Metrics implicitly influencing these – e.g. user engagement IBM Developer / © 2019 IBM Corporation
  • 29. Feedback Data Transform TrainDeploy Feedback Adapt An intelligent system must automatically learn from & adapt to the world around it Continual learning Retraining, online learning, reinforcement learning Feedback loops Explicit: models create or directly influence their own training data Implicit: predictions influence behavior in longer-term or indirect ways Humans in the loop IBM Developer / © 2019 IBM Corporation
  • 30. AI Fairness, Transparency & Security 30IBM Developer / © 2019 IBM Corporation
  • 31. AI Fairness, Transparency & Security 31IBM Developer / © 2019 IBM Corporation https://github.com/IBM/AIX360/ https://github.com/IBM/AIF360 https://github.com/IBM/adversarial-robustness-toolbox
  • 33. What is MAX? 33 - One place for state-of-art open source deep learning models - Wide variety of domains - Tested code and IP - Free and open source - Both trainable and trained versions IBM Developer / © 2019 IBM Corporation
  • 34. Deployable asset on MAX IBM Developer / © 2019 IBM Corporation 34 Data ExpertiseComputeModel REST API specificationPre Trained ModelI/O processing Deployable Asset on Model Asset Exchange MetadataInferenceSwagger Specification
  • 35. Deployable asset on MAX IBM Developer / © 2019 IBM Corporation 35 MetadataInferenceSwagger Specification
  • 36. Deployable asset on MAX IBM Developer / © 2019 IBM Corporation 36
  • 37. Hosted resources IBM Developer / © 2019 IBM Corporation 37 Model Service Web App Kubernetes
  • 38. Trainable asset on MAX IBM Developer / © 2019 IBM Corporation 38 Data ExpertiseComputeModel REST API specificationPre Trained ModelI/O processing Deployable Asset on Model Asset Exchange
  • 39. Trainable asset on MAX IBM Developer / © 2019 IBM Corporation 39 Data ExpertiseComputeModel REST API specificationPre Trained ModelI/O processing Deployable Asset on Model Asset Exchange Standardized scripts & wrappers
  • 40. MAX and CI/CD IBM Developer / © 2019 IBM Corporation 40 – TravisCI for GitHub repositories – Jenkins for more complex workflows • Deploying long-running instances to Kubernetes • Periodic jobs • Trainable model testing (using Watson Machine Learning service)
  • 41. Conclusion – ML Pipelines, not just models – Need to take into account • Data • Models • Variation over time • Fairness, explainability, transparency – Containers / Kube? • KubeFlow, MLFlow • Tekton IBM Developer / © 2019 IBM Corporation 41
  • 42. Thank you IBM Developer / © 2019 IBM Corporation Sign up for IBM Cloud and try Watson Studio: https://ibm.biz/BdznGk codait.org twitter.com/MLnick github.com/MLnick developer.ibm.com 42 Model Asset Exchange: https://ibm.biz/model-exchange
  • 43. IBM Developer / © 2019 IBM Corporation 43