SlideShare a Scribd company logo
The Role of the
DevOps in the
Data Analytics
Teams
J ON THE BEACH
05/21/16
MORPHEDWITH
DEEP LEARNING™
TYPICAL OPSGUY
(source: Reddit)
TYPICAL YOUNGDATA SCIENTIST
(source: Common Sense)
My initial interests
Type Systems Automated Proving Abstract Program Interpretation Functional Programming Garbage Collection
and Vms
Graph Analytics Chess IA Natural Language Processing 80% Emacs /20% VIM
So to sum it up …
I (USED TO?)
TO BE A BIG NERD
Collaboration
CLICKERS CODERS
Software is a Human Problem
I ended up building
A collaborative software
For data science....
DEV OPS
&&
DATA
Let’s get back to the (brief) history of DevOps
Agile Conference, 2008
Scrum, and Agile
in an operational context
He	!	We	should	have	
our	own	velocity	in	
Belgium
10 deploysper day : Dev and Op
Operation at Flickr
O’Reilly Velocity, June 2009Patrick Dubois
2007
Dev
Ops
QA
DevOpsDays
Ghent, October 2009
DevOps
DevOps is the practice of
operations and development
engineers participating together
in the entire service lifecycle,
from design through the
development
process to production support.
DevOps is also characterized
by operations staff making
use many of the same
techniques as developers for
their systems work.
Invite Ops to the Dev Meeting
Oh. And let them SPEAK
Ops should know how to code
Let’s take an example: John devops from 2009
Learnt Python the Hard Way
Startedwith Puppet 1.0
Used EC2 before ELB and EBS !
Hegelian perspective
Conflict and Frustration
Concept
Combination
Catharsis
Create Culture
Share
Create Tools
Dev
+
Ops
There’s been op associated to data for a while ?
It’s called Business Intelligence !
History of Data Analytics (Oversimplified)
2013 2014 2015 2016 2017 2018
Moving to a world of automated decision making
DATA
FOR MORE INSIGHTS
DATA
FOR AUTOMATED DECISIONS
The Age Of Distributed Intelligence
Global,	Personalised	
and	Real	Time	Data	
Driven	Services
Data, Analytics and Data Science
Conflict and Frustration
Concept
Combination
Catharsis
Create Culture
Share
Create Tools
Data
+
Science
Welcome to Technoslavia !
Classic Business Intelligence Team Organization
Business Leader
Data Consumer
Line-of-business
Data Consumer Business Project
Sponsor
BI Solution Architect
Model Designer
ETL Developer
Dashboard / Report Designer
Specs
Dim
Big Boss
Data Science Team Organization
Business Leader
Data Consumer
Line-of-business
Data Consumer
Business Project
Sponsor
Data Engineer
Data Analyst
System Engineer /
Data Architect
Business
Needs
Data Scientist
IT
Constraints
I.T.
Is there room for a new role ?
Data
Plumberer
Data
Engineer
Data
Scientist
Data
Waiter
Data
Cleaner
Data
Analyst
REAL
JOB
DREAM
JOB
DevOps For Data?
Imagine
a company building
a new ”smart car” app: AutoFine™
”Revolutionary Collaborative network that check the quality of your driving and punish
You with virtual fines if you’re a bad driver”
Imagine
a company building
a new ”smart car” service AutoFine™
10 TB of Data
Every Month
Hive / Spark /
Python
10 Different
PredictiveModels
Real-Time API
/ Workflow
????
??
??
OPERATIONS : Whose is responsible for …
Check that the newly
trained model perform as
expected
Check that the product catalog
and the websitetags remain
consistent
Check that the Hadoop cluster scales
as expected and as enough
bandwidthto handlethe workload
Test the performance for
the real-time API
Monitor the performanceof
the model and decide to
rollback / maintain/ rollout
The Rise of the DataOps - Dataiku - J On the Beach 2016
DATA OPS
As a
Philosophy
X OPS PHILOSOPHY
Highly
consensual
Highly
controversial
Create an API culture
Do not share
o Random Piece of Code
o Flat File
o Email
Do share
ü Reproductible documentedworkflows
ü Clean, documentedAPIs
Defensive Data
Programming
•Software has errors.
•You are not your software, yet
you are are responsible for the
errors.
•You can never remove the
errors, only reduce their
probability.
Defensive Data Programming
•Handle the case when one of the input file is empty
•Handle the case when a new value appear
•Handle the case when two columns become completely
correlated
•Handle the case when a column is 16k long
•Etc.. Etc. etc…
Monitoring : the alerts for people who love it
• Performance ….
• Time Spent …
• Number of Errors …
Monitoring : Business Informal Monitoring
• % Opening
• Market Spent
• Exception User Events …
Resource Allocation
I’ve got this strange
Error ”OutOfMemory” . Do you know what it is
?
Why is the Hadoop Cluster going slower than
my laptop ?
The Philosophy of pre-allocating
more resources than necessary
Get to the latest package culture …
Data Scientist
I need the latest version of scikit
And networkX ….
And coud you repackage that
To enable TensorFlow optimizations ?
System Administrator
…..
The culture of containers
Developers’ Sandbox
DATA OPS
As a
Job Title
Job Title : a matter of name, $$ and social ladder
Data scientist Data Ops
Developer
Statistician
Full Stack Developer
Sys Admin
DevOps
Job Role : A matter of Do or Don’t
DO DON’T
Things you really want to do Things you really don’t want to get into
FIGHT THE
TOY PLATFORM ANTI-PATTERN
Test and Invest in Infrastructure == Skilled People
or
Go For Cloud / Packaged Infrastructure
Your	Brand	New	Hadoop	Cluster
is	perceived	as	slow,	not	so	used	
and	not	reliable
FIGHT THE
TECHNO MISMATCH ANTI-PATTERN
Assume Being Polyglot
or
Be a Dictator
VS
VS
The	Python
Clan
The	R
Tribe
The	Old	Elephant
Fraternity
The	New	Elephant
Club
GETTINGDATA POLITICS
> DATA NOT
AVAILABLE
GETTINGDATA POLITICS
THE	FOX
Hunt for Big Problem!
Convince the CEO that you can
Solve a Business Critical problem
And use it as an excuse to get all
The data you want !
THE	SPIDER
Create Network !
Create a set of trackers or
Addictive Data Collection
internally
To get Data on your side !
PREDICTIVE ANALYTICS DEPLOYMENT STRATEGY
Website	2000’	winners	
Companies	that	were	able	to	release fast	
"Artificial	Intelligence	with		Data	for	
Internet	of	Things"	2010’	winners	
Companies	able	to	put	intelligence	in	production
?
Design a way to put “PREDITICTIVE MODELS”
IN PRODUCTION
OWN ANONYMISATION / PRIVACY
/ DATA SECURITY WITH PARTNERS ISSUES
Technical Feasibility ? What can or cannot be done ?
Let’s Wrap IT Up !
A Company Building a GPS powered automated car fine system
10 TB of Data
Every Month
Hive / Spark /
Python
10 Different
PredictiveModels
Real-Time API
/ Workflow
Robust
Workflow
With
Data Quality
Checks
Functional
Monitoring
By Business
People
through
Slack and
Dashboards
Monitoring
for the API
Feature
Engineering
Pipeline in
Python
But you where do you stand ?
???? ???? ???? ?????
What's your roll-back strategy like?
What kind of multi-variatetesting or strategies do
you havein place for predictivemodels?
How do you manage the robustness of your data flow productionscripts?
How can businesspeople monitor the
performance of the application?
http://bit.ly/production-survey
Food		for	thoughts
www.dataiku.com/blog
THANK	YOU	!
http://bit.ly/production-survey http://bit.ly/production-survey

More Related Content

PPTX
data-analytics-strategy-ebook.pptx
PDF
DAS Slides: Master Data Management — Aligning Data, Process, and Governance
PDF
Dataiku - data driven nyc - april 2016 - the solitude of the data team m...
PPTX
Talend Data Quality
PDF
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
PDF
Improving Data Literacy Around Data Architecture
PDF
Future of Data Engineering
PDF
Enterprise Architecture vs. Data Architecture
data-analytics-strategy-ebook.pptx
DAS Slides: Master Data Management — Aligning Data, Process, and Governance
Dataiku - data driven nyc - april 2016 - the solitude of the data team m...
Talend Data Quality
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Improving Data Literacy Around Data Architecture
Future of Data Engineering
Enterprise Architecture vs. Data Architecture

What's hot (20)

PPTX
Demystifying data engineering
PPTX
Data as a Product by Wayne Eckerson
PDF
Modern Data architecture Design
PDF
AI: Built to Scale
PPTX
HDInsight for Architects
PDF
Collibra - Forrester Presentation : Data Governance 2.0
PDF
2022 Trends in Enterprise Analytics
PDF
Data Governance Best Practices
PPTX
Microservices Architecture - Bangkok 2018
PDF
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
PDF
PPTX
Databricks for Dummies
PDF
Data lineage and observability with Marquez - subsurface 2020
PDF
Data Catalogs Are the Answer – What is the Question?
PDF
Introducing Databricks Delta
PDF
RWDG Slides: Data Governance Roles and Responsibilities
PDF
Dagster - DataOps and MLOps for Machine Learning Engineers.pdf
PDF
Data Mesh Part 4 Monolith to Mesh
PDF
Webinar Data Mesh - Part 3
PDF
Taking Charge
Demystifying data engineering
Data as a Product by Wayne Eckerson
Modern Data architecture Design
AI: Built to Scale
HDInsight for Architects
Collibra - Forrester Presentation : Data Governance 2.0
2022 Trends in Enterprise Analytics
Data Governance Best Practices
Microservices Architecture - Bangkok 2018
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Databricks for Dummies
Data lineage and observability with Marquez - subsurface 2020
Data Catalogs Are the Answer – What is the Question?
Introducing Databricks Delta
RWDG Slides: Data Governance Roles and Responsibilities
Dagster - DataOps and MLOps for Machine Learning Engineers.pdf
Data Mesh Part 4 Monolith to Mesh
Webinar Data Mesh - Part 3
Taking Charge
Ad

Similar to The Rise of the DataOps - Dataiku - J On the Beach 2016 (20)

PDF
DevOps Spain 2019. Olivier Perard-Oracle
PDF
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
PDF
From Lab to Factory: Creating value with data
PDF
Introdution to Dataops and AIOps (or MLOps)
PPTX
Comcast Labs Connect - PHLAI Conference Philadelphia 2018
PPTX
Hiring for data roles - Adwait Bhave (ML Engineer and Data Scientist at Druva
PDF
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
PDF
How to become a data scientist
PDF
Challenges of Operationalising Data Science in Production
PDF
The 3 Key Barriers Keeping Companies from Deploying Data Products
PDF
Lessons Learned from Hiring and Retaining Data Practitioners
PDF
iTrain Malaysia: Data Science by Tarun Sukhani
PPTX
Big Data and the Art of Data Science
PDF
Decoding Data Science
PPTX
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
PDF
ORGANISING YOUR ADVANCED ANALYTICS PROJECTS FOR SUCCESS - Big Data Expo 2019
PPTX
Data summit connect fall 2020 - rise of data ops
PPTX
DataOps: Nine steps to transform your data science impact Strata London May 18
PDF
DevOps is to Infrastructure as Code, as DataOps is to...?
PDF
Data Science and Culture
DevOps Spain 2019. Olivier Perard-Oracle
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
From Lab to Factory: Creating value with data
Introdution to Dataops and AIOps (or MLOps)
Comcast Labs Connect - PHLAI Conference Philadelphia 2018
Hiring for data roles - Adwait Bhave (ML Engineer and Data Scientist at Druva
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
How to become a data scientist
Challenges of Operationalising Data Science in Production
The 3 Key Barriers Keeping Companies from Deploying Data Products
Lessons Learned from Hiring and Retaining Data Practitioners
iTrain Malaysia: Data Science by Tarun Sukhani
Big Data and the Art of Data Science
Decoding Data Science
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
ORGANISING YOUR ADVANCED ANALYTICS PROJECTS FOR SUCCESS - Big Data Expo 2019
Data summit connect fall 2020 - rise of data ops
DataOps: Nine steps to transform your data science impact Strata London May 18
DevOps is to Infrastructure as Code, as DataOps is to...?
Data Science and Culture
Ad

More from Dataiku (20)

PDF
Applied Data Science Part 3: Getting dirty; data preparation and feature crea...
PDF
Applied Data Science Course Part 2: the data science workflow and basic model...
PDF
Applied Data Science Course Part 1: Concepts & your first ML model
PPTX
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
PDF
The US Healthcare Industry
PDF
How to Build Successful Data Team - Dataiku ?
PDF
Before Kaggle : from a business goal to a Machine Learning problem
PPTX
04Juin2015_Symposium_Présentation_Coyote_Dataiku
PDF
Dataiku productive application to production - pap is may 2015
PDF
Coyote & Dataiku - Séminaire Dixit GFII du 13 04-2015
PDF
Dataiku - Big data paris 2015 - A Hybrid Platform, a Hybrid Team
PDF
The paradox of big data - dataiku / oxalide APEROTECH
PPTX
OWF 2014 - Take back control of your Web tracking - Dataiku
PDF
Dataiku at SF DataMining Meetup - Kaggle Yandex Challenge
PPTX
Lambda Architecture - Storm, Trident, SummingBird ... - Architecture and Over...
PPTX
Dataiku hadoop summit - semi-supervised learning with hadoop for understand...
PPTX
Dataiku big data paris - the rise of the hadoop ecosystem
PPTX
Dataiku - hadoop ecosystem - @Epitech Paris - janvier 2014
PPTX
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
PDF
Dataiku - for Data Geek Paris@Criteo - Close the Data Circle
Applied Data Science Part 3: Getting dirty; data preparation and feature crea...
Applied Data Science Course Part 2: the data science workflow and basic model...
Applied Data Science Course Part 1: Concepts & your first ML model
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
The US Healthcare Industry
How to Build Successful Data Team - Dataiku ?
Before Kaggle : from a business goal to a Machine Learning problem
04Juin2015_Symposium_Présentation_Coyote_Dataiku
Dataiku productive application to production - pap is may 2015
Coyote & Dataiku - Séminaire Dixit GFII du 13 04-2015
Dataiku - Big data paris 2015 - A Hybrid Platform, a Hybrid Team
The paradox of big data - dataiku / oxalide APEROTECH
OWF 2014 - Take back control of your Web tracking - Dataiku
Dataiku at SF DataMining Meetup - Kaggle Yandex Challenge
Lambda Architecture - Storm, Trident, SummingBird ... - Architecture and Over...
Dataiku hadoop summit - semi-supervised learning with hadoop for understand...
Dataiku big data paris - the rise of the hadoop ecosystem
Dataiku - hadoop ecosystem - @Epitech Paris - janvier 2014
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
Dataiku - for Data Geek Paris@Criteo - Close the Data Circle

Recently uploaded (20)

PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Cloud computing and distributed systems.
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Approach and Philosophy of On baking technology
PPTX
Understanding_Digital_Forensics_Presentation.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Network Security Unit 5.pdf for BCA BBA.
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Review of recent advances in non-invasive hemoglobin estimation
Cloud computing and distributed systems.
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
CIFDAQ's Market Insight: SEC Turns Pro Crypto
“AI and Expert System Decision Support & Business Intelligence Systems”
20250228 LYD VKU AI Blended-Learning.pptx
NewMind AI Monthly Chronicles - July 2025
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Approach and Philosophy of On baking technology
Understanding_Digital_Forensics_Presentation.pptx

The Rise of the DataOps - Dataiku - J On the Beach 2016

  • 1. The Role of the DevOps in the Data Analytics Teams J ON THE BEACH 05/21/16 MORPHEDWITH DEEP LEARNING™ TYPICAL OPSGUY (source: Reddit) TYPICAL YOUNGDATA SCIENTIST (source: Common Sense)
  • 2. My initial interests Type Systems Automated Proving Abstract Program Interpretation Functional Programming Garbage Collection and Vms Graph Analytics Chess IA Natural Language Processing 80% Emacs /20% VIM
  • 3. So to sum it up … I (USED TO?) TO BE A BIG NERD
  • 4. Collaboration CLICKERS CODERS Software is a Human Problem I ended up building A collaborative software For data science....
  • 6. Let’s get back to the (brief) history of DevOps Agile Conference, 2008 Scrum, and Agile in an operational context He ! We should have our own velocity in Belgium 10 deploysper day : Dev and Op Operation at Flickr O’Reilly Velocity, June 2009Patrick Dubois 2007 Dev Ops QA DevOpsDays Ghent, October 2009
  • 7. DevOps DevOps is the practice of operations and development engineers participating together in the entire service lifecycle, from design through the development process to production support. DevOps is also characterized by operations staff making use many of the same techniques as developers for their systems work. Invite Ops to the Dev Meeting Oh. And let them SPEAK Ops should know how to code
  • 8. Let’s take an example: John devops from 2009 Learnt Python the Hard Way Startedwith Puppet 1.0 Used EC2 before ELB and EBS !
  • 9. Hegelian perspective Conflict and Frustration Concept Combination Catharsis Create Culture Share Create Tools Dev + Ops
  • 10. There’s been op associated to data for a while ? It’s called Business Intelligence !
  • 11. History of Data Analytics (Oversimplified) 2013 2014 2015 2016 2017 2018 Moving to a world of automated decision making DATA FOR MORE INSIGHTS DATA FOR AUTOMATED DECISIONS
  • 12. The Age Of Distributed Intelligence Global, Personalised and Real Time Data Driven Services
  • 13. Data, Analytics and Data Science Conflict and Frustration Concept Combination Catharsis Create Culture Share Create Tools Data + Science
  • 15. Classic Business Intelligence Team Organization Business Leader Data Consumer Line-of-business Data Consumer Business Project Sponsor BI Solution Architect Model Designer ETL Developer Dashboard / Report Designer Specs Dim Big Boss
  • 16. Data Science Team Organization Business Leader Data Consumer Line-of-business Data Consumer Business Project Sponsor Data Engineer Data Analyst System Engineer / Data Architect Business Needs Data Scientist IT Constraints I.T.
  • 17. Is there room for a new role ? Data Plumberer Data Engineer Data Scientist Data Waiter Data Cleaner Data Analyst REAL JOB DREAM JOB DevOps For Data?
  • 18. Imagine a company building a new ”smart car” app: AutoFine™ ”Revolutionary Collaborative network that check the quality of your driving and punish You with virtual fines if you’re a bad driver”
  • 19. Imagine a company building a new ”smart car” service AutoFine™ 10 TB of Data Every Month Hive / Spark / Python 10 Different PredictiveModels Real-Time API / Workflow
  • 20. ???? ?? ?? OPERATIONS : Whose is responsible for … Check that the newly trained model perform as expected Check that the product catalog and the websitetags remain consistent Check that the Hadoop cluster scales as expected and as enough bandwidthto handlethe workload Test the performance for the real-time API Monitor the performanceof the model and decide to rollback / maintain/ rollout
  • 24. Create an API culture Do not share o Random Piece of Code o Flat File o Email Do share ü Reproductible documentedworkflows ü Clean, documentedAPIs
  • 25. Defensive Data Programming •Software has errors. •You are not your software, yet you are are responsible for the errors. •You can never remove the errors, only reduce their probability.
  • 26. Defensive Data Programming •Handle the case when one of the input file is empty •Handle the case when a new value appear •Handle the case when two columns become completely correlated •Handle the case when a column is 16k long •Etc.. Etc. etc…
  • 27. Monitoring : the alerts for people who love it • Performance …. • Time Spent … • Number of Errors …
  • 28. Monitoring : Business Informal Monitoring • % Opening • Market Spent • Exception User Events …
  • 29. Resource Allocation I’ve got this strange Error ”OutOfMemory” . Do you know what it is ? Why is the Hadoop Cluster going slower than my laptop ?
  • 30. The Philosophy of pre-allocating more resources than necessary
  • 31. Get to the latest package culture … Data Scientist I need the latest version of scikit And networkX …. And coud you repackage that To enable TensorFlow optimizations ? System Administrator …..
  • 32. The culture of containers Developers’ Sandbox
  • 34. Job Title : a matter of name, $$ and social ladder Data scientist Data Ops Developer Statistician Full Stack Developer Sys Admin DevOps
  • 35. Job Role : A matter of Do or Don’t DO DON’T Things you really want to do Things you really don’t want to get into
  • 36. FIGHT THE TOY PLATFORM ANTI-PATTERN Test and Invest in Infrastructure == Skilled People or Go For Cloud / Packaged Infrastructure Your Brand New Hadoop Cluster is perceived as slow, not so used and not reliable
  • 37. FIGHT THE TECHNO MISMATCH ANTI-PATTERN Assume Being Polyglot or Be a Dictator VS VS The Python Clan The R Tribe The Old Elephant Fraternity The New Elephant Club
  • 39. GETTINGDATA POLITICS THE FOX Hunt for Big Problem! Convince the CEO that you can Solve a Business Critical problem And use it as an excuse to get all The data you want ! THE SPIDER Create Network ! Create a set of trackers or Addictive Data Collection internally To get Data on your side !
  • 40. PREDICTIVE ANALYTICS DEPLOYMENT STRATEGY Website 2000’ winners Companies that were able to release fast "Artificial Intelligence with Data for Internet of Things" 2010’ winners Companies able to put intelligence in production ? Design a way to put “PREDITICTIVE MODELS” IN PRODUCTION
  • 41. OWN ANONYMISATION / PRIVACY / DATA SECURITY WITH PARTNERS ISSUES Technical Feasibility ? What can or cannot be done ?
  • 42. Let’s Wrap IT Up ! A Company Building a GPS powered automated car fine system 10 TB of Data Every Month Hive / Spark / Python 10 Different PredictiveModels Real-Time API / Workflow Robust Workflow With Data Quality Checks Functional Monitoring By Business People through Slack and Dashboards Monitoring for the API Feature Engineering Pipeline in Python
  • 43. But you where do you stand ? ???? ???? ???? ????? What's your roll-back strategy like? What kind of multi-variatetesting or strategies do you havein place for predictivemodels? How do you manage the robustness of your data flow productionscripts? How can businesspeople monitor the performance of the application?