SlideShare a Scribd company logo
a view from the trenches
INDUSTRIALIZING
DATA SCIENCE
Industrializing data science: a view from the trenches
Data science projects without a clear
industrialization path are just expensive math
The data science product life cycle
EXPERIMENT
PRODUCTIDEA
INDUSTRIALIZE
How to overcome the barrier
DATA SCIENTISTDATA ENGINEERDATA ARCHITECT BUSINESS
Key roles for successful data science
DATA TRANSLATOR
The anatomy of a data science product
X �y
DATA SCIENTIST
?
DATA TRANSLATOR
𝛌𝛌
MODEL
The anatomy of a data science product
𝛌𝛌
MODEL SERVINGDATA
MODEL TRAINING MONITORING
DATA ENGINEER
DATA ARCHITECT
BUSINESS
€
INSIGHT ACTION VALUE
From proof-of-concept to product
INDUSTRIALIZEEXPERIMENT
3challenges
BUSINESS TEAMS INFRASTRUCTURE
PRODUCTIDEA
FLEXIBILITY RELIABILITY
There is a difference between
interest and commitment
When you’re interested in doing something,
you do it only when it’s convenient.
When you’re committed to something,
you accept no excuses, only results.
~ Ken Blanchard
Get business committed and not just interested
P U S H
BUSINESS
The challenge
?
Resources
DATA SCIENCE TEAM
DATA SCIENCE TEAM
P U L L
BUSINESS
PRODUCT
MANAGER / OWNER
The solution
BUSINESS CASE, PROJECT, TARGETS, BUDGET, EXPERTISE
Create business pull
DATA SCIENCE TEAM
1. Who are our end users?
2. Are we solving their problem?
3. Do they want our solution?
1. Is the solution aligned
with our strategy?
2. Is there a clear business case?
3. Do we have budget?
The solution
BUSINESS
Asking the right questions
You build it, you run it
Work in multidisciplinary product teams
EXPERIMENT TEAM BUSINESS
The challenge
?
Hand-overs
INDUSTRIALIZATION TEAM
The challenge
EXPERIMENT TEAM
?? ?
Hand-overs
PRODUCT TEAM BUSINESS
The solution
CONTINUOUS DELIVERY
CONTINUOUS FEEDBACK
Product teams
The solution
PRODUCTIDEA
BUSINESS
PRODUCT TEAM
Product teams: evolving composition
The solution
𝛌𝛌 𝛌𝛌
𝛌𝛌
𝛌𝛌
Product teams: ability to operate and improve
PRODUCT TEAM
And they lived happily ever after
Separately
Aim for a modular architecture
DATA ARCHITECT
BUSINESS
𝛌𝛌
MODEL SERVINGDATA
The challenge
?
? ?
Embedding the product
DATA INTEGRATION LAYER
𝛌𝛌
MODEL SERVING
The solution
Model-as-a-Service
BUSINESS
SOURCES
CONSUMERS
PRODUCTIDEA
A data science product pipeline
1. Agree on the life-cycle stages of a data science product
2. Install stage gates with measurable criteria
3. Establish and assign responsibilities at each stage
4. Align technological roadmap
5. Execute and evaluate
Key take-aways
Get business committed and not just interested
Work in multidisciplinary product teams
Aim for a modular architecture
Instate a data science product pipeline
+31 (0) 168 479294
Coltbaan 4C, Nieuwegein
@bigdatarep
www.bigdatarepublic.nl
/company/bigdata-republic
info@bigdatarepublic.nl
DATA SCIENCE | BIG DATA ANALYTICS | BIG DATA ARCHITECTURES

More Related Content

PPTX
Industry - academia collaboration in practice
PDF
A3 in 2 minutes
PDF
Choose your career option wisely!
PPTX
Preparing for a Tech Interview
PDF
Can algorithms help to reduce absenteeism
PPTX
What the fuck is product
PPTX
Eureka Analytics Seminar Series - Product Management for Data Science Products
PDF
Applied_Data_Science_Presented_by_Yhat
Industry - academia collaboration in practice
A3 in 2 minutes
Choose your career option wisely!
Preparing for a Tech Interview
Can algorithms help to reduce absenteeism
What the fuck is product
Eureka Analytics Seminar Series - Product Management for Data Science Products
Applied_Data_Science_Presented_by_Yhat

Similar to Industrializing data science: a view from the trenches (20)

PPTX
Why Data Science Projects Fail
PPTX
Why Data Science Projects Fail
PDF
The 3 Key Barriers Keeping Companies from Deploying Data Products
PDF
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
PDF
Become a citizen data scientist
PDF
From Lab to Factory: Or how to turn data into value
PPTX
KDD 2019 IADSS Workshop - Skills to Master Machine Learning and Data Science ...
PDF
Understanding Products Driven by Machine Learning and AI: A Data Scientist's ...
PPTX
Data Science in Manufacturing and Automation
PDF
Mastering Data Science: A Key to Unlocking Business Potential
PDF
Data Competitive
PDF
Data Science - The New Skill for Today’s Entrepreneurs.pdf
PPTX
Data science tools of the trade
PDF
The Impact of Data Science on Business Strategy | IABAC
PDF
"What we learned from 5 years of building a data science software that actual...
PDF
How to succeed at data without even trying!
PDF
Building successful data science teams
PDF
From Lab to Factory: Creating value with data
PDF
Training Taster: Leading the way to become a data-driven organization
PDF
Real-World-Case-Studies-in-Data-Science.
Why Data Science Projects Fail
Why Data Science Projects Fail
The 3 Key Barriers Keeping Companies from Deploying Data Products
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
Become a citizen data scientist
From Lab to Factory: Or how to turn data into value
KDD 2019 IADSS Workshop - Skills to Master Machine Learning and Data Science ...
Understanding Products Driven by Machine Learning and AI: A Data Scientist's ...
Data Science in Manufacturing and Automation
Mastering Data Science: A Key to Unlocking Business Potential
Data Competitive
Data Science - The New Skill for Today’s Entrepreneurs.pdf
Data science tools of the trade
The Impact of Data Science on Business Strategy | IABAC
"What we learned from 5 years of building a data science software that actual...
How to succeed at data without even trying!
Building successful data science teams
From Lab to Factory: Creating value with data
Training Taster: Leading the way to become a data-driven organization
Real-World-Case-Studies-in-Data-Science.
Ad

Recently uploaded (20)

PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Computer network topology notes for revision
PDF
Business Analytics and business intelligence.pdf
PPTX
Introduction to Knowledge Engineering Part 1
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Database Infoormation System (DBIS).pptx
Reliability_Chapter_ presentation 1221.5784
Clinical guidelines as a resource for EBP(1).pdf
STERILIZATION AND DISINFECTION-1.ppthhhbx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Acceptance and paychological effects of mandatory extra coach I classes.pptx
IB Computer Science - Internal Assessment.pptx
Computer network topology notes for revision
Business Analytics and business intelligence.pdf
Introduction to Knowledge Engineering Part 1
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
SAP 2 completion done . PRESENTATION.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Fluorescence-microscope_Botany_detailed content
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Galatica Smart Energy Infrastructure Startup Pitch Deck
1_Introduction to advance data techniques.pptx
Database Infoormation System (DBIS).pptx
Ad

Industrializing data science: a view from the trenches