SlideShare a Scribd company logo
The 3 Key Barriers Keeping Companies
from Acting Upon the Possibilities
That Big Data has to Offer
A little bit about me…
•  Born & raised in Palo Alto, California

•  BA in European History From Columbia University

•  Masters in Marketing & Communication from Sciences
Po Paris

•  Director of Marketing at Dataiku

•  Currently living in Paris
Shift from
Uomo Universale
« A man can do all things if he will. »
-Leon Battista Alberti (1404-72)
Excel at all things:
•  Intellect
•  Mathematics
•  Science
•  Art
•  Social
•  Physical
Shift from
Uomo Universale To Expert
Required Assets:
•  Hacker mindset
•  Logic
•  Statistics
•  Polyglot Programmer
•  Mathematics
•  Algorithmics
•  Engineering
•  Databases
•  Machine Learning
•  Strong creativity
•  Strategical thinker
•  Business understanding
•  Strong communication skills
•  Project management
Data Science Superstar
The 3 Key Barriers Keeping Companies from Deploying Data Products
Are we in some sort
of Renaissance Era 
of Big Data?
If so, what’s next?
Investigation Part 1:
What is a Data Product?
Data Products
Data Products
Data Products
Data Products
Data Products
Data Products
Data Products
Data Products
Data Products
Data Products
Data Products
Data Products
=

Data + Technology + Data Scientist? + End User
Investigation Part 2:
What goes on behind the scenes?
Building a Data Product
User	
  Interface	
  Stream / Real-time
 Query
 Data
Preprocessing
Building a (Predictive) Data Product
Predicted Data
Historical Data
 Machine Learning
 Model
Preprocessing
Building a (Predictive) Data Product
Predicted Data
Historical Data
 Machine Learning
 Model
Preprocessing
Pre-processing & cleaning data alone can take 
up to 80% of the time spent on a data project
Building a (Predictive) Data Product
Predicted Data
Historical Data
 Machine Learning
 Model
Preprocessing
Building a (Predictive) Data Product
Predicted Data
Historical Data
 Machine Learning
 Model
Preprocessing
Running a (Predictive) Data Product
Deployment
Real-time / Stream
Model
Preprocessing	
  
Predicted Data
Investigation Part 3:
Who does what?
Customer Data!
Machine Data!
System Data!
Graph Data!
Structured Data!
Unstructured Data!
Transactional Data!
Catalogue Data!
Web Log Data!
RAW DATA
System Architect /
IT Team / Data Engineer
RAWDATA
Data Product
= 
Business Incentive
Mathematics / statistics
Data Product
= 
Business Incentive
Data Product
= 
Business Incentive
Mathematics / statistics / Business
« Data Scientist »
Data Product
= 
Business Incentive
« Data Scientist »?
Data Engineers
« Data Scientist »
What I’ve Learned
Fact 1: The Skill Sets Exist
Business Statistics Math
 Data Engineering
Build
 Maintain
Fact 1: The Skill Sets Exist (& your company probably already has them)
Business Analyst
 Data Engineer
Build
 Maintain
Mathematician / Statistician
Fact 2: The Technologies Exist
(and some are free!)
Fact 3: The Data Exists
Why is Production & Industrialisation of
(Predictive) Data Products Important?
Those who win are those who deliver
new data products continuously
Those who win are those
who deliver data products
Data products are supposed to deliver
business value… if you don’t deploy them,
where’s the long term value?
No Industrialisation =
Limited ROI
Those who win are those
who deliver data products
It’s like building your dream house but
never moving in.
No Industrialisation =
Limited ROI
Those who win are those
who deliver data products
It’s like building your dream house but
never moving in. Absurd!
So Why Aren’t More Companies Deploying
(Predictive) Data Products?
A Data Product must be 
business focused (ROI) & mathematically accurate (RELIABLE)
1° Business Analytic & Algorithmic Minds Are Different…
1° Business Analytic & Algorithmic Minds Are Different…
The Business Analysts Brain
Patterns. Patterns. Patterns.
The Algorithmic Brain
Performance. Truth. Anomaly.
1° Business Analytic & Algorithmic Minds Are Different…
…But Your Data Product Needs Both
Patterns, patterns, patterns
Performance, Truth, Anomaly
BUSINESS
KNOWLEDGE
MATHEMATICAL
ACCURACY
1° Business Analytic & Algorithmic Minds Are Different…
MINDSET
•  Project alignment from conception to execution – install team mindset with
common goal – even if the paths to get there are different
FRAMEWORK
•  One common platform with enough flexibility for both mindsets to fully
exercise their individual skill and expertise on a common project
Resolving the Skill Gap
2° So Many Technologies, Languages, and Needs
2° So Many Technologies, Languages, and Needs
R	
  /	
  Python	
  
2° So Many Technologies, Languages, and Needs
Code-­‐free	
  
R	
  /	
  Python	
  
2° So Many Technologies, Languages, and Needs
Code-­‐free	
  
SQL	
  
R	
  /	
  Python	
  
2° So Many Technologies, Languages, and Needs
Code-­‐free	
  
R	
  /	
  Python	
  
Hadoop	
  
SQL	
  
…Or As My Boss Calls It: Technoslavia
Florian Douetteau
Dataiku CEO
…Or As My Boss Calls It: Technoslavia
Florian Douetteau
Dataiku CEO
…Or As My Boss Calls It: Technoslavia
Florian Douetteau
Dataiku CEO
…Or As My Boss Calls It: Technoslavia
Florian Douetteau
Dataiku CEO
OPTION 1:
Enterprise dictatorship
Living in Harmony with Technoslavia
OPTION 2 (my personal favorite):
Accept a polyglot approach
Living in Harmony with Technoslavia
3° Production and Industrialisation is Complex
Data reliability is hard to guarantee 


Technological Complexity
3° Production and Industrialisation is Complex
Technological Complexity
The assumption that production will identically reproduce the analysis phase 
is a hard promise to make 


3° Production and Industrialisation is Complex
Technological Complexity
Monitoring a predictive model’s life cycle is a tedious and continuous task


3° Production and Industrialisation is Complex
Human & Organisational Complexity
BUILDING
 MAINTAINING
3° Production and Industrialisation is Complex
Human & Organisational Complexity
BUILDING
 MAINTAINING
Business Analyst
•  patterns
3° Production and Industrialisation is Complex
Human & Organisational Complexity
BUILDING
 MAINTAINING
Business Analyst / Algorithmic 
•  patterns
•  performance 
•  truth	
  
3° Production and Industrialisation is Complex
Business Analyst / Algorithmic 
•  patterns
•  performance 
•  truth	
  
Data engineers 
•  stability 
•  reliability 
•  cost of ownership	
  
Human & Organisational Complexity
BUILDING
 MAINTAINING
3° Production and Industrialisation is Complex
Business Analyst / Algorithmic 
•  patterns
•  performance 
•  truth	
  
Data engineers 
•  stability 
•  reliability 
•  cost of ownership	
  
Human & Organisational Complexity
BUILDING
 MAINTAINING
3° Production and Industrialisation is Complex
TIP #1: Invest in a platform where development and production are the same

DEVELOPMENT
 TEST
 PRODUCTION
Making Complexity Work for You
TIP #2: Invest in monitoring capabilities & strategies
Making Complexity Work for You
Data Engineers must have visibility and understanding of the key business metrics
TIP #3: Name your Data Engineer(s) Wisely & Define Responsibilities
Making Complexity Work for You
Data Engineers must know if (and when) a model is diverging
TIP #3: Name your Data Engineer(s) Wisely & Define Responsibilities
Making Complexity Work for You
Data Engineer must be responsible for quality of service
TIP #3: Name your Data Engineer(s) Wisely & Define Responsibilities
Making Complexity Work for You
Differentiate builders of new data products 
from those that maintain them. 
TIP #4: It’s Not a One Man Show
Making Complexity Work for You
What To Expect?
From the Renaissance of Big Data…
Where the Data Science Superstar is one person that excels at all skill sets…
…and where actual data products are rarely deployed and maintained
To the Enlightenment of (Big) Data
Where the Data Science Superstar is a team of complimentary skill sets…
… and where data products are designed, built, tested, and deployed by a
team of skilled individuals that each have a distinct role.
TEAM	
  
SPOTLIGHT on the
Data Science Team Manager
The Rise of the
Data Science Team Manager
The Data Science Team Manager must understand the stakeholders’ needs,
translate them into a business need that can be answered with a data
product…
The Rise of the
Data Science Team Manager
The Data Science Team Manager must permit and enable collaboration
between business analysts, statisticians, & engineers… 

Collaboratively 
design, build, & deploy 
Data Products
The Rise of the
Collaborative Data Science Team
…all the while maintaining the distinction between each individual role 
and each individual skill set.
Business
 Mathematics
Data 
Engineering
The Secret to Building and Industrializing Data Products 
is Collaboration.

Today, collaboration between different 
skill sets, technologies, and data is finally possible.
Data Science Studio:
One Platform for Development and Industrialization
Thank You!
Pauline Brown
Dataiku, Director of Marketing
Pauline.brown@dataiku.com
@pauline8brown
www.dataiku.com
The 3 Key Barriers Keeping Companies from Deploying Data Products

More Related Content

KEY
Eat whatever you can with PyBabe
PDF
Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013
PDF
Dataiku - data driven nyc - april 2016 - the solitude of the data team m...
PDF
How to Build Successful Data Team - Dataiku ?
PPTX
Dataiku, Pitch Data Innovation Night, Boston, Septembre 16th
PPTX
Dataiku - From Big Data To Machine Learning
PDF
Dataiku productive application to production - pap is may 2015
PPTX
Online Games Analytics - Data Science for Fun
Eat whatever you can with PyBabe
Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013
Dataiku - data driven nyc - april 2016 - the solitude of the data team m...
How to Build Successful Data Team - Dataiku ?
Dataiku, Pitch Data Innovation Night, Boston, Septembre 16th
Dataiku - From Big Data To Machine Learning
Dataiku productive application to production - pap is may 2015
Online Games Analytics - Data Science for Fun

What's hot (20)

PPTX
Dataiku r users group v2
PDF
Dataiku - google cloud platform roadshow - october 2013
PPTX
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
PDF
The Rise of the DataOps - Dataiku - J On the Beach 2016
PDF
Applied Data Science Course Part 1: Concepts & your first ML model
PPTX
Dataiku data science studio
PPTX
Your Data Nerd Friends Need You!
PPTX
Dataiku - hadoop ecosystem - @Epitech Paris - janvier 2014
PPTX
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
PDF
Building Data Science Teams
 
PDF
Back to Square One: Building a Data Science Team from Scratch
PDF
Data Wrangling and the Art of Big Data Discovery
PDF
Agile, Automated, Aware: How to Model for Success
PPTX
Beyond the Data Lake - Matthias Korn, Technical Consultant at Data Virtuality
PDF
seven steps to dataops @ dataops.rocks conference Oct 2019
PDF
Dataiku Data Science Studio (datasheet)
PPTX
Washington DC DataOps Meetup -- Nov 2019
PDF
Dataiku - Big data paris 2015 - A Hybrid Platform, a Hybrid Team
PDF
Uwe Seiler, Data Architect and Trainer at codecentric AG - "Hadoop & Germany ...
PDF
The paradox of big data - dataiku / oxalide APEROTECH
Dataiku r users group v2
Dataiku - google cloud platform roadshow - october 2013
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
The Rise of the DataOps - Dataiku - J On the Beach 2016
Applied Data Science Course Part 1: Concepts & your first ML model
Dataiku data science studio
Your Data Nerd Friends Need You!
Dataiku - hadoop ecosystem - @Epitech Paris - janvier 2014
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
Building Data Science Teams
 
Back to Square One: Building a Data Science Team from Scratch
Data Wrangling and the Art of Big Data Discovery
Agile, Automated, Aware: How to Model for Success
Beyond the Data Lake - Matthias Korn, Technical Consultant at Data Virtuality
seven steps to dataops @ dataops.rocks conference Oct 2019
Dataiku Data Science Studio (datasheet)
Washington DC DataOps Meetup -- Nov 2019
Dataiku - Big data paris 2015 - A Hybrid Platform, a Hybrid Team
Uwe Seiler, Data Architect and Trainer at codecentric AG - "Hadoop & Germany ...
The paradox of big data - dataiku / oxalide APEROTECH
Ad

Viewers also liked (16)

PDF
The US Healthcare Industry
PPTX
Driving Real Change In US Healthcare - Webinar slides
PDF
Structure of us healthcare
PPTX
Us health care system final presentation.
PPT
US health care system overview 1
PDF
Health system functions and structure
PDF
NFC - Near Field Communication
PDF
日本企業がとるべき サイバーセキュリティ戦略
PPTX
LEPs in the US Healthcare System
PDF
Market trend nov.2016 cyber security
PDF
情報漏洩リスクに備えるサイバーセキュリティ
PPTX
Innovations in Healthcare - US Chamber of Commerce
PDF
Data Driven Innovation
PDF
"Machine Learning and Internet of Things, the future of medical prevention", ...
PDF
健康医療分野の海外サイバーセキュリティ最新動向
PDF
Dataiku pig - hive - cascading
The US Healthcare Industry
Driving Real Change In US Healthcare - Webinar slides
Structure of us healthcare
Us health care system final presentation.
US health care system overview 1
Health system functions and structure
NFC - Near Field Communication
日本企業がとるべき サイバーセキュリティ戦略
LEPs in the US Healthcare System
Market trend nov.2016 cyber security
情報漏洩リスクに備えるサイバーセキュリティ
Innovations in Healthcare - US Chamber of Commerce
Data Driven Innovation
"Machine Learning and Internet of Things, the future of medical prevention", ...
健康医療分野の海外サイバーセキュリティ最新動向
Dataiku pig - hive - cascading
Ad

Similar to The 3 Key Barriers Keeping Companies from Deploying Data Products (20)

PDF
Executive Briefing: Why managing machines is harder than you think
PDF
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
PDF
How to Build a Successful Data Team - Florian Douetteau @ PAPIs Connect
PPTX
Why Everything You Know About bigdata Is A Lie
PDF
Industrial Data Science
PPTX
Making advanced analytics work for you
PPTX
Data set The Future of Big Data
PDF
Big data Analytics
PPTX
Eureka Analytics Seminar Series - Product Management for Data Science Products
PDF
New Age Skills for Business Analysts
PDF
How to succeed at data without even trying!
PDF
Article Evaluation 4
PDF
ORGANISING YOUR ADVANCED ANALYTICS PROJECTS FOR SUCCESS - Big Data Expo 2019
PPTX
Lecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptx
PPTX
Borys Pratsiuk "How to be NVidia partner"
PPTX
Machine intelligence data science methodology 060420
PPTX
Making Advanced Analytics Work for You
PDF
Predictive Analytics in Practice - Breakfast Club 11th May 2017
PDF
DS Life Cycle
PDF
DS Life Cycle
Executive Briefing: Why managing machines is harder than you think
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
How to Build a Successful Data Team - Florian Douetteau @ PAPIs Connect
Why Everything You Know About bigdata Is A Lie
Industrial Data Science
Making advanced analytics work for you
Data set The Future of Big Data
Big data Analytics
Eureka Analytics Seminar Series - Product Management for Data Science Products
New Age Skills for Business Analysts
How to succeed at data without even trying!
Article Evaluation 4
ORGANISING YOUR ADVANCED ANALYTICS PROJECTS FOR SUCCESS - Big Data Expo 2019
Lecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptx
Borys Pratsiuk "How to be NVidia partner"
Machine intelligence data science methodology 060420
Making Advanced Analytics Work for You
Predictive Analytics in Practice - Breakfast Club 11th May 2017
DS Life Cycle
DS Life Cycle

More from Dataiku (14)

PDF
Applied Data Science Part 3: Getting dirty; data preparation and feature crea...
PDF
Applied Data Science Course Part 2: the data science workflow and basic model...
PDF
Before Kaggle : from a business goal to a Machine Learning problem
PPTX
04Juin2015_Symposium_Présentation_Coyote_Dataiku
PDF
Coyote & Dataiku - Séminaire Dixit GFII du 13 04-2015
PPTX
OWF 2014 - Take back control of your Web tracking - Dataiku
PDF
Dataiku at SF DataMining Meetup - Kaggle Yandex Challenge
PPTX
Lambda Architecture - Storm, Trident, SummingBird ... - Architecture and Over...
PPTX
Dataiku hadoop summit - semi-supervised learning with hadoop for understand...
PPTX
Dataiku big data paris - the rise of the hadoop ecosystem
PDF
Dataiku - for Data Geek Paris@Criteo - Close the Data Circle
PPTX
Data Disruption for Insurance - Perspective from th
PPTX
Dataiku Flow and dctc - Berlin Buzzwords
PPTX
Dataiku - Paris JUG 2013 - Hadoop is a batch
Applied Data Science Part 3: Getting dirty; data preparation and feature crea...
Applied Data Science Course Part 2: the data science workflow and basic model...
Before Kaggle : from a business goal to a Machine Learning problem
04Juin2015_Symposium_Présentation_Coyote_Dataiku
Coyote & Dataiku - Séminaire Dixit GFII du 13 04-2015
OWF 2014 - Take back control of your Web tracking - Dataiku
Dataiku at SF DataMining Meetup - Kaggle Yandex Challenge
Lambda Architecture - Storm, Trident, SummingBird ... - Architecture and Over...
Dataiku hadoop summit - semi-supervised learning with hadoop for understand...
Dataiku big data paris - the rise of the hadoop ecosystem
Dataiku - for Data Geek Paris@Criteo - Close the Data Circle
Data Disruption for Insurance - Perspective from th
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku - Paris JUG 2013 - Hadoop is a batch

Recently uploaded (20)

PPTX
INTERNATIONAL LABOUR ORAGNISATION PPT ON SOCIAL SCIENCE
PPTX
Role and Responsibilities of Bangladesh Coast Guard Base, Mongla Challenges
PDF
Swiggy’s Playbook: UX, Logistics & Monetization
PPTX
The Effect of Human Resource Management Practice on Organizational Performanc...
PPTX
Tour Presentation Educational Activity.pptx
PDF
Parts of Speech Prepositions Presentation in Colorful Cute Style_20250724_230...
PPTX
nose tajweed for the arabic alphabets for the responsive
PPTX
AcademyNaturalLanguageProcessing-EN-ILT-M02-Introduction.pptx
PDF
Instagram's Product Secrets Unveiled with this PPT
PPTX
BIOLOGY TISSUE PPT CLASS 9 PROJECT PUBLIC
PPTX
Impressionism_PostImpressionism_Presentation.pptx
PPTX
Emphasizing It's Not The End 08 06 2025.pptx
PPTX
Primary and secondary sources, and history
DOC
学位双硕士UTAS毕业证,墨尔本理工学院毕业证留学硕士毕业证
PPTX
Understanding-Communication-Berlos-S-M-C-R-Model.pptx
PDF
oil_refinery_presentation_v1 sllfmfls.pdf
PPTX
An Unlikely Response 08 10 2025.pptx
DOCX
"Project Management: Ultimate Guide to Tools, Techniques, and Strategies (2025)"
PPTX
Presentation for DGJV QMS (PQP)_12.03.2025.pptx
PPTX
Relationship Management Presentation In Banking.pptx
INTERNATIONAL LABOUR ORAGNISATION PPT ON SOCIAL SCIENCE
Role and Responsibilities of Bangladesh Coast Guard Base, Mongla Challenges
Swiggy’s Playbook: UX, Logistics & Monetization
The Effect of Human Resource Management Practice on Organizational Performanc...
Tour Presentation Educational Activity.pptx
Parts of Speech Prepositions Presentation in Colorful Cute Style_20250724_230...
nose tajweed for the arabic alphabets for the responsive
AcademyNaturalLanguageProcessing-EN-ILT-M02-Introduction.pptx
Instagram's Product Secrets Unveiled with this PPT
BIOLOGY TISSUE PPT CLASS 9 PROJECT PUBLIC
Impressionism_PostImpressionism_Presentation.pptx
Emphasizing It's Not The End 08 06 2025.pptx
Primary and secondary sources, and history
学位双硕士UTAS毕业证,墨尔本理工学院毕业证留学硕士毕业证
Understanding-Communication-Berlos-S-M-C-R-Model.pptx
oil_refinery_presentation_v1 sllfmfls.pdf
An Unlikely Response 08 10 2025.pptx
"Project Management: Ultimate Guide to Tools, Techniques, and Strategies (2025)"
Presentation for DGJV QMS (PQP)_12.03.2025.pptx
Relationship Management Presentation In Banking.pptx

The 3 Key Barriers Keeping Companies from Deploying Data Products

  • 1. The 3 Key Barriers Keeping Companies from Acting Upon the Possibilities That Big Data has to Offer
  • 2. A little bit about me… •  Born & raised in Palo Alto, California •  BA in European History From Columbia University •  Masters in Marketing & Communication from Sciences Po Paris •  Director of Marketing at Dataiku •  Currently living in Paris
  • 3. Shift from Uomo Universale « A man can do all things if he will. » -Leon Battista Alberti (1404-72) Excel at all things: •  Intellect •  Mathematics •  Science •  Art •  Social •  Physical
  • 5. Required Assets: •  Hacker mindset •  Logic •  Statistics •  Polyglot Programmer •  Mathematics •  Algorithmics •  Engineering •  Databases •  Machine Learning •  Strong creativity •  Strategical thinker •  Business understanding •  Strong communication skills •  Project management Data Science Superstar
  • 7. Are we in some sort of Renaissance Era of Big Data?
  • 9. Investigation Part 1: What is a Data Product?
  • 21. Data Products = Data + Technology + Data Scientist? + End User
  • 22. Investigation Part 2: What goes on behind the scenes?
  • 23. Building a Data Product User  Interface  Stream / Real-time Query Data Preprocessing
  • 24. Building a (Predictive) Data Product Predicted Data Historical Data Machine Learning Model Preprocessing
  • 25. Building a (Predictive) Data Product Predicted Data Historical Data Machine Learning Model Preprocessing Pre-processing & cleaning data alone can take up to 80% of the time spent on a data project
  • 26. Building a (Predictive) Data Product Predicted Data Historical Data Machine Learning Model Preprocessing
  • 27. Building a (Predictive) Data Product Predicted Data Historical Data Machine Learning Model Preprocessing
  • 28. Running a (Predictive) Data Product Deployment Real-time / Stream Model Preprocessing   Predicted Data
  • 30. Customer Data! Machine Data! System Data! Graph Data! Structured Data! Unstructured Data! Transactional Data! Catalogue Data! Web Log Data! RAW DATA
  • 31. System Architect / IT Team / Data Engineer RAWDATA Data Product = Business Incentive
  • 32. Mathematics / statistics Data Product = Business Incentive
  • 33. Data Product = Business Incentive Mathematics / statistics / Business
  • 38. Fact 1: The Skill Sets Exist Business Statistics Math Data Engineering Build Maintain
  • 39. Fact 1: The Skill Sets Exist (& your company probably already has them) Business Analyst Data Engineer Build Maintain Mathematician / Statistician
  • 40. Fact 2: The Technologies Exist (and some are free!)
  • 41. Fact 3: The Data Exists
  • 42. Why is Production & Industrialisation of (Predictive) Data Products Important?
  • 43. Those who win are those who deliver new data products continuously
  • 44. Those who win are those who deliver data products Data products are supposed to deliver business value… if you don’t deploy them, where’s the long term value?
  • 45. No Industrialisation = Limited ROI Those who win are those who deliver data products It’s like building your dream house but never moving in.
  • 46. No Industrialisation = Limited ROI Those who win are those who deliver data products It’s like building your dream house but never moving in. Absurd!
  • 47. So Why Aren’t More Companies Deploying (Predictive) Data Products?
  • 48. A Data Product must be business focused (ROI) & mathematically accurate (RELIABLE)
  • 49. 1° Business Analytic & Algorithmic Minds Are Different…
  • 50. 1° Business Analytic & Algorithmic Minds Are Different… The Business Analysts Brain Patterns. Patterns. Patterns.
  • 51. The Algorithmic Brain Performance. Truth. Anomaly. 1° Business Analytic & Algorithmic Minds Are Different…
  • 52. …But Your Data Product Needs Both Patterns, patterns, patterns Performance, Truth, Anomaly BUSINESS KNOWLEDGE MATHEMATICAL ACCURACY 1° Business Analytic & Algorithmic Minds Are Different…
  • 53. MINDSET •  Project alignment from conception to execution – install team mindset with common goal – even if the paths to get there are different FRAMEWORK •  One common platform with enough flexibility for both mindsets to fully exercise their individual skill and expertise on a common project Resolving the Skill Gap
  • 54. 2° So Many Technologies, Languages, and Needs
  • 55. 2° So Many Technologies, Languages, and Needs R  /  Python  
  • 56. 2° So Many Technologies, Languages, and Needs Code-­‐free   R  /  Python  
  • 57. 2° So Many Technologies, Languages, and Needs Code-­‐free   SQL   R  /  Python  
  • 58. 2° So Many Technologies, Languages, and Needs Code-­‐free   R  /  Python   Hadoop   SQL  
  • 59. …Or As My Boss Calls It: Technoslavia Florian Douetteau Dataiku CEO
  • 60. …Or As My Boss Calls It: Technoslavia Florian Douetteau Dataiku CEO
  • 61. …Or As My Boss Calls It: Technoslavia Florian Douetteau Dataiku CEO
  • 62. …Or As My Boss Calls It: Technoslavia Florian Douetteau Dataiku CEO
  • 63. OPTION 1: Enterprise dictatorship Living in Harmony with Technoslavia
  • 64. OPTION 2 (my personal favorite): Accept a polyglot approach Living in Harmony with Technoslavia
  • 65. 3° Production and Industrialisation is Complex
  • 66. Data reliability is hard to guarantee Technological Complexity 3° Production and Industrialisation is Complex
  • 67. Technological Complexity The assumption that production will identically reproduce the analysis phase is a hard promise to make 3° Production and Industrialisation is Complex
  • 68. Technological Complexity Monitoring a predictive model’s life cycle is a tedious and continuous task 3° Production and Industrialisation is Complex
  • 69. Human & Organisational Complexity BUILDING MAINTAINING 3° Production and Industrialisation is Complex
  • 70. Human & Organisational Complexity BUILDING MAINTAINING Business Analyst •  patterns 3° Production and Industrialisation is Complex
  • 71. Human & Organisational Complexity BUILDING MAINTAINING Business Analyst / Algorithmic •  patterns •  performance •  truth   3° Production and Industrialisation is Complex
  • 72. Business Analyst / Algorithmic •  patterns •  performance •  truth   Data engineers •  stability •  reliability •  cost of ownership   Human & Organisational Complexity BUILDING MAINTAINING 3° Production and Industrialisation is Complex
  • 73. Business Analyst / Algorithmic •  patterns •  performance •  truth   Data engineers •  stability •  reliability •  cost of ownership   Human & Organisational Complexity BUILDING MAINTAINING 3° Production and Industrialisation is Complex
  • 74. TIP #1: Invest in a platform where development and production are the same DEVELOPMENT TEST PRODUCTION Making Complexity Work for You
  • 75. TIP #2: Invest in monitoring capabilities & strategies Making Complexity Work for You
  • 76. Data Engineers must have visibility and understanding of the key business metrics TIP #3: Name your Data Engineer(s) Wisely & Define Responsibilities Making Complexity Work for You
  • 77. Data Engineers must know if (and when) a model is diverging TIP #3: Name your Data Engineer(s) Wisely & Define Responsibilities Making Complexity Work for You
  • 78. Data Engineer must be responsible for quality of service TIP #3: Name your Data Engineer(s) Wisely & Define Responsibilities Making Complexity Work for You
  • 79. Differentiate builders of new data products from those that maintain them. TIP #4: It’s Not a One Man Show Making Complexity Work for You
  • 81. From the Renaissance of Big Data… Where the Data Science Superstar is one person that excels at all skill sets… …and where actual data products are rarely deployed and maintained
  • 82. To the Enlightenment of (Big) Data Where the Data Science Superstar is a team of complimentary skill sets… … and where data products are designed, built, tested, and deployed by a team of skilled individuals that each have a distinct role. TEAM  
  • 83. SPOTLIGHT on the Data Science Team Manager
  • 84. The Rise of the Data Science Team Manager The Data Science Team Manager must understand the stakeholders’ needs, translate them into a business need that can be answered with a data product…
  • 85. The Rise of the Data Science Team Manager The Data Science Team Manager must permit and enable collaboration between business analysts, statisticians, & engineers… Collaboratively design, build, & deploy Data Products
  • 86. The Rise of the Collaborative Data Science Team …all the while maintaining the distinction between each individual role and each individual skill set. Business Mathematics Data Engineering
  • 87. The Secret to Building and Industrializing Data Products is Collaboration. Today, collaboration between different skill sets, technologies, and data is finally possible.
  • 88. Data Science Studio: One Platform for Development and Industrialization
  • 89. Thank You! Pauline Brown Dataiku, Director of Marketing Pauline.brown@dataiku.com @pauline8brown www.dataiku.com