SlideShare a Scribd company logo
Data Science Team
A practice to set up
Omid Mogharian
V0.2.1 - 06.02.2017
… it’s a mistake to treat data science teams like any old product group ... To build
teams that create great data products, you have to find people with the skills and
the curiosity to ask the big questions. You have build cross-disciplinary groups with
people who are comfortable creating together…
DJ Patil, U.S. Chief Data Scientist at White House
Office of Science and Technology Policy
Roles
Machine Learning
Engineer
Data
Engineer/Architect
Data Analyst
Math &
Statistics
Interpretation/
Visualisation
Modeling &
ML
Math &
Statistics
Machine
Learning
Developing
Developing
Infrastructure
Design
Operation
Data Scientist
Core Team Skills
Relations
Data Science Team
Operations/
System
Administration
Sales
PO/
Customer
relation
BI
App
Development
To make it real
Method
CRISP-DM
The Data Science Process
Communications with Customer
How to? lambdaBig data and Fast data
Big Data Pipeline
Job repository
Scheduler/
Runner
Incremental
Runner
Message QueueData Pipe Agent
A practice for continuous analyse
Severing
layer
Application Environments
Source
Simulator
Stage
Production
Sample Data
Source
Connector
Big Data
Continuous Analyse Application*
Continuous Analyse Application*
To bring accuracy
* Whole software with
several components
which are explained in
previous slide
References
● https://dzone.com/articles/lambda-architecture-with-apache-spark
● https://en.wikipedia.org/wiki/Lambda_architecture
● https://www.mapr.com/developercentral/lambda-architecture
● http://www.kdnuggets.com/2015/11/different-data-science-roles-industry.html
● http://www.kdnuggets.com/2016/03/data-science-process-rediscovered.html
● http://www.datascienceassn.org/sites/default/files/Building%20Data%20Scienc
e%20Teams.pdf

More Related Content

PDF
Data science team (new version)
PDF
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
PPTX
Idiots guide to setting up a data science team
PDF
Building a Data Platform Strata SF 2019
PDF
Introduction to Data Science (Data Summit, 2017)
PDF
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
PDF
Building Data Science Teams
 
PDF
Data Science Salon: Building a Data Science Culture
Data science team (new version)
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Idiots guide to setting up a data science team
Building a Data Platform Strata SF 2019
Introduction to Data Science (Data Summit, 2017)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Building Data Science Teams
 
Data Science Salon: Building a Data Science Culture

What's hot (20)

PDF
How to build a data science team 20115.03.13v6
PPTX
Building Data Science Teams: A Moneyball Approach
PDF
The Big Data Dream Team
PPTX
Data scientist the sexiest job of the 21st century by thomas h davenport and ...
PDF
How I Learned to Stop Worrying and Love Linked Data
PPTX
Strata Data Conference 2019 : Scaling Visualization for Big Data in the Cloud
PPTX
New professional careers in data
PPTX
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
PDF
Walmart Big Data Expo
PPTX
Leveraging Data Science in the Automotive Industry
PDF
Evaluation of big data analysis
PPTX
Data Engineering and the Data Science Lifecycle
PPTX
The Five Data Questions
PDF
Back to Square One: Building a Data Science Team from Scratch
PPTX
Moving Data Science from an Event to A Program: Considerations in Creating Su...
PPTX
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
PDF
Data Architecture: OMG It’s Made of People
PDF
Pay no attention to the man behind the curtain - the unseen work behind data ...
PDF
Data science vs. Data scientist by Jothi Periasamy
PDF
How to Build Successful Data Team - Dataiku ?
How to build a data science team 20115.03.13v6
Building Data Science Teams: A Moneyball Approach
The Big Data Dream Team
Data scientist the sexiest job of the 21st century by thomas h davenport and ...
How I Learned to Stop Worrying and Love Linked Data
Strata Data Conference 2019 : Scaling Visualization for Big Data in the Cloud
New professional careers in data
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
Walmart Big Data Expo
Leveraging Data Science in the Automotive Industry
Evaluation of big data analysis
Data Engineering and the Data Science Lifecycle
The Five Data Questions
Back to Square One: Building a Data Science Team from Scratch
Moving Data Science from an Event to A Program: Considerations in Creating Su...
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
Data Architecture: OMG It’s Made of People
Pay no attention to the man behind the curtain - the unseen work behind data ...
Data science vs. Data scientist by Jothi Periasamy
How to Build Successful Data Team - Dataiku ?
Ad

Viewers also liked (20)

ODP
From Config Management Sucks to #cfgmgmtlove
PDF
Rootconf
PDF
Mesoscon 2015
PPTX
Transform your Analytics Practice into Insights Practice
PDF
Building Product from ground up using Open Source Technologies
PPTX
Send that (damn) elevator down !
PDF
Introducing ELK
PDF
Experiences in ELK with D3.js for Large Log Analysis and Visualization
ODP
Monitoring with ElasticSearch
PPTX
Elastic Stackにハマった話
PPTX
Monitoring using Open source technologies
PDF
Launching A Management Consulting Practice (2009)
PPTX
The Rise of Real Time
PDF
Keystone - Leverage Big Data 2016
PDF
How Did BuzzFeed Harvest One Million Email Subscribers?
PPTX
Real-Time Log Analysis with Apache Mesos, Kafka and Cassandra
PPTX
Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout
PPTX
Scaling an ELK stack at bol.com
PDF
Building an IoT Kafka Pipeline in Under 5 Minutes
PPTX
ELK at LinkedIn - Kafka, scaling, lessons learned
From Config Management Sucks to #cfgmgmtlove
Rootconf
Mesoscon 2015
Transform your Analytics Practice into Insights Practice
Building Product from ground up using Open Source Technologies
Send that (damn) elevator down !
Introducing ELK
Experiences in ELK with D3.js for Large Log Analysis and Visualization
Monitoring with ElasticSearch
Elastic Stackにハマった話
Monitoring using Open source technologies
Launching A Management Consulting Practice (2009)
The Rise of Real Time
Keystone - Leverage Big Data 2016
How Did BuzzFeed Harvest One Million Email Subscribers?
Real-Time Log Analysis with Apache Mesos, Kafka and Cassandra
Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout
Scaling an ELK stack at bol.com
Building an IoT Kafka Pipeline in Under 5 Minutes
ELK at LinkedIn - Kafka, scaling, lessons learned
Ad

Similar to Data science team, a practice to setup (20)

PPTX
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
PPTX
Ch1IntroductiontoDataScience.pptx
PDF
Data engineering design patterns
PDF
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
PDF
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
PPTX
Just ask Watson Seminar
PPTX
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
PPTX
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
PDF
Gse uk-cedrinemadera-2018-shared
PPTX
Meetup Data-science OVH
PPTX
Data Engineering Proposal for Homerunner.pptx
PPTX
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
PDF
Introduction to Data Science: data science process
PPTX
BI & Analytics with Ms Power BI.pptx
PPTX
Data Science as a Service: Intersection of Cloud Computing and Data Science
PPTX
Data Science as a Service: Intersection of Cloud Computing and Data Science
PDF
Introduction to Data Science - Fundamentals
PDF
Visionet Business Intelligence Solutions - Is your Business Intelligence real...
PPT
Qiagram
PDF
Big Data Meetup: Analytical Systems Evolution
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
Ch1IntroductiontoDataScience.pptx
Data engineering design patterns
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Just ask Watson Seminar
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Gse uk-cedrinemadera-2018-shared
Meetup Data-science OVH
Data Engineering Proposal for Homerunner.pptx
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Introduction to Data Science: data science process
BI & Analytics with Ms Power BI.pptx
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
Introduction to Data Science - Fundamentals
Visionet Business Intelligence Solutions - Is your Business Intelligence real...
Qiagram
Big Data Meetup: Analytical Systems Evolution

More from Omid Mogharian (7)

PDF
Privacy in Computer Vision
PDF
The journey to Private AI, where Privacy-Preserving ML meets DLT
PDF
Blockchain, a disappointing tech talk
PDF
How big is big data?
PDF
Distributed File System and Why It Matters.
PDF
Hadoop essential setup
PDF
Python: The Dynamic!
Privacy in Computer Vision
The journey to Private AI, where Privacy-Preserving ML meets DLT
Blockchain, a disappointing tech talk
How big is big data?
Distributed File System and Why It Matters.
Hadoop essential setup
Python: The Dynamic!

Recently uploaded (20)

PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
annual-report-2024-2025 original latest.
PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
Lecture1 pattern recognition............
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Foundation of Data Science unit number two notes
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
Computer network topology notes for revision
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPT
Quality review (1)_presentation of this 21
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
climate analysis of Dhaka ,Banglades.pptx
Qualitative Qantitative and Mixed Methods.pptx
annual-report-2024-2025 original latest.
ISS -ESG Data flows What is ESG and HowHow
Lecture1 pattern recognition............
Business Ppt On Nestle.pptx huunnnhhgfvu
oil_refinery_comprehensive_20250804084928 (1).pptx
Business Analytics and business intelligence.pdf
IBA_Chapter_11_Slides_Final_Accessible.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Foundation of Data Science unit number two notes
Miokarditis (Inflamasi pada Otot Jantung)
Computer network topology notes for revision
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
Quality review (1)_presentation of this 21
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg

Data science team, a practice to setup