SlideShare a Scribd company logo
Data Science
Scrum – What Works and Doesn’t
Share Experience in Scrum on Data Science
Practice, is it relevant or not?
Why Scrum & Data Science ?
What Works & Doesn’t ?
Data Science Demo
T O P I C S
Adi Wijaya
Co-Founder & Data Science
Lead
Poke me, and let’s talk about
Typical 2018 Data Science
Requirement Proposal
Business
Understanding
Data Preparation
Create Model
Evaluation
Deployment
2 Weeks
1 Months
1 Months
1 Months
1 Months
Common Data Science Project Plan
That has High Risk to Fail
Typical 2018 Data Science
Requirement Proposal
Business
Understanding
Data Preparation
Create Model
Evaluation
Deployment
2 Weeks
1 Months
1 Months
1 Months
1 Months
Common Data Science Project Plan
That has High Risk to Fail
Scrum Data Science
Framework
For complex adaptive problems
Unified
Stats, technology, Data Analysist,
Business Knowledge
To understand and analyze actual
phenomena with data
Big Data
Google Trends History of Data Science
Popularity
Today 2018
Timeline
Big Data
Google Data
Worldwide
2014 - Now
Big Data
Google Trends History of Data Science
Popularity
Today 2018
Timeline
Hadoop
Hadoop
Big Data
2005
Google Data
Worldwide
2014 - Now
Big Data
Google Trends History of Data Science
Popularity
Today 2018
Timeline
Hadoop
Hadoop
Big Data
Data Science
Data Science
Google Data
Worldwide
2014 - Now
Big Data
Google Trends History of Data Science
Popularity
Today 2018
Timeline
Hadoop
Hadoop
Big Data
Data Science
Data Science
Software Development
Scrum
Software Development
Scrum
Google Data
Worldwide
2014 - Now
SCRUM
What Works and Doesn’t in Data
Science Activity
https://www.scrumguides.org/
3 Pillars of Scrum
Transparency Inspection Adaptive
3 Pillars of Scrum
Transparency Inspection Adaptive
Data Scientists
What my Mom thinks I do What my Boss/Client think I do
What I think I do What I Actually do
Scrum Framework
Roles Rules
Events Artifacts
https://www.scrumguides.org/
“A Data Scientist is that a unique blend
of skills that can both unlock the
insights of data and tell a fantastic story
via the data”
-- DJ Patil --
DJ Patil, former Linkedin and White House Data Scientist. Together
with Jeff H (former Facebook) invent the term Data Scientist in 2011
What is Data Scientists?
“A Data Scientist is that a unique blend
of skills that can both unlock the
insights of data and tell a fantastic story
via the data”
-- DJ Patil --
DJ Patil, former Linkedin and White House Data Scientist. Together
with Jeff H (former Facebook) invent the term Data Scientist in 2011
What is Data Scientists?
Common 2018 Data Scientists =
Machine Learning Engineers
Common 2018 Data Scientists =
Machine Learning Engineers
Adapt!
ROLESData Engineer
Data Scientist
Business Analysts
Product Owner
Scrum Master
Business UnitBusiness
Manager
External Party
Development Team
Assist
Presentation
Assist
Roles
Scrum Data Science Team
ROLESData Engineer
Data Scientist
Business Analysts
Product Owner
Scrum Master
Business UnitBusiness
Manager
External Party
Development Team
Assist
Presentation
Assist
Roles
Scrum Data Science Team
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
SCRUM EVENTS
Kanban Board, Standup Meeting, Sprint Review
Kanban Board
Kanban Board
Stand Up Meeting
1. Only Development Team!
2. Less than 15 Minutes!
3. Everyday
Sprint Review
1. Involve Business
2. Less than 4 hours
3. Once in every 1-2 Weeks
Project Goals
Data
Business
Problem Data Science Team
Graph Analytics
Text Analytics
Path Analytics
Machine Learning
Define
Business Problem
Provide
Data Science Team
Insights,
Recommendations
and Workflow
Doing
Data Exploration
Deliver
Insights
DataLabs AGILE ANALYTICS Service
Contact for Engagement : adi@datalabs.id
We help company to :
Week 1
Activity Timeline
B Gath Wrangling
Exploration Presentation&
Evaluation
Week 2
Week 3
Week 4Exploration Presentation
& Evaluation
Exploration Presentation
& Evaluation
Exploration Final
Presentation
Next Agile
One Agile Phase
As the spirit of true Big Data, we will explore your data according to the defined business use
cases with adjustable priority on each week evaluation. We will deliver all results and findings
we found when the agreed time is up. The result of one agile phase, can be continued for next
agile phases.
© Copyright 2018 DataLabs. All rights reserved. Not to be reproduced or shared without the prior written consent of DataLabs.
Contact for Engagement : adi@datalabs.id
Data Science Cycle and Environment
<= 2018
Data Engineer
Data Scientist 1
Big Data Environment
Create ETL job
To extract sample data
To csv
Use FTP or even USB to transfer the data
Jupyter
Notebook
Data Scientist 2
R Studio
Jupyter Notebook to Data Engineer
(Again sometimes using USB)
• Rewrite
notebook to
scripts
• Create API with
other language
• Deploy
Data Science Team 2018
Life Cycle
Data Engineer
Data Scientist 1
Big Data Environment
Create ETL job
To extract sample data
To csv
Use FTP or even USB to transfer the data
Jupyter
Notebook
Data Scientist 2
R Studio
Jupyter Notebook to Data Engineer
(Again sometimes using USB)
• Rewrite
notebook to
scripts
• Create API with
other language
• Deploy
Data Science Team 2018
Life Cycle
Data Engineer
Big Data Environment
Ideal Data Science
Life Cycle
Data Scientist 1 Data Scientist 2
Analytics
Environment
Maintain DataLake
Maintain Production Model
Optimize Performance
Experimentation on Big Data
Create Model
Evaluate & Deploy
✓ One Environment
✓ Self Organizing
✓ Cross-functional
Data Engineer
Cloudera Hadoop
Ideal Data Science
Life Cycle
Data Scientist 1 Data Scientist 2
Maintain DataLake
Maintain Production Model
Optimize Performance
Experimentation on Big Data
Create Model
Evaluate & Deploy
✓ One Environment
✓ Self Organizing
✓ Cross-functional
Data Science Workbench
I Want to Predict Your Gender
T H A N K S F O R
A T T E N D I N G
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t

More Related Content

PPTX
Dataiku - From Big Data To Machine Learning
PDF
What is a Data Scientist
PDF
Scaling Your Data: Data Democratisation and DataOps
PDF
How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...
PPTX
Data Scientist: The Sexiest Job in the 21st Century
PDF
Be a Data Scientist in 8 steps!
PDF
Data Science: Hype vs Reality - talk at GA - Dec 2013
PDF
The 3 Key Barriers Keeping Companies from Deploying Data Products
Dataiku - From Big Data To Machine Learning
What is a Data Scientist
Scaling Your Data: Data Democratisation and DataOps
How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...
Data Scientist: The Sexiest Job in the 21st Century
Be a Data Scientist in 8 steps!
Data Science: Hype vs Reality - talk at GA - Dec 2013
The 3 Key Barriers Keeping Companies from Deploying Data Products

What's hot (19)

PDF
Dataiku productive application to production - pap is may 2015
PPTX
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
PPTX
Data Scientist vs Data Analyst vs Data Engineer - Role & Responsibility, Skil...
PDF
How to Build Successful Data Team - Dataiku ?
PDF
The Rise of the DataOps - Dataiku - J On the Beach 2016
PDF
Applied Data Science Course Part 1: Concepts & your first ML model
PDF
Agile Data Science
PDF
Intro to Data Science
PDF
Data Discoverability with DataHub
PPTX
Hadoop Meets Scrum
PPTX
Dataiku - hadoop ecosystem - @Epitech Paris - janvier 2014
PDF
Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013
PDF
Dataiku - data driven nyc - april 2016 - the solitude of the data team m...
PPTX
Accidental DataOps
PDF
Strata+hadoop data kitchen-seven-steps-to-high-velocity-data-analytics-with d...
PPTX
Dataiku, Pitch Data Innovation Night, Boston, Septembre 16th
PPTX
Cloud-native Enterprise Data Science Teams
PPTX
Anaconda Data Science Collaboration
PDF
Different Career Paths in Data Science
Dataiku productive application to production - pap is may 2015
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
Data Scientist vs Data Analyst vs Data Engineer - Role & Responsibility, Skil...
How to Build Successful Data Team - Dataiku ?
The Rise of the DataOps - Dataiku - J On the Beach 2016
Applied Data Science Course Part 1: Concepts & your first ML model
Agile Data Science
Intro to Data Science
Data Discoverability with DataHub
Hadoop Meets Scrum
Dataiku - hadoop ecosystem - @Epitech Paris - janvier 2014
Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013
Dataiku - data driven nyc - april 2016 - the solitude of the data team m...
Accidental DataOps
Strata+hadoop data kitchen-seven-steps-to-high-velocity-data-analytics-with d...
Dataiku, Pitch Data Innovation Night, Boston, Septembre 16th
Cloud-native Enterprise Data Science Teams
Anaconda Data Science Collaboration
Different Career Paths in Data Science
Ad

Similar to Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t (20)

PDF
From Lab to Factory: Creating value with data
PDF
From Lab to Factory: Or how to turn data into value
PDF
Big Data for Data Scientists - Info Session
PDF
OSA Con 2022 - Scaling your Pandas Analytics with Modin - Doris Lee - Ponder.pdf
PPTX
Neurodb Engr245 2021 Lessons Learned
PPTX
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
PDF
Kelly O'Briant - DataOps in the Cloud: How To Supercharge Data Science with a...
PDF
Building successful data science teams
PPTX
Best Practices for Development Apps for Big Data
PDF
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
PPTX
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
PDF
How to Build a Successful Data Team - Florian Douetteau @ PAPIs Connect
PDF
How to succeed at data without even trying!
PPTX
Best Practices for Scaling Data Science Across the Organization
PPTX
Joe C
PDF
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
PDF
Lean Analytics: How to get more out of your data science team
PDF
How Data Virtualization Puts Machine Learning into Production (APAC)
PPTX
Your Data Nerd Friends Need You!
PDF
Data Science Popup Austin: Back to The Future for Data and Analytics
From Lab to Factory: Creating value with data
From Lab to Factory: Or how to turn data into value
Big Data for Data Scientists - Info Session
OSA Con 2022 - Scaling your Pandas Analytics with Modin - Doris Lee - Ponder.pdf
Neurodb Engr245 2021 Lessons Learned
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Kelly O'Briant - DataOps in the Cloud: How To Supercharge Data Science with a...
Building successful data science teams
Best Practices for Development Apps for Big Data
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
How to Build a Successful Data Team - Florian Douetteau @ PAPIs Connect
How to succeed at data without even trying!
Best Practices for Scaling Data Science Across the Organization
Joe C
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Lean Analytics: How to get more out of your data science team
How Data Virtualization Puts Machine Learning into Production (APAC)
Your Data Nerd Friends Need You!
Data Science Popup Austin: Back to The Future for Data and Analytics
Ad

More from Agile Impact (19)

PDF
Edo Suryo Pamungkas - Agile Recruitment
PDF
Peterjan Van Nieuwenhuizen - Transformation vs Enterprise distruption
PDF
Kaspar Situmorang - The anatomy of BRI Digital Transformation.
PDF
Norman Sasono - Incorporating AI/ML into Your Application Architecture
PDF
Tze Chin Tang - Path to Agility
PPTX
Christine Anna Rumawas  - A Place Called Home
PPTX
Rohit Arora - Retrospective, Making Them From Good To Great
PDF
Wahid Nurdin - What is Agile Coach exactly and why do we need them so much ?
PPTX
Andeka Putra - The Path to Agility at PT. Blue Bird Group TBK
PPTX
Viola Eva - How to use agile practices to transform digital marketing
PDF
Alex Sloley - Coaching Up to the C-Suite
PDF
Urmila Kandha - Emotional Intelligence for the agile enterprises
PPTX
Priscilla Henriette - Agile Transformation, Do it the opposite
PPTX
Arthur Purnama & Ichsan Rahardianto - The science in Agile Transformation
PDF
Quang Nguyen - What happens when everybody is a leader?
PDF
Shashank Kapoor & Neha Rahaman - Learning Kanban hands on!
PPTX
Paul Hutton - Making User Stories Work for Your Product
PDF
Alex Sloley - Create Your Own Business Agility Canvas
PDF
Jeff Lopez - To Affinity and Beyond
Edo Suryo Pamungkas - Agile Recruitment
Peterjan Van Nieuwenhuizen - Transformation vs Enterprise distruption
Kaspar Situmorang - The anatomy of BRI Digital Transformation.
Norman Sasono - Incorporating AI/ML into Your Application Architecture
Tze Chin Tang - Path to Agility
Christine Anna Rumawas  - A Place Called Home
Rohit Arora - Retrospective, Making Them From Good To Great
Wahid Nurdin - What is Agile Coach exactly and why do we need them so much ?
Andeka Putra - The Path to Agility at PT. Blue Bird Group TBK
Viola Eva - How to use agile practices to transform digital marketing
Alex Sloley - Coaching Up to the C-Suite
Urmila Kandha - Emotional Intelligence for the agile enterprises
Priscilla Henriette - Agile Transformation, Do it the opposite
Arthur Purnama & Ichsan Rahardianto - The science in Agile Transformation
Quang Nguyen - What happens when everybody is a leader?
Shashank Kapoor & Neha Rahaman - Learning Kanban hands on!
Paul Hutton - Making User Stories Work for Your Product
Alex Sloley - Create Your Own Business Agility Canvas
Jeff Lopez - To Affinity and Beyond

Recently uploaded (20)

PDF
Timeless Leadership Principles from History’s Greatest Figures by Alfonso Ken...
PPTX
Concluding Session_Wrapup-India Jun 5 2024-Oct 5 2025 ZS.pptx
PPTX
Improved_Leadership_in_Total_Quality_Lesson.pptx
PPTX
Consulting on marketing-The needs wants and demands are a very important comp...
PPTX
_ISO_Presentation_ISO 9001 and 45001.pptx
PDF
MANAGEMENT LESSONS FROM ANCIENT KNOWLEDGE SYSTEM-ARTHASHASTRA AND THIRUKKURAL...
PPTX
Empowering Project Management Through Servant Leadership - PMI UK.pptx
PPTX
Strategic Plan 2023-2024 Presentation.pptx
PPTX
Course Overview of the Course Titled.pptx
PPTX
Course Overview of the Course Titled.pptx
PPTX
Mangeroal Finance for Strategic Management
PDF
1_Corporate Goverance presentation topic
PPTX
2. CYCLE OF FUNCTIONING RIFLE -PP Presentation..pptx
PPT
Claims and Adjustment Business_Communication.pptx.ppt
PPTX
INTELLECTUAL PROPERTY LAW IN UGANDA.pptx
PPTX
Human resources management -job perception concept
PPTX
TCoE_IT_Concrete industry.why is it required
PDF
CHAPTER 14 Manageement of Nursing Educational Institutions- planing and orga...
PPTX
Project Management Methods PERT-and-CPM.pptx
PPTX
Supervisory Styles and When to Use Them!
Timeless Leadership Principles from History’s Greatest Figures by Alfonso Ken...
Concluding Session_Wrapup-India Jun 5 2024-Oct 5 2025 ZS.pptx
Improved_Leadership_in_Total_Quality_Lesson.pptx
Consulting on marketing-The needs wants and demands are a very important comp...
_ISO_Presentation_ISO 9001 and 45001.pptx
MANAGEMENT LESSONS FROM ANCIENT KNOWLEDGE SYSTEM-ARTHASHASTRA AND THIRUKKURAL...
Empowering Project Management Through Servant Leadership - PMI UK.pptx
Strategic Plan 2023-2024 Presentation.pptx
Course Overview of the Course Titled.pptx
Course Overview of the Course Titled.pptx
Mangeroal Finance for Strategic Management
1_Corporate Goverance presentation topic
2. CYCLE OF FUNCTIONING RIFLE -PP Presentation..pptx
Claims and Adjustment Business_Communication.pptx.ppt
INTELLECTUAL PROPERTY LAW IN UGANDA.pptx
Human resources management -job perception concept
TCoE_IT_Concrete industry.why is it required
CHAPTER 14 Manageement of Nursing Educational Institutions- planing and orga...
Project Management Methods PERT-and-CPM.pptx
Supervisory Styles and When to Use Them!

Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t

  • 1. Data Science Scrum – What Works and Doesn’t Share Experience in Scrum on Data Science Practice, is it relevant or not?
  • 2. Why Scrum & Data Science ? What Works & Doesn’t ? Data Science Demo T O P I C S
  • 3. Adi Wijaya Co-Founder & Data Science Lead Poke me, and let’s talk about
  • 4. Typical 2018 Data Science Requirement Proposal Business Understanding Data Preparation Create Model Evaluation Deployment 2 Weeks 1 Months 1 Months 1 Months 1 Months Common Data Science Project Plan That has High Risk to Fail
  • 5. Typical 2018 Data Science Requirement Proposal Business Understanding Data Preparation Create Model Evaluation Deployment 2 Weeks 1 Months 1 Months 1 Months 1 Months Common Data Science Project Plan That has High Risk to Fail
  • 6. Scrum Data Science Framework For complex adaptive problems Unified Stats, technology, Data Analysist, Business Knowledge To understand and analyze actual phenomena with data
  • 7. Big Data Google Trends History of Data Science Popularity Today 2018 Timeline Big Data Google Data Worldwide 2014 - Now
  • 8. Big Data Google Trends History of Data Science Popularity Today 2018 Timeline Hadoop Hadoop Big Data 2005 Google Data Worldwide 2014 - Now
  • 9. Big Data Google Trends History of Data Science Popularity Today 2018 Timeline Hadoop Hadoop Big Data Data Science Data Science Google Data Worldwide 2014 - Now
  • 10. Big Data Google Trends History of Data Science Popularity Today 2018 Timeline Hadoop Hadoop Big Data Data Science Data Science Software Development Scrum Software Development Scrum Google Data Worldwide 2014 - Now
  • 11. SCRUM What Works and Doesn’t in Data Science Activity https://www.scrumguides.org/
  • 12. 3 Pillars of Scrum Transparency Inspection Adaptive
  • 13. 3 Pillars of Scrum Transparency Inspection Adaptive
  • 14. Data Scientists What my Mom thinks I do What my Boss/Client think I do What I think I do What I Actually do
  • 15. Scrum Framework Roles Rules Events Artifacts https://www.scrumguides.org/
  • 16. “A Data Scientist is that a unique blend of skills that can both unlock the insights of data and tell a fantastic story via the data” -- DJ Patil -- DJ Patil, former Linkedin and White House Data Scientist. Together with Jeff H (former Facebook) invent the term Data Scientist in 2011 What is Data Scientists?
  • 17. “A Data Scientist is that a unique blend of skills that can both unlock the insights of data and tell a fantastic story via the data” -- DJ Patil -- DJ Patil, former Linkedin and White House Data Scientist. Together with Jeff H (former Facebook) invent the term Data Scientist in 2011 What is Data Scientists?
  • 18. Common 2018 Data Scientists = Machine Learning Engineers
  • 19. Common 2018 Data Scientists = Machine Learning Engineers Adapt!
  • 20. ROLESData Engineer Data Scientist Business Analysts Product Owner Scrum Master Business UnitBusiness Manager External Party Development Team Assist Presentation Assist Roles Scrum Data Science Team
  • 21. ROLESData Engineer Data Scientist Business Analysts Product Owner Scrum Master Business UnitBusiness Manager External Party Development Team Assist Presentation Assist Roles Scrum Data Science Team
  • 23. SCRUM EVENTS Kanban Board, Standup Meeting, Sprint Review
  • 27. 1. Only Development Team! 2. Less than 15 Minutes! 3. Everyday
  • 29. 1. Involve Business 2. Less than 4 hours 3. Once in every 1-2 Weeks
  • 30. Project Goals Data Business Problem Data Science Team Graph Analytics Text Analytics Path Analytics Machine Learning Define Business Problem Provide Data Science Team Insights, Recommendations and Workflow Doing Data Exploration Deliver Insights DataLabs AGILE ANALYTICS Service Contact for Engagement : adi@datalabs.id We help company to :
  • 31. Week 1 Activity Timeline B Gath Wrangling Exploration Presentation& Evaluation Week 2 Week 3 Week 4Exploration Presentation & Evaluation Exploration Presentation & Evaluation Exploration Final Presentation Next Agile One Agile Phase As the spirit of true Big Data, we will explore your data according to the defined business use cases with adjustable priority on each week evaluation. We will deliver all results and findings we found when the agreed time is up. The result of one agile phase, can be continued for next agile phases. © Copyright 2018 DataLabs. All rights reserved. Not to be reproduced or shared without the prior written consent of DataLabs. Contact for Engagement : adi@datalabs.id
  • 32. Data Science Cycle and Environment <= 2018
  • 33. Data Engineer Data Scientist 1 Big Data Environment Create ETL job To extract sample data To csv Use FTP or even USB to transfer the data Jupyter Notebook Data Scientist 2 R Studio Jupyter Notebook to Data Engineer (Again sometimes using USB) • Rewrite notebook to scripts • Create API with other language • Deploy Data Science Team 2018 Life Cycle
  • 34. Data Engineer Data Scientist 1 Big Data Environment Create ETL job To extract sample data To csv Use FTP or even USB to transfer the data Jupyter Notebook Data Scientist 2 R Studio Jupyter Notebook to Data Engineer (Again sometimes using USB) • Rewrite notebook to scripts • Create API with other language • Deploy Data Science Team 2018 Life Cycle
  • 35. Data Engineer Big Data Environment Ideal Data Science Life Cycle Data Scientist 1 Data Scientist 2 Analytics Environment Maintain DataLake Maintain Production Model Optimize Performance Experimentation on Big Data Create Model Evaluate & Deploy ✓ One Environment ✓ Self Organizing ✓ Cross-functional
  • 36. Data Engineer Cloudera Hadoop Ideal Data Science Life Cycle Data Scientist 1 Data Scientist 2 Maintain DataLake Maintain Production Model Optimize Performance Experimentation on Big Data Create Model Evaluate & Deploy ✓ One Environment ✓ Self Organizing ✓ Cross-functional Data Science Workbench
  • 37. I Want to Predict Your Gender
  • 38. T H A N K S F O R A T T E N D I N G