SlideShare a Scribd company logo
Data + Data Scientists ≠ Money
Dr. David Hoyle
My background
PRODUCING DATA SCIENCE
• 20 yrs in academia
• dunnhumby
• dunnhumby
APPLYING DATA SCIENCE
• Lloyds Banking Group
• AutoTrader UK
• InfinityWorks
The challenges in applying Data Science are very different
Data
Science
Doesn’t
Work
How did we get here?
Your Company Inc.
Cultural and
organizational
challenges
are always
harder than
technical
challenges
• Which parts of the business?
• How should we organize?
• How should we work?
• How should we communicate?
• What support do we need?
• What data should we use?
• SVM vs Logistic Regression?
Where do
companies need
Data Science?
Data touchpoints
Bad guys Account
Manager
Marketing
Developers OEMs
Private seller
£, €, $
Finance
Car Dealer
Consumer
External Data
Internal Data
£ Trade£ Retail
‘Why?’ is a
powerful Data
Science tool How will you consume the outputs?
‘We need a neural network’
‘We want to predict if users will
click this link’
‘Not clicking indicates low user
engagement’
Why?
Why?
‘We can alter the content in
session if engagement is low’
Can you respond to the neural
network output fast enough?
‘Hmmm… No.’
Build cross-functional teams
Data Scientist ≠ Data Engineer
Data
Science Data Engineering
Product
10
Get close to the
business Analytics
Team
Product
Area C
Analyst
Product
Area A
Analyst
Product
Area B
Analyst
Business
Area 2
Analyst
Business
Area 1
Analyst
Data Scientists
+
Data Analysts
Does Agile
always
work for
Data
Science?
All parts of Data Science have
outputs
INFORMATION
PRESENTATION
ALGORITHM
DEVELOPMENT
Easier to communicate outputs
Easier to communicate progress
Harder to communicate outputs
Harder to communicate progress
Always
communicate what
the outputs will be
Understand
business problem
Map to
appropriate
abstraction
Mathematical
statement of
abstraction
Identify type of
mathematical
model required
Identify & explore
potential data
sources
Build, validate, &
test model,
e.g. CRISP-DM
Productionize
model
Deploy
production model
artefacts
Consume model
outputs
Monitor
production model
Re-build
production model Improve model
Data
Science
Data
Engineering
Data
Science
Data
Engineering
Data
Engineering
Data
Engineering
Data
Science
Data
Science
Understand &
conceptualize
the problem
Understand
resources
available & build
model
Incorporate
model into
business
process
Monitor &
improve
The Data Science innovation lifecycle is longer than you think
Data & Compute should
be close together
Operational
Operational
+
Data Warehouse
SQL, BI
Not all data is valuable
Either
Your data is valuable to
you – e.g. helps improve
business processes
Or
Your data is valuable to
someone else – e.g.
gives a market wide view
To make Data Science pay you need to
1. Work on the projects with direct P&L impact
2. …..by asking the right business questions up-front
3. …..using teams that have the technical right skills
4. …..and understand the business challenges
5. …..using Agile methodologies where appropriate
6. …..always communicating what you are doing and why
7. …..with the right tools and on the right data
Data and data scientists are not equal to money   david hoyle

More Related Content

PDF
Playing Nice in the Product Playground #StrataHadoop
PPTX
Playing Nice in the Product Playground
PDF
BA and Beyond 19 Andrej Guštin - Mirror mirror on the wall Who's the wisest o...
PDF
BA and Beyond 20 - Antonio Gonzalez Sanchis - Add some RICE to your organisation
PDF
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
PDF
Making Big Data Projects Successful - Data Science Pop-up Seattle
PDF
Run the good race with Collaborative innovation
PDF
Sketching out a cognitive masterpiece
Playing Nice in the Product Playground #StrataHadoop
Playing Nice in the Product Playground
BA and Beyond 19 Andrej Guštin - Mirror mirror on the wall Who's the wisest o...
BA and Beyond 20 - Antonio Gonzalez Sanchis - Add some RICE to your organisation
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
Making Big Data Projects Successful - Data Science Pop-up Seattle
Run the good race with Collaborative innovation
Sketching out a cognitive masterpiece

What's hot (19)

PPTX
Idiots guide to setting up a data science team
PPTX
AI as a platform
PDF
BA and Beyond 19 - Adrian Reed - Don't bring me solutions Bring me problems
PPTX
Using Data To Tranform Your Business - Marketing Business
PPTX
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
PDF
7 Dimensions of Agile Analytics by Ken Collier
PDF
1000 track 1 groves_using our laptop
PPTX
CASE STUDY SEW WHAT? Inc
PDF
BA and Beyond 19 - Lynda Girvan - User story workshop
PDF
Data Strategy - Enabling the Data-Guided Enterprise
PDF
Dave Elliman - Applying Continuous Intelligence ThoughtWorks Live UK 2018
PDF
BA and Beyond 19 Sponsor spotlight - The Business Analysts - Why is agile mak...
PPTX
Agile Analytics
PPT
Chetan Karkhanis Profile
PDF
Artificial Intelligence - 3 Weeks to Success
PPSX
Teknasoft IT Services & Consulting Presentation
PDF
Going Beyond 'What Success Looks Like' - Using Data to Achieve Successful Pro...
PPTX
My mistakes as a ba pankaj kanchankar
PPTX
My mistakes as a Business Analyst
Idiots guide to setting up a data science team
AI as a platform
BA and Beyond 19 - Adrian Reed - Don't bring me solutions Bring me problems
Using Data To Tranform Your Business - Marketing Business
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
7 Dimensions of Agile Analytics by Ken Collier
1000 track 1 groves_using our laptop
CASE STUDY SEW WHAT? Inc
BA and Beyond 19 - Lynda Girvan - User story workshop
Data Strategy - Enabling the Data-Guided Enterprise
Dave Elliman - Applying Continuous Intelligence ThoughtWorks Live UK 2018
BA and Beyond 19 Sponsor spotlight - The Business Analysts - Why is agile mak...
Agile Analytics
Chetan Karkhanis Profile
Artificial Intelligence - 3 Weeks to Success
Teknasoft IT Services & Consulting Presentation
Going Beyond 'What Success Looks Like' - Using Data to Achieve Successful Pro...
My mistakes as a ba pankaj kanchankar
My mistakes as a Business Analyst
Ad

Similar to Data and data scientists are not equal to money david hoyle (20)

PPTX
intro to data science Clustering and visualization of data science subfields ...
PDF
Making an impact with data science
PPTX
introduction to data science
PPTX
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
PDF
Lean Analytics: How to get more out of your data science team
PDF
Why Data Science Is Important for the Future of Work | IABAC
PPTX
The Power of Data Science by DICS INNOVATIVE.pptx
PDF
Introduction to Data Science.pdf
PDF
Embracing data science
PPTX
Data Science PPT _basics of data science.pptx
DOCX
What is Data Science?
PPTX
Impact of Data Science
PPTX
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
PDF
5_Data Analytics, Data Science and Machine Learning
PDF
iTrain Malaysia: Data Science by Tarun Sukhani
PPTX
Data science in business Administration Nagarajan.pptx
PDF
Understanding Data Science: Concepts, Techniques, and Applications | IABAC
PDF
How Data Science Can Transform Your Business. | IABAC
PPTX
introduction TO DS 1.pptxvbvcbvcbvcbvcbvcb
PPTX
Data Science Training in Chandigarh h
intro to data science Clustering and visualization of data science subfields ...
Making an impact with data science
introduction to data science
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
Lean Analytics: How to get more out of your data science team
Why Data Science Is Important for the Future of Work | IABAC
The Power of Data Science by DICS INNOVATIVE.pptx
Introduction to Data Science.pdf
Embracing data science
Data Science PPT _basics of data science.pptx
What is Data Science?
Impact of Data Science
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
5_Data Analytics, Data Science and Machine Learning
iTrain Malaysia: Data Science by Tarun Sukhani
Data science in business Administration Nagarajan.pptx
Understanding Data Science: Concepts, Techniques, and Applications | IABAC
How Data Science Can Transform Your Business. | IABAC
introduction TO DS 1.pptxvbvcbvcbvcbvcbvcb
Data Science Training in Chandigarh h
Ad

More from Institute of Contemporary Sciences (20)

PDF
First 5 years of PSI:ML - Filip Panjevic
PPTX
Building valuable (online and offline) Data Science communities - Experience ...
PPT
Data Science Master 4.0 on Belgrade University - Drazen Draskovic
PPTX
Deep learning fast and slow, a responsible and explainable AI framework - Ahm...
PPTX
Solving churn challenge in Big Data environment - Jelena Pekez
PDF
Application of Business Intelligence in bank risk management - Dimitar Dilov
PPTX
Trends and practical applications of AI/ML in Fin Tech industry - Milos Kosan...
PPTX
Recommender systems for personalized financial advice from concept to product...
PDF
Advanced tools in real time analytics and AI in customer support - Milan Sima...
PPTX
Complex AI forecasting methods for investments portfolio optimization - Pawel...
PPTX
From Zero to ML Hero for Underdogs - Amir Tabakovic
PPSX
The price is right - Tomislav Krizan
PPTX
When it's raining gold, bring a bucket - Andjela Culibrk
PPTX
Reality and traps of real time data engineering - Milos Solujic
PPTX
Sensor networks for personalized health monitoring - Vladimir Brusic
PDF
Improving Data Quality with Product Similarity Search
PPTX
Prediction of good patterns for future sales using image recognition
PPTX
Using data to fight corruption: full budget transparency in local government
PPTX
Geospatial Analysis and Open Data - Forest and Climate
PPTX
Machine Learning-Driven Injury Prediction for a Professional Sports Team
First 5 years of PSI:ML - Filip Panjevic
Building valuable (online and offline) Data Science communities - Experience ...
Data Science Master 4.0 on Belgrade University - Drazen Draskovic
Deep learning fast and slow, a responsible and explainable AI framework - Ahm...
Solving churn challenge in Big Data environment - Jelena Pekez
Application of Business Intelligence in bank risk management - Dimitar Dilov
Trends and practical applications of AI/ML in Fin Tech industry - Milos Kosan...
Recommender systems for personalized financial advice from concept to product...
Advanced tools in real time analytics and AI in customer support - Milan Sima...
Complex AI forecasting methods for investments portfolio optimization - Pawel...
From Zero to ML Hero for Underdogs - Amir Tabakovic
The price is right - Tomislav Krizan
When it's raining gold, bring a bucket - Andjela Culibrk
Reality and traps of real time data engineering - Milos Solujic
Sensor networks for personalized health monitoring - Vladimir Brusic
Improving Data Quality with Product Similarity Search
Prediction of good patterns for future sales using image recognition
Using data to fight corruption: full budget transparency in local government
Geospatial Analysis and Open Data - Forest and Climate
Machine Learning-Driven Injury Prediction for a Professional Sports Team

Recently uploaded (20)

PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
1_Introduction to advance data techniques.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPT
Quality review (1)_presentation of this 21
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Lecture1 pattern recognition............
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Miokarditis (Inflamasi pada Otot Jantung)
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
IB Computer Science - Internal Assessment.pptx
Moving the Public Sector (Government) to a Digital Adoption
Clinical guidelines as a resource for EBP(1).pdf
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
1_Introduction to advance data techniques.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Quality review (1)_presentation of this 21
Data_Analytics_and_PowerBI_Presentation.pptx
Lecture1 pattern recognition............

Data and data scientists are not equal to money david hoyle

  • 1. Data + Data Scientists ≠ Money Dr. David Hoyle
  • 2. My background PRODUCING DATA SCIENCE • 20 yrs in academia • dunnhumby • dunnhumby APPLYING DATA SCIENCE • Lloyds Banking Group • AutoTrader UK • InfinityWorks The challenges in applying Data Science are very different
  • 4. How did we get here? Your Company Inc.
  • 5. Cultural and organizational challenges are always harder than technical challenges • Which parts of the business? • How should we organize? • How should we work? • How should we communicate? • What support do we need? • What data should we use? • SVM vs Logistic Regression?
  • 7. Data touchpoints Bad guys Account Manager Marketing Developers OEMs Private seller £, €, $ Finance Car Dealer Consumer External Data Internal Data £ Trade£ Retail
  • 8. ‘Why?’ is a powerful Data Science tool How will you consume the outputs? ‘We need a neural network’ ‘We want to predict if users will click this link’ ‘Not clicking indicates low user engagement’ Why? Why? ‘We can alter the content in session if engagement is low’ Can you respond to the neural network output fast enough? ‘Hmmm… No.’
  • 9. Build cross-functional teams Data Scientist ≠ Data Engineer Data Science Data Engineering Product
  • 10. 10 Get close to the business Analytics Team Product Area C Analyst Product Area A Analyst Product Area B Analyst Business Area 2 Analyst Business Area 1 Analyst Data Scientists + Data Analysts
  • 12. All parts of Data Science have outputs INFORMATION PRESENTATION ALGORITHM DEVELOPMENT Easier to communicate outputs Easier to communicate progress Harder to communicate outputs Harder to communicate progress Always communicate what the outputs will be
  • 13. Understand business problem Map to appropriate abstraction Mathematical statement of abstraction Identify type of mathematical model required Identify & explore potential data sources Build, validate, & test model, e.g. CRISP-DM Productionize model Deploy production model artefacts Consume model outputs Monitor production model Re-build production model Improve model Data Science Data Engineering Data Science Data Engineering Data Engineering Data Engineering Data Science Data Science Understand & conceptualize the problem Understand resources available & build model Incorporate model into business process Monitor & improve The Data Science innovation lifecycle is longer than you think
  • 14. Data & Compute should be close together Operational Operational + Data Warehouse SQL, BI
  • 15. Not all data is valuable Either Your data is valuable to you – e.g. helps improve business processes Or Your data is valuable to someone else – e.g. gives a market wide view
  • 16. To make Data Science pay you need to 1. Work on the projects with direct P&L impact 2. …..by asking the right business questions up-front 3. …..using teams that have the technical right skills 4. …..and understand the business challenges 5. …..using Agile methodologies where appropriate 6. …..always communicating what you are doing and why 7. …..with the right tools and on the right data