SlideShare a Scribd company logo
#datapopupseattle
What Makes Healthcare Data
Science so Hard & Interesting
David Talby
SVP Engineering, Atigeo
davidtalby antigeo
#datapopupseattle
UNSTRUCTURED
Data Science POP-UP in Seattle
www.dominodatalab.com
D
Produced by Domino Data Lab
Domino’s enterprise data science platform is used
by leading analytical organizations to increase
productivity, enable collaboration, and publish
models into production faster.
David&Talby&
SVP&Engineering,&Atigeo&
@davidtalby
WHAT&MAKES&HEALTHCARE&DATA&SCIENCE

SO&HARD&&&INTERESTING
“TECHNOLOGY&WILL&REPLACE&80%&OF&WHAT&DOCTORS&DO”&



VINOD&KHOSLA,&2012
2 2
“as&many&as&40,500&patients&die&each&year&in&an&ICU&in&

the&U.S.&due&to&misdiagnosis”&Winters&et&al.,&2012,&John&Hopkins&&
“Combining&estimates&from&3&studies&yielded&a&rate&of&
outpatient&diagnostic&errors&of&5.08%,&or&12&million&US&
adults&every&year.”&Singhet&al.,&2014,&VA&Medical&Center
3
Root$cause:$being$human
Premature(closure Fatigue
Overconfidence Team&dynamics
Snap(judgment Prejudice
Real&time&monitoring
Always&up&to&date&with&science
Large&sample&size
Works&24x7&
No&trip,&no&waiting
Cheaper
More&accurate
More&objective
THE&PROMISE
4
5
Big Challenge #1
“The&algorithm&was&able&
to&identify&the&fake&smiles&
92%&of&the&time.&

Humans,&on&the&other&
hand,&performed&no&better&
than&chance.”&
MIT,&2012
HUMAN&NUANCES
6
Can&you&distinguish&between&real&smiles&of&happiness&and&fake&smiles&
trying&to&mask&frustration?
“Algorithms&correctly&
predicted&which&atcrisk&
youth&would&go&on&to&
develop&psychosis&over&a&
2.5cyear&period&with&

100%&accuracy.”&
Bedi&et&al.,&Nature'Schizophrenia,'2015
MENTAL&HEALTH
7
SAMPLE&HYBRID&ANALYTICS&PIPELINE
8
Freectext&
clinical&notes
Relationships&
&&ontologies&
Sensors&&&
wearables
Graph&Features
Time&Series&
Features
NLP&Features
Direct&&&ambient&Feedback
Train&&&test&Classifiers
Imagery,&

drugs,&labs,&…
Train&&&test&ensembles
THE&OPEN&PROBLEM:&EXPLAINABILITY
9
@DavidJBianco,&http://www2.mlsecproject.org/blog/oncexplainabilitycincmachineclearning
1 0
Big Challenge #2
Never&Changing Always&Changing
Online$Social$
Networking$Models/
Rules$
Banking$&$

eCommerce$fraud&
Cyber$Security
Automated$trading&
RealAtime$ad$bidding
Natural$Language,$
Social$Behavior$
Models
Political$&$
Economic$Models
Physical$models:&
Face$recognition&
Voice$recognition$
Climate$models
Google/Amazon&
Search$models
THE&MOMENT&YOU&PUT&A&MODEL&IN&PRODUCTION,&

IT&STARTS&DEGRADING
[Gunjan&Gupta,&Atigeo,&2014]
100%&Offcline 100%&Online
Automated$ensemble,$
boosting$&$feature$
selection$techniques
Automated$
‘challenger’$online$
evaluation$&$
deployment
RealAtime$online$
learning$via$
passive$feedback
HandAcrafted$
machine$
learned$
models
Active$learning$via&
Active$feedback
Traditional

Scientific$Method:

Test$a$Hypothesis
Hard$Crafted$Rules
Daily/weekly$
batch$retraining
SO&PUT&THE&RIGHT&MACHINERY&IN&PLACE
100%&Offcline 100%&Online
Automated$ensemble,$
boosting$&$feature$
selection$techniques
Automated$
‘challenger’$online$
evaluation$&$
deployment
RealAtime$online$
learning$via$
passive$feedback
HandAcrafted$
machine$
learned$
models
Active$learning$via&
Active$feedback
Traditional

Scientific$Method:

Test$a$Hypothesis
Hard$Crafted$Rules
Daily/weekly$
batch$retraining
STATE&OF&THE&PRACTICE&IN&HEALTHCARE
THE&OPEN&PROBLEM:&MODEL&EVALUATION
1 4
Evaluate&models&that&are:&
• Personalized&
• Localized&
• Evolve&over&time&
• Regulatory&acceptable&
?,'?
1 5
Big Challenge #3
#datapopupseattle
@datapopup
#datapopupseattle
#datapopupseattle
Thank You To Our Sponsors
‹ # › 16
@Atigeo&
@davidtalby
©&2015&Atigeo,&Corporation.&All&rights&reserved.&&Atigeo&and&the&xPatterns&logo&are&trademarks&of&Atigeo.&The&information&herein&is&for&informational&purposes&only&and&represents&the&current&view&of&Atigeo&as&of&the&date&of&this&presentation.&&Because&Atigeo&must&
respond&to&changing&market&conditions,&it&should&not&be&interpreted&to&be&a&commitment&on&the&part&of&Atigeo,&and&Atigeo&cannot&guarantee&the&accuracy&of&any&information&provided&after&the&date&of&this&presentation.&&ATIGEO&MAKES&NO&WARRANTIES,&EXPRESS,&
IMPLIED&OR&STATUTORY,&AS&TO&THE&INFORMATION&IN&THIS&PRESENTATION.

More Related Content

PDF
What is Your Data Worth? - Data Science Pop-up Seattle
PDF
Making Big Data Projects Successful - Data Science Pop-up Seattle
PDF
Keys to understanding when you are looking for a Data Scientist vs. Engineer,...
PDF
Teradata presentation at the Chief Analytics Officer Forum East Coast USA (#C...
PDF
How Data Science Builds Better Products - Data Science Pop-up Seattle
PPTX
Notilyze SAS
PPTX
Giovanni Lanzani GoDataDriven
PPTX
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)
What is Your Data Worth? - Data Science Pop-up Seattle
Making Big Data Projects Successful - Data Science Pop-up Seattle
Keys to understanding when you are looking for a Data Scientist vs. Engineer,...
Teradata presentation at the Chief Analytics Officer Forum East Coast USA (#C...
How Data Science Builds Better Products - Data Science Pop-up Seattle
Notilyze SAS
Giovanni Lanzani GoDataDriven
How to Build a Successful Data Team - Florian Douetteau (@Dataiku)

What's hot (19)

PDF
Dow Chemical presentation at the Chief Analytics Officer Forum East Coast USA...
PPTX
Agile Analytics
PPTX
Managing Data Science | Lessons from the Field
PPTX
Moving Data Science from an Event to A Program: Considerations in Creating Su...
PPTX
London Jaspersoft Community User Group Event 2 KETL presentation
PDF
Adoption is the only option hadoop is changing our world and changing yours f...
PPTX
Week3 day6slide
PPTX
Why Data Science Projects Fail
PDF
Creating a Data-Driven Organization -- thisismetis meetup
PDF
Anchormen corne versloot
PDF
Setting up Data Science for Success: The Data Layer
PPTX
Why Data Science Projects Fail?
PPTX
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
PDF
Becoming a data driven organization
PPTX
How can campus big data be captured and incorporated Craig napier_NSW Learnin...
PPTX
Marketing Network presentation: Why marketers need to be concerned with data ...
PPTX
Social Media World presentation
PDF
State Farm presentation at the Chief Analytics Officer Forum East Coast USA (...
PPTX
Talend community user group Bristol & SW UK event
Dow Chemical presentation at the Chief Analytics Officer Forum East Coast USA...
Agile Analytics
Managing Data Science | Lessons from the Field
Moving Data Science from an Event to A Program: Considerations in Creating Su...
London Jaspersoft Community User Group Event 2 KETL presentation
Adoption is the only option hadoop is changing our world and changing yours f...
Week3 day6slide
Why Data Science Projects Fail
Creating a Data-Driven Organization -- thisismetis meetup
Anchormen corne versloot
Setting up Data Science for Success: The Data Layer
Why Data Science Projects Fail?
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
Becoming a data driven organization
How can campus big data be captured and incorporated Craig napier_NSW Learnin...
Marketing Network presentation: Why marketers need to be concerned with data ...
Social Media World presentation
State Farm presentation at the Chief Analytics Officer Forum East Coast USA (...
Talend community user group Bristol & SW UK event
Ad

Viewers also liked (20)

PPTX
Hunting criminals with hybrid analytics -- October 2015
PPTX
Building an intelligent big data application in 30 minutes
PPT
Public internet access
PPTX
Semantic Natural Language Understanding with Spark, UIMA & Machine Learned On...
PDF
Online Predictive Modeling of Fraud Schemes from Mulitple Live Streams by Cla...
PPTX
10 R Packages to Win Kaggle Competitions
PDF
Myths and Mathemagical Superpowers of Data Scientists
PDF
How to Become a Data Scientist
PPTX
Artificial neural network
PPTX
Artificial Intelligence Presentation
PDF
Tips for data science competitions
PPTX
Tutorial on Deep learning and Applications
PPTX
Hadoop and Machine Learning
PPTX
Deep Learning for Natural Language Processing
PDF
Data By The People, For The People
PDF
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
PDF
How to Interview a Data Scientist
PDF
A Statistician's View on Big Data and Data Science (Version 1)
PDF
A tutorial on deep learning at icml 2013
PDF
Hands-on Deep Learning in Python
Hunting criminals with hybrid analytics -- October 2015
Building an intelligent big data application in 30 minutes
Public internet access
Semantic Natural Language Understanding with Spark, UIMA & Machine Learned On...
Online Predictive Modeling of Fraud Schemes from Mulitple Live Streams by Cla...
10 R Packages to Win Kaggle Competitions
Myths and Mathemagical Superpowers of Data Scientists
How to Become a Data Scientist
Artificial neural network
Artificial Intelligence Presentation
Tips for data science competitions
Tutorial on Deep learning and Applications
Hadoop and Machine Learning
Deep Learning for Natural Language Processing
Data By The People, For The People
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
How to Interview a Data Scientist
A Statistician's View on Big Data and Data Science (Version 1)
A tutorial on deep learning at icml 2013
Hands-on Deep Learning in Python
Ad

Similar to What Makes Healthcare Data Science so Hard & Interesting - Data Science Pop-up Seattle (20)

PPTX
ppt for data science slideshare.pptx
PDF
Data Science Application in Health Care System
PDF
Data Science Deep Roots in Healthcare Industry
PPTX
Roll no.40 Class Presentation.pptx
PDF
Hiring for Data Scientists - Data Science Pop-up Seattle
PDF
Fair by design
PPTX
Data Science in Healthcare.pptx sdflkhsdalk jdsj
PPTX
The Application of Data science in Healthcare
PPTX
Data science in healthcare-Assignment 2.pptx
PDF
Application of data science ppt.pdf
PPTX
Data Science and AI in Biomedicine: The World has Changed
PPTX
Data science in healthcare.pptx
PPTX
Biomedicine from Stethoscope to Computer
PPTX
HEALTH SECTOR AND DATA SCIENCE.pptx
PPTX
Data Science and AI in Biomedicine: The World has Changed
PPTX
Atul Butte's presentation to the Association of Medical School Pediatric Depa...
PPTX
Atul Butte NIPS 2017 ML4H
PDF
Data Science Transforming Healthcare| IABAC
PPTX
Diabetes Data Science
PDF
The Role of Data Science in the Healthcare Industry | IABAC
ppt for data science slideshare.pptx
Data Science Application in Health Care System
Data Science Deep Roots in Healthcare Industry
Roll no.40 Class Presentation.pptx
Hiring for Data Scientists - Data Science Pop-up Seattle
Fair by design
Data Science in Healthcare.pptx sdflkhsdalk jdsj
The Application of Data science in Healthcare
Data science in healthcare-Assignment 2.pptx
Application of data science ppt.pdf
Data Science and AI in Biomedicine: The World has Changed
Data science in healthcare.pptx
Biomedicine from Stethoscope to Computer
HEALTH SECTOR AND DATA SCIENCE.pptx
Data Science and AI in Biomedicine: The World has Changed
Atul Butte's presentation to the Association of Medical School Pediatric Depa...
Atul Butte NIPS 2017 ML4H
Data Science Transforming Healthcare| IABAC
Diabetes Data Science
The Role of Data Science in the Healthcare Industry | IABAC

More from Domino Data Lab (20)

PDF
What's in your workflow? Bringing data science workflows to business analysis...
PDF
The Proliferation of New Database Technologies and Implications for Data Scie...
PDF
Racial Bias in Policing: an analysis of Illinois traffic stops data
PPTX
Data Quality Analytics: Understanding what is in your data, before using it
PPTX
Supporting innovation in insurance with randomized experimentation
PPTX
Leveraging Data Science in the Automotive Industry
PDF
Summertime Analytics: Predicting E. coli and West Nile Virus
PPTX
Reproducible Dashboards and other great things to do with Jupyter
PDF
GeoViz: A Canvas for Data Science
PDF
Doing your first Kaggle (Python for Big Data sets)
PDF
Leveraged Analytics at Scale
PDF
How I Learned to Stop Worrying and Love Linked Data
PDF
Software Engineering for Data Scientists
PDF
Making Big Data Smart
PPTX
Building Data Analytics pipelines in the cloud using serverless technology
PPTX
Leveraging Open Source Automated Data Science Tools
PPTX
Domino and AWS: collaborative analytics and model governance at financial ser...
PDF
The Role and Importance of Curiosity in Data Science
PDF
Fuzzy Matching to the Rescue
PDF
How to Effectively Combine Numerical Features and Categorical Features
What's in your workflow? Bringing data science workflows to business analysis...
The Proliferation of New Database Technologies and Implications for Data Scie...
Racial Bias in Policing: an analysis of Illinois traffic stops data
Data Quality Analytics: Understanding what is in your data, before using it
Supporting innovation in insurance with randomized experimentation
Leveraging Data Science in the Automotive Industry
Summertime Analytics: Predicting E. coli and West Nile Virus
Reproducible Dashboards and other great things to do with Jupyter
GeoViz: A Canvas for Data Science
Doing your first Kaggle (Python for Big Data sets)
Leveraged Analytics at Scale
How I Learned to Stop Worrying and Love Linked Data
Software Engineering for Data Scientists
Making Big Data Smart
Building Data Analytics pipelines in the cloud using serverless technology
Leveraging Open Source Automated Data Science Tools
Domino and AWS: collaborative analytics and model governance at financial ser...
The Role and Importance of Curiosity in Data Science
Fuzzy Matching to the Rescue
How to Effectively Combine Numerical Features and Categorical Features

Recently uploaded (20)

PPT
Predictive modeling basics in data cleaning process
PDF
Business Analytics and business intelligence.pdf
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPTX
Introduction to Inferential Statistics.pptx
PPTX
A Complete Guide to Streamlining Business Processes
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
Leprosy and NLEP programme community medicine
PDF
Global Data and Analytics Market Outlook Report
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPTX
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
PPTX
New ISO 27001_2022 standard and the changes
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PDF
[EN] Industrial Machine Downtime Prediction
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
annual-report-2024-2025 original latest.
PDF
Navigating the Thai Supplements Landscape.pdf
PPTX
modul_python (1).pptx for professional and student
PDF
Microsoft 365 products and services descrption
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
Predictive modeling basics in data cleaning process
Business Analytics and business intelligence.pdf
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Introduction to Inferential Statistics.pptx
A Complete Guide to Streamlining Business Processes
SAP 2 completion done . PRESENTATION.pptx
Leprosy and NLEP programme community medicine
Global Data and Analytics Market Outlook Report
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
New ISO 27001_2022 standard and the changes
retention in jsjsksksksnbsndjddjdnFPD.pptx
[EN] Industrial Machine Downtime Prediction
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
ISS -ESG Data flows What is ESG and HowHow
annual-report-2024-2025 original latest.
Navigating the Thai Supplements Landscape.pdf
modul_python (1).pptx for professional and student
Microsoft 365 products and services descrption
Optimise Shopper Experiences with a Strong Data Estate.pdf

What Makes Healthcare Data Science so Hard & Interesting - Data Science Pop-up Seattle