SlideShare a Scribd company logo
JADS Workshop on
Responsible Data Science
“Data science without guesswork –
How to answer questions with a
guaranteed level of accuracy?”
Moderated by prof.dr. Mykola Pechenizkiy
RDS:
• Fairness
• Accuracy
• Confidentiality
• Transparency
www.jads.nl
Data science with(out) guesswork
• Why?
• What?
• How?
www.jads.nl
Open questions
• Accuracy  Guarantees, accountability and responsibility
• Accuracy  What are the right measures?
• Accuracy  Do we understand the trade-offs well?
www.jads.nl
When should you trust your model’s prediction?
• What are the limits of an algorithm / model and when is it known to
(have high chances to) fail?
• Beyond “simple” error bounds: errors generated by uncertain
data, noisy labels, biased data, concept drift, …
• Guarantees on average vs. per case performance
• cf. worst case analysis
• What are we really trying to guarantee?
• what are the right accuracy measures
• in connection to accountability
www.jads.nl
What are (not so) well understood tradeoffs?
Models are hard to make 100% accurate  many trade-offs:
• Well formulated and well studied:
• precision-recall; bias-variance; robustness-adaptivity;
• Well formulated, but not so well studied:
• accuracy-computational resources; accuracy-human resources
(inducing, applying and maintaining does not come for free)
• Not so well formulated and not so well studied:
• Accuracy-fairness, accuracy-confidentiality, accuracy-privacy,
accuracy-transparency, …

More Related Content

PDF
5. Workshop Responsible Data Science - Discussion on Transparency in data sci...
PPTX
Make good products great with data and analytics
PPTX
Giovanni Lanzani GoDataDriven
PDF
Pdf analytics-and-witch-doctoring -why-executives-succumb-to-the-black-box-me...
PPTX
Notilyze SAS
PPTX
Andreas weigend
PDF
How to succeed at data without even trying!
PPTX
Data Quality Analytics: Understanding what is in your data, before using it
5. Workshop Responsible Data Science - Discussion on Transparency in data sci...
Make good products great with data and analytics
Giovanni Lanzani GoDataDriven
Pdf analytics-and-witch-doctoring -why-executives-succumb-to-the-black-box-me...
Notilyze SAS
Andreas weigend
How to succeed at data without even trying!
Data Quality Analytics: Understanding what is in your data, before using it

What's hot (20)

PPTX
Introduction to data science
PDF
AI Hierarchy of Needs
PDF
Wtf is data science?
PPTX
Big data meetup
PPTX
Investigating Performance: Design & Outcomes with xAPI | LSCon 2017
PPTX
How to Become a Data Science Company instead of a company with Data Scientist...
PDF
10 Tips From A Young Data Scientist
PPTX
Security Administration Vii 2 Statistical Analysis
PPTX
How To Make The Most Out of Enterprise Data
PDF
How to start thinking like a data scientist
PDF
Community-Assisted Software Engineering Decision Making
PPTX
Week2 day2slide
PDF
Design Thinking for Data Science #StrataHadoop
PPTX
20180314 piloting advanced data applications
PPTX
How to think like a data scientist sandeep
PDF
Managing Data Science by David Martínez Rego
PDF
AI and ML for Everyone
PPT
Data Science Day New York: Data Scientist - The New Data Analyst
PDF
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
PPT
Many Roads Lead to Rome. Mapping Users’ Problem Solving Strategies.
Introduction to data science
AI Hierarchy of Needs
Wtf is data science?
Big data meetup
Investigating Performance: Design & Outcomes with xAPI | LSCon 2017
How to Become a Data Science Company instead of a company with Data Scientist...
10 Tips From A Young Data Scientist
Security Administration Vii 2 Statistical Analysis
How To Make The Most Out of Enterprise Data
How to start thinking like a data scientist
Community-Assisted Software Engineering Decision Making
Week2 day2slide
Design Thinking for Data Science #StrataHadoop
20180314 piloting advanced data applications
How to think like a data scientist sandeep
Managing Data Science by David Martínez Rego
AI and ML for Everyone
Data Science Day New York: Data Scientist - The New Data Analyst
Agile Analytics: The Secret to Test, Improve, Fail & Succeed Quickly.
Many Roads Lead to Rome. Mapping Users’ Problem Solving Strategies.
Ad

Similar to 3. Workshop Responsible Data Science - Discussion on Accuracy in data science by Mykola Pechenizkiy (20)

PPTX
Soft on People, Hard on Code: interpersonal approaches that promote high qual...
PDF
Jsm big-data
PPTX
Bit by Bit: Effective Use of People, Processes and Computer Technology in the...
PDF
Rekard Edgren - Curing Our Binary Disease - EuroSTAR 2012
PPTX
Systemic Learning Analytics Symposium, October 10th 2013
PDF
AI in the Real World: Challenges, and Risks and how to handle them?
PPTX
Trustworthiness of AI based predictions Aachen 2024
PPTX
Testing in modern times a story about quality and value - agile testing dev ...
PPTX
"Using Data Science to Design Effective Precision Preventative Behavioral Med...
PPTX
Supporting innovation in insurance with randomized experimentation
PPTX
Responsible AI in Industry: Practical Challenges and Lessons Learned
PPTX
Big Data & ML for Clinical Data
PDF
Lightning talk on the future of analytics - CloudCamp London, 2016
PDF
DS-38data sciencehandbooknotescompiled-46.pdf
PPTX
Testing for cognitive bias in ai systems
PDF
Understanding Cognitive Applications: A Framework - Sue Feldman
PPTX
WTI Framework Preso.pptx
PPTX
New Developments in Machine Learning - Prof. Dr. Max Welling
PDF
Machine learning, health data & the limits of knowledge
PDF
Max Shron, Thinking with Data at the NYC Data Science Meetup
Soft on People, Hard on Code: interpersonal approaches that promote high qual...
Jsm big-data
Bit by Bit: Effective Use of People, Processes and Computer Technology in the...
Rekard Edgren - Curing Our Binary Disease - EuroSTAR 2012
Systemic Learning Analytics Symposium, October 10th 2013
AI in the Real World: Challenges, and Risks and how to handle them?
Trustworthiness of AI based predictions Aachen 2024
Testing in modern times a story about quality and value - agile testing dev ...
"Using Data Science to Design Effective Precision Preventative Behavioral Med...
Supporting innovation in insurance with randomized experimentation
Responsible AI in Industry: Practical Challenges and Lessons Learned
Big Data & ML for Clinical Data
Lightning talk on the future of analytics - CloudCamp London, 2016
DS-38data sciencehandbooknotescompiled-46.pdf
Testing for cognitive bias in ai systems
Understanding Cognitive Applications: A Framework - Sue Feldman
WTI Framework Preso.pptx
New Developments in Machine Learning - Prof. Dr. Max Welling
Machine learning, health data & the limits of knowledge
Max Shron, Thinking with Data at the NYC Data Science Meetup
Ad

Recently uploaded (20)

PPTX
A Complete Guide to Streamlining Business Processes
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PDF
Transcultural that can help you someday.
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PPTX
IMPACT OF LANDSLIDE.....................
PPTX
importance of Data-Visualization-in-Data-Science. for mba studnts
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PPTX
Leprosy and NLEP programme community medicine
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
Introduction to the R Programming Language
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PDF
Microsoft Core Cloud Services powerpoint
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
A Complete Guide to Streamlining Business Processes
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Transcultural that can help you someday.
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
SAP 2 completion done . PRESENTATION.pptx
Topic 5 Presentation 5 Lesson 5 Corporate Fin
IMPACT OF LANDSLIDE.....................
importance of Data-Visualization-in-Data-Science. for mba studnts
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Leprosy and NLEP programme community medicine
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction to the R Programming Language
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
Microsoft Core Cloud Services powerpoint
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
STERILIZATION AND DISINFECTION-1.ppthhhbx
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx

3. Workshop Responsible Data Science - Discussion on Accuracy in data science by Mykola Pechenizkiy

  • 1. JADS Workshop on Responsible Data Science “Data science without guesswork – How to answer questions with a guaranteed level of accuracy?” Moderated by prof.dr. Mykola Pechenizkiy RDS: • Fairness • Accuracy • Confidentiality • Transparency
  • 2. www.jads.nl Data science with(out) guesswork • Why? • What? • How?
  • 3. www.jads.nl Open questions • Accuracy  Guarantees, accountability and responsibility • Accuracy  What are the right measures? • Accuracy  Do we understand the trade-offs well?
  • 4. www.jads.nl When should you trust your model’s prediction? • What are the limits of an algorithm / model and when is it known to (have high chances to) fail? • Beyond “simple” error bounds: errors generated by uncertain data, noisy labels, biased data, concept drift, … • Guarantees on average vs. per case performance • cf. worst case analysis • What are we really trying to guarantee? • what are the right accuracy measures • in connection to accountability
  • 5. www.jads.nl What are (not so) well understood tradeoffs? Models are hard to make 100% accurate  many trade-offs: • Well formulated and well studied: • precision-recall; bias-variance; robustness-adaptivity; • Well formulated, but not so well studied: • accuracy-computational resources; accuracy-human resources (inducing, applying and maintaining does not come for free) • Not so well formulated and not so well studied: • Accuracy-fairness, accuracy-confidentiality, accuracy-privacy, accuracy-transparency, …