SlideShare a Scribd company logo
DECIDING IN THE DARK:A Survey of The Current
Ecosystem of Explainability
Techniques
By Coco Sack 1
WHO AM I?
 AI2 Incubator Intern
 Yale freshman (Computer Science & Psychology major)
 Child of AI Boom
 As of recent, an “Explainability Expert”
2
3
WHO CARES ABOUT EXPLAINABILITY?
DATA-SCIENTISTS
To debug...
POLITICIANS
GDPR:
“a right to
explanation”
To protect the public...
EXECUTIVES
76%
said lack of
transparency
was seriously
impeding
adoption
To sell...
CONSUMERS
To trust...
4
EXPLAINABILITY: A HOT TOPIC
5
???
6
77
88
DATA ANALYSIS
◂ Data must be complete, unique, credible, accurate, consistent, and
unbiased
◂ Errors or duplicates can undermine the model’s performance
◂ Outdated bias in the training data can re-entrench discrimination.
You are saying, 'Here's the data, figure out what the behavior is.'
That is an inherently fuzzy and statistical approach. The real
challenge of deep learning is that it's not modeling, necessarily, the
world around it. It's modeling the data it's getting. And that modeling
often includes bias and problematic correlations.”
-- Sheldon Fernandez, CEO of DarwinAI
9
INTRINSICALLY INTERPRETABLE MODELS
1. Regression 2. Additive 3. Tree Graphs 4. Decision Rules
10
OUTPUT ANALYSIS TECHNIQUES
11
WHAT KIND OF MODELS DO THEY WORK ON?
12
WHAT KIND OF MODELS DO THEY WORK ON?
13
WHAT DOES IT OUTPUT?
14
WHAT DOES IT OUTPUT?
15
LOCAL VS. GLOBAL?
16
OTHER EXPLAINABILITY RESOURCES
General Surveys of Techniques:
A Survey of Methods For Explaining
Black Box Models (2018)
Visual Analytics in Deep Learning
(2018)
Explaining Explanations (2019)
Peeking Inside the Black Box (2018)
Books and Presentations:
Interpretable Machine Learning (by
Christopher Molnar, 2019)
XAI (by Dave Gunning, 2017)
Explaining Explanations (2019)
Unique Technical Papers:
Generative Synthesis (Wong et al.)
Golden Eye++ (Hendricks et al.)
Grad-CAM (Selvaraju et al.)
DeepLift (Shrikumar et al.)
T-CAV (Kim et al.)
Ethical/Political Reports:
“Computer Says No” (by Ian Sample)
“Why We Need to Open the Black Box”
(by AJ Abdallat)
“The Importance of Interpretable
Machine learning” (by DJ Sakar)
17

More Related Content

PPTX
Scary numbers
PDF
Cognitive Biases in Data Interpretation-2
PPTX
NATURE - OF - QUANTITATIVE - RESEARCH.pptx
PDF
The best thing in Data Science? Collaboration
PDF
Laws and limits of data science 11 10-14
PDF
Accretive Health - Quality Management in Health Care
PPTX
IE_expressyourself_EssayH
PDF
Big Data for Recruiting | SourceIn New York
Scary numbers
Cognitive Biases in Data Interpretation-2
NATURE - OF - QUANTITATIVE - RESEARCH.pptx
The best thing in Data Science? Collaboration
Laws and limits of data science 11 10-14
Accretive Health - Quality Management in Health Care
IE_expressyourself_EssayH
Big Data for Recruiting | SourceIn New York

Similar to Rsqrd AI: A Survey of The Current Ecosystem of Explainability Techniques (20)

DOCX
Mastering Data Science A Comprehensive Introduction.docx
PPTX
Systemic Learning Analytics Symposium, October 10th 2013
PPTX
“Don’t shoot the PM!” or data literacy from a product management point of view
PPTX
ETHICS Fraud and Internal Control wfa.pptx
PDF
W03_HEFCS_Ethics Fundamentals-3-30.pdf for people
PDF
Digital Citizenship Summit 2014
PPTX
Data Ethics in the Workplace: Beyond AI, Privacy and Security
PDF
eli2012-learning-analytics
PPTX
AoA Presentation.v.6Feb2024.pptx
PDF
2012 pip futureof internetyoungbrains
PDF
Unveiling the Power of Data Science.pdf
PDF
DLBDSIDS01_E_Session 1 dATA sCIENCES pRÄSO
PPTX
Open Data and the Social Sciences - OpenCon Community Webcast
PPTX
Demystifying Gamification in Learning
PDF
Curiosity: the blessing and the curse of the PhD entrepreneur
PPTX
Talking Tech - the art and science of communicating complex ideas (Bristech2...
PPTX
EARLI SIG14 keynote Littlejohn FINAL-2008242.pptx
PDF
M2 l10 fairness, accountability, and transparency
PPTX
Educational Regimes of Truth: Blockchain, Badgechain, and the Ethics of the R...
PDF
Introduction to the ethics of machine learning
Mastering Data Science A Comprehensive Introduction.docx
Systemic Learning Analytics Symposium, October 10th 2013
“Don’t shoot the PM!” or data literacy from a product management point of view
ETHICS Fraud and Internal Control wfa.pptx
W03_HEFCS_Ethics Fundamentals-3-30.pdf for people
Digital Citizenship Summit 2014
Data Ethics in the Workplace: Beyond AI, Privacy and Security
eli2012-learning-analytics
AoA Presentation.v.6Feb2024.pptx
2012 pip futureof internetyoungbrains
Unveiling the Power of Data Science.pdf
DLBDSIDS01_E_Session 1 dATA sCIENCES pRÄSO
Open Data and the Social Sciences - OpenCon Community Webcast
Demystifying Gamification in Learning
Curiosity: the blessing and the curse of the PhD entrepreneur
Talking Tech - the art and science of communicating complex ideas (Bristech2...
EARLI SIG14 keynote Littlejohn FINAL-2008242.pptx
M2 l10 fairness, accountability, and transparency
Educational Regimes of Truth: Blockchain, Badgechain, and the Ethics of the R...
Introduction to the ethics of machine learning
Ad

More from Sanjana Chowdhury (12)

PDF
Rsqrd AI: Making Conversational AI Work for Everybody
PDF
Rsqrd AI: Application of Explanation Model in Healthcare
PDF
Rsqrd AI: Recent Advances in Explainable Machine Learning Research
PDF
Rsqrd AI: Incorporating Priors with Feature Attribution on Text Classification
PDF
Rsqrd AI: Discovering Natural Bugs Using Adversarial Perturbations
PPTX
Rsqrd AI: Explaining ML Models w/ Geometric Intuition
PDF
Rsqrd AI: Errudite- Scalable, Reproducible, and Testable Error Analysis
PDF
Rsqrd AI: Exploring Machine Learning Model Predictions
PDF
Rsqrd AI: Zestimates and Zillow AI Platform
PDF
Rsqrd AI: ML Tooling at an AI-first Startup
PDF
Rsqrd AI: From R&D to ROI of AI
PDF
Rsqrd AI: How to Design a Reliable and Reproducible Pipeline
Rsqrd AI: Making Conversational AI Work for Everybody
Rsqrd AI: Application of Explanation Model in Healthcare
Rsqrd AI: Recent Advances in Explainable Machine Learning Research
Rsqrd AI: Incorporating Priors with Feature Attribution on Text Classification
Rsqrd AI: Discovering Natural Bugs Using Adversarial Perturbations
Rsqrd AI: Explaining ML Models w/ Geometric Intuition
Rsqrd AI: Errudite- Scalable, Reproducible, and Testable Error Analysis
Rsqrd AI: Exploring Machine Learning Model Predictions
Rsqrd AI: Zestimates and Zillow AI Platform
Rsqrd AI: ML Tooling at an AI-first Startup
Rsqrd AI: From R&D to ROI of AI
Rsqrd AI: How to Design a Reliable and Reproducible Pipeline
Ad

Recently uploaded (20)

PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Cloud computing and distributed systems.
PPTX
Spectroscopy.pptx food analysis technology
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Encapsulation theory and applications.pdf
PDF
cuic standard and advanced reporting.pdf
PPTX
Big Data Technologies - Introduction.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Unlocking AI with Model Context Protocol (MCP)
Assigned Numbers - 2025 - Bluetooth® Document
A comparative analysis of optical character recognition models for extracting...
Per capita expenditure prediction using model stacking based on satellite ima...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
MIND Revenue Release Quarter 2 2025 Press Release
Network Security Unit 5.pdf for BCA BBA.
Programs and apps: productivity, graphics, security and other tools
Chapter 3 Spatial Domain Image Processing.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Machine learning based COVID-19 study performance prediction
Cloud computing and distributed systems.
Spectroscopy.pptx food analysis technology
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Encapsulation theory and applications.pdf
cuic standard and advanced reporting.pdf
Big Data Technologies - Introduction.pptx

Rsqrd AI: A Survey of The Current Ecosystem of Explainability Techniques

  • 1. DECIDING IN THE DARK:A Survey of The Current Ecosystem of Explainability Techniques By Coco Sack 1
  • 2. WHO AM I?  AI2 Incubator Intern  Yale freshman (Computer Science & Psychology major)  Child of AI Boom  As of recent, an “Explainability Expert” 2
  • 3. 3
  • 4. WHO CARES ABOUT EXPLAINABILITY? DATA-SCIENTISTS To debug... POLITICIANS GDPR: “a right to explanation” To protect the public... EXECUTIVES 76% said lack of transparency was seriously impeding adoption To sell... CONSUMERS To trust... 4
  • 7. 77
  • 8. 88
  • 9. DATA ANALYSIS ◂ Data must be complete, unique, credible, accurate, consistent, and unbiased ◂ Errors or duplicates can undermine the model’s performance ◂ Outdated bias in the training data can re-entrench discrimination. You are saying, 'Here's the data, figure out what the behavior is.' That is an inherently fuzzy and statistical approach. The real challenge of deep learning is that it's not modeling, necessarily, the world around it. It's modeling the data it's getting. And that modeling often includes bias and problematic correlations.” -- Sheldon Fernandez, CEO of DarwinAI 9
  • 10. INTRINSICALLY INTERPRETABLE MODELS 1. Regression 2. Additive 3. Tree Graphs 4. Decision Rules 10
  • 12. WHAT KIND OF MODELS DO THEY WORK ON? 12
  • 13. WHAT KIND OF MODELS DO THEY WORK ON? 13
  • 14. WHAT DOES IT OUTPUT? 14
  • 15. WHAT DOES IT OUTPUT? 15
  • 17. OTHER EXPLAINABILITY RESOURCES General Surveys of Techniques: A Survey of Methods For Explaining Black Box Models (2018) Visual Analytics in Deep Learning (2018) Explaining Explanations (2019) Peeking Inside the Black Box (2018) Books and Presentations: Interpretable Machine Learning (by Christopher Molnar, 2019) XAI (by Dave Gunning, 2017) Explaining Explanations (2019) Unique Technical Papers: Generative Synthesis (Wong et al.) Golden Eye++ (Hendricks et al.) Grad-CAM (Selvaraju et al.) DeepLift (Shrikumar et al.) T-CAV (Kim et al.) Ethical/Political Reports: “Computer Says No” (by Ian Sample) “Why We Need to Open the Black Box” (by AJ Abdallat) “The Importance of Interpretable Machine learning” (by DJ Sakar) 17

Editor's Notes

  • #8: INTRINSICALLY INTERPRETABLE: the most straightforward way to achieve interpretability is to design an algorithm or model that is intrinsically interpretable meaning it is naturally human interpretable due to its simple and intuitive structures. DEEP-EXPLANATION: altering deep learning models to create secondary systems that are trained to generate explanations of the first. OUTPUT ANALYSIS: Finally output-focused methods can analyze an opaque machine learning model after it is trained based on its outputs. These techniques are “post-hoc,” meaning they are applied after training a black-box model making them more general, flexible, and applicable. Some of these techniques are specifically designed for certain kinds of models while others are truly agnostic and can be applied to any machine learning model.