SlideShare a Scribd company logo
Proprietary + Confidential
Frederick Liu
7/17/19 @ Robust AI
Incorporating priors with feature
attribution on text classification
Proprietary + ConfidentialProprietary + Confidential
Machine learning
.6 .2
.1
.3
.7
.1
.8
.7
.5.3
.1
.4
.9 .2
.0
.6.8 .2.1.6Toxic … … … … … … …
Neutral … … … … … …
Toxic … … … … … … …
Toxic … … … … … …
Neutral … … … … … …
Training
Inference
Gay pride is in June.
.6 .2
.1
.3
.7
.1
.8
.7
.5.3
.1
.4
.9 .2
.0
.6.8 .2.1.6
95%
Toxic
Proprietary + ConfidentialProprietary + Confidential
Machine learning + Explainability
.6 .2
.1
.3
.7
.1
.8
.7
.5.3
.1
.4
.9 .2
.0
.6.8 .2.1.6Toxic … … … … … … …
Neutral … … … … … …
Toxic … … … … … … …
Toxic … … … … … …
Neutral … … … … … …
Training
Inference
Gay pride is in June.
.6 .2
.1
.3
.7
.1
.8
.7
.5.3
.1
.4
.9 .2
.0
.6.8 .2.1.6
95%
Toxic
Gay
Pride
is
in
June
90%
1%
1%
1%
2%
Proprietary + ConfidentialProprietary + Confidential
Machine learning + Regularization
Toxic … … … … … … …
Neutral … … … … … …
Toxic … … … … … … …
Toxic … … … … … …
Neutral … … … … … …
Training
Inference
Gay pride is in June.
.5 .2
.1
.3
.5
.1
.5
.5
.5.3
.1
.4
.5 .2
.0
.5.5 .2.1.5
85%
Toxic
.5 .2
.1
.3
.5
.1
.5
.5
.5.3
.1
.4
.5 .2
.0
.5.5 .2.1.5
Proprietary + ConfidentialProprietary + Confidential
Machine learning + Regularization + Explainability
Toxic … … … … … … …
Neutral … … … … … …
Toxic … … … … … … …
Toxic … … … … … …
Neutral … … … … … …
Training
Inference
Gay pride is in June.
15%
Toxic
Gay
Pride
is
in
June
He
is
an
impolite
gay 0%
.7 .2
.1
.3
.7
.1
.7
.5.3.4
.9 .2
.1
.6.8 .20.
1
.1
.2
.5
.7 .2
.1
.3
.7
.1
.7
.5.3.4
.9 .2
.1
.6.8 .20.
1
.1
.2
.5
+
person
Proprietary + ConfidentialProprietary + Confidential
Regularizing + Explainability → Controllability
.6
.2
.1
.3
.7
.1
.8
.7
.5.3
.1
.4
.9
.2
.0
.6.8 .2 .1.6
Explanation
Proprietary + ConfidentialProprietary + Confidential
Regularizing + Explainability → Controllability
.6
.2
.1
.3
.7
.1
.3
.7
.5.3
.9
.4
.9
.2
.0
.6.8 .4 .1.7
Explanation
More Red!
Less Green!
Proprietary + ConfidentialProprietary + Confidential
Explainability - Integrated Gradients
Link to paper - https://arxiv.org/pdf/1703.01365.pdf
Proprietary + ConfidentialProprietary + Confidential
Explainability + Regularization
Proprietary + ConfidentialProprietary + Confidential
Results - Classification Metric
Proprietary + ConfidentialProprietary + Confidential
Results - Fairness Metric
Proprietary + ConfidentialProprietary + Confidential
Results - Shift in embedding
Proprietary + Confidential
Thank You
Link to paper - https://arxiv.org/pdf/1906.08286.pdf
Sign up if you want to know more: bit.ly/model-interpret-interest

More Related Content

PPTX
SearchLove San Diego 2017 | Annie Cushing | Avoid Panic Attacks With a First ...
PDF
How to Raise a Robot Army #dddperth
PDF
FlyerBlastinBubbles
PDF
Code is so much more...
PPTX
Machine learning ppt unit one syllabuspptx
PDF
Introduction to Machine Learning
PDF
GenerativeModelsMaskedSelf-Attention.pdf
SearchLove San Diego 2017 | Annie Cushing | Avoid Panic Attacks With a First ...
How to Raise a Robot Army #dddperth
FlyerBlastinBubbles
Code is so much more...
Machine learning ppt unit one syllabuspptx
Introduction to Machine Learning
GenerativeModelsMaskedSelf-Attention.pdf

More from Sanjana Chowdhury (12)

PDF
Rsqrd AI: Making Conversational AI Work for Everybody
PDF
Rsqrd AI: Application of Explanation Model in Healthcare
PDF
Rsqrd AI: Recent Advances in Explainable Machine Learning Research
PDF
Rsqrd AI: Discovering Natural Bugs Using Adversarial Perturbations
PPTX
Rsqrd AI: A Survey of The Current Ecosystem of Explainability Techniques
PPTX
Rsqrd AI: Explaining ML Models w/ Geometric Intuition
PDF
Rsqrd AI: Errudite- Scalable, Reproducible, and Testable Error Analysis
PDF
Rsqrd AI: Exploring Machine Learning Model Predictions
PDF
Rsqrd AI: Zestimates and Zillow AI Platform
PDF
Rsqrd AI: ML Tooling at an AI-first Startup
PDF
Rsqrd AI: From R&D to ROI of AI
PDF
Rsqrd AI: How to Design a Reliable and Reproducible Pipeline
Rsqrd AI: Making Conversational AI Work for Everybody
Rsqrd AI: Application of Explanation Model in Healthcare
Rsqrd AI: Recent Advances in Explainable Machine Learning Research
Rsqrd AI: Discovering Natural Bugs Using Adversarial Perturbations
Rsqrd AI: A Survey of The Current Ecosystem of Explainability Techniques
Rsqrd AI: Explaining ML Models w/ Geometric Intuition
Rsqrd AI: Errudite- Scalable, Reproducible, and Testable Error Analysis
Rsqrd AI: Exploring Machine Learning Model Predictions
Rsqrd AI: Zestimates and Zillow AI Platform
Rsqrd AI: ML Tooling at an AI-first Startup
Rsqrd AI: From R&D to ROI of AI
Rsqrd AI: How to Design a Reliable and Reproducible Pipeline
Ad

Recently uploaded (20)

PDF
Approach and Philosophy of On baking technology
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
Big Data Technologies - Introduction.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Empathic Computing: Creating Shared Understanding
PDF
Encapsulation theory and applications.pdf
PDF
Electronic commerce courselecture one. Pdf
PPTX
Machine Learning_overview_presentation.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Cloud computing and distributed systems.
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Approach and Philosophy of On baking technology
Review of recent advances in non-invasive hemoglobin estimation
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Big Data Technologies - Introduction.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Empathic Computing: Creating Shared Understanding
Encapsulation theory and applications.pdf
Electronic commerce courselecture one. Pdf
Machine Learning_overview_presentation.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Unlocking AI with Model Context Protocol (MCP)
Chapter 3 Spatial Domain Image Processing.pdf
MYSQL Presentation for SQL database connectivity
gpt5_lecture_notes_comprehensive_20250812015547.pdf
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Cloud computing and distributed systems.
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Ad

Rsqrd AI: Incorporating Priors with Feature Attribution on Text Classification

  • 1. Proprietary + Confidential Frederick Liu 7/17/19 @ Robust AI Incorporating priors with feature attribution on text classification
  • 2. Proprietary + ConfidentialProprietary + Confidential Machine learning .6 .2 .1 .3 .7 .1 .8 .7 .5.3 .1 .4 .9 .2 .0 .6.8 .2.1.6Toxic … … … … … … … Neutral … … … … … … Toxic … … … … … … … Toxic … … … … … … Neutral … … … … … … Training Inference Gay pride is in June. .6 .2 .1 .3 .7 .1 .8 .7 .5.3 .1 .4 .9 .2 .0 .6.8 .2.1.6 95% Toxic
  • 3. Proprietary + ConfidentialProprietary + Confidential Machine learning + Explainability .6 .2 .1 .3 .7 .1 .8 .7 .5.3 .1 .4 .9 .2 .0 .6.8 .2.1.6Toxic … … … … … … … Neutral … … … … … … Toxic … … … … … … … Toxic … … … … … … Neutral … … … … … … Training Inference Gay pride is in June. .6 .2 .1 .3 .7 .1 .8 .7 .5.3 .1 .4 .9 .2 .0 .6.8 .2.1.6 95% Toxic Gay Pride is in June 90% 1% 1% 1% 2%
  • 4. Proprietary + ConfidentialProprietary + Confidential Machine learning + Regularization Toxic … … … … … … … Neutral … … … … … … Toxic … … … … … … … Toxic … … … … … … Neutral … … … … … … Training Inference Gay pride is in June. .5 .2 .1 .3 .5 .1 .5 .5 .5.3 .1 .4 .5 .2 .0 .5.5 .2.1.5 85% Toxic .5 .2 .1 .3 .5 .1 .5 .5 .5.3 .1 .4 .5 .2 .0 .5.5 .2.1.5
  • 5. Proprietary + ConfidentialProprietary + Confidential Machine learning + Regularization + Explainability Toxic … … … … … … … Neutral … … … … … … Toxic … … … … … … … Toxic … … … … … … Neutral … … … … … … Training Inference Gay pride is in June. 15% Toxic Gay Pride is in June He is an impolite gay 0% .7 .2 .1 .3 .7 .1 .7 .5.3.4 .9 .2 .1 .6.8 .20. 1 .1 .2 .5 .7 .2 .1 .3 .7 .1 .7 .5.3.4 .9 .2 .1 .6.8 .20. 1 .1 .2 .5 + person
  • 6. Proprietary + ConfidentialProprietary + Confidential Regularizing + Explainability → Controllability .6 .2 .1 .3 .7 .1 .8 .7 .5.3 .1 .4 .9 .2 .0 .6.8 .2 .1.6 Explanation
  • 7. Proprietary + ConfidentialProprietary + Confidential Regularizing + Explainability → Controllability .6 .2 .1 .3 .7 .1 .3 .7 .5.3 .9 .4 .9 .2 .0 .6.8 .4 .1.7 Explanation More Red! Less Green!
  • 8. Proprietary + ConfidentialProprietary + Confidential Explainability - Integrated Gradients Link to paper - https://arxiv.org/pdf/1703.01365.pdf
  • 9. Proprietary + ConfidentialProprietary + Confidential Explainability + Regularization
  • 10. Proprietary + ConfidentialProprietary + Confidential Results - Classification Metric
  • 11. Proprietary + ConfidentialProprietary + Confidential Results - Fairness Metric
  • 12. Proprietary + ConfidentialProprietary + Confidential Results - Shift in embedding
  • 13. Proprietary + Confidential Thank You Link to paper - https://arxiv.org/pdf/1906.08286.pdf Sign up if you want to know more: bit.ly/model-interpret-interest