SlideShare a Scribd company logo
21 June, 2018
Demystifying AI – Part 2
An Introduction to AI in Life Sciences
Pistoia Alliance Centre of Excellence for AI in Life Sciences and Health
Prashant Natarajan (@BigDataCXO)
Moderator: Nick Lynch
This webinar is being recorded
Poll Question 1:
How much is your organisation planning to
increase investment into AI/ML* in the next
2-3 years? (tools/platform, people etc)
A. 0-25%
B. >25% - 50%
C. >50% - 75%
D. >75% - 100%
E. Not sure AI* = including machine
learning/deep learning/chat
bots)
©PistoiaAlliance
Webinars: AI in Life Sciences – Q2/Q3 2018
Pistoia Alliance Membership Introduction
4
• Webinar 1 (23 May 2018) Prashant Natarajan
– A Brief History
– Big Data/ML/DL/AI - fundamentals and concepts
– Data Fidelity & NFR Framework
– Best Practices from the Trenches
– Q&A
• Webinar 2: 21 June 2018 Prashant Natarajan
– Big Data Analytics & AI - 2 sides of the same coin
– A guided tour of learning algorithms for Healthcare
– Real-life use cases in health & life sciences from the book Q & A
– AI Solutions - Going Beyond Algorithms
– Q & A
• Webinar 3: July 2018 – (panel)
– Real World Evidence, the Big Data Connection
– The 3 P’s of RWE: Persons, Providers, and Pharma
• Webinar 4
– State of the Art in AI with working examples
• Etc – monthly
Like to give
a talk or
panel?
Boston
Community
Workshop
Oct 2018
©PistoiaAlliance
Poll Question 2:
Are you/is your organisation currently
looking to hire additional AI/ML* experts
or retrain existing staff?
A. Yes, now or soon
B. Yes in the next 12 months
C. Yes but later than 12 months
D. No
E. Don’t Know
AI* = including machine learning/deep learning/chat
bots etc)
Prashant Natarajan
• Senior Director of AI Applications at H2O.ai, Mountain View, CA, USA (www.h2o.ai)
• Undergraduate degree in Chemical Engineering; Master’s in Technical
Communications & Linguistics; PhD courses in Logic & Cognitive Psychology; AT&T-
Yahoo Chancellor’s Fellow
• 18+ years in health sciences industry – providers, pharma, payers, patients
• H2O.ai; Oracle Health Sciences; McKesson; Healthways; Siemens
• Lead author or contributor to books on big data analytics, business intelligence,
cancer, machine learning, AI (best-sellers in 2012, 2017, 2018)
• Co-Faculty Instructor, Stanford University School of Medicine, Palo Alto, CA
• Industry Advisor, CA Initiative to Advance Precision Medicine/San Francisco VA
@BigDataCXO | prashant.natarajan@gmail.com | www.BigDataCXO.com
©PistoiaAlliance
Agenda
721 June, 2018
• Considerations for Life Sciences
• ML 102
• TIE – Interpretability & Explainability
• Conversational AI: Bot Basics
• Q & A
©PistoiaAlliance
Consideration for Life Sciences
821 June, 2018
• Regulations and policy
• Innovation in a regulated environment
• TIE it up
• Organization and structural challenges in Life Sciences
• Resourcing
• Data fidelity and labeling
• MDM is critical as is data governance
• Ethics and privacy – human and machine morality are
not the same. Does a machine have morals?
• Clear demarcation or sharing of human & machine-
learning/CIA responsibilities when failure happens
Machine Learning 102
Mastering the Basics
Sources:
www.H2O.ai Driverless AI overview
“Demystifying Big Data and Machine Learning for Healthcare” (Taylor & Francis, 2017), Natarajan et al.
“Principles of Data Wrangling” (O’Reilly, 2017), Rattenbury et al.
AWS Sagemaker Developer Guide
Prashant Natarajan
©PistoiaAlliance
Typical Enterprise Machine Learning
Workflow
ModelModel
Building
Features
Target
Modeling
Table
Data Quality
& Transformation
Data
Integration
+
Driverless AI
Copyright 2018 H2O.ai Inc. All rights
©PistoiaAlliance
ML Workflows: from Data to Deployment
©PistoiaAlliance
Data Preparation & Wrangling
1221 June, 2018
• Ingest Data from RDBMS, files, distributed DBs, etc –
describe data - assess data utility
• Create & manage metadata
• Profile data – grain, structure, data fidelity, temporality,
scope
• Pre-visualization and outlier analysis
• Refine data – mastering, structuring (changing form or
schema), enriching (adding new info via joins, unions,
derived data), transforming (cleansing, addressing
missing/invalid values)
• Create production data for training and use/build
automated ML systems to process all the way to the
scoring pipeline (or) visualization
©PistoiaAlliance
Training & Scoring in H2O’s Driverless AI
"Confidential and property of H2O.ai. All rights reserved"
Data
Processing
Model
Tuning
Feature
Engineering
Final Model
Training
Scoring
Pipeline
©PistoiaAlliance
Deployment & Tracking
1421 June, 2018
• Monitor Ongoing Performance - How will you monitor
the performance of your algorithm on an ongoing
basis? Data drifts and systems evolve.
• Look for ability to connect to your existing visualization
– verify interpretability – make it easy for data
scientists/IT/business to collaborate via results and
code
• Keep Track Of Your Model Changes - Always track the
revision of your model and report it with your results.
As you improve different parts of your data analytics
pipeline, you will want to go back and re-analyze data.
Recording which model was used at which time helps
you understand what to recalculate.
©PistoiaAlliance
ML Workflows: from Data to Deployment
©PistoiaAlliance
Interpretability
Why/Why not?
Prashant Natarajan
©PistoiaAlliance
Interpretability
*Source: https://christophm.github.io/interpretable-ml-book/interpretability-importance.html 1721 June, 2018
TIE: Interpretability is the degree to which a human can
understand the cause of a decision (Miller 2017)*
• If the ML model performs well, can’t we just trust it?
• “The problem is a single metric, such as classification
accuracy, is an incomplete description of most real-
world tasks” (Doshi-Velez & Kim)
• What v why/how of predictions: knowing the “why” can
help you understand more about the problem, data,
biases, leaks, debug/audit, and why a model might fail
• Facilitate learning and satisfy human curiosity
• The model becomes the source of insights and
knowledge – not just the raw data. Hence,
interpretability becomes important
• Interpretability is not the same as explainability
©PistoiaAlliance
Interpretability
Source: https://christophm.github.io/interpretable-ml-book/interpretability-importance.html 1821 June, 2018
If the ML model is interpretable and explainable, we can
check for the following traits:
• Fairness: why was “x” denied a credit limit upgrade? Is
there a racial bias in the data?
• Privacy: ensuring sensitive data in the information is
tracked and protected
• Robustness: testing that small changes in inputs don’t
lead to big changes in prediction
• Trust: humans trust a system that explains decisions
compared to a black box
When don’t we need interpretability?
• Problem is too well-studied
• Model has no significant impact
• Enable “gaming” of the ML system
©PistoiaAlliance
Conversational AI
Examining Bot Basics
Sources:
Demystifying Big Data & Machine Learning for Healthcare (Natarajan et al, CRC Press, 2017)
“Designing Bots: Creating Conversational Experiences” (Amir Shervat, O’Reilly Press 2017)
Prashant Natarajan
©PistoiaAlliance
AI & Bots: the Connections
2021 June, 2018
Conversational AI & Bots
• Most bots are powered by ML/AI – though not all of
them
• Designing a great conversation is orthogonal, in most
cases, to the decision to use AI or another technology
• What can AI do for bots today?
– Natural Language Understanding (extracting & converting free
text to entities)
– Conversation mgmt. and context switching
– Computer vision and image recognition
– Prediction – finding patterns and predicting outcomes based on
past data
– Sentiment analysis – understanding emotional state
• Bot types: personal v team, super v domain-specific,
business v consumer, text v voice, Net New Service v
New Interfaces
©PistoiaAlliance
Anatomy of a Bot
2121 June, 2018
• Bot anatomies are important given that the primary
purpose of a bot is to recognize and help accomplish
human intent
• Anatomical features of a bot include
– Branding, personality and human involvement
– AI
– Conversation management: onboarding, flows, feedback/error
handling, help and support
– Rich interactions via files, audio, images, buttons, helpful links,
emojis, typing indicators, Web views
– Context and memory
– Engagement methods: notifications, user-led, subscriptions
©PistoiaAlliance
Some Use Cases
2221 June, 2018
• Bot anatomies are important given that the primary
purpose of a bot is to recognize and help accomplish
human intent
• Conversational commerce – FB, Alexa, etc
• Bots for business – Slackbot, GitHub ChatOps
• Productivity and coaching – Lark, AHA, etc
• Alerts and notifications
• Router between humans (Uber, Lyft, scheduling bots)
• Customer service and FAQs
• 3rd party integration bots (Slack and CRM)
• Games and entertainment
• Brand bots
©PistoiaAlliance
Poll Question 3:
How important do you feel FAIR* data
principles are to ensuring successful
outputs from AI projects ?
A. Very important
B. Important
C. Neutral
D. Not important
E. Not Very important
FAIR : Findable Accessible, Interoperable & Reusable
©PistoiaAlliance
Audience Q&A
Please use the Question function in GoToWebinar
©PistoiaAlliance
RWD and AI – how can work they
together?
The next Pistoia Alliance CoE AI Webinar:
Date: TBD July 2018
check http://www.pistoiaalliance.org/events/ for the latest information
info@pistoiaalliance.org @pistoiaalliance www.pistoiaalliance.org
Thanks for your engagement

More Related Content

PPTX
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI Webina...
PPTX
NLP & ML Webinar
PPTX
JavaZone 2018 - A Practical(ish) Introduction to Data Science
PPTX
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
PDF
Data+Science : A First Course
PDF
Data science vs. Data scientist by Jothi Periasamy
PPTX
New professional careers in data
PPTX
NDC Oslo : A Practical Introduction to Data Science
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI Webina...
NLP & ML Webinar
JavaZone 2018 - A Practical(ish) Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
Data+Science : A First Course
Data science vs. Data scientist by Jothi Periasamy
New professional careers in data
NDC Oslo : A Practical Introduction to Data Science

What's hot (20)

PPTX
1. Data Analytics-introduction
PDF
Data Scientist Toolbox
PDF
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
PPTX
Intro to Data Science by DatalentTeam at Data Science Clinic#11
PDF
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
PPTX
A Practical-ish Introduction to Data Science
PDF
Introduction to Data Science
PDF
From Rocket Science to Data Science
PDF
Data Science Applications | Data Science For Beginners | Data Science Trainin...
PDF
8 minute intro to data science
PPTX
Big data and Predictive Analytics By : Professor Lili Saghafi
PDF
Course - Machine Learning Basics with R
PDF
Intro to Data Science for Non-Data Scientists
PDF
How relevant is Predictive Analytics relevant today?
PPTX
Data Science
PPTX
Lecture #01
PPTX
Data science and business analytics
PPT
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
PDF
Heartificial intelligence - claudio-mirti
1. Data Analytics-introduction
Data Scientist Toolbox
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Intro to Data Science by DatalentTeam at Data Science Clinic#11
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
A Practical-ish Introduction to Data Science
Introduction to Data Science
From Rocket Science to Data Science
Data Science Applications | Data Science For Beginners | Data Science Trainin...
8 minute intro to data science
Big data and Predictive Analytics By : Professor Lili Saghafi
Course - Machine Learning Basics with R
Intro to Data Science for Non-Data Scientists
How relevant is Predictive Analytics relevant today?
Data Science
Lecture #01
Data science and business analytics
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Heartificial intelligence - claudio-mirti
Ad

Similar to Pistoia Alliance Demystifying AI & ML part 2 (20)

PPTX
Chapter 1 Introduction to Data Science (Computing)
PDF
Data Lake Architecture – Modern Strategies & Approaches
PPTX
Unit-I_Big data life cycle.pptx, sources of Big Data
PDF
Data Profiling: The First Step to Big Data Quality
PDF
Practical Applications of Machine Learning in Cybersecurity
PDF
Overview of Data and Analytics Essentials and Foundations
PPTX
Introduction to Data Analytics and Its Importance
PPTX
Exploring the impact and evolution of Advanced Analytics Tools.pptx
PPTX
Mtech First_Year Data Analytics in Industry with power bI
PDF
Big Data Trends
PPTX
DA DS traning.pptx. Data Science is marking its graph on a high note by expan...
PDF
Your AI and ML Projects Are Failing – Key Steps to Get Them Back on Track
PDF
Agile data science
PPTX
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
PPTX
L3 Big Data and Application.pptx
PDF
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
PPTX
Knowledge Extraction from Social Media
PPTX
DataSpryng Overview
PDF
Big Data Intoduction & Hadoop ArchitectureModule1.pdf
PDF
Using Machine Learning to Understand and Predict Marketing ROI
Chapter 1 Introduction to Data Science (Computing)
Data Lake Architecture – Modern Strategies & Approaches
Unit-I_Big data life cycle.pptx, sources of Big Data
Data Profiling: The First Step to Big Data Quality
Practical Applications of Machine Learning in Cybersecurity
Overview of Data and Analytics Essentials and Foundations
Introduction to Data Analytics and Its Importance
Exploring the impact and evolution of Advanced Analytics Tools.pptx
Mtech First_Year Data Analytics in Industry with power bI
Big Data Trends
DA DS traning.pptx. Data Science is marking its graph on a high note by expan...
Your AI and ML Projects Are Failing – Key Steps to Get Them Back on Track
Agile data science
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
L3 Big Data and Application.pptx
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Knowledge Extraction from Social Media
DataSpryng Overview
Big Data Intoduction & Hadoop ArchitectureModule1.pdf
Using Machine Learning to Understand and Predict Marketing ROI
Ad

More from Pistoia Alliance (20)

PDF
Fairification experience clarifying the semantics of data matrices
PPTX
MPS webinar master deck
PPTX
Digital webinar master deck final
PDF
Fair by design
PDF
Knowledge graphs ilaria maresi the hyve 23apr2020
PPTX
2020.04.07 automated molecular design and the bradshaw platform webinar
PDF
Data market evolution, a future shaped by FAIR
PPTX
AI in translational medicine webinar
PDF
CEDAR work bench for metadata management
PDF
Open interoperability standards, tools and services at EMBL-EBI
PDF
Fair webinar, Ted slater: progress towards commercial fair data products and ...
PDF
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
PPTX
Implementing Blockchain applications in healthcare
PPTX
Building trust and accountability - the role User Experience design can play ...
PPTX
Pistoia Alliance-Elsevier Datathon
PDF
Data for AI models, the past, the present, the future
PDF
PA webinar on benefits & costs of FAIR implementation in life sciences
PDF
AI & ML in Drug Design: Pistoia Alliance CoE
PDF
Ai in drug design webinar 26 feb 2019
PDF
Blockchain and IOT and the GxP Lab Slides
Fairification experience clarifying the semantics of data matrices
MPS webinar master deck
Digital webinar master deck final
Fair by design
Knowledge graphs ilaria maresi the hyve 23apr2020
2020.04.07 automated molecular design and the bradshaw platform webinar
Data market evolution, a future shaped by FAIR
AI in translational medicine webinar
CEDAR work bench for metadata management
Open interoperability standards, tools and services at EMBL-EBI
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Implementing Blockchain applications in healthcare
Building trust and accountability - the role User Experience design can play ...
Pistoia Alliance-Elsevier Datathon
Data for AI models, the past, the present, the future
PA webinar on benefits & costs of FAIR implementation in life sciences
AI & ML in Drug Design: Pistoia Alliance CoE
Ai in drug design webinar 26 feb 2019
Blockchain and IOT and the GxP Lab Slides

Recently uploaded (20)

PDF
Selvita_Development-Strategy-2022-2025.pdf
PPTX
community services team project 2(4).pptx
PDF
CHAPTER 9 MEETING SAFETY NEEDS FOR OLDER ADULTS.pdf
PDF
Dermatology diseases Index August 2025.pdf
PPTX
Importance of Immediate Response (1).pptx
PPTX
PE and Health 7 Quarter 3 Lesson 1 Day 3,4 and 5.pptx
PPTX
Genaralised anxiety disorder presentation
PDF
NUTRITION THROUGHOUT THE LIFE CYCLE CHILDHOOD -AGEING
PDF
MECE & SCQA FRAMEWORKS, - Adding Innovation & Influencing Hospital & Super-Sp...
PPTX
different types of Gait in orthopaedic injuries
PPTX
BLS, BCLS Module-A life saving procedure
PDF
2E-Learning-Together...PICS-PCISF con.pdf
PPTX
HEMODYNAMICS - I DERANGEMENTS OF BODY FLUIDS.pptx
PPT
Adrenergic drugs (sympathomimetics ).ppt
PPTX
General Pharmacology by Nandini Ratne, Nagpur College of Pharmacy, Hingna Roa...
PDF
DAY-6. Summer class. Ppt. Cultural Nursing
PPTX
Medical aspects of impairment including all the domains mentioned in ICF
PPTX
Infection prevention and control for medical students
PDF
Megan Miller Colona Illinois - Passionate About CrossFit
PDF
Pharmacology slides archer and nclex quest
Selvita_Development-Strategy-2022-2025.pdf
community services team project 2(4).pptx
CHAPTER 9 MEETING SAFETY NEEDS FOR OLDER ADULTS.pdf
Dermatology diseases Index August 2025.pdf
Importance of Immediate Response (1).pptx
PE and Health 7 Quarter 3 Lesson 1 Day 3,4 and 5.pptx
Genaralised anxiety disorder presentation
NUTRITION THROUGHOUT THE LIFE CYCLE CHILDHOOD -AGEING
MECE & SCQA FRAMEWORKS, - Adding Innovation & Influencing Hospital & Super-Sp...
different types of Gait in orthopaedic injuries
BLS, BCLS Module-A life saving procedure
2E-Learning-Together...PICS-PCISF con.pdf
HEMODYNAMICS - I DERANGEMENTS OF BODY FLUIDS.pptx
Adrenergic drugs (sympathomimetics ).ppt
General Pharmacology by Nandini Ratne, Nagpur College of Pharmacy, Hingna Roa...
DAY-6. Summer class. Ppt. Cultural Nursing
Medical aspects of impairment including all the domains mentioned in ICF
Infection prevention and control for medical students
Megan Miller Colona Illinois - Passionate About CrossFit
Pharmacology slides archer and nclex quest

Pistoia Alliance Demystifying AI & ML part 2

  • 1. 21 June, 2018 Demystifying AI – Part 2 An Introduction to AI in Life Sciences Pistoia Alliance Centre of Excellence for AI in Life Sciences and Health Prashant Natarajan (@BigDataCXO) Moderator: Nick Lynch
  • 2. This webinar is being recorded
  • 3. Poll Question 1: How much is your organisation planning to increase investment into AI/ML* in the next 2-3 years? (tools/platform, people etc) A. 0-25% B. >25% - 50% C. >50% - 75% D. >75% - 100% E. Not sure AI* = including machine learning/deep learning/chat bots)
  • 4. ©PistoiaAlliance Webinars: AI in Life Sciences – Q2/Q3 2018 Pistoia Alliance Membership Introduction 4 • Webinar 1 (23 May 2018) Prashant Natarajan – A Brief History – Big Data/ML/DL/AI - fundamentals and concepts – Data Fidelity & NFR Framework – Best Practices from the Trenches – Q&A • Webinar 2: 21 June 2018 Prashant Natarajan – Big Data Analytics & AI - 2 sides of the same coin – A guided tour of learning algorithms for Healthcare – Real-life use cases in health & life sciences from the book Q & A – AI Solutions - Going Beyond Algorithms – Q & A • Webinar 3: July 2018 – (panel) – Real World Evidence, the Big Data Connection – The 3 P’s of RWE: Persons, Providers, and Pharma • Webinar 4 – State of the Art in AI with working examples • Etc – monthly Like to give a talk or panel? Boston Community Workshop Oct 2018
  • 5. ©PistoiaAlliance Poll Question 2: Are you/is your organisation currently looking to hire additional AI/ML* experts or retrain existing staff? A. Yes, now or soon B. Yes in the next 12 months C. Yes but later than 12 months D. No E. Don’t Know AI* = including machine learning/deep learning/chat bots etc)
  • 6. Prashant Natarajan • Senior Director of AI Applications at H2O.ai, Mountain View, CA, USA (www.h2o.ai) • Undergraduate degree in Chemical Engineering; Master’s in Technical Communications & Linguistics; PhD courses in Logic & Cognitive Psychology; AT&T- Yahoo Chancellor’s Fellow • 18+ years in health sciences industry – providers, pharma, payers, patients • H2O.ai; Oracle Health Sciences; McKesson; Healthways; Siemens • Lead author or contributor to books on big data analytics, business intelligence, cancer, machine learning, AI (best-sellers in 2012, 2017, 2018) • Co-Faculty Instructor, Stanford University School of Medicine, Palo Alto, CA • Industry Advisor, CA Initiative to Advance Precision Medicine/San Francisco VA @BigDataCXO | prashant.natarajan@gmail.com | www.BigDataCXO.com
  • 7. ©PistoiaAlliance Agenda 721 June, 2018 • Considerations for Life Sciences • ML 102 • TIE – Interpretability & Explainability • Conversational AI: Bot Basics • Q & A
  • 8. ©PistoiaAlliance Consideration for Life Sciences 821 June, 2018 • Regulations and policy • Innovation in a regulated environment • TIE it up • Organization and structural challenges in Life Sciences • Resourcing • Data fidelity and labeling • MDM is critical as is data governance • Ethics and privacy – human and machine morality are not the same. Does a machine have morals? • Clear demarcation or sharing of human & machine- learning/CIA responsibilities when failure happens
  • 9. Machine Learning 102 Mastering the Basics Sources: www.H2O.ai Driverless AI overview “Demystifying Big Data and Machine Learning for Healthcare” (Taylor & Francis, 2017), Natarajan et al. “Principles of Data Wrangling” (O’Reilly, 2017), Rattenbury et al. AWS Sagemaker Developer Guide Prashant Natarajan
  • 10. ©PistoiaAlliance Typical Enterprise Machine Learning Workflow ModelModel Building Features Target Modeling Table Data Quality & Transformation Data Integration + Driverless AI Copyright 2018 H2O.ai Inc. All rights
  • 12. ©PistoiaAlliance Data Preparation & Wrangling 1221 June, 2018 • Ingest Data from RDBMS, files, distributed DBs, etc – describe data - assess data utility • Create & manage metadata • Profile data – grain, structure, data fidelity, temporality, scope • Pre-visualization and outlier analysis • Refine data – mastering, structuring (changing form or schema), enriching (adding new info via joins, unions, derived data), transforming (cleansing, addressing missing/invalid values) • Create production data for training and use/build automated ML systems to process all the way to the scoring pipeline (or) visualization
  • 13. ©PistoiaAlliance Training & Scoring in H2O’s Driverless AI "Confidential and property of H2O.ai. All rights reserved" Data Processing Model Tuning Feature Engineering Final Model Training Scoring Pipeline
  • 14. ©PistoiaAlliance Deployment & Tracking 1421 June, 2018 • Monitor Ongoing Performance - How will you monitor the performance of your algorithm on an ongoing basis? Data drifts and systems evolve. • Look for ability to connect to your existing visualization – verify interpretability – make it easy for data scientists/IT/business to collaborate via results and code • Keep Track Of Your Model Changes - Always track the revision of your model and report it with your results. As you improve different parts of your data analytics pipeline, you will want to go back and re-analyze data. Recording which model was used at which time helps you understand what to recalculate.
  • 17. ©PistoiaAlliance Interpretability *Source: https://christophm.github.io/interpretable-ml-book/interpretability-importance.html 1721 June, 2018 TIE: Interpretability is the degree to which a human can understand the cause of a decision (Miller 2017)* • If the ML model performs well, can’t we just trust it? • “The problem is a single metric, such as classification accuracy, is an incomplete description of most real- world tasks” (Doshi-Velez & Kim) • What v why/how of predictions: knowing the “why” can help you understand more about the problem, data, biases, leaks, debug/audit, and why a model might fail • Facilitate learning and satisfy human curiosity • The model becomes the source of insights and knowledge – not just the raw data. Hence, interpretability becomes important • Interpretability is not the same as explainability
  • 18. ©PistoiaAlliance Interpretability Source: https://christophm.github.io/interpretable-ml-book/interpretability-importance.html 1821 June, 2018 If the ML model is interpretable and explainable, we can check for the following traits: • Fairness: why was “x” denied a credit limit upgrade? Is there a racial bias in the data? • Privacy: ensuring sensitive data in the information is tracked and protected • Robustness: testing that small changes in inputs don’t lead to big changes in prediction • Trust: humans trust a system that explains decisions compared to a black box When don’t we need interpretability? • Problem is too well-studied • Model has no significant impact • Enable “gaming” of the ML system
  • 19. ©PistoiaAlliance Conversational AI Examining Bot Basics Sources: Demystifying Big Data & Machine Learning for Healthcare (Natarajan et al, CRC Press, 2017) “Designing Bots: Creating Conversational Experiences” (Amir Shervat, O’Reilly Press 2017) Prashant Natarajan
  • 20. ©PistoiaAlliance AI & Bots: the Connections 2021 June, 2018 Conversational AI & Bots • Most bots are powered by ML/AI – though not all of them • Designing a great conversation is orthogonal, in most cases, to the decision to use AI or another technology • What can AI do for bots today? – Natural Language Understanding (extracting & converting free text to entities) – Conversation mgmt. and context switching – Computer vision and image recognition – Prediction – finding patterns and predicting outcomes based on past data – Sentiment analysis – understanding emotional state • Bot types: personal v team, super v domain-specific, business v consumer, text v voice, Net New Service v New Interfaces
  • 21. ©PistoiaAlliance Anatomy of a Bot 2121 June, 2018 • Bot anatomies are important given that the primary purpose of a bot is to recognize and help accomplish human intent • Anatomical features of a bot include – Branding, personality and human involvement – AI – Conversation management: onboarding, flows, feedback/error handling, help and support – Rich interactions via files, audio, images, buttons, helpful links, emojis, typing indicators, Web views – Context and memory – Engagement methods: notifications, user-led, subscriptions
  • 22. ©PistoiaAlliance Some Use Cases 2221 June, 2018 • Bot anatomies are important given that the primary purpose of a bot is to recognize and help accomplish human intent • Conversational commerce – FB, Alexa, etc • Bots for business – Slackbot, GitHub ChatOps • Productivity and coaching – Lark, AHA, etc • Alerts and notifications • Router between humans (Uber, Lyft, scheduling bots) • Customer service and FAQs • 3rd party integration bots (Slack and CRM) • Games and entertainment • Brand bots
  • 23. ©PistoiaAlliance Poll Question 3: How important do you feel FAIR* data principles are to ensuring successful outputs from AI projects ? A. Very important B. Important C. Neutral D. Not important E. Not Very important FAIR : Findable Accessible, Interoperable & Reusable
  • 24. ©PistoiaAlliance Audience Q&A Please use the Question function in GoToWebinar
  • 25. ©PistoiaAlliance RWD and AI – how can work they together? The next Pistoia Alliance CoE AI Webinar: Date: TBD July 2018 check http://www.pistoiaalliance.org/events/ for the latest information