SlideShare a Scribd company logo
1
Perfect Partnership – Machine Learning & CDISC
Kevin Lee
Director of Data Science
2
The views and opinions expressed in the following PowerPoint
slides are those of the individual presenter and should not be
attributed to Drug Information Association, Inc. (“DIA”), its
directors, officers, employees, volunteers, members, chapters,
councils, Communities or affiliates, or any organization with
which the presenter is employed or affiliated.
These PowerPoint slides are the intellectual property of the
individual presenter and are protected under the copyright laws
of the United States of America and other countries. Used by
permission. All rights reserved. Drug Information Association,
Drug Information Association Inc., DIA and DIA logo are
registered trademarks. All other trademarks are the property of
their respective owners.
Disclaimer – Content Slide
3
Q1 : World’s Largest Transportation Company?
In January 2018, there is about
7,000,000 “drivers”.
The uber is operating in 600
cities in 78 countries.
There has been 5 billion rides.
Uber – about 85% of US
hailing market.
4
Q2 : World’s Largest Accommodation Provider?
There are more than 4.5 million Airbnb listings in 81,000 cities.
Airbnb hosts have earned $41 billion in 10 years.
As of June’17, Airbnb’s total valuation is about $31 billion.
5
Common Characteristics of Exponential
Organization
Data
Algorithm (Machine Learning/AI)
Exponential &
Scalable Growth
6
How Data and Algorithms help the exponential
growth
More Data
Better
Algorithms
(ML/AI)
Better
Products
More
Users
7
What is Machine Learning?
An application of artificial
intelligence (AI) that
provides systems the
ability to automatically
learn and improve from
experience without being
explicitly programmed.
8
Explicit Programming vs Machine Learning
Explicit Programming Machine Learning
9
How does Human Learn? - Experience
10
How does Machine Learn?
Algorithm
Test Data (cats)
Real Data
cat
11
More data,
better model
X
Y
X
Y Different data,
different model
12
Data Quality in Machine Learning
Garbage in Garbage out
13
The economist – The world’s most valuable
resource is no longer oil, but DATA.
14
Can Pharmaceutical be the exponential organization?
Pharm =
exponential organization
????
15
Gold mines - Clinical trial data in
Pharmaceutical industries
Clean – Pharma companies spent a
lot of hours to clean the data.
Unbiased - Prospective study,
randomized
Blinded – double-blinded
Structured
Metadata
Standards – CDISC
Already pharmaceutical companies
already owned the data.
Data in Pharmaceutical industry
16
Main purpose - for the submission
No or limited analysis after submission
Do not know exactly where clinical trial data is
Limited/No access
No CDR (Central Data Repository)
What is the reality of clinical trial data
17
Clinical Trial Data with ML/AI for Patients/Doctor
Machine Learning/ AI
CDISC
Clinical
Trial
Data
Patients/
Doctors/
Healthcare
Providers/
Drug
Companies
18
CDISC with Machine Learning
Algorithm
CDISC
(Test
Data)
Real
World
Data
Prediction
19
CDISC + Machine
Learning
More than drug
Drugs &
Experience
Conclusion
What Apple gives us?
20
Kevin Lee
Director of Data Science
kevin.kyosun.lee@gmail.com
Twitter @HelloKevinLee
LinkedIn https://www.linkedin.com/in/HelloKevinLee/
Join the conversation #DIA2018
Thank You

More Related Content

PDF
Big data in pharmaceutical industry
PDF
Artificial Intelligence in Pharmaceutical Industry
PDF
AI in Healthcare 2017
PDF
Trends and issues of artificial intelligence in medical application tutors i...
PDF
TestIstanbul 2018 Openning Speech: Artificial Intelligence vs Human Mind
PDF
Artificial Intelligence In mobile Application Industry
PDF
Application of artificial intelligence medical robots in healthcare types and...
PPTX
Artificial intelligence
Big data in pharmaceutical industry
Artificial Intelligence in Pharmaceutical Industry
AI in Healthcare 2017
Trends and issues of artificial intelligence in medical application tutors i...
TestIstanbul 2018 Openning Speech: Artificial Intelligence vs Human Mind
Artificial Intelligence In mobile Application Industry
Application of artificial intelligence medical robots in healthcare types and...
Artificial intelligence

What's hot (20)

PDF
AI and The Future of the Workplace
PPTX
Digital healthcare show - How will Artificial Intelligence in healthcare will...
PDF
State of AI Report 2019
PDF
AI for Good Global Summit - 2017 Report
 
PDF
Artificial intelligence a bane or boon-pdf
PDF
PPTX
Ai in healthcare
PPTX
APPLICATION OF ARTIFICIAL INTELLIGENCE TO TRACK PLANT DISEASES
PPTX
10/28 Top 5 Deep Learning Stories
PPT
Data Mining and Knowledge Discovery in Business Databases
PPTX
Artificial Intelligence (AI): Applications in agriculture
PPTX
11/4 Top 5 Deep Learning Stories
PPTX
Artificial intelligence - Digital Readiness.
PDF
Teacher Education: Artificial Intelligence
PDF
A primer on Artificial Intelligence (AI) and Machine Learning (ML)
PPTX
Artificial intelligence in Health
PPTX
Top 5 Deep Learning Stories 2/24
PPTX
Artificial Intelligence.
PPTX
14 Startups Leading the Artificial Intelligence (AI) Revolution
PPTX
Artificial Intelligence and Current State of It
AI and The Future of the Workplace
Digital healthcare show - How will Artificial Intelligence in healthcare will...
State of AI Report 2019
AI for Good Global Summit - 2017 Report
 
Artificial intelligence a bane or boon-pdf
Ai in healthcare
APPLICATION OF ARTIFICIAL INTELLIGENCE TO TRACK PLANT DISEASES
10/28 Top 5 Deep Learning Stories
Data Mining and Knowledge Discovery in Business Databases
Artificial Intelligence (AI): Applications in agriculture
11/4 Top 5 Deep Learning Stories
Artificial intelligence - Digital Readiness.
Teacher Education: Artificial Intelligence
A primer on Artificial Intelligence (AI) and Machine Learning (ML)
Artificial intelligence in Health
Top 5 Deep Learning Stories 2/24
Artificial Intelligence.
14 Startups Leading the Artificial Intelligence (AI) Revolution
Artificial Intelligence and Current State of It
Ad

Similar to Perfect partnership - machine learning and CDISC standard data (20)

PDF
Data Science Ai And Machine Learning In Drug Development Harry Yang
PPTX
How Big Data is Transforming Medical Information Insights - DIA 2014
PPTX
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
PPTX
Data Science in Pharmaceutical Industry.pptx
PDF
Business Drivers PowerPoint Presentation Slides
PDF
Data Science in Action
PDF
Project Management Careers in Data Science
PPTX
Machine Learning - A Trending Tech Skill in 2020
PDF
Scope Of Data Science
PPTX
The Use of Artificial Intelligence and Machine Learning in Clinical Data Mana...
DOCX
How Data Providers Companies Are Powering Growth in the Digital Era.docx
PDF
ML & AI in Drug development: the hidden part of the iceberg
PPTX
Machine learning 060517
PDF
How to succeed at data without even trying!
PDF
The 3 Key Barriers Keeping Companies from Deploying Data Products
PDF
Product management for data science - Product Tank Meetup
PDF
Full download Artificial Intelligence in Data Mining: Theories and Applicatio...
PDF
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
PDF
Math in data
PDF
AI Pharma Summit Keynote Boston 7-26-17
Data Science Ai And Machine Learning In Drug Development Harry Yang
How Big Data is Transforming Medical Information Insights - DIA 2014
Maximize Your Understanding of Operational Realities in Manufacturing with Pr...
Data Science in Pharmaceutical Industry.pptx
Business Drivers PowerPoint Presentation Slides
Data Science in Action
Project Management Careers in Data Science
Machine Learning - A Trending Tech Skill in 2020
Scope Of Data Science
The Use of Artificial Intelligence and Machine Learning in Clinical Data Mana...
How Data Providers Companies Are Powering Growth in the Digital Era.docx
ML & AI in Drug development: the hidden part of the iceberg
Machine learning 060517
How to succeed at data without even trying!
The 3 Key Barriers Keeping Companies from Deploying Data Products
Product management for data science - Product Tank Meetup
Full download Artificial Intelligence in Data Mining: Theories and Applicatio...
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Math in data
AI Pharma Summit Keynote Boston 7-26-17
Ad

More from Kevin Lee (20)

PDF
Patient’s Journey using Real World Data and its Advanced Analytics
PDF
Introduction of AWS Cloud Computing and its future for Biometric Department
PDF
A fear of missing out and a fear of messing up : A Strategic Roadmap for Chat...
PDF
Prompt it, not Google it - Prompt Engineering for Data Scientists
PPTX
Leading into the Unknown? Yes, we need Change Management Leadership
PDF
How to create SDTM DM.xpt using Python v1.1
PDF
Enterprise-level Transition from SAS to Open-source Programming for the whole...
PDF
How I became ML Engineer
PDF
Tell stories with jupyter notebook
PDF
Machine Learning : why we should know and how it works
PDF
Big data for SAS programmers
PDF
How FDA will reject non compliant electronic submission
PDF
End to end standards driven oncology study (solid tumor, Immunotherapy, Leuke...
PDF
Are you ready for Dec 17, 2016 - CDISC compliant data?
PDF
SAS integration with NoSQL data
PDF
Introduction of semantic technology for SAS programmers
PPTX
Standards Metadata Management (system)
PDF
Data centric SDLC for automated clinical data development
PPTX
Beyond regulatory submission - standards metadata management
PDF
Two different use cases to obtain best response using recist 11 sdtm and a ...
Patient’s Journey using Real World Data and its Advanced Analytics
Introduction of AWS Cloud Computing and its future for Biometric Department
A fear of missing out and a fear of messing up : A Strategic Roadmap for Chat...
Prompt it, not Google it - Prompt Engineering for Data Scientists
Leading into the Unknown? Yes, we need Change Management Leadership
How to create SDTM DM.xpt using Python v1.1
Enterprise-level Transition from SAS to Open-source Programming for the whole...
How I became ML Engineer
Tell stories with jupyter notebook
Machine Learning : why we should know and how it works
Big data for SAS programmers
How FDA will reject non compliant electronic submission
End to end standards driven oncology study (solid tumor, Immunotherapy, Leuke...
Are you ready for Dec 17, 2016 - CDISC compliant data?
SAS integration with NoSQL data
Introduction of semantic technology for SAS programmers
Standards Metadata Management (system)
Data centric SDLC for automated clinical data development
Beyond regulatory submission - standards metadata management
Two different use cases to obtain best response using recist 11 sdtm and a ...

Recently uploaded (20)

PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
Introduction to Business Data Analytics.
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
IB Computer Science - Internal Assessment.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Database Infoormation System (DBIS).pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
1_Introduction to advance data techniques.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
Mega Projects Data Mega Projects Data
PPT
Quality review (1)_presentation of this 21
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Fluorescence-microscope_Botany_detailed content
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Miokarditis (Inflamasi pada Otot Jantung)
Introduction to Business Data Analytics.
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
IB Computer Science - Internal Assessment.pptx
Reliability_Chapter_ presentation 1221.5784
Major-Components-ofNKJNNKNKNKNKronment.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Database Infoormation System (DBIS).pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
1_Introduction to advance data techniques.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Mega Projects Data Mega Projects Data
Quality review (1)_presentation of this 21
Acceptance and paychological effects of mandatory extra coach I classes.pptx

Perfect partnership - machine learning and CDISC standard data

  • 1. 1 Perfect Partnership – Machine Learning & CDISC Kevin Lee Director of Data Science
  • 2. 2 The views and opinions expressed in the following PowerPoint slides are those of the individual presenter and should not be attributed to Drug Information Association, Inc. (“DIA”), its directors, officers, employees, volunteers, members, chapters, councils, Communities or affiliates, or any organization with which the presenter is employed or affiliated. These PowerPoint slides are the intellectual property of the individual presenter and are protected under the copyright laws of the United States of America and other countries. Used by permission. All rights reserved. Drug Information Association, Drug Information Association Inc., DIA and DIA logo are registered trademarks. All other trademarks are the property of their respective owners. Disclaimer – Content Slide
  • 3. 3 Q1 : World’s Largest Transportation Company? In January 2018, there is about 7,000,000 “drivers”. The uber is operating in 600 cities in 78 countries. There has been 5 billion rides. Uber – about 85% of US hailing market.
  • 4. 4 Q2 : World’s Largest Accommodation Provider? There are more than 4.5 million Airbnb listings in 81,000 cities. Airbnb hosts have earned $41 billion in 10 years. As of June’17, Airbnb’s total valuation is about $31 billion.
  • 5. 5 Common Characteristics of Exponential Organization Data Algorithm (Machine Learning/AI) Exponential & Scalable Growth
  • 6. 6 How Data and Algorithms help the exponential growth More Data Better Algorithms (ML/AI) Better Products More Users
  • 7. 7 What is Machine Learning? An application of artificial intelligence (AI) that provides systems the ability to automatically learn and improve from experience without being explicitly programmed.
  • 8. 8 Explicit Programming vs Machine Learning Explicit Programming Machine Learning
  • 9. 9 How does Human Learn? - Experience
  • 10. 10 How does Machine Learn? Algorithm Test Data (cats) Real Data cat
  • 11. 11 More data, better model X Y X Y Different data, different model
  • 12. 12 Data Quality in Machine Learning Garbage in Garbage out
  • 13. 13 The economist – The world’s most valuable resource is no longer oil, but DATA.
  • 14. 14 Can Pharmaceutical be the exponential organization? Pharm = exponential organization ????
  • 15. 15 Gold mines - Clinical trial data in Pharmaceutical industries Clean – Pharma companies spent a lot of hours to clean the data. Unbiased - Prospective study, randomized Blinded – double-blinded Structured Metadata Standards – CDISC Already pharmaceutical companies already owned the data. Data in Pharmaceutical industry
  • 16. 16 Main purpose - for the submission No or limited analysis after submission Do not know exactly where clinical trial data is Limited/No access No CDR (Central Data Repository) What is the reality of clinical trial data
  • 17. 17 Clinical Trial Data with ML/AI for Patients/Doctor Machine Learning/ AI CDISC Clinical Trial Data Patients/ Doctors/ Healthcare Providers/ Drug Companies
  • 18. 18 CDISC with Machine Learning Algorithm CDISC (Test Data) Real World Data Prediction
  • 19. 19 CDISC + Machine Learning More than drug Drugs & Experience Conclusion What Apple gives us?
  • 20. 20 Kevin Lee Director of Data Science kevin.kyosun.lee@gmail.com Twitter @HelloKevinLee LinkedIn https://www.linkedin.com/in/HelloKevinLee/ Join the conversation #DIA2018 Thank You