Moving from Big Data to
Better Models of Disease
and Drug Response
Joel Dudley, PhD
Director of Biomedical Informatics &
Assistant Professor of Genetics and Genomic Sciences,
Mount Sinai School of Medicine

Icahn School of Medicine at
Mount Sinai

@IcahnIns(tute
Mount Sinai Health System Facts

7

Member hospital campuses

>3,500

Hospital beds

>3,100,000

Patient visits

>6,000

Physicians
Moving from Big Data to Better Models of Disease and Drug Response - Joel Dudley
Mount Sinai is attracting key talent to
thrive in a Big Data world

Demeter
There are rarely smoking guns in
human disease biology
There are rarely smoking guns in
human disease biology
We must embrace complexity to fully
That promise to enable the construction of molecular and that define
understand human physiologynetworks disease
the biological processes that comprise living systems
ENVIRONMENT

Non-coding RNA network

ENVIRONMENT

HEART

protein network

GI TRACT
KIDNEY

metabolite network

IMMUNE SYSTEM
VASCULATURE

transcriptional network

ENVIRONMENT

ENVIRONMENT

BRAIN
We must embrace complexity to fully
understand human physiology and disease
“A complex adaptive system has three characteristics. The
first is that the system consists of a number of
heterogeneous agents, and each of those agents makes
decisions about how to behave. The most important
dimension here is that those decisions will evolve over
time. The second characteristic is that the agents interact
with one another. That interaction leads to the third—
something that scientists call emergence: In a very real
way, the whole becomes greater than the sum of the parts.
The key issue is that you can’t really understand the whole
system by simply looking at its individual parts”
.
- Michael J. Mauboussin (investment banker)
Although our ability to embrace complexity
will bump up against our want to tell stories
Zeus, the sky god; when he is angry he throws
lightening bolts out of the sky

Ptolemaic astronomy: the earth is
the center of the universe

The earth is flat

Biological processes are driven by simple linearly
ordered pathways (e.g. TGF-beta signaling)
Integrating and modeling the digital universe of information
We need to be able to leverage the digital universe
of information to solve complex problems

1.8

ZETTABYTES
(1.8 trillion gigabytes) of information

will be created and replicated in 2011and growing fast (it has grown by a factor of 9 in just five years)

Last 2011	
  IDC	
  Digital	
  Ucrackedsponsored	
  bzettabyte
year WE niverse	
  Study	
   the 1 y	
  EMC
Being masters of really big data is now critical
for biomedical research (TB→PB→EB→ZB)

Organisms

Tissues

Single	
  cells

Single	
  cell,	
  
real-­‐2me,	
  
con2nuous?
Real time observation systems add complex
but powerful new dimensions to NGS

Inter Pulse Distance (IPD)
We measure more than we know
Exploring the transcriptional landscape of human disease
20k+	
  Genes
~300	
  Diseases	
  
and	
  Condi2ons

Blue:	
  gene	
  goes	
  
down	
  in	
  disease
Yellow:	
  gene	
  goes	
  up	
  
in	
  disease
Building molecular taxonomies of human
disease

Figure 2. Significant disease-disease similarities. (A) Hierarchical clustering of the disease correlations. The distance between two diseases wa
Suthram S, Dudley J et al. Network-based elucidation of human disease similarities reveals common
defined to be (1-correlation coefficient) of the two diseases. The tree was constructed using the average method of hierarchical clustering. The re
functional a p-value of 0.01 and for pluripotent disease correlations below this line are considered (2010)
line corresponds to modules enrichedFDR of 10.37% and, drug targets. PLoS Computational Biology significant. The different color
represent the various categories of significant disease correlations. (B) The network of all the 138 significant disease correlations. The colo
Data Driven Approach to Connect Drugs
and Disease Using Molecular Profiles

Sirota, M., Dudley, J. T., et al. (2011). Discovery and Preclinical Validation of Drug Indications Using
Compendia of Public Gene Expression Data. Science Translational Medicine, 3(96).
Topiramate Reduces IBD Severity in a TNBS Rodent
Model of IBD
• TNBS chemically
induced rat model of
IBD
• Animals treated with
80mg/kg topiramate
oral after sensitization
• Prednisolone positive
control (approved for
IBD in humans)

Dudley, J. T., Sirota, M., et al. (2011). Computational Repositioning of the Anticonvulsant Topiramate
for Inflammatory Bowel Disease. Science Translational Medicine, 3(96).
1HXUREODVWRPD WXPRUV

3URPHWKD]LQH ,PLSUDPLQH
+0
+0

Approved compound for non-cancer indication prevents
formation of SCLC tumors in a genetic model of SCLC


31(7










7

10

Days of Treatment

13



5

$

3








`7 0,1

31(7V





























31(7V






%



0





1















RQWURO
,PLSUDPLQH +0

,PLSUDPLQH +0 0,1
`7
,PLSUDPLQH +0




3'

0



$







0








3'

2






Mice dosed



 





 

after








 
tumor







formation
`7 0,1

6XUYLYDO 077
*
*
*

*
*









4



5

**
**
**



$

*
**
*











G

3'

6










Bepridil



$

p53/Rb/p130
triple knockout
model of SCLC








Imipramine
Promethazine

F



0

**
**
*

0

Imipramine

%

Saline

8

1%




$

c

6XUYLYDO 077
Fold Change of Tumor Volume

b



1

Control

F

6XUYLYDO 077
3URPHWKD]LQH ,PLSUDPLQH
+0
+0

9HKLFOH
:DWHU
3'$

51$ H[SUHVVLRQ OHYHOV



E



31(7V
Molecular networks act as sensors and mediators
of complex and adaptive cellular physiology
What we are about: Integrating big data across many
domains to build predictive models that improve how we
diagnose and treat disease

Population

Predictive Network Model

Sample
acquisition

Slide	
  courtesy	
  of	
  Eric	
  Schadt
Moving from Big Data to Better Models of Disease and Drug Response - Joel Dudley
Causal network models generate testable
predictions from in silico experiments
Ultimately want to drive decision making in drug discovery
Novel phosphatase
under development at
Merck for T2D
Grit

Sh3gl2

Prr7
PPM1L

C6

Insulin

Fat
Mass

Irx3

Glra2

Atp1a3

Slc38a1
Glucose

Tcf7l2

Predictions derived from the predictive models
Slide	
  courtesy	
  of	
  
Eric	
  Schadt

Increases fat mass
Negatively impacts
Hypertension genes

Lowers glucose

BAD

GOOD

Raises insulin
Predictions are great, but only meaningful if they are validated
GLUCOSE
LOWERED
GOOD
Grit

Sh3gl2

Prr7
PPM1L

C6

Insulin

Fat
Mass

Irx3

Glra2

Atp1a3

BAD

Slc38a1
Glucose

FAT MASS
INCREASED

Tcf7l2

BLOOD PRESSURE
INCREASED
BAD

Slide	
  courtesy	
  of	
  Eric	
  Schadt
Validation of network model
But wait, the network also shows PPM1L and PPARG
prediction in a patient population
(target of Avandia) in a causal relationship
PPARG
PPM1L

Network Predicts:
- Avandia will lower glucose
- Avandia will make you fat
- Avandia will increase
cardiovascular risk

Validation 2 years later:
Leveraging NGS and Predictive Network
Models to Drive Personalized Cancer Therapy
Humancell
systemscreening
Pa1ent2specific
mutantflymodels

Pa1ent2specific
xenogra7models

Soma4c
varia4on
Clinical'

CD8'epitope'
predic4on

Tumor'RNA'

Network'
integra4on

Tumor'DNA'
Germline'DNA'

Chemo'
genomic
Public'data'
integra4on

Cancer'Pa)ent'
Profiling

Pa)ent0Specific'
Analyses

Interpreta)on''Screens'
Informed'by'Pa)ent0
Specific'Tumor'Network

Personalized'Report''
Treatment'Op)ons'
Delivered'to'Clinician
Personalized multiscale tumor
networks to diagnose and treat cancers
Tumor$biopsy$+$normal

Genomics Core Facility
(Illumina, PacBio, Ion)

RNA$+$DNA

= key driver

Key	
  driver	
  
targeted	
  therapy

Patient-specific subnetwork

Predictive network model of cancer
Personalized multiscale tumor
networks to diagnose and treat cancers
Tumor$biopsy$+$normal

Genomics Core Facility
(Illumina, PacBio, Ion)

RNA$+$DNA

= key driver

Pa2ent	
  network	
  
targeted	
  therapy

Patient-specific subnetwork

Predictive network model of cancer
Personalized multiscale networks to
model dynamics of complex disease

DNA
Cell'specific-RNA
Cytokines
Clinical-labs
Physiometrics

0:

min
00

Th1
Th17

0:05 min

0:10

min
How to capture all of the clinical data exhaust?

CPOE
EMR
Billing

Telemetry
Data driven translational medicine
pipeline at Mount Sinai
BioBank
Research.and.
Clinical.Queries;
Experiment.
CreaAon;.etc.

PaAent.
Traffic

Sequencing.
Facility

Clinical.Labs
Clinical.Data

AcAonable.
Feedback

EMR
(EPIC)
Data.
Warehouse

Disease.Model.
ConstrucAon.and.
PredicAon.
GeneraAon

Primary.Data

HighF
Performance.
CompuAng
Multiscale analysis of patient networks
enables precision medicine

=
Genomic
Environment
Clinical
Multiscale measures of patients becoming
available through the Mount Sinai Biobank
Diagnoses

DNA

RNA

Drugs

Microbiome

Immune

Labs

Procedures
Image credit:
Li Li (ISMMS)
Many possible topological analyses can be driven using Mt.
Sinai genotype/phenotype data
Topological network generated using SNP
data separates race
Low

enr

ich

Hig
. di

he

abe

tes

nric

h. d

iab

ete

s

DMSEA

DMSAA

DMSHA

DMSHA,
diabetes enriched
The	
  personal	
  biosensor	
  wave	
  is	
  forming
Printable	
  tattoo	
  biosensor

More Related Content

PDF
Knocking on the clinic door of precision medicine
PDF
유전체의학과 미래의학 2 스마트 의학_빅데이터_공개용
PDF
임상의사 관점의 의료빅데이터 연구와 임상적용 - From Clinic To Data, From Data To Clinic
PPTX
Application of Biomedical Informatics in Clinical Problem Solving
PDF
Use cases
PDF
Autologous Bone Marrow Cell Therapy for Autism: An Open Label Uncontrolled C...
PPTX
Personalized medicine through wes and big data analytics
PPTX
2019.12.08 Bill Faloon Healthy Masters Portugal
Knocking on the clinic door of precision medicine
유전체의학과 미래의학 2 스마트 의학_빅데이터_공개용
임상의사 관점의 의료빅데이터 연구와 임상적용 - From Clinic To Data, From Data To Clinic
Application of Biomedical Informatics in Clinical Problem Solving
Use cases
Autologous Bone Marrow Cell Therapy for Autism: An Open Label Uncontrolled C...
Personalized medicine through wes and big data analytics
2019.12.08 Bill Faloon Healthy Masters Portugal

What's hot (20)

PDF
Cancer recurrence prediction using
PPTX
Assessing the clinical utility of cancer genomic and proteomic data across tu...
PPTX
Osteoblasts remotely supply lung tumors with cancer-promoting SiglecFhigh neu...
PPT
Genomics, Cellular Networks, Preventive Medicine, and Society
PDF
Genomica Yquimiot
PPTX
Proteogenomic analysis of human colon cancer reveals new therapeutic opportun...
PDF
Application of Microarray Technology and softcomputing in cancer Biology
PPTX
Stratification of TCGA melanoma patients according to Tumor Infiltrative CD8...
PPTX
RAADfest 2019 Bill Faloon's Senolytics Slides
PPTX
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
PPTX
Role of biotechnology in cancer control
PPTX
Clinical Trials Update by Bill Faloon at RAADfest 2021
PPTX
Unified Theory of Stem Cell Rejuvenation
PPTX
Human genome project
PPTX
Bill Faloon 2019 RAADfest keynote presentation
PPTX
Human genome project
PPTX
Nanodroplet processing platform for deep and quantitative proteome profiling ...
PDF
Big Data & Immunotherapeutics Symposium Program
PDF
Ijsrp p10758
PPT
INBIOMEDvision Workshop at MIE 2011. Victoria López
Cancer recurrence prediction using
Assessing the clinical utility of cancer genomic and proteomic data across tu...
Osteoblasts remotely supply lung tumors with cancer-promoting SiglecFhigh neu...
Genomics, Cellular Networks, Preventive Medicine, and Society
Genomica Yquimiot
Proteogenomic analysis of human colon cancer reveals new therapeutic opportun...
Application of Microarray Technology and softcomputing in cancer Biology
Stratification of TCGA melanoma patients according to Tumor Infiltrative CD8...
RAADfest 2019 Bill Faloon's Senolytics Slides
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
Role of biotechnology in cancer control
Clinical Trials Update by Bill Faloon at RAADfest 2021
Unified Theory of Stem Cell Rejuvenation
Human genome project
Bill Faloon 2019 RAADfest keynote presentation
Human genome project
Nanodroplet processing platform for deep and quantitative proteome profiling ...
Big Data & Immunotherapeutics Symposium Program
Ijsrp p10758
INBIOMEDvision Workshop at MIE 2011. Victoria López
Ad

Viewers also liked (18)

PDF
Stephen Friend PRISME Forum 2011-05-04
PPTX
NetBioSIG2013-Talk Thomas Kelder
DOCX
Tweet 3
PPT
Blank Screen Creativity
DOCX
Raghavendra. Subbaro
PPTX
What Is Supply Chain Management?
DOCX
Resume
PDF
The essentials
PPTX
Creating Digital Media Profiles Online
PDF
Technik.hotelarstwa 341[04] z4.01_u
DOCX
[NOTES] When Your Community Does the Blogging | MuseumNext Indy
DOCX
Matt's resume. word doc (1)
PDF
Ramayana Tour Legend Comes Alive in Sri Lanka
DOCX
PPTX
How a guitar works
PPTX
Managing Digital Footprints
PPTX
How to See People Who Block You on Twitter
PDF
National Diploma of N Mabunda.PDF
Stephen Friend PRISME Forum 2011-05-04
NetBioSIG2013-Talk Thomas Kelder
Tweet 3
Blank Screen Creativity
Raghavendra. Subbaro
What Is Supply Chain Management?
Resume
The essentials
Creating Digital Media Profiles Online
Technik.hotelarstwa 341[04] z4.01_u
[NOTES] When Your Community Does the Blogging | MuseumNext Indy
Matt's resume. word doc (1)
Ramayana Tour Legend Comes Alive in Sri Lanka
How a guitar works
Managing Digital Footprints
How to See People Who Block You on Twitter
National Diploma of N Mabunda.PDF
Ad

Similar to Moving from Big Data to Better Models of Disease and Drug Response - Joel Dudley (20)

PDF
SILS 2015 - Connecting Precision Medicine to Precision Wellness
PDF
2013 05 society for clinical trials
PPTX
Systems medicine
PDF
Translational Bioinformatics and Systems Biology Methods for Personalized Med...
PDF
Big Data and Analytic Strategy for Clinical Research
PDF
Drug Repositioning Conference Washington DC 20190923
PDF
PDF
Personalized models for Quantitative Systems Pharmacology
PDF
Friend NIEHS 2013-03-01
PDF
Unravelling the molecular linkage of co morbid diseases
PDF
Unravelling the molecular linkage of co morbid
PDF
Bioinformatics in dermato-oncology
PDF
G. Poste. Big Data and the Evolution of Precision Medicine, Cambridge 2nd Ann...
PPTX
Repositioning Old Drugs For New Indications Using Computational Approaches
PDF
Amia tb-review-12
PDF
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
PDF
Health Technology - Who wants to live forever?
PDF
Big Data Analytics in the Health Domain
PDF
The state of the art in behavioral machine learning for healthcare
PDF
Challenges and opportunities for machine learning in biomedical research
SILS 2015 - Connecting Precision Medicine to Precision Wellness
2013 05 society for clinical trials
Systems medicine
Translational Bioinformatics and Systems Biology Methods for Personalized Med...
Big Data and Analytic Strategy for Clinical Research
Drug Repositioning Conference Washington DC 20190923
Personalized models for Quantitative Systems Pharmacology
Friend NIEHS 2013-03-01
Unravelling the molecular linkage of co morbid diseases
Unravelling the molecular linkage of co morbid
Bioinformatics in dermato-oncology
G. Poste. Big Data and the Evolution of Precision Medicine, Cambridge 2nd Ann...
Repositioning Old Drugs For New Indications Using Computational Approaches
Amia tb-review-12
Health IT Summit Austin 2013 - Presentation "The Impact of All Data on Health...
Health Technology - Who wants to live forever?
Big Data Analytics in the Health Domain
The state of the art in behavioral machine learning for healthcare
Challenges and opportunities for machine learning in biomedical research

More from CityAge (20)

PDF
Health technology partnership: From blue sky to lives touched - Ryan C.N. D’Arcy
PDF
Using BC and Canadian Data to Improve Health and Healthcare What are the best...
PDF
The Canadian Clinical Trials Asset Map (CCTAM) - Shurjeel H Choudhri and Alis...
PDF
Treatment as Prevention The Key to an AIDS & HIV free Generation or Harnessin...
PDF
My Personal Odyssey with Big Data - Brad Popovich
PDF
Applying innovative commercial technology to deliver on the promise of person...
PDF
Simon O'Byrne Presentation: Ignite Your City's Brand: Mess up Your Neighbourh...
PDF
Seshadri Subbanna Presentation: Driving Collaborative Innovation with Clients...
PPTX
Pierre Meulien Presentation: The Innovation Economy: How Genomics could chang...
PPTX
Mike Murray Presentation: The Big Shift: Fostering Innovation in Waterloo Reg...
PDF
Eric Simmons Presentation: Delivering the Connected City
PDF
Tom Jenkins Presentation: Open Data and the Implications for Local Government...
PDF
James Lingerfelt smarter approach to crime reduction
PDF
Crime Patterns and Urban Living - Dr. Patricia Brantingham
PDF
LEVERAGING YOUR ANALYTIC CAPACITY TO DRIVE VALUE FROM YOUR DATA ASSETS - Marc...
PDF
Data Science Meets Healthcare: The Advent of Personalized Medicine - Jacomo C...
PDF
Treatment as Prevention: THE KEY TO AN AIDS FREE GENERATION - Irene Day and D...
PDF
The big data opportunity - Chris Yiu
PDF
A Strong Canada Depends on Strong Wireless Networks - Bernard Lord
PDF
The Data Effect: Canadian Big Data & Analytics Update - Dr. Alison Brooks Dir...
Health technology partnership: From blue sky to lives touched - Ryan C.N. D’Arcy
Using BC and Canadian Data to Improve Health and Healthcare What are the best...
The Canadian Clinical Trials Asset Map (CCTAM) - Shurjeel H Choudhri and Alis...
Treatment as Prevention The Key to an AIDS & HIV free Generation or Harnessin...
My Personal Odyssey with Big Data - Brad Popovich
Applying innovative commercial technology to deliver on the promise of person...
Simon O'Byrne Presentation: Ignite Your City's Brand: Mess up Your Neighbourh...
Seshadri Subbanna Presentation: Driving Collaborative Innovation with Clients...
Pierre Meulien Presentation: The Innovation Economy: How Genomics could chang...
Mike Murray Presentation: The Big Shift: Fostering Innovation in Waterloo Reg...
Eric Simmons Presentation: Delivering the Connected City
Tom Jenkins Presentation: Open Data and the Implications for Local Government...
James Lingerfelt smarter approach to crime reduction
Crime Patterns and Urban Living - Dr. Patricia Brantingham
LEVERAGING YOUR ANALYTIC CAPACITY TO DRIVE VALUE FROM YOUR DATA ASSETS - Marc...
Data Science Meets Healthcare: The Advent of Personalized Medicine - Jacomo C...
Treatment as Prevention: THE KEY TO AN AIDS FREE GENERATION - Irene Day and D...
The big data opportunity - Chris Yiu
A Strong Canada Depends on Strong Wireless Networks - Bernard Lord
The Data Effect: Canadian Big Data & Analytics Update - Dr. Alison Brooks Dir...

Recently uploaded (20)

PDF
MNEMONICS MNEMONICS MNEMONICS MNEMONICS s
PDF
Forensic Psychology and Its Impact on the Legal System.pdf
PPTX
Impression Materials in dental materials.pptx
PDF
Comparison of Swim-Up and Microfluidic Sperm Sorting.pdf
PPTX
Post Op complications in general surgery
PPT
Opthalmology presentation MRCP preparation.ppt
PPTX
Reading between the Rings: Imaging in Brain Infections
PDF
Lecture on Anesthesia for ENT surgery 2025pptx.pdf
PDF
OSCE SERIES ( Questions & Answers ) - Set 3.pdf
PPT
Blood and blood products and their uses .ppt
PPTX
Neoplasia III.pptxjhghgjhfj fjfhgfgdfdfsrbvhv
PDF
OSCE SERIES - Set 7 ( Questions & Answers ).pdf
PPT
Infections Member of Royal College of Physicians.ppt
PDF
Glaucoma Definition, Introduction, Etiology, Epidemiology, Clinical Presentat...
PPTX
Introduction to Medical Microbiology for 400L Medical Students
PPTX
Approach to chest pain, SOB, palpitation and prolonged fever
PPT
nephrology MRCP - Member of Royal College of Physicians ppt
PPTX
Antepartum_Haemorrhage_Guidelines_2024.pptx
PPTX
ANESTHETIC CONSIDERATION IN ALCOHOLIC ASSOCIATED LIVER DISEASE.pptx
PDF
OSCE Series Set 1 ( Questions & Answers ).pdf
MNEMONICS MNEMONICS MNEMONICS MNEMONICS s
Forensic Psychology and Its Impact on the Legal System.pdf
Impression Materials in dental materials.pptx
Comparison of Swim-Up and Microfluidic Sperm Sorting.pdf
Post Op complications in general surgery
Opthalmology presentation MRCP preparation.ppt
Reading between the Rings: Imaging in Brain Infections
Lecture on Anesthesia for ENT surgery 2025pptx.pdf
OSCE SERIES ( Questions & Answers ) - Set 3.pdf
Blood and blood products and their uses .ppt
Neoplasia III.pptxjhghgjhfj fjfhgfgdfdfsrbvhv
OSCE SERIES - Set 7 ( Questions & Answers ).pdf
Infections Member of Royal College of Physicians.ppt
Glaucoma Definition, Introduction, Etiology, Epidemiology, Clinical Presentat...
Introduction to Medical Microbiology for 400L Medical Students
Approach to chest pain, SOB, palpitation and prolonged fever
nephrology MRCP - Member of Royal College of Physicians ppt
Antepartum_Haemorrhage_Guidelines_2024.pptx
ANESTHETIC CONSIDERATION IN ALCOHOLIC ASSOCIATED LIVER DISEASE.pptx
OSCE Series Set 1 ( Questions & Answers ).pdf

Moving from Big Data to Better Models of Disease and Drug Response - Joel Dudley

  • 1. Moving from Big Data to Better Models of Disease and Drug Response Joel Dudley, PhD Director of Biomedical Informatics & Assistant Professor of Genetics and Genomic Sciences, Mount Sinai School of Medicine Icahn School of Medicine at Mount Sinai @IcahnIns(tute
  • 2. Mount Sinai Health System Facts 7 Member hospital campuses >3,500 Hospital beds >3,100,000 Patient visits >6,000 Physicians
  • 4. Mount Sinai is attracting key talent to thrive in a Big Data world Demeter
  • 5. There are rarely smoking guns in human disease biology
  • 6. There are rarely smoking guns in human disease biology
  • 7. We must embrace complexity to fully That promise to enable the construction of molecular and that define understand human physiologynetworks disease the biological processes that comprise living systems ENVIRONMENT Non-coding RNA network ENVIRONMENT HEART protein network GI TRACT KIDNEY metabolite network IMMUNE SYSTEM VASCULATURE transcriptional network ENVIRONMENT ENVIRONMENT BRAIN
  • 8. We must embrace complexity to fully understand human physiology and disease “A complex adaptive system has three characteristics. The first is that the system consists of a number of heterogeneous agents, and each of those agents makes decisions about how to behave. The most important dimension here is that those decisions will evolve over time. The second characteristic is that the agents interact with one another. That interaction leads to the third— something that scientists call emergence: In a very real way, the whole becomes greater than the sum of the parts. The key issue is that you can’t really understand the whole system by simply looking at its individual parts” . - Michael J. Mauboussin (investment banker)
  • 9. Although our ability to embrace complexity will bump up against our want to tell stories Zeus, the sky god; when he is angry he throws lightening bolts out of the sky Ptolemaic astronomy: the earth is the center of the universe The earth is flat Biological processes are driven by simple linearly ordered pathways (e.g. TGF-beta signaling)
  • 10. Integrating and modeling the digital universe of information
  • 11. We need to be able to leverage the digital universe of information to solve complex problems 1.8 ZETTABYTES (1.8 trillion gigabytes) of information will be created and replicated in 2011and growing fast (it has grown by a factor of 9 in just five years) Last 2011  IDC  Digital  Ucrackedsponsored  bzettabyte year WE niverse  Study   the 1 y  EMC
  • 12. Being masters of really big data is now critical for biomedical research (TB→PB→EB→ZB) Organisms Tissues Single  cells Single  cell,   real-­‐2me,   con2nuous?
  • 13. Real time observation systems add complex but powerful new dimensions to NGS Inter Pulse Distance (IPD)
  • 14. We measure more than we know
  • 15. Exploring the transcriptional landscape of human disease 20k+  Genes ~300  Diseases   and  Condi2ons Blue:  gene  goes   down  in  disease Yellow:  gene  goes  up   in  disease
  • 16. Building molecular taxonomies of human disease Figure 2. Significant disease-disease similarities. (A) Hierarchical clustering of the disease correlations. The distance between two diseases wa Suthram S, Dudley J et al. Network-based elucidation of human disease similarities reveals common defined to be (1-correlation coefficient) of the two diseases. The tree was constructed using the average method of hierarchical clustering. The re functional a p-value of 0.01 and for pluripotent disease correlations below this line are considered (2010) line corresponds to modules enrichedFDR of 10.37% and, drug targets. PLoS Computational Biology significant. The different color represent the various categories of significant disease correlations. (B) The network of all the 138 significant disease correlations. The colo
  • 17. Data Driven Approach to Connect Drugs and Disease Using Molecular Profiles Sirota, M., Dudley, J. T., et al. (2011). Discovery and Preclinical Validation of Drug Indications Using Compendia of Public Gene Expression Data. Science Translational Medicine, 3(96).
  • 18. Topiramate Reduces IBD Severity in a TNBS Rodent Model of IBD • TNBS chemically induced rat model of IBD • Animals treated with 80mg/kg topiramate oral after sensitization • Prednisolone positive control (approved for IBD in humans) Dudley, J. T., Sirota, M., et al. (2011). Computational Repositioning of the Anticonvulsant Topiramate for Inflammatory Bowel Disease. Science Translational Medicine, 3(96).
  • 19. 1HXUREODVWRPD WXPRUV 3URPHWKD]LQH ,PLSUDPLQH +0 +0 Approved compound for non-cancer indication prevents formation of SCLC tumors in a genetic model of SCLC 31(7 7 10 Days of Treatment 13 5 $ 3 `7 0,1 31(7V 31(7V % 0 1 RQWURO ,PLSUDPLQH +0 ,PLSUDPLQH +0 0,1 `7 ,PLSUDPLQH +0 3' 0 $ 0 3' 2 Mice dosed after tumor formation `7 0,1 6XUYLYDO 077
  • 20. * * * * * 4 5 ** ** ** $ * ** * G 3' 6 Bepridil $ p53/Rb/p130 triple knockout model of SCLC Imipramine Promethazine F 0 ** ** * 0 Imipramine % Saline 8 1% $ c 6XUYLYDO 077
  • 21. Fold Change of Tumor Volume b 1 Control F 6XUYLYDO 077
  • 24. Molecular networks act as sensors and mediators of complex and adaptive cellular physiology
  • 25. What we are about: Integrating big data across many domains to build predictive models that improve how we diagnose and treat disease Population Predictive Network Model Sample acquisition Slide  courtesy  of  Eric  Schadt
  • 27. Causal network models generate testable predictions from in silico experiments Ultimately want to drive decision making in drug discovery Novel phosphatase under development at Merck for T2D Grit Sh3gl2 Prr7 PPM1L C6 Insulin Fat Mass Irx3 Glra2 Atp1a3 Slc38a1 Glucose Tcf7l2 Predictions derived from the predictive models Slide  courtesy  of   Eric  Schadt Increases fat mass Negatively impacts Hypertension genes Lowers glucose BAD GOOD Raises insulin
  • 28. Predictions are great, but only meaningful if they are validated GLUCOSE LOWERED GOOD Grit Sh3gl2 Prr7 PPM1L C6 Insulin Fat Mass Irx3 Glra2 Atp1a3 BAD Slc38a1 Glucose FAT MASS INCREASED Tcf7l2 BLOOD PRESSURE INCREASED BAD Slide  courtesy  of  Eric  Schadt
  • 29. Validation of network model But wait, the network also shows PPM1L and PPARG prediction in a patient population (target of Avandia) in a causal relationship PPARG PPM1L Network Predicts: - Avandia will lower glucose - Avandia will make you fat - Avandia will increase cardiovascular risk Validation 2 years later:
  • 30. Leveraging NGS and Predictive Network Models to Drive Personalized Cancer Therapy Humancell systemscreening Pa1ent2specific mutantflymodels Pa1ent2specific xenogra7models Soma4c varia4on Clinical' CD8'epitope' predic4on Tumor'RNA' Network' integra4on Tumor'DNA' Germline'DNA' Chemo' genomic Public'data' integra4on Cancer'Pa)ent' Profiling Pa)ent0Specific' Analyses Interpreta)on''Screens' Informed'by'Pa)ent0 Specific'Tumor'Network Personalized'Report'' Treatment'Op)ons' Delivered'to'Clinician
  • 31. Personalized multiscale tumor networks to diagnose and treat cancers Tumor$biopsy$+$normal Genomics Core Facility (Illumina, PacBio, Ion) RNA$+$DNA = key driver Key  driver   targeted  therapy Patient-specific subnetwork Predictive network model of cancer
  • 32. Personalized multiscale tumor networks to diagnose and treat cancers Tumor$biopsy$+$normal Genomics Core Facility (Illumina, PacBio, Ion) RNA$+$DNA = key driver Pa2ent  network   targeted  therapy Patient-specific subnetwork Predictive network model of cancer
  • 33. Personalized multiscale networks to model dynamics of complex disease DNA Cell'specific-RNA Cytokines Clinical-labs Physiometrics 0: min 00 Th1 Th17 0:05 min 0:10 min
  • 34. How to capture all of the clinical data exhaust? CPOE EMR Billing Telemetry
  • 35. Data driven translational medicine pipeline at Mount Sinai BioBank Research.and. Clinical.Queries; Experiment. CreaAon;.etc. PaAent. Traffic Sequencing. Facility Clinical.Labs Clinical.Data AcAonable. Feedback EMR (EPIC) Data. Warehouse Disease.Model. ConstrucAon.and. PredicAon. GeneraAon Primary.Data HighF Performance. CompuAng
  • 36. Multiscale analysis of patient networks enables precision medicine = Genomic Environment Clinical
  • 37. Multiscale measures of patients becoming available through the Mount Sinai Biobank Diagnoses DNA RNA Drugs Microbiome Immune Labs Procedures
  • 39. Many possible topological analyses can be driven using Mt. Sinai genotype/phenotype data Topological network generated using SNP data separates race Low enr ich Hig . di he abe tes nric h. d iab ete s DMSEA DMSAA DMSHA DMSHA, diabetes enriched
  • 40. The  personal  biosensor  wave  is  forming
  • 43. Key challenge: incorporate data-driven models into clinical decision support at the point-of-care PRAC TICE CLIPMERGE platform Rules for actionable gene/drug pairs CRAE Genome-informed CDS Electronic health record CLIPMERGE database This patient has been prescribed clopidogrel (Plavix®) and is a CYP2C19-poor metabolizer (*2/*2) according to genomic testing. Poor metabolizer status is associated with significantly diminished antiplatelet response to clopidogrel and increased risk for adverse cardiovascular events following percutaneous coronary intervention (PCI). If no contraindication, consider alternative medication from order set below. Click here to learn more. Longitudinal clinical data Clinical genotype data Mount Sinai Genetic Testing Laboratory OK Reference material If no contraindication, consider prescribing an alternative medication. Click the medication name for further information including indications, dosage and contraindications. ® PRASUGREL (Effient ) ® TICAGRELOR (Brilinta ) OK CLIPMERGE PGx saliva sample from consented BIOMe participant Drug information Figure 1 A platform for the implementation of genome-informed clinical decision support (CDS). Saliva samples from BioMe patients sent to the Mount Sinai Genetic Testing Laboratory are subjected to clinical pharmacogenomic testing. Valid genotypes are released to the CLIPMERGE database, which also contains longitudinal clinical data extracted from the electronic health record (EHR). These data are assessed by the clinical risk assessment engine (CRAE), which contains prespecified rules relating actionable genotype–drug pairs to genome-informed advice messages. If a rule is fulfilled, decision support is delivered in real time via the EHR. A mockup of CDS for a clopidogrel (Plavix) poor metabolizer is shown, consisting of a text segment, a reference link, and an order set with suggested alternative medications. Erwin Bottinger useful genomic information, regardless of how it is generated. Omri Gottesman DEVELOPMENT AND EVALUATION OF CDS CONTENT
  • 44. New from Oxford University Press • • PERSONAL GENOMICS Disease risk modeling • EXPLORING Visualization Pharmacogenomics • DNA-to-physiology • Gene-by-environment • More! JOEL T. DUDLEY KONRAD J. KARCZEWSKI Foreword by George M. Church Foreword  by  George  Church http://exploringpersonalgenomics.org
  • 45. Thank you for your attention Email: joel.dudley@mssm.edu Twitter: @jdudley Web: research.mssm.edu/dudley/ Icahn School of Medicine at Mount Sinai