SlideShare a Scribd company logo
Big Data in Biomedicine:
Discovering new drugs and diagnostics
from 300 trillion points of data
Atul Butte, MD, PhD
Chief, Division of Systems Medicine,
Departments of Pediatrics, Genetics,
and, by courtesy, Computer Science,
Pathology, and Medicine
Center for Pediatric Bioinformatics, LPCH
Stanford University
abutte@stanford.edu
@atulbutte
@ImmPortDB
Disclosures
• Scientific founder and
advisory board membership
– Genstruct
– NuMedii
– Personalis
– Carmenta
• Honoraria for talks
– Lilly
– Pfizer
– Siemens
– Bristol Myers Squibb
– AstraZeneca
– Roche
– Genentech
• Past or present consultancy
– Lilly
– Johnson and Johnson
– Roche
– NuMedii
– Genstruct
– Tercica
– Ecoeos
– Ansh Labs
– Prevendia
– Samsung
– Assay Depot
– Regeneron
– Verinata
– Geisinger
– Covance
• Corporate Relationships
– Northrop Grumman
– Aptalis
– Thomson Reuters
• Speakers’ bureau
– None
• Companies started by students
– Carmenta
– Serendipity
– NuMedii
– Stimulomics
– NunaHealth
– Praedicat
– MyTime
– Flipora
Kilo
Mega
Giga
Tera
Peta
Exa
Zetta
2014 simr presentation
Big Data in
Biomedicine
2014 simr presentation
Perou CM. Nature Genetics 2001, 29:373.
2014 simr presentation
Nearly 1.4 million microarrays available
Doubles every 2-3 years
Butte AJ. Translational Bioinformatics:
coming of age. JAMIA, 2008.
2014 simr presentation
Public big data = retroactive crowd-sourcing
2014 simr presentation
2014 simr presentation
14
2014 simr presentation
2014 simr presentation
Protein
2014 simr presentation
2014 simr presentation
2014 simr presentation
2014 simr presentation
2014 simr presentation
Preeclampsia: large cause of maternal and
fetal death
• Incidence
• 5-8% of all pregnancies in the U.S. and worldwide
• 4.1 million births in the U.S. in 2009
• Up to 300K cases of preeclampsia annually in the U.S.
• Mortality
• Responsible for 18% of all maternal deaths in the U.S.
• Maternal death in 56 out of every 100,000 live births in US
• Neonatal death in 71 out of every 100,000 live births in US
• Cost
• $20 billion in direct costs in the U.S annually
• Average hospital stay of 3.5 days
Linda Liu
Matt Cooper
Bruce Ling
2014 simr presentation
New markers for preeclampsia
p value 3.49 X 10-41.79 X 10-5
ng/ml
p value = 1.92 X 10-8
Control
N=16
Preeclampsia
N=15
Control
N=16
Preeclampsia
N=17
GA 23-34 weeks GA > 34 weeks
ng/ml
Gestational age (weeks)
Linda Liu
Bruce Ling
Need a
diagnostic for
preeclampsia
Public big data
available
March of Dimes
Center for
Prematurity
Research
(Gift/Grant)
Data analyzed,
diagnostic
designed
SPARK grant
($50k)
Life Science
Angels, other
seed investors
($2 million)
27
2014 simr presentation
2014 simr presentation
2014 simr presentation
Lamb J, ..., Golub TR. Science, 2006.
Sirota M, Dudley JT, ..., Sweet-Cordero A, Sage J, Butte AJ.
Science Translational Medicine, 2011.
2014 simr presentation
2014 simr presentation
2014 simr presentation
Validation methods are increasingly
commoditized
2014 simr presentation
2014 simr presentation
Anti-seizure drug works against a rat model of
inflammatory bowel disease
Dudley JT, Sirota M, ..., Pasricha J, Butte AJ. Science Translational Medicine, 2011.
Marina Sirota
Joel Dudley
Mohan M Shenoy
Jay Pasricha
Rat colonoscopy Rat with
Inflammatory
Bowel Disease
Inflammatory
Bowel Disease
After
Anti-seizure Drug
Dudley JT, Sirota M, ..., Pasricha J, Butte AJ. Science Translational Medicine, 2011.
Anti-seizure drug works against a rat model of
inflammatory bowel disease
Anti-depressant Imipramine Shows Significant Activity
Against Small Cell Lung Cancer
Vehicle control Imipramine
p53/Rb/p130
triple knockout
model of SCLC
Mice dosed after
tumor formation
Joel Dudley
Nadine Jahchan
Julien Sage
Joel Neal
NuMedii
Cancer Discovery,
2013.
Need more
drugs for more
diseases
Public big data
available
NIH funding
LPFCH/CHI gift
Funds
Data analyzed,
method
designed
Company
launched,
ARRA, Stanford
license,
first deal
Claremont
Creek,
Lightspeed
($3.5 million)
42
Credit: Whitehead Institute and MIT
Sequencing Excitement
• Original genome: $3 bil, 13 yrs
• Helicos: $30k genome
• Pacific Biosystems: sequence
human genome in 15 minutes
• Run times in minutes
at a cost of hundreds of dollars
• 20 TB in 15 minutes
• Complete Genomics:
80 genomes/day
• Ion Torrent and
Illumina: ~$1500 per
genome
2014 simr presentation
Credit: Oxford Nanopore Technologies and Wired
2014 simr presentation
We are used to kids starting computer,
mobile, and internet companies in
garages and dorm rooms...
We are used to kids starting computer,
mobile, and internet companies in
garages and dorm rooms...
Maybe kids today need to start
“garage biotechs”?
2014 simr presentation
Collaborators
• Jeff Wiser, Patrick Dunn, Mike Atassi / Northrop Grumman
• Ashley Xia and Quan Chen / NIAID
• Takashi Kadowaki, Momoko Horikoshi, Kazuo Hara, Hiroshi Ohtsu / U Tokyo
• Kyoko Toda, Satoru Yamada, Junichiro Irie / Kitasato Univ and Hospital
• Shiro Maeda / RIKEN
• Alejandro Sweet-Cordero, Julien Sage / Pediatric Oncology
• Mark Davis, C. Garrison Fathman / Immunology
• Russ Altman, Steve Quake / Bioengineering
• Euan Ashley, Joseph Wu, Tom Quertermous / Cardiology
• Mike Snyder, Carlos Bustamante, Anne Brunet / Genetics
• Jay Pasricha / Gastroenterology
• Rob Tibshirani, Brad Efron / Statistics
• Hannah Valantine, Kiran Khush/ Cardiology
• Ken Weinberg / Pediatric Stem Cell Therapeutics
• Mark Musen, Nigam Shah / National Center for Biomedical Ontology
• Minnie Sarwal / Nephrology
• David Miklos / Oncology
Support
• Lucile Packard Foundation for Children's Health
• National Institutes of Health
• March of Dimes
• Hewlett Packard
• Howard Hughes Medical Institute
• California Institute for Regenerative Medicine
• Luke Evnin and Deann Wright (Scleroderma Research Foundation)
• Clayville Research Fund
• PhRMA Foundation
• Stanford Cancer Center, Bio-X, SPARK
• Tarangini Deshpande
• Kimayani Butte
• Hugh O’Brodovich
• Isaac Kohane
Admin and Tech Staff
• Susan Aptekar
• Jen Cory
• Boris Oskotsky

More Related Content

PPTX
Atul Butte's presentation at ASHG 2014
PPTX
Presentation given at UCSF Precision Medicine meeting 4/11/2015
PPTX
Atul Butte's presentation at JGI March 2015
PPTX
Atul Butte's presentation at the 2015 AMIA Fall Symposium
PDF
2014 07 ismb personalized medicine
PDF
Atul Butte's presentation at LINCS 2013
PPTX
2014 farr institute presentation
PPTX
Precision Medicine World Conference 2017
Atul Butte's presentation at ASHG 2014
Presentation given at UCSF Precision Medicine meeting 4/11/2015
Atul Butte's presentation at JGI March 2015
Atul Butte's presentation at the 2015 AMIA Fall Symposium
2014 07 ismb personalized medicine
Atul Butte's presentation at LINCS 2013
2014 farr institute presentation
Precision Medicine World Conference 2017

What's hot (20)

PPTX
Atul Butte's AAPS keynote presentation 6/2015
PPTX
2015-11 Atul Butte's Presentation at Exponential Medicine
PDF
2013 05 society for clinical trials
PPTX
Presentation on Research Reproducibility at Friends of the National Library o...
PPTX
Atul Butte's presentation at the Milken Institute Public Health Summit
PPTX
Atul Butte's presentation to the Association of Medical School Pediatric Depa...
PDF
2013 09 atul butte mahajani symposium
PPTX
Atul Butte's AAPS big data workshop presentation 6/2015
PPTX
Atul Butte's presentation for the FDA 5th Annual Scientific Computing Days
PPTX
2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...
PPTX
Atul Butte NIPS 2017 ML4H
PPTX
The Uneven Future of Evidence-Based Medicine
PPTX
Intro: California Initiative to Advance Precision Medicine Workshop
PDF
2013 01 pmwc atul butte scrubbed
PPTX
Atul Butte presentation on 2019-02-05 for Accelerating biology 2019: Towards ...
PPTX
Atul Butte's presentation at the From Data to Discovery symposium at Westat
PDF
Atul Butte's presentation at #AMIA2021 for the Knowledge Discovery and Data M...
PPTX
Presentation at ISMB NIH Office of Data Science Strategy Panel
PDF
Presentation for the CSIR Fourth Paradigm Institute Silver Jubilee (Bangalore...
PPTX
Atul Butte presentation at the Morris Collen 100 birthday celebration
Atul Butte's AAPS keynote presentation 6/2015
2015-11 Atul Butte's Presentation at Exponential Medicine
2013 05 society for clinical trials
Presentation on Research Reproducibility at Friends of the National Library o...
Atul Butte's presentation at the Milken Institute Public Health Summit
Atul Butte's presentation to the Association of Medical School Pediatric Depa...
2013 09 atul butte mahajani symposium
Atul Butte's AAPS big data workshop presentation 6/2015
Atul Butte's presentation for the FDA 5th Annual Scientific Computing Days
2015-04-28 Atul Butte's presentation to the NIH Precision Medicine Initiative...
Atul Butte NIPS 2017 ML4H
The Uneven Future of Evidence-Based Medicine
Intro: California Initiative to Advance Precision Medicine Workshop
2013 01 pmwc atul butte scrubbed
Atul Butte presentation on 2019-02-05 for Accelerating biology 2019: Towards ...
Atul Butte's presentation at the From Data to Discovery symposium at Westat
Atul Butte's presentation at #AMIA2021 for the Knowledge Discovery and Data M...
Presentation at ISMB NIH Office of Data Science Strategy Panel
Presentation for the CSIR Fourth Paradigm Institute Silver Jubilee (Bangalore...
Atul Butte presentation at the Morris Collen 100 birthday celebration
Ad

Similar to 2014 simr presentation (13)

PPTX
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
PPTX
Atul Butte's presentation at CTIC 2020
PDF
Stellate Cells in Health and Disease 1st Edition Chandrashekhar Gandhi Phd
PPTX
Presentation by Atul Butte at the NSTC Interagency Working Group on Biologica...
PDF
2020-03-08 Atul Butte's keynote for the AMIA Virtual Informatics Summit
PPTX
2013 03 genomic medicine slides
PPTX
The Learning Health System: Thinking and Acting Across Scales
PDF
BIG DATA paper
PDF
Health Science Research A Handbook of Quantitative Methods 1st Edition Jennif...
PDF
Pgd discussion challengesconcerns
PPT
Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...
PDF
BioData West 2017 Brochure.PDF
PDF
Academic aspect of Animal Research and its Application
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
Atul Butte's presentation at CTIC 2020
Stellate Cells in Health and Disease 1st Edition Chandrashekhar Gandhi Phd
Presentation by Atul Butte at the NSTC Interagency Working Group on Biologica...
2020-03-08 Atul Butte's keynote for the AMIA Virtual Informatics Summit
2013 03 genomic medicine slides
The Learning Health System: Thinking and Acting Across Scales
BIG DATA paper
Health Science Research A Handbook of Quantitative Methods 1st Edition Jennif...
Pgd discussion challengesconcerns
Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...
BioData West 2017 Brochure.PDF
Academic aspect of Animal Research and its Application
Ad

Recently uploaded (20)

PDF
The Digestive System Science Educational Presentation in Dark Orange, Blue, a...
PDF
The_EHRA_Book_of_Interventional Electrophysiology.pdf
PPTX
Human Reproduction: Anatomy, Physiology & Clinical Insights.pptx
PPTX
09. Diabetes in Pregnancy/ gestational.pptx
PPTX
NRP and care of Newborn.pptx- APPT presentation about neonatal resuscitation ...
PDF
focused on the development and application of glycoHILIC, pepHILIC, and comm...
PPTX
Post Op complications in general surgery
PPTX
Manage HIV exposed child and a child with HIV infection.pptx
PPT
Rheumatology Member of Royal College of Physicians.ppt
PPTX
thio and propofol mechanism and uses.pptx
PPTX
Vaccines and immunization including cold chain , Open vial policy.pptx
PDF
B C German Homoeopathy Medicineby Dr Brij Mohan Prasad
PDF
MNEMONICS MNEMONICS MNEMONICS MNEMONICS s
PPTX
Effects of lipid metabolism 22 asfelagi.pptx
PPTX
NUCLEAR-MEDICINE-Copy.pptxbabaabahahahaahha
PPTX
HYPERSENSITIVITY REACTIONS - Pathophysiology Notes for Second Year Pharm D St...
PPTX
Reading between the Rings: Imaging in Brain Infections
PDF
Plant-Based Antimicrobials: A New Hope for Treating Diarrhea in HIV Patients...
PPTX
Hearthhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh
PDF
Comparison of Swim-Up and Microfluidic Sperm Sorting.pdf
The Digestive System Science Educational Presentation in Dark Orange, Blue, a...
The_EHRA_Book_of_Interventional Electrophysiology.pdf
Human Reproduction: Anatomy, Physiology & Clinical Insights.pptx
09. Diabetes in Pregnancy/ gestational.pptx
NRP and care of Newborn.pptx- APPT presentation about neonatal resuscitation ...
focused on the development and application of glycoHILIC, pepHILIC, and comm...
Post Op complications in general surgery
Manage HIV exposed child and a child with HIV infection.pptx
Rheumatology Member of Royal College of Physicians.ppt
thio and propofol mechanism and uses.pptx
Vaccines and immunization including cold chain , Open vial policy.pptx
B C German Homoeopathy Medicineby Dr Brij Mohan Prasad
MNEMONICS MNEMONICS MNEMONICS MNEMONICS s
Effects of lipid metabolism 22 asfelagi.pptx
NUCLEAR-MEDICINE-Copy.pptxbabaabahahahaahha
HYPERSENSITIVITY REACTIONS - Pathophysiology Notes for Second Year Pharm D St...
Reading between the Rings: Imaging in Brain Infections
Plant-Based Antimicrobials: A New Hope for Treating Diarrhea in HIV Patients...
Hearthhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh
Comparison of Swim-Up and Microfluidic Sperm Sorting.pdf

2014 simr presentation

  • 1. Big Data in Biomedicine: Discovering new drugs and diagnostics from 300 trillion points of data Atul Butte, MD, PhD Chief, Division of Systems Medicine, Departments of Pediatrics, Genetics, and, by courtesy, Computer Science, Pathology, and Medicine Center for Pediatric Bioinformatics, LPCH Stanford University abutte@stanford.edu @atulbutte @ImmPortDB
  • 2. Disclosures • Scientific founder and advisory board membership – Genstruct – NuMedii – Personalis – Carmenta • Honoraria for talks – Lilly – Pfizer – Siemens – Bristol Myers Squibb – AstraZeneca – Roche – Genentech • Past or present consultancy – Lilly – Johnson and Johnson – Roche – NuMedii – Genstruct – Tercica – Ecoeos – Ansh Labs – Prevendia – Samsung – Assay Depot – Regeneron – Verinata – Geisinger – Covance • Corporate Relationships – Northrop Grumman – Aptalis – Thomson Reuters • Speakers’ bureau – None • Companies started by students – Carmenta – Serendipity – NuMedii – Stimulomics – NunaHealth – Praedicat – MyTime – Flipora
  • 7. Perou CM. Nature Genetics 2001, 29:373.
  • 9. Nearly 1.4 million microarrays available Doubles every 2-3 years Butte AJ. Translational Bioinformatics: coming of age. JAMIA, 2008.
  • 11. Public big data = retroactive crowd-sourcing
  • 14. 14
  • 23. Preeclampsia: large cause of maternal and fetal death • Incidence • 5-8% of all pregnancies in the U.S. and worldwide • 4.1 million births in the U.S. in 2009 • Up to 300K cases of preeclampsia annually in the U.S. • Mortality • Responsible for 18% of all maternal deaths in the U.S. • Maternal death in 56 out of every 100,000 live births in US • Neonatal death in 71 out of every 100,000 live births in US • Cost • $20 billion in direct costs in the U.S annually • Average hospital stay of 3.5 days Linda Liu Matt Cooper Bruce Ling
  • 25. New markers for preeclampsia p value 3.49 X 10-41.79 X 10-5 ng/ml p value = 1.92 X 10-8 Control N=16 Preeclampsia N=15 Control N=16 Preeclampsia N=17 GA 23-34 weeks GA > 34 weeks ng/ml Gestational age (weeks) Linda Liu Bruce Ling
  • 26. Need a diagnostic for preeclampsia Public big data available March of Dimes Center for Prematurity Research (Gift/Grant) Data analyzed, diagnostic designed SPARK grant ($50k) Life Science Angels, other seed investors ($2 million)
  • 27. 27
  • 31. Lamb J, ..., Golub TR. Science, 2006. Sirota M, Dudley JT, ..., Sweet-Cordero A, Sage J, Butte AJ. Science Translational Medicine, 2011.
  • 35. Validation methods are increasingly commoditized
  • 38. Anti-seizure drug works against a rat model of inflammatory bowel disease Dudley JT, Sirota M, ..., Pasricha J, Butte AJ. Science Translational Medicine, 2011. Marina Sirota Joel Dudley Mohan M Shenoy Jay Pasricha
  • 39. Rat colonoscopy Rat with Inflammatory Bowel Disease Inflammatory Bowel Disease After Anti-seizure Drug Dudley JT, Sirota M, ..., Pasricha J, Butte AJ. Science Translational Medicine, 2011. Anti-seizure drug works against a rat model of inflammatory bowel disease
  • 40. Anti-depressant Imipramine Shows Significant Activity Against Small Cell Lung Cancer Vehicle control Imipramine p53/Rb/p130 triple knockout model of SCLC Mice dosed after tumor formation Joel Dudley Nadine Jahchan Julien Sage Joel Neal NuMedii Cancer Discovery, 2013.
  • 41. Need more drugs for more diseases Public big data available NIH funding LPFCH/CHI gift Funds Data analyzed, method designed Company launched, ARRA, Stanford license, first deal Claremont Creek, Lightspeed ($3.5 million)
  • 42. 42
  • 44. Sequencing Excitement • Original genome: $3 bil, 13 yrs • Helicos: $30k genome • Pacific Biosystems: sequence human genome in 15 minutes • Run times in minutes at a cost of hundreds of dollars • 20 TB in 15 minutes • Complete Genomics: 80 genomes/day • Ion Torrent and Illumina: ~$1500 per genome
  • 46. Credit: Oxford Nanopore Technologies and Wired
  • 48. We are used to kids starting computer, mobile, and internet companies in garages and dorm rooms...
  • 49. We are used to kids starting computer, mobile, and internet companies in garages and dorm rooms... Maybe kids today need to start “garage biotechs”?
  • 51. Collaborators • Jeff Wiser, Patrick Dunn, Mike Atassi / Northrop Grumman • Ashley Xia and Quan Chen / NIAID • Takashi Kadowaki, Momoko Horikoshi, Kazuo Hara, Hiroshi Ohtsu / U Tokyo • Kyoko Toda, Satoru Yamada, Junichiro Irie / Kitasato Univ and Hospital • Shiro Maeda / RIKEN • Alejandro Sweet-Cordero, Julien Sage / Pediatric Oncology • Mark Davis, C. Garrison Fathman / Immunology • Russ Altman, Steve Quake / Bioengineering • Euan Ashley, Joseph Wu, Tom Quertermous / Cardiology • Mike Snyder, Carlos Bustamante, Anne Brunet / Genetics • Jay Pasricha / Gastroenterology • Rob Tibshirani, Brad Efron / Statistics • Hannah Valantine, Kiran Khush/ Cardiology • Ken Weinberg / Pediatric Stem Cell Therapeutics • Mark Musen, Nigam Shah / National Center for Biomedical Ontology • Minnie Sarwal / Nephrology • David Miklos / Oncology
  • 52. Support • Lucile Packard Foundation for Children's Health • National Institutes of Health • March of Dimes • Hewlett Packard • Howard Hughes Medical Institute • California Institute for Regenerative Medicine • Luke Evnin and Deann Wright (Scleroderma Research Foundation) • Clayville Research Fund • PhRMA Foundation • Stanford Cancer Center, Bio-X, SPARK • Tarangini Deshpande • Kimayani Butte • Hugh O’Brodovich • Isaac Kohane Admin and Tech Staff • Susan Aptekar • Jen Cory • Boris Oskotsky