SlideShare a Scribd company logo
Big Data in Biomedicine – An NIH
Perspective
Philip E. Bourne Ph.D., FACMI
Associate Director for Data Science
National Institutes of Health
philip.bourne@nih.gov
IEEE BIBM Nov 10 2015, Washington DC
http://www.slideshare.net/pebourne
Perspective
 Structural bioinformatics researcher
 Former custodian of the PDB
 Obsessive about open science e.g., PLOS
 NIH-wide responsibility for developments in
data science – responding to the disruption
Bioinformatics 2015 31(1):146-50
Big Data in Biomedicine…
This speaks to something more
fundamental that more data …
It speaks to new methodologies, new
skills, new emphasis, new cultures,
new modes of discovery …
Consider this change from my own
career experience ….
The History of Computational
Biomedicine According to Bourne
1980s 1990s 2000s 2010s 2020
Discipline:
Unknown Expt. Driven Emergent Over-sold A Service A Partner A Driver
The Raw Material:
Non-existent Limited /Poor More/Ontologies Big Data/Siloed Open/Integrated
The People:
No name Technicians Industry recognition data scientists Academics
Searls (ed) The Roots in Bioinformatics Series PLOS Comp Biol
Premise:
We are entering a period of disruption
in biomedical research and we should
all be thinking about what this means
to bioinformatics & biomedicine
http://i1.wp.com/chisconsult.com/wp-
content/uploads/2013/05/disruption-is-a-
process.jpg
http://cdn2.hubspot.net/hubfs/418817/disruption1.jpg
We are at a Point of Deception …
 Evidence:
– Google car
– 3D printers
– Waze
– Robotics
– Sensors
From: The Second Machine Age: Work, Progress,
and Prosperity in a Time of Brilliant Technologies
by Erik Brynjolfsson & Andrew McAfee
Disruption: Example - Photography
Digitization
Deception
Disruption
Demonetization
Dematerialization
Democratization
Time
Volume,Velocity,Variety
Digital camera invented by
Kodak but shelved
Megapixels & quality improve slowly;
Kodak slow to react
Film market collapses;
Kodak goes bankrupt
Phones replace
cameras
Instagram,
Flickr become the
value proposition
Digital media becomes bona fide
form of communication
Disruption: Biomedical Research
Digitization of Basic &
Clinical Research & EHR’s
Deception
We Are Here
Disruption
Demonetization
Dematerialization
Democratization
Open science
Patient centered health care
Disruptive Features: Sustainability
Source Michael Bell http://homepages.cs.ncl.ac.uk/m.j.bell1/blog/?p=830
Disruptive Features:
Reproducibility
Changing Value of Scholarship (?)
“And that’s why we’re here today. Because something
called precision medicine … gives us one of the greatest
opportunities for new medical breakthroughs that we
have ever seen.”
President Barack Obama
January 30, 2015
Disruptive Features – New Science
An Example of That Promise:
Comorbidity Network for 6.2M Danes
Over 14.9 Years
Jensen et al 2014 Nat Comm 5:4022
What Are Some General Implications
of Such a Future?
 Open collaborative science becomes of increasing
importance
 The value of data and associated analytics becomes
of increasing value to scholarship
 Opportunities exist to improve the efficiency of the
research enterprise and hence fund more research
 Cooperation between funders will be needed to
sustain the emergent digital enterprise
 Current training content and modalities will not match
supply to demand
 Balancing accessibility vs security becomes more
important yet more complex
How Should We Respond?
 Funders: Encourage change and facilitate an orderly
transition
 Academic Leaders: Respond and facilitate a cultural
shift
 Developers: Develop working environments that are
more adaptive and capable of answers questions in a
more efficient and hopefully accurate way
 Users: Use the above environments
 Publishers: Move beyond papers
Take an Example That is Central to
What We Do
Molecular Graphics
Is It Optimal for Today’s Science?
http://upload.wikimedia.org/wikipedia/commons/2/2e/M
olecular-Graphics-GRIP-75-Console.jpg
Good News/Bad News
 Good News:
– It is harder to think of a
more powerful way to
comprehend complex
data
– It has excited
generations to the
promise of science
– It has adapted to
changing technologies
 Bad News:
– It is not an
adaptive/extensible
environment
– It is not a collaborative
environment
– It is not an integrative
environment
– It is the curse of the
ribbon
BMC Bioinformatics 2005, 6:21
1. A link brings up figures
from the paper
0. Full text of PLoS papers stored
in a database
2. Clicking the paper figure retrieves
data from the PDB which is
analyzed
3. A composite view of
journal and database
content results
Is a database
really different
than a
biological
journal?
PloS Comp Biol
2005 1(3) e34
4. The composite view has
links to pertinent blocks
of literature text and back to the PDB
1.
2.
3.
4.
The Knowledge and Data Cycle
Take Another Example:
The Raw Material of Structural
Bioinformatics
Is this the optimal starting point anymore?
Do data resources including the PDB
best serve the needs of the user at
this point?
Good News/Bad News for the PDB in
this Changing Landscape
 Bad News:
– Interface complex and
uni-data oriented
– Data accessible;
methods accessible (sort
of); but not together
– Significant redundancy in
services offered
 Good News:
– Annotation!
– Demand is increasing
– Integrated with other
data types
– Restful services
General Problem Statement:
How to insure a high quality
annotated data source that provides
the optimal environment for
accessibility, integration and analysis
by a broad community of diverse
users?
 The Commons is a shared virtual space which is
FAIR:
– Find
– Access (use effectively)
– Interoperate
– Reuse
 An environment to find and catalyze the use of
shared digital research objects
The Commons
Concept
The Developer or User Defines the
Environment from the Appropriate
Building Blocks
Public Beacons
Host Content
AMPLab 1000 Genomes Project
Broad Institute ExAC
Curoverse PGP, GA4GH Example Data
EBI
1000 Genomes Project, UK10K, GoNL, EVS,
GEUVADIS, UMCG Cardio GenePanel
Google
1000 Genomes Project, Phase III, Illumina Platinum
Genomes
ISB Known VARiants
NCBI NHLBI Exome Sequence Project
OICR 55 cancer datasets
SolveBio 56 public datasets
UCSC ClinVar, LOVD, UniProt
University of Leicester Cafe CardioKit, Cafe Variome Central
WTSI IBD, Native American, Egyptian, UK10K
Over ?? public datasets beaconized across 21 institutions
10s thousands of individuals
Big Data in Biomedicine – An NIH Perspective
The Commons
Components
 Computing environment
– cloud or HPC (High Performance Computing)
– supports access, utilization, sharing and storage of
digital objects.
 Methods for Interoperability
– enables connectivity, shareability and interoperability
between digital objects.
 Digital object compliance model
– describes the properties of digital objects that
enables them to be discoverable and shareable.
The Commons
Components
BD2K
Center
BD2K
Center
BD2K
Center
BD2K
Center
BD2K
Center
BD2K
Center
DDICC
Software
Standard
s
Infrastructure - The
Commons
Labs
Labs
Labs
Labs
The ability to store and share and
compute on digital research objects
 Especially useful for large data sets that
are not easily computed locally
Scalable and Elastic
Pay per use - Cost effective
An environment that fosters
collaboration
The Commons
Computing Environment: Cloud
Commons - Pilots
 The Cloud Credits - business model
 BD2K Centers
 MODs (Model Organism Databases)
 HMP Data and tools available in the cloud
 NCI Cloud Pilots & Genomic Data
Commons
The PDB in the Commons
 Components:
– Annotated collection of data files
– API’s to access these data files
– Example methods using these APIs
 Potential outcomes
– Nothing happens?
– A new breed of developer starts to use PDB data in new
ways ?
– The casual user has a broader set of services that
previously?
– Quality declines/increases?
I not only use all the brains
I have, but all I can borrow.
– Woodrow Wilson
ADDS Team
BD2K Representatives
NIHNIH……
Turning Discovery Into HealthTurning Discovery Into Health
philip.bourne@nih.gov
https://datascience.nih.gov/
http://www.ncbi.nlm.nih.gov/research/staff/bourne/

More Related Content

PPT
Open Data in a Global Ecosystem
PPT
Data Analytics
PPTX
A SWOT Analysis of Data Science @ NIH
PPT
BD2K Update
PPTX
The Commons: Leveraging the Power of the Cloud for Big Data
PPTX
Big Data as a Catalyst for Collaboration & Innovation
PPTX
Towards the Digital Research Enterprise
PPTX
Understanding the Big Data Enterprise
Open Data in a Global Ecosystem
Data Analytics
A SWOT Analysis of Data Science @ NIH
BD2K Update
The Commons: Leveraging the Power of the Cloud for Big Data
Big Data as a Catalyst for Collaboration & Innovation
Towards the Digital Research Enterprise
Understanding the Big Data Enterprise

What's hot (20)

PPT
There is No Intelligent Life Down Here
PPT
Data Science BD2K Update for NIH
PPTX
From Where Have We Come & Where Are We Going
PPT
The Vision for Data @ the NIH
PPT
The NIH as a Digital Enterprise: Implications for PAG
PPTX
SWOT Analysis - What Does it Tell Us?
PPT
Meeting the Computational Challenges Associated with Human Health
PPTX
Highlights from NIH Data Science
PPT
Big Data in Biomedicine: Where is the NIH Headed
PPT
Data Science in Biomedicine - Where Are We Headed?
PPTX
The Analytics and Data Science Landscape
PPT
A Successful Academic Medical Center Must be a Truly Digital Enterprise
PPT
Health Policy and Management as it Relates to Big Data
PPT
Yale Day of Data
PPT
AMIA 2014
PPTX
Data Harmonization for a Molecularly Driven Health System
PPTX
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
PPT
The Thinking Behind Big Data at the NIH
PPTX
Methods for measuring citizen-science impact
PPTX
Moving Forward with Open Data Science - SWOT Analysis
There is No Intelligent Life Down Here
Data Science BD2K Update for NIH
From Where Have We Come & Where Are We Going
The Vision for Data @ the NIH
The NIH as a Digital Enterprise: Implications for PAG
SWOT Analysis - What Does it Tell Us?
Meeting the Computational Challenges Associated with Human Health
Highlights from NIH Data Science
Big Data in Biomedicine: Where is the NIH Headed
Data Science in Biomedicine - Where Are We Headed?
The Analytics and Data Science Landscape
A Successful Academic Medical Center Must be a Truly Digital Enterprise
Health Policy and Management as it Relates to Big Data
Yale Day of Data
AMIA 2014
Data Harmonization for a Molecularly Driven Health System
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
The Thinking Behind Big Data at the NIH
Methods for measuring citizen-science impact
Moving Forward with Open Data Science - SWOT Analysis
Ad

Viewers also liked (8)

PDF
Big Data Meets Biomedicine: Opportunities & Challenges
PPTX
Big data processing using hadoop poster presentation
PPTX
Using Big Data for Improved Healthcare Operations and Analytics
PPTX
Driving to Market - V2! - How to "Drive" Competitive Advantage in your Go To ...
PPTX
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
PDF
Big Data Solutions for Healthcare
PPTX
2016 Future of Cloud Computing Study
PPTX
Big data ppt
Big Data Meets Biomedicine: Opportunities & Challenges
Big data processing using hadoop poster presentation
Using Big Data for Improved Healthcare Operations and Analytics
Driving to Market - V2! - How to "Drive" Competitive Advantage in your Go To ...
Big Data in Healthcare Made Simple: Where It Stands Today and Where It’s Going
Big Data Solutions for Healthcare
2016 Future of Cloud Computing Study
Big data ppt
Ad

Similar to Big Data in Biomedicine – An NIH Perspective (20)

PPT
The PDB An Exemplar for Data Science To Date, But What About the Future?
PPT
Human Genome and Big Data Challenges
PPT
Data at the NIH
PPTX
Will Biomedical Research Fundamentally Change in the Era of Big Data?
PPT
Data Science at NIH and its Relationship to Social Computing, Behavioral-Cult...
PPT
PhRMA Some Early Thoughts
PPTX
Biomedical Data Sciences - New Name and New Opportunities for Change?
PPT
Some Early Thoughts
PPTX
Data Science Meets Biomedicine, Does Anything Change
PPT
PPT
Biomedical Research as Part of the Digital Enterprise
PPT
The Role of Automated Function Prediction in the Era of Big Data and Small Bu...
PPTX
Bioinformatics in the Era of Open Science and Big Data
PDF
On community-standards, data curation and scholarly communication - BITS, Ita...
PPTX
Komatsoulis internet2 global forum 2015
PPT
The Future of Research (Science and Technology)
PDF
G. Poste. Managing the Data Deluge: Critical Issues in the Integration and An...
PPT
Data at the NIH: Some Early Thoughts
PPTX
Data commons bonazzi bd2 k fundamentals of science feb 2017
PPTX
Data Science and AI in Biomedicine: The World has Changed
The PDB An Exemplar for Data Science To Date, But What About the Future?
Human Genome and Big Data Challenges
Data at the NIH
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Data Science at NIH and its Relationship to Social Computing, Behavioral-Cult...
PhRMA Some Early Thoughts
Biomedical Data Sciences - New Name and New Opportunities for Change?
Some Early Thoughts
Data Science Meets Biomedicine, Does Anything Change
Biomedical Research as Part of the Digital Enterprise
The Role of Automated Function Prediction in the Era of Big Data and Small Bu...
Bioinformatics in the Era of Open Science and Big Data
On community-standards, data curation and scholarly communication - BITS, Ita...
Komatsoulis internet2 global forum 2015
The Future of Research (Science and Technology)
G. Poste. Managing the Data Deluge: Critical Issues in the Integration and An...
Data at the NIH: Some Early Thoughts
Data commons bonazzi bd2 k fundamentals of science feb 2017
Data Science and AI in Biomedicine: The World has Changed

More from Philip Bourne (20)

PPTX
Your Science Needs You - More Than Ever Before
PPTX
The Biological Data Sustainability Paradox: A Time to Think Differently
PPTX
Data Science and AI in Biomedicine: The World has Changed
PPTX
AI in Medical Education A Meta View to Start a Conversation
PPTX
AI+ Now and Then How Did We Get Here And Where Are We Going
PPTX
Thoughts on Biological Data Sustainability
PPTX
What is FAIR Data and Who Needs It?
PPTX
Data Science Meets Drug Discovery
PPTX
Biomedical Data Science: We Are Not Alone
PPTX
BIMS7100-2023. Social Responsibility in Research
PPTX
AI from the Perspective of a School of Data Science
PPTX
What Data Science Will Mean to You - One Person's View
PPTX
Novo Nordisk 080522.pptx
PPTX
Towards a US Open research Commons (ORC)
PPTX
COVID and Precision Education
PPTX
One View of Data Science
PPTX
Cancer Research Meets Data Science — What Can We Do Together?
PPTX
Data Science Meets Open Scholarship – What Comes Next?
PPTX
Data to Advance Sustainability
PPTX
Frontiers of Computing at the Cellular and Molecular Scales
Your Science Needs You - More Than Ever Before
The Biological Data Sustainability Paradox: A Time to Think Differently
Data Science and AI in Biomedicine: The World has Changed
AI in Medical Education A Meta View to Start a Conversation
AI+ Now and Then How Did We Get Here And Where Are We Going
Thoughts on Biological Data Sustainability
What is FAIR Data and Who Needs It?
Data Science Meets Drug Discovery
Biomedical Data Science: We Are Not Alone
BIMS7100-2023. Social Responsibility in Research
AI from the Perspective of a School of Data Science
What Data Science Will Mean to You - One Person's View
Novo Nordisk 080522.pptx
Towards a US Open research Commons (ORC)
COVID and Precision Education
One View of Data Science
Cancer Research Meets Data Science — What Can We Do Together?
Data Science Meets Open Scholarship – What Comes Next?
Data to Advance Sustainability
Frontiers of Computing at the Cellular and Molecular Scales

Recently uploaded (20)

PDF
Basic Mud Logging Guide for educational purpose
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Classroom Observation Tools for Teachers
PDF
01-Introduction-to-Information-Management.pdf
PDF
VCE English Exam - Section C Student Revision Booklet
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PPTX
Institutional Correction lecture only . . .
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
Cell Types and Its function , kingdom of life
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Business Ethics Teaching Materials for college
Basic Mud Logging Guide for educational purpose
Microbial diseases, their pathogenesis and prophylaxis
Module 4: Burden of Disease Tutorial Slides S2 2025
Classroom Observation Tools for Teachers
01-Introduction-to-Information-Management.pdf
VCE English Exam - Section C Student Revision Booklet
Week 4 Term 3 Study Techniques revisited.pptx
Institutional Correction lecture only . . .
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
O7-L3 Supply Chain Operations - ICLT Program
Cell Types and Its function , kingdom of life
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
TR - Agricultural Crops Production NC III.pdf
PPH.pptx obstetrics and gynecology in nursing
Renaissance Architecture: A Journey from Faith to Humanism
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Business Ethics Teaching Materials for college

Big Data in Biomedicine – An NIH Perspective

  • 1. Big Data in Biomedicine – An NIH Perspective Philip E. Bourne Ph.D., FACMI Associate Director for Data Science National Institutes of Health philip.bourne@nih.gov IEEE BIBM Nov 10 2015, Washington DC http://www.slideshare.net/pebourne
  • 2. Perspective  Structural bioinformatics researcher  Former custodian of the PDB  Obsessive about open science e.g., PLOS  NIH-wide responsibility for developments in data science – responding to the disruption
  • 4. Big Data in Biomedicine… This speaks to something more fundamental that more data … It speaks to new methodologies, new skills, new emphasis, new cultures, new modes of discovery …
  • 5. Consider this change from my own career experience ….
  • 6. The History of Computational Biomedicine According to Bourne 1980s 1990s 2000s 2010s 2020 Discipline: Unknown Expt. Driven Emergent Over-sold A Service A Partner A Driver The Raw Material: Non-existent Limited /Poor More/Ontologies Big Data/Siloed Open/Integrated The People: No name Technicians Industry recognition data scientists Academics Searls (ed) The Roots in Bioinformatics Series PLOS Comp Biol
  • 7. Premise: We are entering a period of disruption in biomedical research and we should all be thinking about what this means to bioinformatics & biomedicine http://i1.wp.com/chisconsult.com/wp- content/uploads/2013/05/disruption-is-a- process.jpg http://cdn2.hubspot.net/hubfs/418817/disruption1.jpg
  • 8. We are at a Point of Deception …  Evidence: – Google car – 3D printers – Waze – Robotics – Sensors From: The Second Machine Age: Work, Progress, and Prosperity in a Time of Brilliant Technologies by Erik Brynjolfsson & Andrew McAfee
  • 9. Disruption: Example - Photography Digitization Deception Disruption Demonetization Dematerialization Democratization Time Volume,Velocity,Variety Digital camera invented by Kodak but shelved Megapixels & quality improve slowly; Kodak slow to react Film market collapses; Kodak goes bankrupt Phones replace cameras Instagram, Flickr become the value proposition Digital media becomes bona fide form of communication
  • 10. Disruption: Biomedical Research Digitization of Basic & Clinical Research & EHR’s Deception We Are Here Disruption Demonetization Dematerialization Democratization Open science Patient centered health care
  • 11. Disruptive Features: Sustainability Source Michael Bell http://homepages.cs.ncl.ac.uk/m.j.bell1/blog/?p=830
  • 13. “And that’s why we’re here today. Because something called precision medicine … gives us one of the greatest opportunities for new medical breakthroughs that we have ever seen.” President Barack Obama January 30, 2015 Disruptive Features – New Science
  • 14. An Example of That Promise: Comorbidity Network for 6.2M Danes Over 14.9 Years Jensen et al 2014 Nat Comm 5:4022
  • 15. What Are Some General Implications of Such a Future?  Open collaborative science becomes of increasing importance  The value of data and associated analytics becomes of increasing value to scholarship  Opportunities exist to improve the efficiency of the research enterprise and hence fund more research  Cooperation between funders will be needed to sustain the emergent digital enterprise  Current training content and modalities will not match supply to demand  Balancing accessibility vs security becomes more important yet more complex
  • 16. How Should We Respond?  Funders: Encourage change and facilitate an orderly transition  Academic Leaders: Respond and facilitate a cultural shift  Developers: Develop working environments that are more adaptive and capable of answers questions in a more efficient and hopefully accurate way  Users: Use the above environments  Publishers: Move beyond papers
  • 17. Take an Example That is Central to What We Do Molecular Graphics Is It Optimal for Today’s Science? http://upload.wikimedia.org/wikipedia/commons/2/2e/M olecular-Graphics-GRIP-75-Console.jpg
  • 18. Good News/Bad News  Good News: – It is harder to think of a more powerful way to comprehend complex data – It has excited generations to the promise of science – It has adapted to changing technologies  Bad News: – It is not an adaptive/extensible environment – It is not a collaborative environment – It is not an integrative environment – It is the curse of the ribbon BMC Bioinformatics 2005, 6:21
  • 19. 1. A link brings up figures from the paper 0. Full text of PLoS papers stored in a database 2. Clicking the paper figure retrieves data from the PDB which is analyzed 3. A composite view of journal and database content results Is a database really different than a biological journal? PloS Comp Biol 2005 1(3) e34 4. The composite view has links to pertinent blocks of literature text and back to the PDB 1. 2. 3. 4. The Knowledge and Data Cycle
  • 20. Take Another Example: The Raw Material of Structural Bioinformatics Is this the optimal starting point anymore?
  • 21. Do data resources including the PDB best serve the needs of the user at this point?
  • 22. Good News/Bad News for the PDB in this Changing Landscape  Bad News: – Interface complex and uni-data oriented – Data accessible; methods accessible (sort of); but not together – Significant redundancy in services offered  Good News: – Annotation! – Demand is increasing – Integrated with other data types – Restful services
  • 23. General Problem Statement: How to insure a high quality annotated data source that provides the optimal environment for accessibility, integration and analysis by a broad community of diverse users?
  • 24.  The Commons is a shared virtual space which is FAIR: – Find – Access (use effectively) – Interoperate – Reuse  An environment to find and catalyze the use of shared digital research objects The Commons Concept
  • 25. The Developer or User Defines the Environment from the Appropriate Building Blocks
  • 26. Public Beacons Host Content AMPLab 1000 Genomes Project Broad Institute ExAC Curoverse PGP, GA4GH Example Data EBI 1000 Genomes Project, UK10K, GoNL, EVS, GEUVADIS, UMCG Cardio GenePanel Google 1000 Genomes Project, Phase III, Illumina Platinum Genomes ISB Known VARiants NCBI NHLBI Exome Sequence Project OICR 55 cancer datasets SolveBio 56 public datasets UCSC ClinVar, LOVD, UniProt University of Leicester Cafe CardioKit, Cafe Variome Central WTSI IBD, Native American, Egyptian, UK10K Over ?? public datasets beaconized across 21 institutions 10s thousands of individuals
  • 28. The Commons Components  Computing environment – cloud or HPC (High Performance Computing) – supports access, utilization, sharing and storage of digital objects.  Methods for Interoperability – enables connectivity, shareability and interoperability between digital objects.  Digital object compliance model – describes the properties of digital objects that enables them to be discoverable and shareable.
  • 31. The ability to store and share and compute on digital research objects  Especially useful for large data sets that are not easily computed locally Scalable and Elastic Pay per use - Cost effective An environment that fosters collaboration The Commons Computing Environment: Cloud
  • 32. Commons - Pilots  The Cloud Credits - business model  BD2K Centers  MODs (Model Organism Databases)  HMP Data and tools available in the cloud  NCI Cloud Pilots & Genomic Data Commons
  • 33. The PDB in the Commons  Components: – Annotated collection of data files – API’s to access these data files – Example methods using these APIs  Potential outcomes – Nothing happens? – A new breed of developer starts to use PDB data in new ways ? – The casual user has a broader set of services that previously? – Quality declines/increases?
  • 34. I not only use all the brains I have, but all I can borrow. – Woodrow Wilson
  • 36. NIHNIH…… Turning Discovery Into HealthTurning Discovery Into Health philip.bourne@nih.gov https://datascience.nih.gov/ http://www.ncbi.nlm.nih.gov/research/staff/bourne/

Editor's Notes

  • #14: Photos: FC tweet; RK screen grab
  • #15: 16 million hospital inpatient events (24.5% of total), 35 million outpatient clinic events (53.6% of total) and 14 million emergency department events (21.9% of total
  • #27: on this slide we have a list of Beacon providers and the content that they're serving. so to date we have over 120 public datasets that have been made available via Beacons at 12 different institutions. So this represents data from 10s of thousands of individuals and theses metrics, the numbers of datasets and individuals that they represent
  • #29: Digital object = data or analytics software