SlideShare a Scribd company logo
Why we do itJonathan IzantVP, Sage BionetworksOpen Science Summit 31 July 2010www.sagebase.org
denial
Genomics does not yet teach us muchPharma drug development is brokenStandards of care are inadequateAcademics limit open access
Genetics Timeline180019002000
Gene Regulation circa 1990
Gene Regulation circa 1996
Gene Regulation circa 2002
How is genomic data used to understand biology?RNA amplificationMicroarray hybirdizationDNAVariationTumorsProfiling Approaches“Standard” GWAS ApproachesIdentifies Causative DNA Variation but provides NO mechanismGenome scale profiling provide correlates of disease Many examples BUT what is cause and effect?Tumors Provide unbiased view of molecular physiology  as it relates to disease phenotypes
Insights on mechanism
 Provide causal relationships and allows predictionsComplex TraitVariationGene IndextraitDNAVariation8Molecular TraitVariation“Integrated” Genetics Approaches
The “Rosetta Integrative Genomics Experiment”: Generation, assembly,  and integration of data to build models that predict clinical outcomeMerck Inc. Co.5 Year ProgramBased at RosettaTotal Resources >$150MGenerate data needed to build bionetworks
Assemble other available data useful for building networks
Integrate and build models
Test predictions
Develop treatments
Design Predictive MarkersConstructing Bayesian Networks
Extensive Publications now Substantiating Scientific ApproachProbabilistic Causal Bionetwork Models>60 Publications from Rosetta Genetics Group (~30 scientists) over 5 years including high profile papers in PLoS Nature and Nature Genetics Metabolic Disease"Genetics of gene expression surveyed in maize, mouse and man." Nature. (2003)"Variations in DNA elucidate molecular networks that cause disease." Nature. (2008)"Genetics of gene expression and its effect on disease." Nature. (2008) "Validation of candidate causal genes for obesity that affect..." Nat Genet. (2009)….. Plus  10 additional papers in Genome Research, PLoS Genetics, PLoS Comp.Biology,  etcCVD"Identification of pathways for atherosclerosis." Circ Res. (2007) "Mapping the genetic architecture of gene expression in human liver." PLoS Biol. (2008) …… Plus  5 additional papers in Genome Res., Genomics, Mamm.Genome"Integrating genotypic and expression data …for bone traits…" Nat Genet. (2005) “..approach to identify candidate genes regulating BMD…" J Bone Miner Res. (2009) dBoneMethods"An integrative genomics approach to infer causal associations ...” Nat Genet. (2005)"Increasing the power to detect causal associations… “PLoS Comput Biol. (2007)"Integrating large-scale functional genomic data ..." Nat Genet. (2008) …… Plus 3 additional papers in PLoS Genet., BMC Genet.
OpportunityThe stunning technologies coming will generate heaps of genomic dataBionetworks using integrative genomic approaches can highlight the non-redundant components- can find drivers of the disease and of therapiesNeed to develop ways to host massive amounts of data, evolving representations of disease as represented by these probabilistic causal disease models
DriversRecognition that the benefits of bionetwork based molecular models of diseases are powerful but that they require significant resourcesAppreciation that it will require decades of evolving representations as real complexity emerges and needs to be integrated with therapeutic interventionsRealizing the donation by Merck might seed a “commons”  allowing a potential long term gain to the whole community provided by evolving models  of disease built via a contributor network
Mission14Sage Bionetworks is a non-profit organization with a vision to create a “Commons” where integrative bionetworks are evolved by contributor scientists with a shared vision to accelerate the elimination of human disease
Sage Bionetworks:a busy first year$5m LSDF GrantPartnership with Merck14 Staff move into Sage Offices at FHCRC1st Sage Commons Congress in SF$8m NCI grant for new CCSBPartnership with Pfizer2009                    2010First Board of Directors Meeting501(c)(3)determinationCatalyst Funding from Listwin, CHDI and QuintilesNIH New Institution ReviewFirst NIH grant payment
Sage Bionetworks PartnersTrainingResearchPlatform
Izant openscience
Global Coherent Data SetsA data set containing genome-wide DNA variation and intermediate trait, as well as physiological phenotype data across a population of individuals large enough to power association or linkage studies, typically 50 or more individuals. To be coherent, the data needs to be matched with consistent identifiers. Intermediate traits are typically gene expression, but may also include proteomic, metabolomic, and other molecular data. GCDs are current state of knowledge and subject to change as more information becomes available to Sage
http://www.sagebase.org/research/tools.html
Sage Commons ChallengesStandards (data, annotation)Tools (combining, analyzing)Citation (recognition)InternationalizationPublic Engagement
Barriers:Designing a simple-to-use model for uploading and processing dataData interoperabilityData standartization, Data Qualityconsistent data format and metadataTools and standards: allow the reosuce to gown and evolve, capture metadata in a standardized way and quality measures and quality controlThe Commons will need to resolve issues surrounding protection of human subjects data if the information is to be widely shared.IRB and protection of human subjectsplatform independenceVisualization toolsbuilding the critical mass of contributorslegal/licensing frameworkenormous curation effort needed to correct for incompatible study designs, incomplete data gatheringAbility to capture structured content
Problem: ‘Accessible’ data often isn’t
Izant openscience
Izant openscience
Collaborators

More Related Content

PDF
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
PPT
Personal Genomes: what can I do with my data?
PDF
RDD Conf Day 1: Diseases: Models and Mechanisms, Phil Hieter
PPTX
Data Commons & Data Science Workshop
PDF
Big Data in Cancer Control
PPTX
NCI Support for Cancer Data Sharing
PDF
DNA & Personalized Medicine
PPTX
DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
Personal Genomes: what can I do with my data?
RDD Conf Day 1: Diseases: Models and Mechanisms, Phil Hieter
Data Commons & Data Science Workshop
Big Data in Cancer Control
NCI Support for Cancer Data Sharing
DNA & Personalized Medicine
DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...

What's hot (20)

PPTX
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
PDF
Cancer and Work Design v1.01
PPTX
A Vision for a Cancer Research Knowledge System
PDF
biomedical research in an increasingly digital world
PPTX
Cancer Moonshot, Data sharing and the Genomic Data Commons
PPTX
Systems Genetics of Cancer - big data and all that
PDF
Personal Genomics: Business Model for 23andMe
PDF
Seattle-Denver VA Center for Innovation
PDF
Participant-centered research design and “equal access” data sharing practice...
PDF
20160811 Big Data for Health and Medicine
PDF
NESCent visit: Measuring progress toward a cultural norm of shared (and reus...
PDF
Cancer genome repository_berkeley
PPTX
Federal Research & Development for the Florida system Sept 2014
PDF
decentralization: a trend in biomedical research
PDF
PPTX
Bioinformatics
PPT
Big Data in Biomedicine: Where is the NIH Headed
PDF
research participation as a social contract
PPTX
NCBI Database.pptx
PPT
The NIH as a Digital Enterprise: Implications for PAG
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
Cancer and Work Design v1.01
A Vision for a Cancer Research Knowledge System
biomedical research in an increasingly digital world
Cancer Moonshot, Data sharing and the Genomic Data Commons
Systems Genetics of Cancer - big data and all that
Personal Genomics: Business Model for 23andMe
Seattle-Denver VA Center for Innovation
Participant-centered research design and “equal access” data sharing practice...
20160811 Big Data for Health and Medicine
NESCent visit: Measuring progress toward a cultural norm of shared (and reus...
Cancer genome repository_berkeley
Federal Research & Development for the Florida system Sept 2014
decentralization: a trend in biomedical research
Bioinformatics
Big Data in Biomedicine: Where is the NIH Headed
research participation as a social contract
NCBI Database.pptx
The NIH as a Digital Enterprise: Implications for PAG
Ad

Viewers also liked (15)

PDF
New laws in russia 902
PPT
PDF
[oGIP] oGIP Standards
PDF
Современные модели менеджмента качества в судебно-экспертной деятельности.
PDF
ISO 14001:2015 Revision Update Webinar
PPT
Total quality management report
PPTX
types of production system
PPTX
Operational KPIs - Definitions November 2016
PPTX
Total quality management tools and techniques
PPTX
types of production system
PPTX
How to implement lean - Executive overview
PPTX
5 s workplace organization
PPT
Total Quality Management (TQM)
PPT
Operations Management: Production System
PPT
Stanford E245 Lean LaunchPad winter 10 session 01 course overview rev 4
New laws in russia 902
[oGIP] oGIP Standards
Современные модели менеджмента качества в судебно-экспертной деятельности.
ISO 14001:2015 Revision Update Webinar
Total quality management report
types of production system
Operational KPIs - Definitions November 2016
Total quality management tools and techniques
types of production system
How to implement lean - Executive overview
5 s workplace organization
Total Quality Management (TQM)
Operations Management: Production System
Stanford E245 Lean LaunchPad winter 10 session 01 course overview rev 4
Ad

Similar to Izant openscience (20)

PDF
Stephen Friend Dana Farber Cancer Institute 2011-10-24
PPTX
Khoury ashg2014
PPT
Latest Project to Cure PKD
PDF
Reaching out to collaborators and crowdsourcing for pharmaceutical research
PPT
Bioinformatics
PDF
When pharmaceutical companies publish large datasets an abundance of riches o...
PDF
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29
PPTX
Branch: An interactive, web-based tool for building decision tree classifiers
PDF
White_matter_Ouellette_2022-06-07.pdf
PPT
Semantic Web for Health Care and Biomedical Informatics
PPTX
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
PPT
A Successful Academic Medical Center Must be a Truly Digital Enterprise
PPT
Slides for burroughs wellcome foundation ajw100611 sefinal
PPT
Open Notebook Science and One Future for Scientific Research
PPTX
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
PDF
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
PDF
Expert Panel on Data Challenges in Translational Research
PPTX
The Learning Health System: Thinking and Acting Across Scales
PPT
Secure Data Sharing and Related Matters – An NIH View
Stephen Friend Dana Farber Cancer Institute 2011-10-24
Khoury ashg2014
Latest Project to Cure PKD
Reaching out to collaborators and crowdsourcing for pharmaceutical research
Bioinformatics
When pharmaceutical companies publish large datasets an abundance of riches o...
Stephen Friend Institute of Development, Aging and Cancer 2011-11-29
Branch: An interactive, web-based tool for building decision tree classifiers
White_matter_Ouellette_2022-06-07.pdf
Semantic Web for Health Care and Biomedical Informatics
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
A Successful Academic Medical Center Must be a Truly Digital Enterprise
Slides for burroughs wellcome foundation ajw100611 sefinal
Open Notebook Science and One Future for Scientific Research
Advancing Innovation and Convergence in Cancer Research: US Federal Cancer Mo...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
Expert Panel on Data Challenges in Translational Research
The Learning Health System: Thinking and Acting Across Scales
Secure Data Sharing and Related Matters – An NIH View

More from Open Science Summit (20)

PPTX
David ewing duncan open science 7-30-10
PPT
Pink army july 31
PPT
Barry bunin open science summit at the berkeley intl house 2010
PPTX
Batten oss 727 -bw changes
PPT
Myelin repair open science summit 07.31.10 v2
PPT
Beth baber oen science summit
PPT
Aiden hollis hif presentation berkeley
PDF
Delinkage oss2010 jameslove_kei
PPT
Cordeiro education2030berkeleyopensciencesummit2010
PPT
Rebecca goulding alt ip
PPTX
Lisa green oss deck
PPTX
Carol mimura open society july 31, 2010, copy
PPT
Keith oss 2010 - ip-based platforms
PPT
Bennett open access_7-31-10
PDF
4 cowell oss-07302010
PPT
1 reinhoff berkeley july 2010
PPTX
Vitrant fund science opensciencesummit2010presentation
PPT
Oss2010 sci flies
PPT
Fbi open science summit presentation (29 july 2010)
PDF
2010 opensciencepeterson
David ewing duncan open science 7-30-10
Pink army july 31
Barry bunin open science summit at the berkeley intl house 2010
Batten oss 727 -bw changes
Myelin repair open science summit 07.31.10 v2
Beth baber oen science summit
Aiden hollis hif presentation berkeley
Delinkage oss2010 jameslove_kei
Cordeiro education2030berkeleyopensciencesummit2010
Rebecca goulding alt ip
Lisa green oss deck
Carol mimura open society july 31, 2010, copy
Keith oss 2010 - ip-based platforms
Bennett open access_7-31-10
4 cowell oss-07302010
1 reinhoff berkeley july 2010
Vitrant fund science opensciencesummit2010presentation
Oss2010 sci flies
Fbi open science summit presentation (29 july 2010)
2010 opensciencepeterson

Izant openscience

  • 1. Why we do itJonathan IzantVP, Sage BionetworksOpen Science Summit 31 July 2010www.sagebase.org
  • 3. Genomics does not yet teach us muchPharma drug development is brokenStandards of care are inadequateAcademics limit open access
  • 8. How is genomic data used to understand biology?RNA amplificationMicroarray hybirdizationDNAVariationTumorsProfiling Approaches“Standard” GWAS ApproachesIdentifies Causative DNA Variation but provides NO mechanismGenome scale profiling provide correlates of disease Many examples BUT what is cause and effect?Tumors Provide unbiased view of molecular physiology as it relates to disease phenotypes
  • 10. Provide causal relationships and allows predictionsComplex TraitVariationGene IndextraitDNAVariation8Molecular TraitVariation“Integrated” Genetics Approaches
  • 11. The “Rosetta Integrative Genomics Experiment”: Generation, assembly, and integration of data to build models that predict clinical outcomeMerck Inc. Co.5 Year ProgramBased at RosettaTotal Resources >$150MGenerate data needed to build bionetworks
  • 12. Assemble other available data useful for building networks
  • 17. Extensive Publications now Substantiating Scientific ApproachProbabilistic Causal Bionetwork Models>60 Publications from Rosetta Genetics Group (~30 scientists) over 5 years including high profile papers in PLoS Nature and Nature Genetics Metabolic Disease"Genetics of gene expression surveyed in maize, mouse and man." Nature. (2003)"Variations in DNA elucidate molecular networks that cause disease." Nature. (2008)"Genetics of gene expression and its effect on disease." Nature. (2008) "Validation of candidate causal genes for obesity that affect..." Nat Genet. (2009)….. Plus 10 additional papers in Genome Research, PLoS Genetics, PLoS Comp.Biology, etcCVD"Identification of pathways for atherosclerosis." Circ Res. (2007) "Mapping the genetic architecture of gene expression in human liver." PLoS Biol. (2008) …… Plus 5 additional papers in Genome Res., Genomics, Mamm.Genome"Integrating genotypic and expression data …for bone traits…" Nat Genet. (2005) “..approach to identify candidate genes regulating BMD…" J Bone Miner Res. (2009) dBoneMethods"An integrative genomics approach to infer causal associations ...” Nat Genet. (2005)"Increasing the power to detect causal associations… “PLoS Comput Biol. (2007)"Integrating large-scale functional genomic data ..." Nat Genet. (2008) …… Plus 3 additional papers in PLoS Genet., BMC Genet.
  • 18. OpportunityThe stunning technologies coming will generate heaps of genomic dataBionetworks using integrative genomic approaches can highlight the non-redundant components- can find drivers of the disease and of therapiesNeed to develop ways to host massive amounts of data, evolving representations of disease as represented by these probabilistic causal disease models
  • 19. DriversRecognition that the benefits of bionetwork based molecular models of diseases are powerful but that they require significant resourcesAppreciation that it will require decades of evolving representations as real complexity emerges and needs to be integrated with therapeutic interventionsRealizing the donation by Merck might seed a “commons” allowing a potential long term gain to the whole community provided by evolving models of disease built via a contributor network
  • 20. Mission14Sage Bionetworks is a non-profit organization with a vision to create a “Commons” where integrative bionetworks are evolved by contributor scientists with a shared vision to accelerate the elimination of human disease
  • 21. Sage Bionetworks:a busy first year$5m LSDF GrantPartnership with Merck14 Staff move into Sage Offices at FHCRC1st Sage Commons Congress in SF$8m NCI grant for new CCSBPartnership with Pfizer2009 2010First Board of Directors Meeting501(c)(3)determinationCatalyst Funding from Listwin, CHDI and QuintilesNIH New Institution ReviewFirst NIH grant payment
  • 24. Global Coherent Data SetsA data set containing genome-wide DNA variation and intermediate trait, as well as physiological phenotype data across a population of individuals large enough to power association or linkage studies, typically 50 or more individuals. To be coherent, the data needs to be matched with consistent identifiers. Intermediate traits are typically gene expression, but may also include proteomic, metabolomic, and other molecular data. GCDs are current state of knowledge and subject to change as more information becomes available to Sage
  • 26. Sage Commons ChallengesStandards (data, annotation)Tools (combining, analyzing)Citation (recognition)InternationalizationPublic Engagement
  • 27. Barriers:Designing a simple-to-use model for uploading and processing dataData interoperabilityData standartization, Data Qualityconsistent data format and metadataTools and standards: allow the reosuce to gown and evolve, capture metadata in a standardized way and quality measures and quality controlThe Commons will need to resolve issues surrounding protection of human subjects data if the information is to be widely shared.IRB and protection of human subjectsplatform independenceVisualization toolsbuilding the critical mass of contributorslegal/licensing frameworkenormous curation effort needed to correct for incompatible study designs, incomplete data gatheringAbility to capture structured content
  • 32. Biomedical research developed as a Cottage Industry
  • 34. Need for multi-layer mega datasets and the vanishing ‘price’ for genes provides incentive for pre-competitive space for genomics
  • 35. Incentives:Sociology and policy.  Getting people to share and building trust.IMHO, the central challenge will be community adoption.This is a social (political) experiment/ entreprise as much as a scientific challenge. How to motivate individuals not community inclined might be key.Researcher "Turf" /lack of experience sharingpolitic: competitive funding versus communal goalWillingness by the community to share data and key ancillary information (e.g. pathology/clinical data for profiled samples)We need a team that will take the time to make sure we create a set of tools that can interoperate , rather than a set of tools that perform discrete independent tasks.Changing culture of individual recognition, publication, rewards, incentivesBusiness case for contributing and sharing resources and information is unclear to many, while business case for hoarding them is well articulated and obvious.Buy-in from tool developers, data producers and data usersThe theory is great, the practice needs commitment from a wide variety of players
  • 36. The Federation ExperimentStephen FriendSage BionetworksAndrea CalifanoColumbia U.Eric SchadtPacBio - UCSFAtul ButteStanford MedTrey IdekerUCSD
  • 37. Sage BionetworksFocused on improving treatment of diseaseWorking through extensive partnerships to enable research and drug developmentCultural challenges may eclipse technical and operational hurtleswww.sagebase.org

Editor's Notes

  • #25: This is us (data on 230 attendees from ~100 survey respondents) [[with black background]]
  • #26: Funny thing about collaborations… “IT Experts” claim to collaborate with everyone, but only 3 other groups acknowledge collaborating with “IT Experts”[[No return arrows from “Funders”, “Media” and “Policy”]]