SlideShare a Scribd company logo
You are free to:
Copy, share, adapt, or re-mix;
Photograph, film, or broadcast;

Blog, live-blog, or post video of;

This presentation. Provided that:
You attribute the work to its author and respect the rights
and licenses associated with its components.

Slide Concept by Cameron Neylon, who has waived all copyright and related or neighbouring rights. This slide only ccZero.
Social Media Icons adapted with permission from originals by Christopher Ross. Original images are available under GPL at;
http://www.thisismyurl.com/free-downloads/15-free-speech-bubble-icons-for-popular-websites
You are free to:
Copy, share, adapt, or re-mix;
Photograph, film, or broadcast;

Blog, live-blog, or post video of;

This presentation. Provided that:
You attribute the work to its author and respect the rights
and licenses associated with its components.

Slide Concept by Cameron Neylon, who has waived all copyright and related or neighbouring rights. This slide only ccZero.
Social Media Icons adapted with permission from originals by Christopher Ross. Original images are available under GPL at;
http://www.thisismyurl.com/free-downloads/15-free-speech-bubble-icons-for-popular-websites
Disclaimer
• I do not (and will not) profit in any way, shape
or form, from any of the brands, products or
companies I may mention in this
presentation.
Data availability and re‐usability in the
transition from microarray to next‐generation
sequencing: can we do better?
B.F. Francis Ouellette
• Senior Scientist & Associate Director, Informatics and
Biocomputing, Ontario Institute for Cancer Research,
Toronto, ON
• Associate Professor, Department of Cell and Systems
Biology, University of Toronto, Toronto, ON.

@bffo on
•

Gabriella Rustici, Eleanor Williams, B.F. Francis Ouellette,
Alvis Brazma and the Functional Genomics Data Society
http://fged.org

•
•
•
•
•
•
•
•
•
•
•
•
•
•

Alvis Brazma - EBI
Roger Bumgarner - U of Washington
Cesare Furlanello - FBK – MPBA
Michael Miller - ISB
Francis Ouellette - OICR
John Quackenbush – Dana-Farber
Michael Reich - Broad
Gabriella Rustici - EBI
Chris Stoeckert – U Penn
Ronald Taylor - PNNL
Steve Chervitz Trutane - Personalis
Jennifer Weller - UNC
Brian Wilhelm - IRIC
Neil Winegarden - UHN
FGED’s mission:

To be a positive agent of
change in the effective
sharing and reproducibility
of functional genomic data
Poster # 142 (Friday)
fged.org
I come here wearing many hats!
• Officer of FGED
• Data submitter to a large international cancer
genomics initiative
• Receiving and curating data from that same
initiative from 67 cancer genome projects.
• Editor in an #openaccess journal where we are just
now rewriting the data submission policy to ensure
reproducibility
• Associate Editor of an #OA DATABASE journal
• Also on the SAB of Galaxy and Genomespace
What do we do with this?
FGED
(Functional Genomics Data Society)
was
MGED
(Microarray Gene Expression
Data Society)
we evaluated the replication of data analyses in 18 articles on
microarray-based gene expression profiling. (…) We reproduced
two analyses in principle and six partially or with some
discrepancies; ten could not be reproduced. The main reason
for failure to reproduce was data unavailability, and discrepancies
were mostly due to incomplete data annotation or specification of
data processing and analysis. Repeatability of published
microarray studies is apparently limited. More strict publication
rules enforcing public data availability and explicit description of
data processing and analysis should be considered.
Does it matter?
• In Ioannidis et al (2009), they were not saying that
the papers were wrong.

• But there were problems
– missing data (38%)
– missing software, hardware details (50%)
– missing method, processing details (66%)
… forensic bioinformatics [was needed] to infer what
was done to obtain the results
- Keith Baggerly
Does it matter?
• In both cases the supporting data WERE deposited
in GEO or ArrayExpress
• Forensic bioinformatics was needed and more
often than not failed
• May be just depositing is not quite enough?
Cshl minseqe 2013_ouellette
What was in MIAME?
1. The raw data
2. The final processed (normalised) data
3. The essential sample annotation and experimental
variables
4. Sample data relationships
5. Array annotation (e.g., probe oligonucleotide
sequences)
6. The laboratory and data processing protocols
Did it work? The glass half empty…
• Where were the hiccups? MIAME was asking too
much!
• However, some now say that MIAME is much too
little to ask! (e.g., publishing fully documented code
with instructions how to run it)
• What does it mean ‘sufficient data processing
protocols’?
• Even when data and protocols were deposited,
would the reviewers check these? Probably not
• So does it help at all?
Did it work? The glass half full …
• ArrayExpress and GEO have data from well
over 6 million high throughput assays from
some 30,000 functional genomics studies
• The MIAME compliance has been increasing
over time
• Many studies have shown the reusability of
these data
• We can have an informed discussion about the
reproducibility rather than forensics
Standards for content vs
standards for format
• Developing a usable format is challenging
– If it’s too ‘flexible’, too much free text, it’s no longer a
standard, no software can reasonably parse it
– If it’s too rigid, too granular, it can’t handle new type of
data, and people end up putting things in fields that don’t
work

• Human readable formats is useful, but machine
readability is essential!
A simple human readable format for Functional
genomics experiment metadata
• Sample-Data Relationship File (SDRF)
Lessons learned
• Keep it simple, keep it simple, keep it simple!
• Perils of designing standards by a committee vs
advantages of community agreement
• Successful formats are mostly defined by
successful software, e.g., GFF in UCSC GB or
Bioconductors gene_set
• The attraction and perils of perfection – the last few
steps of full automation cost most effort
– A human person may be a cheep broker between two
pieces of software (again – Bioconductor example)
What does it mean for HTS?
• (RNASeq – ChIPSeq)
• The metadata for functional genomics HTS
experiments are not so different from microarray
experiments – replace cel files with BAM files
MINSEQE - Minimum Information about a highthroughput Nucleotide SeQuencing Experiment
1. A general description of the aim of the experiment;
2. The submitter contact details;
3. Essential sample annotation and the experimental
factors;
4. An ‘experiment’ or ‘run’ date, which may be
important for identifying batch effects;
5. Sufficient information to correctly identify bio &
tech reps;
6. Experimental and data processing protocols
7. Raw sequencing reads location; and processed
data.
Percentage of publications from 2012
containing new gene expression data
Data type

Number of
PMID with new
data

% of data in
SRA/Arrayexpr
ess/GEO

Microarray

347

49

RNA-SEQ

334

61
Percentage of RNA-Seq studies
providing metadata (1/2)
Original
Database

ArrayExpress GEO

SRA

Experimental
description

95

100

100

Contact

100

100

0

Sample &
Factor info

100

100

60

Experimental
Or Run date

0

0

60
Percentage of RNA-Seq studies
providing metadata (2/2)
Original
Database

ArrayExpress GEO

SRA

Biological
and Tech
replicates

Yes

Sometimes

Yes

Exp and data
processing
protocol

60

100

0

Raw reads

100

100

100

Processed
data

35

90

0
Things we still need to do:
• Involves folks from NCBI
• Compare methods and metrics over time (20092012)
• Compare methods with ENCODE, ICGC, EGA and
the databases we presented here.
• Look for shared meta data and seek to mate what
is best and core to all.
• Make sure it aligns with large funder’s current
requirements.
• Share and publish this information
Take home messages
• Archiving just something is not the same as
making data available and useful – metadata,
analysis code, usable format, …
– Storing metadata doesn’t cost too much, extracting them
from data generators does!

• Minimising the human mediation in moving data
between the LIMS, archives and analysis tools is
more realistic goal than eliminating it – the need for
brokerage
• The main source of variability in RNSseq
interpretation seems to be the alignments – we
don’t know how to do this well yet. Getting the
short reads for RNASeq is a beginning.
• FGED: The Functional Genomics Data Society is a
very open society, and we welcome feedback and
input!

– http://fged.org
– Twitter: @fged
Acknowledgements:
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•

Gabriella Rustici, Eleanor Williams, Alvis Brazma and
the Functional Genomics Data Society http://fged.org
Alvis Brazma - EBI
Roger Bumgarner - U of Washington
Cesare Furlanello - FBK – MPBA
Michael Miller - ISB
Francis Ouellette - OICR
John Quackenbush – Dana-Farber
Michael Reich - Broad
Gabriella Rustici - EBI
Chris Stoeckert – U Penn
Ronald Taylor - PNNL
Steve Chervitz Trutane - Personalis
Jennifer Weller - UNC
Brian Wilhelm - IRIC
Neil Winegarden - UHN

More Related Content

PDF
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
PDF
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
PPTX
Phylogenetics: Making publication-quality tree figures
PDF
Overview of Next Gen Sequencing Data Analysis
PPT
Folker Meyer: Metagenomic Data Annotation
PPTX
Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3
PDF
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Phylogenetics: Making publication-quality tree figures
Overview of Next Gen Sequencing Data Analysis
Folker Meyer: Metagenomic Data Annotation
Aug2015 Ali Bashir and Jason Chin Pac bio giab_assembly_summary_ali3
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...

What's hot (20)

PPT
Gene Ontology Enrichment Network Analysis -Tutorial
PDF
Introduction to Bioinformatics
PPTX
2016 bioinformatics i_wim_vancriekinge_vupload
PDF
OpenTox Europe 2013
PPTX
NetBioSIG2012 ugurdogrusoz-cbio
PDF
NetBioSIG2013-Talk Robin Haw
PPT
Intro bioinformatics
PDF
Canadian health census to lod
PPTX
2016 davis-plantbio
PPT
Bioinformatics - Discovering the Bio Logic Of Nature
DOCX
Bioinformatics Final Report
PDF
Advanced Bioinformatics for Genomics and BioData Driven Research
PPT
Kishor Presentation
PDF
NetBioSIG2012 anyatsalenko-en-viz
PPTX
Introduction to Bayesian phylogenetics and BEAST
PDF
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
PPTX
Bioinformatic, and tools by kk sahu
Gene Ontology Enrichment Network Analysis -Tutorial
Introduction to Bioinformatics
2016 bioinformatics i_wim_vancriekinge_vupload
OpenTox Europe 2013
NetBioSIG2012 ugurdogrusoz-cbio
NetBioSIG2013-Talk Robin Haw
Intro bioinformatics
Canadian health census to lod
2016 davis-plantbio
Bioinformatics - Discovering the Bio Logic Of Nature
Bioinformatics Final Report
Advanced Bioinformatics for Genomics and BioData Driven Research
Kishor Presentation
NetBioSIG2012 anyatsalenko-en-viz
Introduction to Bayesian phylogenetics and BEAST
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
Bioinformatic, and tools by kk sahu
Ad

Viewers also liked (9)

PDF
Mar Gonzales Porta, One gene One transcript, fged_seattle_2013
PDF
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
PDF
Ishwar Chandramouliswaran, Cancer Research, fged_seattle_2013
PDF
Kimberly Glass, Network model - Ovarian Cancer, fged_seattle_2013
PPT
Information, Science, and Society
PDF
miRNA Breast Cancer Prognosis -- Ingenuity Systems
PDF
Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...
PPTX
Big Data and Advanced Analytics
PPTX
DATA WAREHOUSING
Mar Gonzales Porta, One gene One transcript, fged_seattle_2013
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
Ishwar Chandramouliswaran, Cancer Research, fged_seattle_2013
Kimberly Glass, Network model - Ovarian Cancer, fged_seattle_2013
Information, Science, and Society
miRNA Breast Cancer Prognosis -- Ingenuity Systems
Big Data and the Future of Journalism (Futurist Keynote Speaker Gerd Leonhard...
Big Data and Advanced Analytics
DATA WAREHOUSING
Ad

Similar to Cshl minseqe 2013_ouellette (20)

PDF
Managing the analysis of high-throughput data
PPTX
2013 nas-ehs-data-integration-dc
PDF
IPK - Reproducible research - To infinity
PPTX
Cool Informatics Tools and Services for Biomedical Research
PDF
Reproducible research - to infinity
PPTX
From bioinformatics scientist to entrepreneur - Women in Omics - ICG11 - 2016
PPTX
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
PPTX
Functional genomics, and tools
PPTX
Finding and Accessing Human Genomics Datasets
PDF
Containerized attribute indexing and graph genomes for federated data access
PDF
eScience-School-Oct2012-Campinas-Brazil
PDF
Microarrays for an Integrative Genomics 3rd edition Computational Molecular B...
PDF
Forsharing cshl2011 sequencing
PDF
Addressing privacy concerns_in_the_age_of_federated_data_access
PPTX
Data analysis & integration challenges in genomics
PPTX
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...
PPTX
Workshop finding and accessing data - fiona - lunteren april 18 2016
PPTX
2015 genome-center
PDF
Amia tb-review-08
Managing the analysis of high-throughput data
2013 nas-ehs-data-integration-dc
IPK - Reproducible research - To infinity
Cool Informatics Tools and Services for Biomedical Research
Reproducible research - to infinity
From bioinformatics scientist to entrepreneur - Women in Omics - ICG11 - 2016
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Functional genomics, and tools
Finding and Accessing Human Genomics Datasets
Containerized attribute indexing and graph genomes for federated data access
eScience-School-Oct2012-Campinas-Brazil
Microarrays for an Integrative Genomics 3rd edition Computational Molecular B...
Forsharing cshl2011 sequencing
Addressing privacy concerns_in_the_age_of_federated_data_access
Data analysis & integration challenges in genomics
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop finding and accessing data - fiona - lunteren april 18 2016
2015 genome-center
Amia tb-review-08

Recently uploaded (20)

PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Mushroom cultivation and it's methods.pdf
PPT
Teaching material agriculture food technology
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
August Patch Tuesday
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Machine learning based COVID-19 study performance prediction
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Encapsulation theory and applications.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Encapsulation_ Review paper, used for researhc scholars
Mushroom cultivation and it's methods.pdf
Teaching material agriculture food technology
Spectral efficient network and resource selection model in 5G networks
A comparative analysis of optical character recognition models for extracting...
Univ-Connecticut-ChatGPT-Presentaion.pdf
August Patch Tuesday
NewMind AI Weekly Chronicles - August'25-Week II
Machine learning based COVID-19 study performance prediction
A comparative study of natural language inference in Swahili using monolingua...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Encapsulation theory and applications.pdf
Approach and Philosophy of On baking technology
Diabetes mellitus diagnosis method based random forest with bat algorithm
Programs and apps: productivity, graphics, security and other tools
Building Integrated photovoltaic BIPV_UPV.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
SOPHOS-XG Firewall Administrator PPT.pptx
Network Security Unit 5.pdf for BCA BBA.

Cshl minseqe 2013_ouellette

  • 1. You are free to: Copy, share, adapt, or re-mix; Photograph, film, or broadcast; Blog, live-blog, or post video of; This presentation. Provided that: You attribute the work to its author and respect the rights and licenses associated with its components. Slide Concept by Cameron Neylon, who has waived all copyright and related or neighbouring rights. This slide only ccZero. Social Media Icons adapted with permission from originals by Christopher Ross. Original images are available under GPL at; http://www.thisismyurl.com/free-downloads/15-free-speech-bubble-icons-for-popular-websites
  • 2. You are free to: Copy, share, adapt, or re-mix; Photograph, film, or broadcast; Blog, live-blog, or post video of; This presentation. Provided that: You attribute the work to its author and respect the rights and licenses associated with its components. Slide Concept by Cameron Neylon, who has waived all copyright and related or neighbouring rights. This slide only ccZero. Social Media Icons adapted with permission from originals by Christopher Ross. Original images are available under GPL at; http://www.thisismyurl.com/free-downloads/15-free-speech-bubble-icons-for-popular-websites
  • 3. Disclaimer • I do not (and will not) profit in any way, shape or form, from any of the brands, products or companies I may mention in this presentation.
  • 4. Data availability and re‐usability in the transition from microarray to next‐generation sequencing: can we do better? B.F. Francis Ouellette • Senior Scientist & Associate Director, Informatics and Biocomputing, Ontario Institute for Cancer Research, Toronto, ON • Associate Professor, Department of Cell and Systems Biology, University of Toronto, Toronto, ON. @bffo on
  • 5. • Gabriella Rustici, Eleanor Williams, B.F. Francis Ouellette, Alvis Brazma and the Functional Genomics Data Society http://fged.org • • • • • • • • • • • • • • Alvis Brazma - EBI Roger Bumgarner - U of Washington Cesare Furlanello - FBK – MPBA Michael Miller - ISB Francis Ouellette - OICR John Quackenbush – Dana-Farber Michael Reich - Broad Gabriella Rustici - EBI Chris Stoeckert – U Penn Ronald Taylor - PNNL Steve Chervitz Trutane - Personalis Jennifer Weller - UNC Brian Wilhelm - IRIC Neil Winegarden - UHN
  • 6. FGED’s mission: To be a positive agent of change in the effective sharing and reproducibility of functional genomic data Poster # 142 (Friday) fged.org
  • 7. I come here wearing many hats! • Officer of FGED • Data submitter to a large international cancer genomics initiative • Receiving and curating data from that same initiative from 67 cancer genome projects. • Editor in an #openaccess journal where we are just now rewriting the data submission policy to ensure reproducibility • Associate Editor of an #OA DATABASE journal • Also on the SAB of Galaxy and Genomespace
  • 8. What do we do with this? FGED (Functional Genomics Data Society) was MGED (Microarray Gene Expression Data Society)
  • 9. we evaluated the replication of data analyses in 18 articles on microarray-based gene expression profiling. (…) We reproduced two analyses in principle and six partially or with some discrepancies; ten could not be reproduced. The main reason for failure to reproduce was data unavailability, and discrepancies were mostly due to incomplete data annotation or specification of data processing and analysis. Repeatability of published microarray studies is apparently limited. More strict publication rules enforcing public data availability and explicit description of data processing and analysis should be considered.
  • 10. Does it matter? • In Ioannidis et al (2009), they were not saying that the papers were wrong. • But there were problems – missing data (38%) – missing software, hardware details (50%) – missing method, processing details (66%)
  • 11. … forensic bioinformatics [was needed] to infer what was done to obtain the results - Keith Baggerly
  • 12. Does it matter? • In both cases the supporting data WERE deposited in GEO or ArrayExpress • Forensic bioinformatics was needed and more often than not failed • May be just depositing is not quite enough?
  • 14. What was in MIAME? 1. The raw data 2. The final processed (normalised) data 3. The essential sample annotation and experimental variables 4. Sample data relationships 5. Array annotation (e.g., probe oligonucleotide sequences) 6. The laboratory and data processing protocols
  • 15. Did it work? The glass half empty… • Where were the hiccups? MIAME was asking too much! • However, some now say that MIAME is much too little to ask! (e.g., publishing fully documented code with instructions how to run it) • What does it mean ‘sufficient data processing protocols’? • Even when data and protocols were deposited, would the reviewers check these? Probably not • So does it help at all?
  • 16. Did it work? The glass half full … • ArrayExpress and GEO have data from well over 6 million high throughput assays from some 30,000 functional genomics studies • The MIAME compliance has been increasing over time • Many studies have shown the reusability of these data • We can have an informed discussion about the reproducibility rather than forensics
  • 17. Standards for content vs standards for format • Developing a usable format is challenging – If it’s too ‘flexible’, too much free text, it’s no longer a standard, no software can reasonably parse it – If it’s too rigid, too granular, it can’t handle new type of data, and people end up putting things in fields that don’t work • Human readable formats is useful, but machine readability is essential!
  • 18. A simple human readable format for Functional genomics experiment metadata • Sample-Data Relationship File (SDRF)
  • 19. Lessons learned • Keep it simple, keep it simple, keep it simple! • Perils of designing standards by a committee vs advantages of community agreement • Successful formats are mostly defined by successful software, e.g., GFF in UCSC GB or Bioconductors gene_set • The attraction and perils of perfection – the last few steps of full automation cost most effort – A human person may be a cheep broker between two pieces of software (again – Bioconductor example)
  • 20. What does it mean for HTS? • (RNASeq – ChIPSeq) • The metadata for functional genomics HTS experiments are not so different from microarray experiments – replace cel files with BAM files
  • 21. MINSEQE - Minimum Information about a highthroughput Nucleotide SeQuencing Experiment 1. A general description of the aim of the experiment; 2. The submitter contact details; 3. Essential sample annotation and the experimental factors; 4. An ‘experiment’ or ‘run’ date, which may be important for identifying batch effects; 5. Sufficient information to correctly identify bio & tech reps; 6. Experimental and data processing protocols 7. Raw sequencing reads location; and processed data.
  • 22. Percentage of publications from 2012 containing new gene expression data Data type Number of PMID with new data % of data in SRA/Arrayexpr ess/GEO Microarray 347 49 RNA-SEQ 334 61
  • 23. Percentage of RNA-Seq studies providing metadata (1/2) Original Database ArrayExpress GEO SRA Experimental description 95 100 100 Contact 100 100 0 Sample & Factor info 100 100 60 Experimental Or Run date 0 0 60
  • 24. Percentage of RNA-Seq studies providing metadata (2/2) Original Database ArrayExpress GEO SRA Biological and Tech replicates Yes Sometimes Yes Exp and data processing protocol 60 100 0 Raw reads 100 100 100 Processed data 35 90 0
  • 25. Things we still need to do: • Involves folks from NCBI • Compare methods and metrics over time (20092012) • Compare methods with ENCODE, ICGC, EGA and the databases we presented here. • Look for shared meta data and seek to mate what is best and core to all. • Make sure it aligns with large funder’s current requirements. • Share and publish this information
  • 26. Take home messages • Archiving just something is not the same as making data available and useful – metadata, analysis code, usable format, … – Storing metadata doesn’t cost too much, extracting them from data generators does! • Minimising the human mediation in moving data between the LIMS, archives and analysis tools is more realistic goal than eliminating it – the need for brokerage • The main source of variability in RNSseq interpretation seems to be the alignments – we don’t know how to do this well yet. Getting the short reads for RNASeq is a beginning.
  • 27. • FGED: The Functional Genomics Data Society is a very open society, and we welcome feedback and input! – http://fged.org – Twitter: @fged
  • 28. Acknowledgements: • • • • • • • • • • • • • • • Gabriella Rustici, Eleanor Williams, Alvis Brazma and the Functional Genomics Data Society http://fged.org Alvis Brazma - EBI Roger Bumgarner - U of Washington Cesare Furlanello - FBK – MPBA Michael Miller - ISB Francis Ouellette - OICR John Quackenbush – Dana-Farber Michael Reich - Broad Gabriella Rustici - EBI Chris Stoeckert – U Penn Ronald Taylor - PNNL Steve Chervitz Trutane - Personalis Jennifer Weller - UNC Brian Wilhelm - IRIC Neil Winegarden - UHN