Nils Gehlenborg, PhD
Department of Biomedical Informatics
Harvard Medical School
http://gehlenborglab.org @ngehlenborghttp://gehlenborglab.org
A Unified Approach to Exploration,
Authoring, and Communication
with Reproducible Visualizations
http://gehlenborglab.org
http://gehlenborglab.org
http://gehlenborglab.org
Data
Machine
Human
http://gehlenborglab.org
Machine
Human
Data
Methods for Data Visualization and Exploration
Tools for Reproducible Research
What are applications
of data visualization?
PUBLICATION
experiment
DATA
INSIGHT HYPOTHESIS
interpretation
hypothesis
generation
EXPLANATION
“Storytelling”
PUBLICATION
experiment
DATA
INSIGHT HYPOTHESIS
interpretation
hypothesis
generation
EXPLORATION
EXPLANATION
“Storytelling”
“Pattern Discovery”
PUBLICATION
experiment
DATA
INSIGHT HYPOTHESIS
interpretation
HYPOTHESIS
hypothesis
generation
EXPLORATION
EXPLANATION
“Storytelling”
“Pattern Discovery”
HYPOTHESIS-DRIVEN DISCOVERY
PUBLICATION
experiment
DATA
INSIGHT HYPOTHESIS
interpretation
DATA
hypothesis
generation
EXPLORATION
EXPLANATION
“Storytelling”
“Pattern Discovery”
DATA-DRIVEN DISCOVERY
Insight
EXPLORATION EXPLANATION
Image Editing ToolCytoscape
Insight
EXPLORATION EXPLANATION
Image Editing ToolCytoscape
DATA PIXELS
Image Editing ToolCytoscape
DATA PIXELS
DATA PIXELS
Dead wood?
Hard to move
Keeps on giving!
Best part
Easy to share
Low-hanging fruit?
Why should we care?
Nature asked 1,576 researchers if there
is a reproducibility crisis in science.
M Baker, Nature 533, 452-454, 2016
0% 100%
No crisis (3%)
Don’t know (7%)
Slight crisis (38%)
M Baker, Nature 533, 452-454, 2016
Significant crisis (52%)
Nature asked 1,576 researchers if there
is a reproducibility crisis in science.
M Baker, Nature 533, 452-454, 2016
Intentional?
Inability to capture everything?
Inability to communicate everything?
M Baker, Nature 533, 452-454, 2016
Intentional?
Inability to capture everything?
Inability to communicate everything?
SOCIAL ISSUE
TECHNICAL ISSUES
M Baker, Nature 533, 452-454, 2016
Why does this matter
for visualization?
Discovery of Tumor Subtypes
PROBLEM 1
Visualize overlap of patient sets across two or more stratifications.
PROBLEM 2
Visualize characteristics of patient sets within a stratification of interest.
StratomeX: Exploratory Data Visualization
M Streit, A Lex, S Gratzl, C Partl, D Schmalstieg, H Pfister, P Park, N Gehlenborg, Nature Methods (2014)
M Streit, A Lex, S Gratzl, C Partl, D Schmalstieg, H Pfister, P Park, N Gehlenborg, Nature Methods (2014)
Discovery of Tumor Subtypes
A Unified Approach to Exploration, Authoring, and Communication with Reproducible Visualizations
PROBLEM 3
Identify relevant stratifications, pathways, and clinical variables.
Discovery of Tumor Subtypes
PROBLEM 1
Visualize overlap of patient sets across two or more stratifications.
PROBLEM 2
Visualize characteristics of patient sets within a stratification of interest.
StratomeX: Exploratory Data Visualization
M Streit, A Lex, S Gratzl, C Partl, D Schmalstieg, H Pfister, P Park, N Gehlenborg, Nature Methods (2014)
Is there a mutation that overlaps with this mRNA cluster?
Is there a CNV that affects survival?
Is there a pathway that is enriched in this cluster?
Query
Stratifications
Clinical Params
Pathways
Guided
Exploration
M Streit, A Lex, S Gratzl, C Partl, D Schmalstieg, H Pfister, P Park, N Gehlenborg, Nature Methods (2014)
Query
Rank
Visualize
Stratifications
Clinical Params
Pathways
Guided
Exploration
M Streit, A Lex, S Gratzl, C Partl, D Schmalstieg, H Pfister, P Park, N Gehlenborg, Nature Methods (2014)
A Unified Approach to Exploration, Authoring, and Communication with Reproducible Visualizations
?
A Unified Approach to Exploration, Authoring, and Communication with Reproducible Visualizations
A Unified Approach to Exploration, Authoring, and Communication with Reproducible Visualizations
And now what?
DATA-DRIVEN DISCOVERY
PUBLICATION
experiment
DATA
INSIGHT HYPOTHESIS
interpretation
DATA
hypothesis
generation
EXPLORATION
EXPLANATION
“Storytelling”
“Pattern Discovery”
DATA-DRIVEN DISCOVERY
PUBLICATION
experiment
DATA
INSIGHT HYPOTHESIS
interpretation
DATA
hypothesis
generation
EXPLORATION
EXPLANATION
“Storytelling”
“Pattern Discovery”
DATA-DRIVEN COMMUNICATION
DATA-DRIVEN DISCOVERY
finding figure/videoAuthoringExploration Presentation
DATA-DRIVEN COMMUNICATION
finding figure/videoAuthoringExploration Presentation
Current Model
DATA-DRIVEN DISCOVERY
DATA-DRIVEN COMMUNICATION
finding figure/videoAuthoringloration Pre
What we show.
DATA-DRIVEN DISCOVERY
DATA-DRIVEN COMMUNICATION
finding figure/videoAuthoringloration Pre
What we show.
What we tell.
DATA-DRIVEN DISCOVERY
DATA-DRIVEN COMMUNICATION
finding figure/videoAuthoringloration Pre
What we show.
What we did.
DATA-DRIVEN DISCOVERY
DATA-DRIVEN COMMUNICATION
finding figure/videoAuthoringloration Pre
What we show.
What we did.
DATA-DRIVEN DISCOVERY
CLUE
vistories
Authoring
Exploration Presentation
Vistories
DATA-DRIVEN COMMUNICATION
DATA-DRIVEN DISCOVERY
Exploration
Authoring
Presentation
Exploration
Authoring
Presentation
Exploration
Authoring
Presentation
Exploration
Authoring
Presentation
Exploration
Authoring
Presentation
Exploration
Authoring
Presentation
Exploration
Authoring
Presentation
Exploration
Authoring
Presentation
VISTORY = visualization + story + history
VISTORY = visualization + story + history
Do collaborative data analysis.
Use during peer-review.
Publish with a paper.
Embed in a presentation.
49
50
H Stitz et al., to appear in Proceedings of VAST 2018
Marc Streit Lab: Vistories in Action
Retrieval and Analysis of Visualization Provenance
H Stitz et al., work in progress
Marc Streit Lab: Vistories in Action
Vistories integration with Jupyter Notebooks
DATA-DRIVEN DISCOVERY
DATA-DRIVEN COMMUNICATION
DATA-DRIVEN DISCOVERY
DATA-DRIVEN COMMUNICATION
DATA-DRIVEN DISCOVERY
DATA-DRIVEN COMMUNICATION
Reproducible
Interactive
Visualizations
DATA-DRIVEN DISCOVERY
DATA-DRIVEN COMMUNICATION
http://vistories.org
DATA-DRIVEN DISCOVERY
DATA-DRIVEN COMMUNICATION
http://vistories.org
SAMUEL GRATZL
ALEXANDER LEX
MARC STREIT
A Unified Approach to Exploration, Authoring, and Communication with Reproducible Visualizations
A Unified Approach to Exploration, Authoring, and Communication with Reproducible Visualizations
Vistory “Visualization”

More Related Content

PDF
Data Visualization to Enhance our Understanding of the Cancer Genome
PDF
EMBL John Kendrew Award Lecture 2018
PDF
Data Visualization in Biomedical Sciences: More than Meets the Eye
PDF
Guided visual exploration of patient stratifications in cancer genomics
PDF
Tracing the Origins of Data and Ideas - Provenance Visualization for Biomedic...
PDF
Patients, Genomes, Time: Visualizing Disease Cohorts
PDF
Cancer Genomics Visualization across Scales: Nucleotides to Cohorts
PDF
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
Data Visualization to Enhance our Understanding of the Cancer Genome
EMBL John Kendrew Award Lecture 2018
Data Visualization in Biomedical Sciences: More than Meets the Eye
Guided visual exploration of patient stratifications in cancer genomics
Tracing the Origins of Data and Ideas - Provenance Visualization for Biomedic...
Patients, Genomes, Time: Visualizing Disease Cohorts
Cancer Genomics Visualization across Scales: Nucleotides to Cohorts
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and Time

Similar to A Unified Approach to Exploration, Authoring, and Communication with Reproducible Visualizations (20)

PPTX
The Challenge of Deeper Knowledge Graphs for Science
PPTX
Excursions into the garden of the forking paths
PDF
AstraZeneca - The promise of graphs & graph-based learning in drug discovery
PDF
Visualization Approaches for Biomedical Omics Data: Putting It All Together
PDF
PDF
Bioinformatics Strategies for Exposome 100416
PDF
Asking Better Questions How Presentation Formats Influence Information Search
PDF
De Waard Carusi
PDF
De Waard Carusi
PDF
Visual Exploration of Clinical and Genomic Data for Patient Stratification
PDF
InfoGAN:Bridging the Gap Between Data and Understanding in GANs
PDF
Selecting Empirical Methods for Software Engineering
PDF
FERASAT: A Serendipity-Fostering Faceted Browser for Linked Data
PPTX
Systems Genetics of Cancer - big data and all that
PDF
International Encyclopedia Of Statistical Science 2011th Edition Miodrag Lovric
PDF
Extreme scale text based classification of medical data
PDF
DSS Ontotext Webinar -Examode: Extreme-scale text-based classification of med...
PDF
Haladjian CV
PPTX
Share & Flourish workshop, Leiden, August 2014
PPTX
Computing on Phenotypes AMP 2015
The Challenge of Deeper Knowledge Graphs for Science
Excursions into the garden of the forking paths
AstraZeneca - The promise of graphs & graph-based learning in drug discovery
Visualization Approaches for Biomedical Omics Data: Putting It All Together
Bioinformatics Strategies for Exposome 100416
Asking Better Questions How Presentation Formats Influence Information Search
De Waard Carusi
De Waard Carusi
Visual Exploration of Clinical and Genomic Data for Patient Stratification
InfoGAN:Bridging the Gap Between Data and Understanding in GANs
Selecting Empirical Methods for Software Engineering
FERASAT: A Serendipity-Fostering Faceted Browser for Linked Data
Systems Genetics of Cancer - big data and all that
International Encyclopedia Of Statistical Science 2011th Edition Miodrag Lovric
Extreme scale text based classification of medical data
DSS Ontotext Webinar -Examode: Extreme-scale text-based classification of med...
Haladjian CV
Share & Flourish workshop, Leiden, August 2014
Computing on Phenotypes AMP 2015
Ad

More from Nils Gehlenborg (13)

PDF
HiGlass & Friends
PDF
Power to the People: Data Visualization in Biology and Medicine
PDF
Mining Gems from the Data Visualization Literature
PDF
Visualization of 3D Genome Data
PDF
Bayer Data Science Meetup
PDF
HiGlass + HiPiler: Making Sense of Chromosome Interaction Data with Multi-Sca...
PDF
Relaxation Techniques for the Upset Data Scientist
PDF
Multi-Scale Visualization Tools for Exploration of Chromosome Interaction ...
PDF
SMC-RNA BioVis Data Visualization DREAM Challenge Preview
PDF
Approaches for the Integration of Visual and Computational Analysis of Biomed...
PDF
BioVis Meetup @ IEEE VIS 2015
PDF
Visualization Tools for the Refinery Platform - Supporting reproducible resea...
PDF
Biological Visualization Community Meetup 2014
HiGlass & Friends
Power to the People: Data Visualization in Biology and Medicine
Mining Gems from the Data Visualization Literature
Visualization of 3D Genome Data
Bayer Data Science Meetup
HiGlass + HiPiler: Making Sense of Chromosome Interaction Data with Multi-Sca...
Relaxation Techniques for the Upset Data Scientist
Multi-Scale Visualization Tools for Exploration of Chromosome Interaction ...
SMC-RNA BioVis Data Visualization DREAM Challenge Preview
Approaches for the Integration of Visual and Computational Analysis of Biomed...
BioVis Meetup @ IEEE VIS 2015
Visualization Tools for the Refinery Platform - Supporting reproducible resea...
Biological Visualization Community Meetup 2014
Ad

Recently uploaded (20)

PPTX
TORCH INFECTIONS in pregnancy with toxoplasma
PPTX
perinatal infections 2-171220190027.pptx
PDF
Science Form five needed shit SCIENEce so
PDF
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
PPT
Computional quantum chemistry study .ppt
PPTX
Presentation1 INTRODUCTION TO ENZYMES.pptx
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PPTX
Seminar Hypertension and Kidney diseases.pptx
PPTX
Probability.pptx pearl lecture first year
PPT
Animal tissues, epithelial, muscle, connective, nervous tissue
PDF
Wound infection.pdfWound infection.pdf123
PPT
veterinary parasitology ````````````.ppt
PPT
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
PPTX
Introcution to Microbes Burton's Biology for the Health
PDF
Social preventive and pharmacy. Pdf
PPTX
limit test definition and all limit tests
PPTX
BODY FLUIDS AND CIRCULATION class 11 .pptx
PPTX
gene cloning powerpoint for general biology 2
PDF
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
TORCH INFECTIONS in pregnancy with toxoplasma
perinatal infections 2-171220190027.pptx
Science Form five needed shit SCIENEce so
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
Computional quantum chemistry study .ppt
Presentation1 INTRODUCTION TO ENZYMES.pptx
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
Seminar Hypertension and Kidney diseases.pptx
Probability.pptx pearl lecture first year
Animal tissues, epithelial, muscle, connective, nervous tissue
Wound infection.pdfWound infection.pdf123
veterinary parasitology ````````````.ppt
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
Introcution to Microbes Burton's Biology for the Health
Social preventive and pharmacy. Pdf
limit test definition and all limit tests
BODY FLUIDS AND CIRCULATION class 11 .pptx
gene cloning powerpoint for general biology 2
Looking into the jet cone of the neutrino-associated very high-energy blazar ...

A Unified Approach to Exploration, Authoring, and Communication with Reproducible Visualizations