Information Visualization for
Large-Scale Data Workflows
Michael Conover
Senior Data Scientist, LinkedIn
@vagabondjack
reasonengine.wordpress.com
Wednesday, October 9, 2013
Emergent Structure
Wednesday, October 9, 2013
Elegant Complexity
Pedro Cruz, University of Coimbra
David Crandall, Indiana University
John Nelson, IDV Solutions
Credit
Wednesday, October 9, 2013
Intellectual Dividends
Realistic Mental Models
Verification of Assumptions
Shortened Iteration Cycles
Improved Predictive Performance
Product Insights
Clarity of Communication
Wednesday, October 9, 2013
Hypothesis Generation
Wednesday, October 9, 2013
Wednesday, October 9, 2013
Color Commentary
@whitehouse #RSVP
Wednesday, October 9, 2013
Flock Together
Wednesday, October 9, 2013
Political Polarization On Twitter
Wednesday, October 9, 2013
Basic Workflow Structure
Wednesday, October 9, 2013
aes_string()
Basic Visualization Battery
Wednesday, October 9, 2013
Feature Development
Wednesday, October 9, 2013
Anscombe’s Quartet
http://en.wikipedia.org/wiki/Anscombe's_quartet
Wednesday, October 9, 2013
0.0
0.1
0.2
0.3
0.4
−2.5 0.0 2.5 5.0
Standard Normal
Density
0.0
0.1
0.2
0.3
0.4
−5.0 −2.5 0.0 2.5 5.0
Standard Normal
Density
100,0001,000,000
Wednesday, October 9, 2013
A Lens On The Joint Distribution
log(Connections)
log(EndorsementPagerank)
geom_point()
Wednesday, October 9, 2013
A Lens On The Joint Distribution
log(Connections)
log(EndorsementPagerank)
geom_point(alpha=1/5)
Wednesday, October 9, 2013
A Lens On The Joint Distribution
log(Connections)
log(EndorsementPagerank)
25
50
75
100
count
geom_bin2d(bins=35)
Wednesday, October 9, 2013
A Lens On The Joint Distribution
log(Connections)
log(EndorsementPagerank)
Class
Negative
Positive geom_point(alpha=1/5, aes(color=label))
Wednesday, October 9, 2013
A Lens On The Joint Distribution
log(Connections)
log(EndorsementPagerank)
Class
Negative
Positive geom_density2d(aes(color=label), bins=10)
Wednesday, October 9, 2013
A Lens On The Joint Distribution
Marginal Histograms
Wednesday, October 9, 2013
A Lens On The Joint Distribution
Sepal.Length
6
7
8
5 6 7 8
Cor : −0.118
setosa: 0.743
versicolor: 0.526
virginica: 0.457
Cor : 0.872
setosa: 0.267
versicolor: 0.754
virginica: 0.864
Cor : 0.818
setosa: 0.278
versicolor: 0.546
virginica: 0.281
Sepal.Width
2.5
3
3.5
4
4.5
2 2.5 3 3.5 4 4.5
Cor : −0.428
setosa: 0.178
versicolor: 0.561
virginica: 0.401
Cor : −0.366
setosa: 0.233
versicolor: 0.664
virginica: 0.538
Petal.Length4
6
2 4 6
Cor : 0.963
setosa: 0.332
versicolor: 0.787
virginica: 0.322
Petal.Width
0.5
1
1.5
2
2.5
0 0.5 1 1.5 2 2.5
Species
setosa
versicolor
virginica
GGally (ggpairs)
Wednesday, October 9, 2013
Model Fitting & Evaluation
Wednesday, October 9, 2013
Model Selection
Model A Model B
Training Data I
Training Data II
Battery Battery
Battery Battery
Wednesday, October 9, 2013
stanford.edu/~jhuang11/
Homework At Scale
Wednesday, October 9, 2013
Topic Modeling
vis.stanford.edu/papers/termite
Wednesday, October 9, 2013
Layercake
Wednesday, October 9, 2013
Workflow Principles
Latent, Pervasive
Modular
Consistent Visual Language
Wednesday, October 9, 2013
Workflow Management
Wednesday, October 9, 2013
Azkaban
data.linkedin.com/opensource/azkaban
Wednesday, October 9, 2013
White Elephant
data.linkedin.com/opensource/white-elephant
Wednesday, October 9, 2013
Netflix’ Lipstickgithub.com/Netflix/Lipstick
Wednesday, October 9, 2013
Information Visualization for
Large-Scale Data Workflows
Michael Conover
Senior Data Scientist, LinkedIn
@vagabondjack
reasonengine.wordpress.com
Wednesday, October 9, 2013
Extended Toolbox
Wednesday, October 9, 2013
tableausoftware.com/public
Tableau
Wednesday, October 9, 2013
rstudio.com/shiny/
RStudio Shiny
Wednesday, October 9, 2013
code.google.com/p/google-motion-charts-with-r
GoogleVis
Wednesday, October 9, 2013
rweb.stat.ucla.edu/ggplot2/
Wednesday, October 9, 2013
kuler.adobe.com
Adobe Kuler
Wednesday, October 9, 2013
colorbrewer2.org
Color Brewer
Wednesday, October 9, 2013
d3js.org
D3.js
Wednesday, October 9, 2013
bl.ocks.org/mbostock
Bostock’s Blocks
Wednesday, October 9, 2013
maps.stamen.com
Stamen OpenStreetMap Tiles
Wednesday, October 9, 2013
zipfianacademy.com/maps/h3/
SF Health Inspections
Wednesday, October 9, 2013

More Related Content

PPT
Topographic maps presentation_mine
PPTX
M.B.T.S. Round 3, Week 2
PDF
Machine Learning - Aprendendo com a experiência - Oi Internet Tech Talk
PPT
Boger synoptic key ppt
PPT
What is Topography
PPTX
Introduction to computer graphics
PPT
Topography powerpoint
PDF
Big Data Visualization
Topographic maps presentation_mine
M.B.T.S. Round 3, Week 2
Machine Learning - Aprendendo com a experiência - Oi Internet Tech Talk
Boger synoptic key ppt
What is Topography
Introduction to computer graphics
Topography powerpoint
Big Data Visualization

Similar to Information Visualization for Large-Scale Data Workflows (20)

PDF
From Data to Visualization, what happens in between?
PDF
Information Visualization for Large-Scale Data Workflows by Michael Conover (...
PDF
Creating visualizations using Linked Data
PDF
Practical Data Visualization
PDF
Vivarana fyp report
DOCX
Concept mapping patient initials, age, gender and admitting d
PDF
Handbook Of Data Visualization Springer Handbooks Of Computational Statistics...
PDF
Visualizing Your Startup Pitch Deck
DOCX
Data Visualization with RRob Kabacoff2018-09-032
DOCX
Data visualization with r rob kabacoff2018 09-032
PDF
Science Online 2013: Data Visualization Using R
PDF
2016 05-20-clariah-wp4
PDF
12. Map Visualization .pdf
PDF
Visualisation - techniques, interaction dynamics, big data
PDF
Big Data Visualization
PDF
Gephi tutorial: quick start
PDF
Applying Machine Learning to Data Visaulization: What, Why, Where, and How
PDF
Visualisatie voor transparante beslissingen - Big Data Expo 2019
PDF
Transcript - Data Visualisation - Tools and Techniques
PDF
On the Separability of Structural Classes of Communities
From Data to Visualization, what happens in between?
Information Visualization for Large-Scale Data Workflows by Michael Conover (...
Creating visualizations using Linked Data
Practical Data Visualization
Vivarana fyp report
Concept mapping patient initials, age, gender and admitting d
Handbook Of Data Visualization Springer Handbooks Of Computational Statistics...
Visualizing Your Startup Pitch Deck
Data Visualization with RRob Kabacoff2018-09-032
Data visualization with r rob kabacoff2018 09-032
Science Online 2013: Data Visualization Using R
2016 05-20-clariah-wp4
12. Map Visualization .pdf
Visualisation - techniques, interaction dynamics, big data
Big Data Visualization
Gephi tutorial: quick start
Applying Machine Learning to Data Visaulization: What, Why, Where, and How
Visualisatie voor transparante beslissingen - Big Data Expo 2019
Transcript - Data Visualisation - Tools and Techniques
On the Separability of Structural Classes of Communities
Ad

Recently uploaded (20)

PDF
Hazard Identification & Risk Assessment .pdf
PPTX
Education and Perspectives of Education.pptx
PDF
Myanmar Dental Journal, The Journal of the Myanmar Dental Association (2013).pdf
PDF
IP : I ; Unit I : Preformulation Studies
PPTX
Module on health assessment of CHN. pptx
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PDF
semiconductor packaging in vlsi design fab
PDF
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
PPTX
What’s under the hood: Parsing standardized learning content for AI
PDF
Literature_Review_methods_ BRACU_MKT426 course material
PDF
Empowerment Technology for Senior High School Guide
PDF
Skin Care and Cosmetic Ingredients Dictionary ( PDFDrive ).pdf
PDF
HVAC Specification 2024 according to central public works department
PPTX
Computer Architecture Input Output Memory.pptx
PDF
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
PDF
My India Quiz Book_20210205121199924.pdf
PDF
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 2).pdf
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PDF
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
PDF
International_Financial_Reporting_Standa.pdf
Hazard Identification & Risk Assessment .pdf
Education and Perspectives of Education.pptx
Myanmar Dental Journal, The Journal of the Myanmar Dental Association (2013).pdf
IP : I ; Unit I : Preformulation Studies
Module on health assessment of CHN. pptx
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
semiconductor packaging in vlsi design fab
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
What’s under the hood: Parsing standardized learning content for AI
Literature_Review_methods_ BRACU_MKT426 course material
Empowerment Technology for Senior High School Guide
Skin Care and Cosmetic Ingredients Dictionary ( PDFDrive ).pdf
HVAC Specification 2024 according to central public works department
Computer Architecture Input Output Memory.pptx
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
My India Quiz Book_20210205121199924.pdf
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 2).pdf
FORM 1 BIOLOGY MIND MAPS and their schemes
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
International_Financial_Reporting_Standa.pdf
Ad

Information Visualization for Large-Scale Data Workflows