SlideShare a Scribd company logo
VISUALIZATION
Krist Wongsuphasawat (@kristw)
FROM DATA TO
Senior Data Visualization Scientist, Twitter
Twitter Analytics / Visual Insights
Internal
Dashboarding system
Exploratory data visualization tools
!
External
Public facing visualizations
#interactive
#interactive
http://twitter.github.io/interactive
Examples
What are visualizations?
pretty graphics
POWER OF THE EYES
pretty
MEANINGFUL
Anscombe’s Quartet
X Y
10.0 8.04
8.0 6.95
13.0 7.58
9.0 8.81
11.0 8.33
14.0 9.96
6.0 7.24
4.0 4.26
12.0 10.84
7.0 4.82
5.0 5.68
X Y
10.0 9.14
8.0 8.14
13.0 8.74
9.0 8.77
11.0 9.26
14.0 8.10
6.0 6.13
4.0 3.10
12.0 9.13
7.0 7.26
5.0 4.74
X Y
10.0 7.46
8.0 6.77
13.0 12.74
9.0 7.11
11.0 7.81
14.0 8.84
6.0 6.08
4.0 5.39
12.0 8.15
7.0 6.42
5.0 5.73
X Y
8.0 6.58
8.0 5.76
8.0 7.71
8.0 8.84
8.0 8.47
8.0 7.04
8.0 5.25
19.0 12.50
8.0 5.56
8.0 7.91
8.0 6.89
#1 #2 #3 #4
Anscombe’s Quartet
Property Value
Mean of X 11.0
Variance of X 10.0
Mean of Y 7.5
Variance of Y 3.75
Correlation between X and Y 0.816
Linear regression y = 3.0 +0.5x
#1 #2 #3 #4
Identical statistics!
Anscombe’s Quartet
#1 #2 #3 #4
0!
2!
4!
6!
8!
10!
12!
0! 5! 10! 15!
0!
1!
2!
3!
4!
5!
6!
7!
8!
9!
10!
0! 5! 10! 15!
0!
2!
4!
6!
8!
10!
12!
14!
0! 5! 10! 15!
0!
2!
4!
6!
8!
10!
12!
14!
0! 10! 20!
but very different
Napoleon’s March
geography
time
course (attack/retreat)
quantity of troops
temperature
direction
London Cholera Outbreak
London Cholera Outbreak
Visualization
• Power
• Understand data quickly
• Discover hidden facts
• Usage
• Storytelling / Reporting
• Exploratory data analysis
“Visualization”
• Information Visualization (academia)
• InfoVis
• Data Visualization (commonly used)
• DataVis
!
• infographics (...)
How to start?
• What tool should I use?
!
!
DATA
How to start?
• What tool should I use?
!
!
!
1. What type of data do I have?
DATA
DATA
1) What type of data?
DATA
1) What type of data?
vis7
vis5
vis3
vis2
vis1
vis6
vis4
Many options...
Which visualization technique should I use?
1) What type of data?
• Visualizations are categorized by data types:
• 2,3- dimensional
• Multi-dimensional
• Temporal
• Tree
• Network
• etc.
Let’s take a tour.
2D, 3D data
(real world objects)
!
a.k.a. Scientific Visualization (SciVis)
2D: Maps
3D: Brain
Multi-dimensional data
abstract dimensions
(+ real world dimensions)
Flowers
species sepalLength sepalWidth petalLength petalWidth
setosa 5.1 3.5 1.4 0.2
setosa 4.9 3.0 1.4 0.2
setosa 4.7 3.2 1.3 0.2
virginica 4.6 3.1 1.5 0.2
virginica 5.0 3.6 1.4 0.2
virginica 5.4 3.9 1.7 0.4
DATA
Scatterplot
http://bl.ocks.org/mbostock/3887118
Sepal Length
Sepal Width
Scatterplot Matrix
http://bl.ocks.org/mbostock/4063663
Sepal
Length
Sepal
Width
Petal
Length
Petal
Width
Cars
Name
economy
(mpg)
cylinders
power
(hp)
weight
(lb)
0-60 mph
(s)
Ford Mustang 18 6 88 3139 14.5
Honda Accord 31.5 4 68 2045 18.5
Honda Civic 24 4 97 2489 15
Mazda RX-7 23.7 3 100 2420 12.5
DATA
Parallel Coordinates
http://bl.ocks.org/jasondavies/1341281
The Geography of Tweets
@miguelrios
The Geography of Tweets
@miguelrios
tweet counts latitude longitude
20,000 27.174526 78.042153
9,000 49.124093 52.201304
1,000 12.2995 31.59592
... ... ...
DATA
abstract
dimension
real world
dimensions
Temporal Data
value changes over time
events
Line charts
http://bl.ocks.org/mbostock/3884955
Calendar chart
Events on timeline
http://evolutionofweb.appspot.com/#/evolution/day
Trees
hierarchy
Tree
http://bl.ocks.org/mbostock/4339083
Stock Market
Financial
All stocks
Healthcare Technology ...
Apple Google Canon ...
DATA
TreeMaps
http://www.marketwatch.com/tools/stockresearch/marketmap
Icicle
http://bl.ocks.org/mbostock/1005873
Sunburst
http://bl.ocks.org/mbostock/4348373
Networks
nodes and edges
Character Co-occurrences
{!
nodes: [!
'valjean',!
'fantine',!
'cosette',!
...!
],!
edges: [!
{character1: 'valjean', character2: 'fantine', 10},!
{character1: 'valjean', character2: 'cosette', 5},!
...!
]!
}!
DATA
Node-link diagram
http://bl.ocks.org/mbostock/4062045
Matrix
http://bost.ocks.org/mike/miserables/
Combination
Multi-D + Temporal
Multi-D + Tree
Multi-D + Network
Temporal + Tree
Temporal + Network
...
Life Expectancy
(Multi-D + Temporal)
http://www.gapminder.org/videos/the-river-of-myths/
VISUALIZATION
visual encodings + interactions
tooltips
animation
highlight
filter
etc.
bar chart
line chart
matrix
node-link
treemaps
etc.
or multiple views
(data type)
DATA
1) What type of data?
vis7
vis5
vis3
vis2
vis1
vis6
vis4
Many options...
Which visualization technique should I use?
DATA
1) What type of data?
vis7
vis3
vis4
Less options...
Still, which one should I use?
How to start?
• What tool should I use?
!
!
!
1. What type of data do I have?
2. What do I want from the data?
DATA
2) What do I want from the data?
• Many ways to visualize one type of data.
• Things to consider:
• audience (data scientist, execs, etc.)
• goal (storytelling, exploratory analysis)
• tasks
Storytelling
Exploratory
Four more years
https://www.youtube.com/watch?v=01un0ORjQps
Photogrid (Treemap + photo)
http://twitter.github.io/interactive/sochi
Soccer Tournament
https://uclfinal.twitter.com/
State of the Union
http://twitter.github.io/interactive/sotu2014/#p1
Ok, now tools.
1. What type of data do I have?
2. What do I want from the data?
Tools
Option 1: Programming library
Option 2: Packaged software
You have to write code.
(Mostly) no coding involved
Programming libraries
• d3.js, processing, R, etc.
!
• Copy and modify from examples.
• Can do custom stuffs (if you can figure out how)
• More overhead for common task
Packaged software
• Tableau (multi-dimensional)
• Gephi (graph)
• NodeXL (graph)
• Research projects (contact authors)
!
• Just use the software. No hassle of code/debug
• Limited functionalities to what the tools can do
• Custom designs more difficult
Ideal workflow
1. What type of data do I have?
2. What do I want from the data?
3. Pick appropriate techniques/tools
4. Done!
Ideal workflow
1. What type of data do I have?
2. What do I want from the data?
3. Pick appropriate techniques/tools
4. Done!
Not that easy!
Real-life workflow
data are dirty unsatisfied
transform
What type of data do I have?
Pre-process data
What do I want from the data?
Pick appropriate techniques/tools
See results change goal
change perspective
New year 2014
http://twitter.github.io/interactive/newyear2014/
Behind the scene
From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?
From Data to Visualization, what happens in between?
VISUALIZATION
FROM DATA TO
@kristw
VISUALIZATION
FROM DATA TO
@kristw
DATA first, not tools.
VISUALIZATION
FROM DATA TO
@kristw
DATA first, not tools.
visual encodings

(by data types)
+ interactionschoose:
VISUALIZATION
FROM DATA TO
visual encodings

(by data types)
+ interactions
DATA first, not tools.
@kristw
choose:
twitter.github.io/interactive
Thank you

More Related Content

PDF
"Introduction to Data Visualization" Workshop for General Assembly by Hunter ...
PDF
Data Visualization in Data Science
PDF
Big Data
PDF
The Importance of Data Visualization
PDF
Introduction to Graph Databases
PDF
How to Visualize Data Like a Pro
PPTX
Data Visualization
PPTX
Data ingestion
"Introduction to Data Visualization" Workshop for General Assembly by Hunter ...
Data Visualization in Data Science
Big Data
The Importance of Data Visualization
Introduction to Graph Databases
How to Visualize Data Like a Pro
Data Visualization
Data ingestion

What's hot (20)

PPTX
introduction to data science
PDF
Principles of data visualisation 2021
PPTX
Graph databases
PDF
Data Visualization(s) Using Python
PPTX
Data quality and data profiling
PDF
Data Visualization
PPTX
Reproducibility and Scientific Research: why, what, where, when, who, how
PPTX
Introduction to Looker Studio.pptx
PPTX
Data science applications and usecases
PPTX
Data Visualization Design Best Practices Workshop
PDF
Introduction to data science
PDF
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
PPTX
Data Visualization & Data Storytelling
PDF
Introduction to basic data analytics tools
PPTX
Introduction to Web Development.pptx
PDF
Speeding Time to Insight with a Modern ELT Approach
PPTX
Data visualisation
PPTX
Data Visualization.pptx
PDF
Building End-to-End Delta Pipelines on GCP
PPTX
Data Visualization
introduction to data science
Principles of data visualisation 2021
Graph databases
Data Visualization(s) Using Python
Data quality and data profiling
Data Visualization
Reproducibility and Scientific Research: why, what, where, when, who, how
Introduction to Looker Studio.pptx
Data science applications and usecases
Data Visualization Design Best Practices Workshop
Introduction to data science
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Data Visualization & Data Storytelling
Introduction to basic data analytics tools
Introduction to Web Development.pptx
Speeding Time to Insight with a Modern ELT Approach
Data visualisation
Data Visualization.pptx
Building End-to-End Delta Pipelines on GCP
Data Visualization
Ad

Viewers also liked (10)

PPTX
Tableau presentation
PDF
Data Visualization Techniques
PDF
Tableau Drive, A new methodology for scaling your analytic culture
PDF
Data Visualization 101: How to Design Charts and Graphs
PDF
Fundamental Ways We Use Data Visualizations
PPTX
Data visualization
PPT
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
PPTX
Tableau Software - Business Analytics and Data Visualization
PPTX
The 8 Hats of Data Visualisation
Tableau presentation
Data Visualization Techniques
Tableau Drive, A new methodology for scaling your analytic culture
Data Visualization 101: How to Design Charts and Graphs
Fundamental Ways We Use Data Visualizations
Data visualization
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
Tableau Software - Business Analytics and Data Visualization
The 8 Hats of Data Visualisation
Ad

Similar to From Data to Visualization, what happens in between? (20)

PDF
Data Visualization for Big Data: Experience from the Front Line
PPT
Scratchpads: past, present and future
PPT
Scratchpads: past, present and future
PDF
Sensors and Crowd - Steve Liang, GeoCENS Project
PDF
Data Science with Spark
PPTX
Software Sustainability: Better Software Better Science
PDF
Practical Magic with Incanter
PDF
A Modern Introduction to Decision Tree Ensembles
PDF
Scratchpads past,present,future
PDF
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
PDF
1 mapreduce-fest
PPT
Knowledge Infrastructure for Global Systems Science
PPTX
Big Data in NATO and Your Role
PDF
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
PDF
Exploration – A Serious Game
PPTX
AI from Space using Azure
PDF
Structured Data Presentation
PDF
Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
PPSX
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
PDF
Data Science at Scale - The DevOps Approach
Data Visualization for Big Data: Experience from the Front Line
Scratchpads: past, present and future
Scratchpads: past, present and future
Sensors and Crowd - Steve Liang, GeoCENS Project
Data Science with Spark
Software Sustainability: Better Software Better Science
Practical Magic with Incanter
A Modern Introduction to Decision Tree Ensembles
Scratchpads past,present,future
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
1 mapreduce-fest
Knowledge Infrastructure for Global Systems Science
Big Data in NATO and Your Role
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
Exploration – A Serious Game
AI from Space using Azure
Structured Data Presentation
Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Data Science at Scale - The DevOps Approach

More from Krist Wongsuphasawat (20)

PDF
What I tell myself before visualizing
PDF
Navigating the Wide World of Data Visualization Libraries
PDF
Encodable: Configurable Grammar for Visualization Components
PDF
6 things to expect when you are visualizing (2020 Edition)
PDF
Increasing the Impact of Visualization Research
PDF
6 things to expect when you are visualizing
PDF
What to expect when you are visualizing (v.2)
PDF
ร้อยเรื่องราวจากข้อมูล / Storytelling with Data
PDF
Reveal the talking points of every episode of Game of Thrones from fans' conv...
PDF
What to expect when you are visualizing
PDF
Adventure in Data: A tour of visualization projects at Twitter
PDF
Logs & Visualizations at Twitter
PDF
Data Visualization: A Quick Tour for Data Science Enthusiasts
PDF
Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...
PDF
Data Visualization at Twitter
PDF
Making Sense of Millions of Thoughts: Finding Patterns in the Tweets
PDF
A Narrative Display for Sports Tournament Recap
PDF
Visualization for Event Sequences Exploration
PDF
Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration ...
What I tell myself before visualizing
Navigating the Wide World of Data Visualization Libraries
Encodable: Configurable Grammar for Visualization Components
6 things to expect when you are visualizing (2020 Edition)
Increasing the Impact of Visualization Research
6 things to expect when you are visualizing
What to expect when you are visualizing (v.2)
ร้อยเรื่องราวจากข้อมูล / Storytelling with Data
Reveal the talking points of every episode of Game of Thrones from fans' conv...
What to expect when you are visualizing
Adventure in Data: A tour of visualization projects at Twitter
Logs & Visualizations at Twitter
Data Visualization: A Quick Tour for Data Science Enthusiasts
Using Visualizations to Monitor Changes and Harvest Insights from a Global-sc...
Data Visualization at Twitter
Making Sense of Millions of Thoughts: Finding Patterns in the Tweets
A Narrative Display for Sports Tournament Recap
Visualization for Event Sequences Exploration
Krist Wongsuphasawat's Dissertation Proposal Slides: Interactive Exploration ...

Recently uploaded (20)

PPTX
Global journeys: estimating international migration
PDF
Mega Projects Data Mega Projects Data
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PDF
Fluorescence-microscope_Botany_detailed content
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
Foundation of Data Science unit number two notes
PPTX
Business Acumen Training GuidePresentation.pptx
PDF
Lecture1 pattern recognition............
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PDF
Introduction to Business Data Analytics.
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Global journeys: estimating international migration
Mega Projects Data Mega Projects Data
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Fluorescence-microscope_Botany_detailed content
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Foundation of Data Science unit number two notes
Business Acumen Training GuidePresentation.pptx
Lecture1 pattern recognition............
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
Introduction to Business Data Analytics.
Galatica Smart Energy Infrastructure Startup Pitch Deck
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction-to-Cloud-ComputingFinal.pptx
Miokarditis (Inflamasi pada Otot Jantung)
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx

From Data to Visualization, what happens in between?