SlideShare a Scribd company logo
Data Visualization
• http://www.ted.com/talks/hans_rosling_shows_the_best_stats_you_ve_ever_seen.html
Data
Ambiguity
Failure to
precisely define
just what the data
represent
0
0.5
1
1.5
2
2.5
3
3.5
0 1 2 3
Y-Value 1
Data Distortion
Exaggerating or
understating the
values of some of the
data points
Data
Distraction
Extraneous lines,
graphics, etc.
1st Qtr
58%
2nd Qtr
10%
3rd Qtr
23%
4th Qtr
9%
Sales
How to make graphs that work
(advice from Seth Godin)
1. Don't let popular spreadsheets be in charge
of the way you look.
2. Tell a story.
3. Follow some simple rules.*
4. Break some other rules.
Classics – The Table
• While it might be possible to display data
better graphically, a table often does the job
quite nicely.
*Godin’s Rules
• Time goes from left to right.
Sales data in units
1st
Quarter
2nd
Quarter
3rd
Quarter
4th
Quarter
8.2 1.4 3.2 1.2
Classics – Pie Charts
• Pie charts have a mixed reputation.
• They are popular in business and the media but
many information designers have criticized the
technique.
• Some claim that the pie slice shape
communicates numbers less exactly than other
possibilities such as line length.
• At least one study indicates that use of a pie chart
for analyzing a problem as opposed to a bar chart
changes the way people think about the problem.
*Godin’s Rules
• Pie charts are spectacularly overrated. If you
want to show me that four out of five
dentists prefer Trident and that we need to
target the fifth one, show me a picture of 5
dentists, but make one of them stand out. I'll
remember that.
Sales
Sales
1st Qtr
2nd Qtr
3rd Qtr
4th Qtr
Sales
Sales
1st Qtr
2nd Qtr
3rd Qtr
4th Qtr
Sales (% of total units)
1st Qtr
58%
2nd Qtr
10%
3rd Qtr
23%
4th Qtr
9%
Sales
Sales (% of total units)
1st Qtr
58%
2nd Qtr
10%
3rd Qtr
23%
4th Qtr;
9%
Sales
Your Options
(according to Yoda)
Do.
Do not.
Try.
Classics – Line Graphs
• Line graphs are classic diagrams that usually
give a good picture of the data.
• Line graphs should only be used when the
positions on the x-axis have a natural
ordering. If your labels are "2000, 2001,
2002" that's fine. If your labels are "US,
England, Germany" you should consider a bar
graph instead.
*Godin’s Rules
• Good results should go up on the Y axis. This
means that if you're charting weight loss,
don't chart "how much I weigh" because
good results would go down. Instead, chart
"percentage of goal" or "how much I lost.
Sales (total units)
1st Qtr, 8.2
2nd Qtr, 1.4
3rd Qtr, 3.2
4th Qtr; 1.2
0
1
2
3
4
5
6
7
8
9
1st Qtr 2nd Qtr 3rd Qtr 4th Qtr
Sales
*Godin’s Rules
• "Don't connect unrelated events. For
example, a graph of IQs of everyone in your
kindergarten class should be a series of
unrelated points, not a line graph. On the
other hand, your weight loss is in fact a
continuous function, so each piece of data
should be attached.
Classics – Bar Charts
• Bar charts are classic diagrams that usually
give a good picture of the data.
• Their main problem is that when there are
many bars, labeling becomes problematic.
• They also imply that the data is discrete; if
your data is something that is plausibly
continuously changing over time, for
instance, you might consider a line graph
instead.
Sales (total units)
8.2
1.4
3.2
1.2
1st Qtr 2nd Qtr 3rd Qtr 4th Qtr
0
1
2
3
4
5
6
7
8
9
New Classics – Network Diagram
• Real-world information often comes in the
form of relationships between entities or
items, such as people who know each other
(social networks), or Web pages that are
connected to each other.
• In a network diagram, entities are connected
to each other in the form of a node and link
diagram.
Data visualization
Data visualization
Data visualization
Data visualization
Data visualization
Data visualization
New Classics – Word Cloud
• A "Word Cloud" enables you to see how
frequently words appear in a given text, or
see the relationship between a column of
words and a column of numbers.
• You can tweak your word "clouds" with
different fonts, layouts, and color schemes.
• Wordle.net
Data visualization
Data visualization
Data visualization
Data visualization
Data visualization
Data visualization
Data visualization
New Classics - Infographics
• Information graphics or infographics are
graphic visual representations of
information, data or knowledge.
Data visualization
Data visualization
Data visualization
Data visualization
Data visualization
Data visualization
The future of visualization
• One word: DATA
Example: NYT Cascade
• Cascade allows for precise analysis of the
structures which underly sharing activity on the
web.
• Links browsing behavior on a site to sharing
activity to construct a detailed picture of how
information propagates through the social media
space.
• The tool and its underlying logic may be applied
to any publisher or brand interested in
understanding how its messages are shared.
Data visualization
• http://nytlabs.com/projects/cascade.html

More Related Content

PPTX
Data Visualization by David Kretch
PPT
Biotech day 4
PDF
Targeting Your Audience: Data Visualization to Communicate Data Insights
PPTX
Python your new best friend
PPTX
Data structures and algorithms
PPT
Reflection
PPT
Xml and webdata
PPT
Stacks queues lists
Data Visualization by David Kretch
Biotech day 4
Targeting Your Audience: Data Visualization to Communicate Data Insights
Python your new best friend
Data structures and algorithms
Reflection
Xml and webdata
Stacks queues lists

Viewers also liked (20)

PPT
Xml stylus studio
PPTX
Object model
PPTX
Big picture of data mining
PPT
Data preprocessing
PDF
Text categorization as a graph
PPT
Hash mac algorithms
PPT
Text classification
PPTX
Cobol, lisp, and python
PPTX
Key exchange in crypto
PPTX
Hashfunction
PPTX
Nlp naive bayes
PPTX
La informacion andres sanchez- nidia rodriguez
PPT
PPTX
Exception handling
PPTX
Datamining with nb
PPT
Xml schema
PPTX
Czego pragna klienci
PPTX
Optimizing shared caches in chip multiprocessors
PPT
Computer security
PPTX
Cryptography
Xml stylus studio
Object model
Big picture of data mining
Data preprocessing
Text categorization as a graph
Hash mac algorithms
Text classification
Cobol, lisp, and python
Key exchange in crypto
Hashfunction
Nlp naive bayes
La informacion andres sanchez- nidia rodriguez
Exception handling
Datamining with nb
Xml schema
Czego pragna klienci
Optimizing shared caches in chip multiprocessors
Computer security
Cryptography
Ad

Similar to Data visualization (20)

PDF
Guidelines for data visualisation: eye vegetables and eye candy
PDF
bigD3_mapReducebigD3_mapReducebigD3_mapReduce.pdf
PPTX
The Use of Data and Datasets in Data Science
PDF
DutchMLSchool. Automating Decision Making
PDF
Data cube
PPTX
UNIT_4_data visualization.pptx
PDF
MLSD18. Feature Engineering
PPTX
A Little Graph Theory for the Busy Developer - Jim Webber @ GraphConnect Chic...
PPTX
07 learning
PPTX
Exploratory Data Analysis week 4
PDF
Top 50 Diagrams in Editable Powerpoint
PDF
VSSML18. Clustering and Latent Dirichlet Allocation
PPTX
Data displays in statistics
PPTX
Data visualisationresearch
PDF
VSSML18. Feature Engineering
PDF
Tableau Visual Guidebook
PPTX
L8 scientific visualization of data
PPTX
Measurecamp 6 Workshop: Data Visualisation
PDF
MLSEV. Automating Decision Making
PDF
BSSML16 L1. Introduction, Models, and Evaluations
Guidelines for data visualisation: eye vegetables and eye candy
bigD3_mapReducebigD3_mapReducebigD3_mapReduce.pdf
The Use of Data and Datasets in Data Science
DutchMLSchool. Automating Decision Making
Data cube
UNIT_4_data visualization.pptx
MLSD18. Feature Engineering
A Little Graph Theory for the Busy Developer - Jim Webber @ GraphConnect Chic...
07 learning
Exploratory Data Analysis week 4
Top 50 Diagrams in Editable Powerpoint
VSSML18. Clustering and Latent Dirichlet Allocation
Data displays in statistics
Data visualisationresearch
VSSML18. Feature Engineering
Tableau Visual Guidebook
L8 scientific visualization of data
Measurecamp 6 Workshop: Data Visualisation
MLSEV. Automating Decision Making
BSSML16 L1. Introduction, Models, and Evaluations
Ad

More from Young Alista (20)

PPTX
Google appenginejava.ppt
PDF
Motivation for multithreaded architectures
PPT
Serialization/deserialization
PPTX
Business analytics and data mining
PPTX
Data mining and knowledge discovery
PPTX
Directory based cache coherence
PPTX
Cache recap
PPTX
Hardware managed cache
PPTX
How analysis services caching works
PPT
Abstract data types
PPTX
Abstraction file
PPTX
Concurrency with java
PPT
Abstract class
PPTX
Inheritance
PPTX
Object oriented analysis
PPTX
Programming for engineers in python
PPTX
Api crash
PPTX
Learning python
PPTX
Python basics
PPTX
Extending burp with python
Google appenginejava.ppt
Motivation for multithreaded architectures
Serialization/deserialization
Business analytics and data mining
Data mining and knowledge discovery
Directory based cache coherence
Cache recap
Hardware managed cache
How analysis services caching works
Abstract data types
Abstraction file
Concurrency with java
Abstract class
Inheritance
Object oriented analysis
Programming for engineers in python
Api crash
Learning python
Python basics
Extending burp with python

Recently uploaded (20)

PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
Machine Learning_overview_presentation.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
A Presentation on Artificial Intelligence
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Approach and Philosophy of On baking technology
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Machine learning based COVID-19 study performance prediction
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
Spectral efficient network and resource selection model in 5G networks
Group 1 Presentation -Planning and Decision Making .pptx
Empathic Computing: Creating Shared Understanding
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Network Security Unit 5.pdf for BCA BBA.
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Machine Learning_overview_presentation.pptx
MIND Revenue Release Quarter 2 2025 Press Release
Building Integrated photovoltaic BIPV_UPV.pdf
MYSQL Presentation for SQL database connectivity
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
A Presentation on Artificial Intelligence
SOPHOS-XG Firewall Administrator PPT.pptx
Approach and Philosophy of On baking technology
Dropbox Q2 2025 Financial Results & Investor Presentation
Machine learning based COVID-19 study performance prediction
20250228 LYD VKU AI Blended-Learning.pptx
Assigned Numbers - 2025 - Bluetooth® Document

Data visualization

  • 3. Data Ambiguity Failure to precisely define just what the data represent 0 0.5 1 1.5 2 2.5 3 3.5 0 1 2 3 Y-Value 1
  • 4. Data Distortion Exaggerating or understating the values of some of the data points
  • 5. Data Distraction Extraneous lines, graphics, etc. 1st Qtr 58% 2nd Qtr 10% 3rd Qtr 23% 4th Qtr 9% Sales
  • 6. How to make graphs that work (advice from Seth Godin) 1. Don't let popular spreadsheets be in charge of the way you look. 2. Tell a story. 3. Follow some simple rules.* 4. Break some other rules.
  • 7. Classics – The Table • While it might be possible to display data better graphically, a table often does the job quite nicely.
  • 8. *Godin’s Rules • Time goes from left to right.
  • 9. Sales data in units 1st Quarter 2nd Quarter 3rd Quarter 4th Quarter 8.2 1.4 3.2 1.2
  • 10. Classics – Pie Charts • Pie charts have a mixed reputation. • They are popular in business and the media but many information designers have criticized the technique. • Some claim that the pie slice shape communicates numbers less exactly than other possibilities such as line length. • At least one study indicates that use of a pie chart for analyzing a problem as opposed to a bar chart changes the way people think about the problem.
  • 11. *Godin’s Rules • Pie charts are spectacularly overrated. If you want to show me that four out of five dentists prefer Trident and that we need to target the fifth one, show me a picture of 5 dentists, but make one of them stand out. I'll remember that.
  • 14. Sales (% of total units) 1st Qtr 58% 2nd Qtr 10% 3rd Qtr 23% 4th Qtr 9% Sales
  • 15. Sales (% of total units) 1st Qtr 58% 2nd Qtr 10% 3rd Qtr 23% 4th Qtr; 9% Sales
  • 16. Your Options (according to Yoda) Do. Do not. Try.
  • 17. Classics – Line Graphs • Line graphs are classic diagrams that usually give a good picture of the data. • Line graphs should only be used when the positions on the x-axis have a natural ordering. If your labels are "2000, 2001, 2002" that's fine. If your labels are "US, England, Germany" you should consider a bar graph instead.
  • 18. *Godin’s Rules • Good results should go up on the Y axis. This means that if you're charting weight loss, don't chart "how much I weigh" because good results would go down. Instead, chart "percentage of goal" or "how much I lost.
  • 19. Sales (total units) 1st Qtr, 8.2 2nd Qtr, 1.4 3rd Qtr, 3.2 4th Qtr; 1.2 0 1 2 3 4 5 6 7 8 9 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr Sales
  • 20. *Godin’s Rules • "Don't connect unrelated events. For example, a graph of IQs of everyone in your kindergarten class should be a series of unrelated points, not a line graph. On the other hand, your weight loss is in fact a continuous function, so each piece of data should be attached.
  • 21. Classics – Bar Charts • Bar charts are classic diagrams that usually give a good picture of the data. • Their main problem is that when there are many bars, labeling becomes problematic. • They also imply that the data is discrete; if your data is something that is plausibly continuously changing over time, for instance, you might consider a line graph instead.
  • 22. Sales (total units) 8.2 1.4 3.2 1.2 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr 0 1 2 3 4 5 6 7 8 9
  • 23. New Classics – Network Diagram • Real-world information often comes in the form of relationships between entities or items, such as people who know each other (social networks), or Web pages that are connected to each other. • In a network diagram, entities are connected to each other in the form of a node and link diagram.
  • 30. New Classics – Word Cloud • A "Word Cloud" enables you to see how frequently words appear in a given text, or see the relationship between a column of words and a column of numbers. • You can tweak your word "clouds" with different fonts, layouts, and color schemes. • Wordle.net
  • 38. New Classics - Infographics • Information graphics or infographics are graphic visual representations of information, data or knowledge.
  • 45. The future of visualization • One word: DATA
  • 46. Example: NYT Cascade • Cascade allows for precise analysis of the structures which underly sharing activity on the web. • Links browsing behavior on a site to sharing activity to construct a detailed picture of how information propagates through the social media space. • The tool and its underlying logic may be applied to any publisher or brand interested in understanding how its messages are shared.