SlideShare a Scribd company logo
Data Visualization Nikhil Srivastava, 2015
Nikhil Srivastava
Moringa School
Summer 2015
Data Visualization Nikhil Srivastava, 2015
Data Visualization Nikhil Srivastava, 2015
Data Visualization
Data Visualization Nikhil Srivastava, 2015
Effective Data Visualization
Data Visualization Nikhil Srivastava, 2015
Principles and Techniques
Effective Data Visualization
of
Data Visualization Nikhil Srivastava, 2015
Learn and Apply
Principles and Techniques
Effective Data Visualization
the
of
Data Visualization Nikhil Srivastava, 2015
Nikhil Srivastava
nsrivast@gmail.com
0713 987 262
I build products & businesses in the fields of finance & technology.
I organize & visualize information for teaching & understanding.
nikhilsrivastava.com
Data Visualization Nikhil Srivastava, 2015
About You: Homework #1
Savio Abuga
Mosaab Baba
Victor Chweya
Ron Gichuhi
Kevin Kavai
Mutisya Maingi
Andrew Makachia
Andrew Molo
Leon Muchoki
Anthony Mwangi
Waiyaki Njomo
Data Visualization Nikhil Srivastava, 2015
Course Details
• Class format
• Course website [see email]
– Slides, demos, extra material
– Code samples and libraries
– Final projects
Data Visualization Nikhil Srivastava, 2015
Course Overview
• What is Data Visualization?
• Thinking and Seeing
• From Data to Graphics
• Highcharts & Javascript
• Class Project
• [Next Steps]
introduction
foundation & theory
building blocks
construction
Data Visualization Nikhil Srivastava, 2015
• What is Data Visualization?
• Thinking and Seeing
• From Data to Graphics
• Highcharts & Javascript
• Class Project
• [Next Steps]
introduction
foundation & theory
building blocks
construction
Data Visualization Nikhil Srivastava, 2015
Data Visualization (DV)
Information Visualization
Scientific Visualization
Infographics
Statistical Graphics
Informative Art
Art
Science
Statistics
JournalismDesign
Visual Analytics
Data Visualization Nikhil Srivastava, 2015
City/Town County Population
Ahero Kisumu 76,828
Athi River Machakos 139,380
Awasi Kisumu 93,369
Kangundo-Tala Machakos 218,557
Karuri Kiambu 129,934
Kiambu Kiambu 88,869
Kikuyu Kiambu 233,231
Kisumu Kisumu 409,928
Kitale Trans-Nzoia 106,187
Kitui Kitui 155,896
Limuru Kiambu 104,282
Machakos Machakos 150,041
Molo Nakuru 107,806
Mwingi Kitui 83,803
Naivasha Nakuru 181,966
Nakuru Nakuru 307,990
Nandi Hills Trans-Nzoia 73,626
Ruiru Kiambu 238,858
Thika Kiambu 139,853
Data Visualization Nikhil Srivastava, 2015
City/Town County Population
Ahero Kisumu 76,828
Athi River Machakos 139,380
Awasi Kisumu 93,369
Kangundo-Tala Machakos 218,557
Karuri Kiambu 129,934
Kiambu Kiambu 88,869
Kikuyu Kiambu 233,231
Kisumu Kisumu 409,928
Kitale Trans-Nzoia 106,187
Kitui Kitui 155,896
Limuru Kiambu 104,282
Machakos Machakos 150,041
Molo Nakuru 107,806
Mwingi Kitui 83,803
Naivasha Nakuru 181,966
Nakuru Nakuru 307,990
Nandi Hills Trans-Nzoia 73,626
Ruiru Kiambu 238,858
Thika Kiambu 139,853
• Which is the most populous
city in the list?
• Which county in the list has
the most cities?
• Which county in the list has
the largest average city?
Data Visualization Nikhil Srivastava, 2015
Data Visualization Nikhil Srivastava, 2015
• Which is the most populous
city in the list?
• Which county in the list has
the most cities?
• Which county in the list has
the largest average city?
Data Visualization Nikhil Srivastava, 2015
• Which is the most populous
city in the list?
• Which county in the list has
the most cities?
• Which county in the list has
the largest average city?
• What is the population of
Limuru?
Data Visualization Nikhil Srivastava, 2015
Data Visualization is:
• Useful
– Answers user questions
– Reduces user workload
(by design, not by default.)
Data Visualization Nikhil Srivastava, 2015
Anscombe’s quartet (1973)
Data Visualization Nikhil Srivastava, 2015
Anscombe’s quartet (1973)
Data Visualization Nikhil Srivastava, 2015
Data Visualization is:
• Important
– Resolve ambiguity
– Locate outliers
– Understand structure and patterns
Data Visualization Nikhil Srivastava, 2015
How much bigger is B than A?
A
B
Data Visualization Nikhil Srivastava, 2015
A B
How much bigger is B than A?
Data Visualization Nikhil Srivastava, 2015
A B
How much bigger is B than A?
Data Visualization Nikhil Srivastava, 2015
Data Visualization Nikhil Srivastava, 2015
Data Visualization is:
• Important
– Design decisions affect interpretation
Data Visualization Nikhil Srivastava, 2015
Data Visualization Nikhil Srivastava, 2015
Data Visualization Nikhil Srivastava, 2015
Data Visualization Nikhil Srivastava, 2015
Data Visualization is:
• Powerful
– Communicate, teach, inspire
Data Visualization Nikhil Srivastava, 2015
Why use DV?
• Explore, Analyze, Communicate
• Answer user questions, facilitate tasks
• Visual = efficient
Data Visualization Nikhil Srivastava, 2015
Definitions
• “the process that transforms (abstract) data into interactive
graphical representations for the purpose of exploration,
confirmation, or presentation” 1
• “finding the artificial memory that best supports our natural
means of perception” 2
• “the use of computer-generated, interactive, visual
representations of data to amplify cognition” 3
• “giving information a visual representation that is useful for
analysis and presentation” 4
Data Visualization Nikhil Srivastava, 2015
Planetary Movement Line Chart
Unknown, ~1000 AD
A (Brief) History
Data Visualization Nikhil Srivastava, 2015
A New Chart of History
Joseph Priestley, 1769
Data Visualization Nikhil Srivastava, 2015
“Linear Chronology” of Economy Metrics
William Playfair, 1824
Data Visualization Nikhil Srivastava, 2015
Crimean War Deaths
Florence Nightingale, 1858 (re-colorized)
Data Visualization Nikhil Srivastava, 2015
Mother of All Demos
Douglas Engelbart, 1968
Data Visualization Nikhil Srivastava, 2015
Why is DV relevant?
“The ability to take data—to be able to
understand it, to process it, to extract value from
it, to visualize it, to communicate it— that’s going
to be a hugely important skill in the next
decades, ... because now we really do have
essentially free and ubiquitous data.”
- Hal Varian, 2009
Data Visualization Nikhil Srivastava, 2015
Why is DV relevant?
• In one second …
• Open data
• Open technologies
• Growing use in business, science, media,
advertising
Data Visualization Nikhil Srivastava, 2015
Focus Extra
purpose communicate explore, analyze
data numerical,
categorical
text, maps,
graphs, networks
feature representation animation,
Interactivity
Course Scope
Data Visualization Nikhil Srivastava, 2015
For next time: Homework #2
• Find a data visualization
Data Visualization Nikhil Srivastava, 2015
Questions?
Nikhil Srivastava
nsrivast@gmail.com
0713 987 262

More Related Content

PPTX
SeeingAndThinking
PPTX
Data Visualization Summary iHub
PPTX
Advanced Visualizations
PPTX
From Data To Graphics
PDF
WDQI Spring Conference Agenda
PDF
Data visualization, FMIIT, STD
PDF
Predictive Analytics vs Prescriptive Analytics [Infographic]
PPT
Introduction to Data Visualizations
SeeingAndThinking
Data Visualization Summary iHub
Advanced Visualizations
From Data To Graphics
WDQI Spring Conference Agenda
Data visualization, FMIIT, STD
Predictive Analytics vs Prescriptive Analytics [Infographic]
Introduction to Data Visualizations

Viewers also liked (9)

PPSX
The art technique of data visualization
PPTX
Business analytics and data visualisation
PPTX
Prescriptive Analytics
PPTX
Benefits of data visualization
PDF
The Importance of Data Visualization
PPTX
The 8 Hats of Data Visualisation
PDF
Data Visualization Techniques
PDF
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
PPT
Data mining slides
 
The art technique of data visualization
Business analytics and data visualisation
Prescriptive Analytics
Benefits of data visualization
The Importance of Data Visualization
The 8 Hats of Data Visualisation
Data Visualization Techniques
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
Data mining slides
 
Ad

Similar to What is Data Visualization (20)

PPTX
Data & Analytics Club - Data Visualization Workshop
PPTX
Storytelling through data
PDF
Data visualization for development
PPTX
Data visualisation in data analytics with python
PPTX
Unit III.pptx
PPTX
Data visualization is the representation of data through use of common graphi...
PDF
Webinar - Introduction to Data Visualisations and Analytics.
PDF
Class 3 visual representation of data
KEY
Data Visualization Strategies & Open Source Tools
PPTX
HYDSPIN Dec14 visual story telling
PPTX
Introduction to Data Visualization Slides
PPTX
Datamining data visualization
PDF
Data Viz CE 2014 Intro and Overview
PDF
Data Visualization for Non-Programmers
PDF
What is data visualization
PPTX
Kigali | Nov-14 | Data driven journalism
PPTX
Data visualization of Big Data analytics
PPTX
Data visualization representation of Analytics data
PPTX
Data visualization
PPTX
Introduction to Data science and understanding the basics
Data & Analytics Club - Data Visualization Workshop
Storytelling through data
Data visualization for development
Data visualisation in data analytics with python
Unit III.pptx
Data visualization is the representation of data through use of common graphi...
Webinar - Introduction to Data Visualisations and Analytics.
Class 3 visual representation of data
Data Visualization Strategies & Open Source Tools
HYDSPIN Dec14 visual story telling
Introduction to Data Visualization Slides
Datamining data visualization
Data Viz CE 2014 Intro and Overview
Data Visualization for Non-Programmers
What is data visualization
Kigali | Nov-14 | Data driven journalism
Data visualization of Big Data analytics
Data visualization representation of Analytics data
Data visualization
Introduction to Data science and understanding the basics
Ad

Recently uploaded (20)

PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
Lesson notes of climatology university.
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PPTX
Presentation on HIE in infants and its manifestations
PDF
Complications of Minimal Access Surgery at WLH
PPTX
Pharma ospi slides which help in ospi learning
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Classroom Observation Tools for Teachers
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
master seminar digital applications in india
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Lesson notes of climatology university.
102 student loan defaulters named and shamed – Is someone you know on the list?
Pharmacology of Heart Failure /Pharmacotherapy of CHF
STATICS OF THE RIGID BODIES Hibbelers.pdf
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
FourierSeries-QuestionsWithAnswers(Part-A).pdf
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Presentation on HIE in infants and its manifestations
Complications of Minimal Access Surgery at WLH
Pharma ospi slides which help in ospi learning
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Classroom Observation Tools for Teachers
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
VCE English Exam - Section C Student Revision Booklet
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
master seminar digital applications in india

What is Data Visualization

  • 1. Data Visualization Nikhil Srivastava, 2015 Nikhil Srivastava Moringa School Summer 2015
  • 2. Data Visualization Nikhil Srivastava, 2015
  • 3. Data Visualization Nikhil Srivastava, 2015 Data Visualization
  • 4. Data Visualization Nikhil Srivastava, 2015 Effective Data Visualization
  • 5. Data Visualization Nikhil Srivastava, 2015 Principles and Techniques Effective Data Visualization of
  • 6. Data Visualization Nikhil Srivastava, 2015 Learn and Apply Principles and Techniques Effective Data Visualization the of
  • 7. Data Visualization Nikhil Srivastava, 2015 Nikhil Srivastava nsrivast@gmail.com 0713 987 262 I build products & businesses in the fields of finance & technology. I organize & visualize information for teaching & understanding. nikhilsrivastava.com
  • 8. Data Visualization Nikhil Srivastava, 2015 About You: Homework #1 Savio Abuga Mosaab Baba Victor Chweya Ron Gichuhi Kevin Kavai Mutisya Maingi Andrew Makachia Andrew Molo Leon Muchoki Anthony Mwangi Waiyaki Njomo
  • 9. Data Visualization Nikhil Srivastava, 2015 Course Details • Class format • Course website [see email] – Slides, demos, extra material – Code samples and libraries – Final projects
  • 10. Data Visualization Nikhil Srivastava, 2015 Course Overview • What is Data Visualization? • Thinking and Seeing • From Data to Graphics • Highcharts & Javascript • Class Project • [Next Steps] introduction foundation & theory building blocks construction
  • 11. Data Visualization Nikhil Srivastava, 2015 • What is Data Visualization? • Thinking and Seeing • From Data to Graphics • Highcharts & Javascript • Class Project • [Next Steps] introduction foundation & theory building blocks construction
  • 12. Data Visualization Nikhil Srivastava, 2015 Data Visualization (DV) Information Visualization Scientific Visualization Infographics Statistical Graphics Informative Art Art Science Statistics JournalismDesign Visual Analytics
  • 13. Data Visualization Nikhil Srivastava, 2015 City/Town County Population Ahero Kisumu 76,828 Athi River Machakos 139,380 Awasi Kisumu 93,369 Kangundo-Tala Machakos 218,557 Karuri Kiambu 129,934 Kiambu Kiambu 88,869 Kikuyu Kiambu 233,231 Kisumu Kisumu 409,928 Kitale Trans-Nzoia 106,187 Kitui Kitui 155,896 Limuru Kiambu 104,282 Machakos Machakos 150,041 Molo Nakuru 107,806 Mwingi Kitui 83,803 Naivasha Nakuru 181,966 Nakuru Nakuru 307,990 Nandi Hills Trans-Nzoia 73,626 Ruiru Kiambu 238,858 Thika Kiambu 139,853
  • 14. Data Visualization Nikhil Srivastava, 2015 City/Town County Population Ahero Kisumu 76,828 Athi River Machakos 139,380 Awasi Kisumu 93,369 Kangundo-Tala Machakos 218,557 Karuri Kiambu 129,934 Kiambu Kiambu 88,869 Kikuyu Kiambu 233,231 Kisumu Kisumu 409,928 Kitale Trans-Nzoia 106,187 Kitui Kitui 155,896 Limuru Kiambu 104,282 Machakos Machakos 150,041 Molo Nakuru 107,806 Mwingi Kitui 83,803 Naivasha Nakuru 181,966 Nakuru Nakuru 307,990 Nandi Hills Trans-Nzoia 73,626 Ruiru Kiambu 238,858 Thika Kiambu 139,853 • Which is the most populous city in the list? • Which county in the list has the most cities? • Which county in the list has the largest average city?
  • 15. Data Visualization Nikhil Srivastava, 2015
  • 16. Data Visualization Nikhil Srivastava, 2015 • Which is the most populous city in the list? • Which county in the list has the most cities? • Which county in the list has the largest average city?
  • 17. Data Visualization Nikhil Srivastava, 2015 • Which is the most populous city in the list? • Which county in the list has the most cities? • Which county in the list has the largest average city? • What is the population of Limuru?
  • 18. Data Visualization Nikhil Srivastava, 2015 Data Visualization is: • Useful – Answers user questions – Reduces user workload (by design, not by default.)
  • 19. Data Visualization Nikhil Srivastava, 2015 Anscombe’s quartet (1973)
  • 20. Data Visualization Nikhil Srivastava, 2015 Anscombe’s quartet (1973)
  • 21. Data Visualization Nikhil Srivastava, 2015 Data Visualization is: • Important – Resolve ambiguity – Locate outliers – Understand structure and patterns
  • 22. Data Visualization Nikhil Srivastava, 2015 How much bigger is B than A? A B
  • 23. Data Visualization Nikhil Srivastava, 2015 A B How much bigger is B than A?
  • 24. Data Visualization Nikhil Srivastava, 2015 A B How much bigger is B than A?
  • 25. Data Visualization Nikhil Srivastava, 2015
  • 26. Data Visualization Nikhil Srivastava, 2015 Data Visualization is: • Important – Design decisions affect interpretation
  • 27. Data Visualization Nikhil Srivastava, 2015
  • 28. Data Visualization Nikhil Srivastava, 2015
  • 29. Data Visualization Nikhil Srivastava, 2015
  • 30. Data Visualization Nikhil Srivastava, 2015 Data Visualization is: • Powerful – Communicate, teach, inspire
  • 31. Data Visualization Nikhil Srivastava, 2015 Why use DV? • Explore, Analyze, Communicate • Answer user questions, facilitate tasks • Visual = efficient
  • 32. Data Visualization Nikhil Srivastava, 2015 Definitions • “the process that transforms (abstract) data into interactive graphical representations for the purpose of exploration, confirmation, or presentation” 1 • “finding the artificial memory that best supports our natural means of perception” 2 • “the use of computer-generated, interactive, visual representations of data to amplify cognition” 3 • “giving information a visual representation that is useful for analysis and presentation” 4
  • 33. Data Visualization Nikhil Srivastava, 2015 Planetary Movement Line Chart Unknown, ~1000 AD A (Brief) History
  • 34. Data Visualization Nikhil Srivastava, 2015 A New Chart of History Joseph Priestley, 1769
  • 35. Data Visualization Nikhil Srivastava, 2015 “Linear Chronology” of Economy Metrics William Playfair, 1824
  • 36. Data Visualization Nikhil Srivastava, 2015 Crimean War Deaths Florence Nightingale, 1858 (re-colorized)
  • 37. Data Visualization Nikhil Srivastava, 2015 Mother of All Demos Douglas Engelbart, 1968
  • 38. Data Visualization Nikhil Srivastava, 2015 Why is DV relevant? “The ability to take data—to be able to understand it, to process it, to extract value from it, to visualize it, to communicate it— that’s going to be a hugely important skill in the next decades, ... because now we really do have essentially free and ubiquitous data.” - Hal Varian, 2009
  • 39. Data Visualization Nikhil Srivastava, 2015 Why is DV relevant? • In one second … • Open data • Open technologies • Growing use in business, science, media, advertising
  • 40. Data Visualization Nikhil Srivastava, 2015 Focus Extra purpose communicate explore, analyze data numerical, categorical text, maps, graphs, networks feature representation animation, Interactivity Course Scope
  • 41. Data Visualization Nikhil Srivastava, 2015 For next time: Homework #2 • Find a data visualization
  • 42. Data Visualization Nikhil Srivastava, 2015 Questions? Nikhil Srivastava nsrivast@gmail.com 0713 987 262

Editor's Notes

  • #3: Discussion questions: What exactly are we looking at? What are the different components of this image: lines, areas, colors? What does this image tell you? Why was it designed this way? This class is intended to help you answer these questions. We’re going to learn the how and why of data visualization.
  • #4: How are we going to do this? We’ll start by what DV is: where and why it is used.
  • #5: We’ll learn about what it means for DV to be effective - to be purposefully designed, to use appropriate techniques, and to achieve a specific purpose.
  • #6: We’ll learn the principles of effective DV: some obvious and some subtle, some universal and some contentious. These principles will be grounded in how humans perceive, process, and draw insight from visual information. We’ll also learn the techniques for producing effective DV: how different types of charts and graphics work, how to make design choices, and guidelines for good design.
  • #7: And once we’ve mastered these concepts, we’ll apply them in creating visualizations of our own. It’s a lot to cover! But before we get started, let’s get to know each other and go through some course details.
  • #8: Who I am, where I’m from, what I do, what I’m about.
  • #9: Let’s introduce ourselves. Share with us your answer to the first homework question: a piece of numerical data that describes something about your background, your life, or your interests.
  • #10: Lastly, before we get started – a few notes on the course. We’re going to use a variety of formats: lectures, discussion, demos, homework, coding, presentations. (The first class will be more lecture than usual.) Also, please jump in with questions and discussion anytime. Please continue to monitor the course website. It will have links to everything we cover in class as well as extra material. It will also have links to code samples and libraries, and we’ll put all the projects up at the end of the class. The website will be available after the class ends for your reference.
  • #11: So the way this class is organized is as follows. The first half will be information-based. We’ll start by learning what DV is, what it is used for, how and when it is commonly used (properly and improperly), and why it is relevant. Then we’ll learn how humans process visual information, and why that is relevant to visualization design. Next, we’ll study the building blocks of visualizations – the points, lines, shapes, and colors that represent data – and we’ll organize in different ways to produce different types of charts. We’ll learn about a range of commonly-used charts for visualization, their use cases, advantages, and disadvantages. After learning what makes a chart, we will consider what makes a *good* chart. We’ll learn principles and guidelines for good visualization design and be able to critique existing visualizations. The second half will be project-based. We will do an introduction to Javascript and the charting library Highcharts, and we’ll explore and tinker with examples of basic charts. Then we’ll get to work on our project of building our own data visualizations – working individually or in pairs, we’ll select our data, determine the visualization objectives, make design choices, sketch, prototype, and finally implement the visualizations. We’ll have presentations with class discussion and feedback. If there’s time, we’ll have a discussion on advanced topics in visualization.
  • #12: Alright, let’s get started – what is data visualization?
  • #13: It’s difficult to define precisely: as a field, DV has many related and overlapping goals and descriptions. It is often used interchangeably with different terms, and it falls under many different disciplines.
  • #14: Better than a definition is an example. Let’s take a look at this table of Kenyan cities showing city name, county name, and city population. Take a moment to understand the structure of this data, because I’m about to ask you a few questions on it.
  • #15: Better than a definition is an example. Let’s take a look at this table of Kenyan cities showing city name, county name, and city population. Take a moment to understand the structure of this data, because I’m about to ask you a few questions on it.
  • #16: Now let’s look at the data in a different way. We’re looking at a visualization of the same data known as a bar chart. Each city is represented by a bar whose length is proportional to the city’s population. Cities within the same county are colored the same and grouped together. Within each county, cities are ordered in decreasing population.
  • #17: Now, let’s answer the same questions by using the visualization. What are the cognitive steps required? How easy or difficult is the process?
  • #18: Now let’s ask an additional question we didn’t ask before.
  • #19: We’ve learned that data visualization can be useful in telling us things about a set of data, making it easier to find information and answer questions. We’ve also learned that this usefulness depends both on the design of the visualization and the specific information we are looking for.
  • #20: Let’s take a look at another example. This is a data set called Anscombe’s Quartet, named after the statistician who devised it. It consists of four separate sets of data, each of which is a list of ten pairs of numbers. So there are ten different X and Y values that are paired. To make this a bit more concrete, you can imagine that each data set describes ten people, X represents their height and Y represents their weight. The interesting thing is that all four of these data sets have exactly the same relationship between the X and Y numbers. All X values have the same average and standard deviation, and so do all Y values. Furthermore, the correlation between X and Y is the same for all sets. And except for the last one (which has a bunch of 8s), there’s not much we can do to distinguish them or describe them meaningfully by just looking at the numbers in the table. Now let’s see what happens when we plot them.
  • #21: Here we’ve visualized the data in what’s known as a scatter plot. Each dot represents one of the ten pairs, located on the horizontal axis by X value and on the vertical axis by Y value. By visualizing the data, we see patterns, outliers, and relationships that were impossible to detect in the chart.
  • #22: So we’ve learned that DV is important. It can help us resolve ambiguous data, locate outliers, and generally understand the structure and pattern of a data set.
  • #23: Another example. How much bigger do you think line B is compared with line A? 1 times, 2 times, 5 times?
  • #24: Now, what about these two squares?
  • #25: And these two cubes? This exercise shows us that our perception of the relative sizes of objects can be skewed in different dimensions. This is important if we are trying to communicate the relationship between two numbers, because our design choices will effect the interpretation. Later in the course, we’ll learn about ways of avoiding confusion in these situations.
  • #28: Locations of geocoded tweets in Nairobi before the 2013 presidential elections, a collaborative between Ushahidi and Hivos.
  • #29: Infographic of twitter activity in Africa in late 2013 produced by Portland Communications.
  • #30: Interactive tool from the Gapminder Foundation animating the health and wealth of world countries over time. This screenshot shows the historical path of Kenya from 1800 to 2013. Note the number of data types (life expectancy, GDP, population per country and year) and variety of visual encodings (x- and y- position, size, color, time).
  • #32: Locate data (What is current temp?) Analyze trend (Is X improving?) Understand relationship (between X and Y, as in Anscombe quartet example) Identify points/areas of interest Spot outliers
  • #33: [1] Alexander Lex, Harvard CS171 (2015) [2] Bertin (1967) [3] Sneiderman et al. (1999) [4] John Stasko, CS7450 (2013)
  • #34: Monastery text apparently plotting the latitude of celestial bodies over time. According to Tufte “a mysterious and isolated wonder in the history of data graphics”, predates common line charts by 800 years. Source: http://commons.wikimedia.org/wiki/File:Planetary_Movements.gif
  • #35: Joseph Priestley produced this timeline of historical empires and rulers to map out the world’s history. Regions are organized vertically, and time proceeds horizontally. Areas represent different rules or regimes, and the largest areas (empires) are color-colored. Priestley’s instructions for viewing: “If a person carry his eye horizontally, he sees, in a very short time, all the revolutions that have taken place in any particular country […] and this is done with more exactness, and in much less time, than it could have been done by reading […] If the reader carries his eye vertically, he will see the contemporary state of all the empires subsisting in the world, at any particular time.” Priestley intended this to be read alongside “A Chart of Biography” that he built four years earlier. Source: https://en.wikipedia.org/wiki/A_New_Chart_of_History Source: A Description of a New Chart of History: Containing a View of the Principal Revolutions of Empire that Have Taken Place in the World, Joseph Priestley, 1786
  • #36: Colored line chart showing various economic metrics: price of bread, exports, national debt. Notable for use of common y-scale (different units), color, linear time axis. Source: http://libweb5.princeton.edu/visual_materials/maps/websites/thematic-maps/quantitative/sociology-economics/playfair-chronology-1824.jpg
  • #37: After treating British soldiers wounded in the Crimean War and taking detailed notes on causes of death, Florence Nightingale produced this graphic in a report to Parliament to promote sanitary improvements in hospitals and barracks. The visualization is a variant of a pie chart known as a polar-area diagram, rose diagram, or coxcomb - wedges have equal angle but varying radius (instead of varying angle and equal radius in a pie chart). It breaks down the total number of war deaths in the Crimean war by cause: blue for preventable disease, red for battle injuries, and black for other causes. Source: https://en.wikipedia.org/wiki/Pie_chart#/media/File:Nightingale-mortality.jpg Source: http://itelligencegroup.com/uk/localinsights/reworking-florence-nightingales-diagram-of-the-causes-of-mortality-in-the-army-in-the-east-with-sap-lumira/?isnews=
  • #38: Doug Engelbart gave a presentation of his research team’s work at the Augmentation Research Center to expand the interface for computers for information management and retrieval. The 1950s and 60s saw rapid advancements in modern computing and computer graphics that led to the advancement of scientific and information visualizations. Source: http://www.wired.com/2013/12/tech-time-warp-engelbart/ Full video: https://www.youtube.com/watch?v=yJDv-zdhzMY
  • #39: Hal Varian chief economist at Google.