SlideShare a Scribd company logo
Introduction to Information
Visualisation
Dr Mia Ridge, @mia_out
Digital Curator, British Library
CHASE Arts and Humanities in the Digital Age, February 2017
While we're getting started...
• Check that you can get online with the browsers Firefox or Chrome
• The Exercises page contains all the links you need during the day
• Check you can view it now: http://bit.ly/2kYtGx4
• Check you can log in to Viewshare with your new account
http://viewshare.org/
• Timetable
• 11am Tea and coffee
• 1 - 1:45pm Lunch
• 3 - 3:15pm Tea and coffee
• 4:30pm Finish; free working time until 5pm
Overview
• What is information visualisation and why use
it?
• The building blocks of visualisations
• Exploring and critiquing interactive
visualisations
• Getting from the data you have to the
visualisation you want
What is visualisation?
Visualisation is the graphical display of
quantitative or qualitative information to create
insights by highlighting patterns, trends,
variations and anomalies.
From this...
...to this
...or this
Data visualisation can help you...
Explore your data
Explain your results
Why visualise information?
For 'sense-making (also called data analysis) and
communication' (Stephen Few)
'…showing quantitative and qualitative information
so that a viewer can see patterns, trends, or
anomalies, constancy or variation' (Michael
Friendly)
'…interactive, visual representations of abstract
data to amplify cognition' (Card et al)
'Distant reading' (Moretti) - focus on the shape
rather than detail of a collection
Introductions
• In a sentence or two, what's your interest in
data visualisation?
– What kinds of data do you work with?
– What's the goal of any visualisations you're
interested in creating?
– Do you have any potential users in mind?
The building blocks of visualisation
Joseph Priestley, 1769
Florence Nightingale's petal charts, 1857
Charts
https://cloud.highcharts.com/show/azujym
John Snow's cholera map, 1854
Charles Minard's figurative map, 1869
'Figurative Map of the successive losses in men of the French Army in the Russian campaign
1812-1813'. Drawn up by M. Minard, Inspector General of Bridges and Roads in retirement.
Paris, November 20, 1869.
Web 2.0 and the mashup, 2006
http://www.bombsight.org
Small multiples
The old tube map
Harry Beck, 1931
Exercise: compare n-gram tools
http://bit.ly/2kYtGx4
• Think of two words or phrases you'd like to
compare over time (e.g. Burma, Burmah).
• Open two browser windows
• In one, go to http://books.google.com/ngrams
• In the other, go to http://benschmidt.org/OL/
• Enter your words or phrases in each and compare
the results
• Discuss with your neighbour: what differences
did you find, and why?
Exploring words
http://www.codeitmagazine.com/images/text.png
Exploring words
http://www.jasondavies.com/wordtree/
Networks
http://networks.viraltexts.org/1836to1899/
Networks
Every point on this diagram represents a male film producer. The pink dots represent men who worked exclusively with other men in the period
surveyed, and the green dots represent those who worked with women.
https://theconversation.com/women-arent-the-problem-in-the-film-industry-men-are-68740 Deb Verhoeven and Stuart Palmer
Visualising images and video
http://www.flickr.com/photos/culturevis/5883371358/
'Mondrian vs. Rothko', Lev Manovich, 2010. Image preparation: Xiaoda Wang
Sonification
http://www.caseyrule.com/projects/sounds-of-sorting/
http://notes.husk.org/post/509063519/infographics
Data types
• Quantitative
• Qualitative
• Geographic
• Temporal
• Media
• Entities (people, places, events, concepts,
things)
How do you get data to visualise?
• Make it
– Type it into a spreadsheet or database
• Automate it
– Extract it from text, images, audio or video
• Find it
– Lots of freely available data to practice with
Topic modelling
http://discontents.com.au/mining-for-meanings/
Other forms of text analysis
Entity
recognition:
turning text into
things
Entity recognition examples
Extracting information from video
http://emotions.periscopic.com/inauguration/
Extracting information from images
https://www.clarifai.com/demo
Exercise: try entity recognition
Go to http://bit.ly/2kYtGx4 and follow the steps
for text or images
Exploring scholarly visualisations
Scholarly data visualisations
• Visualisations as 'distant reading' where
distance is 'a specific form of knowledge:
fewer elements, hence a sharper sense of
their overall interconnection' (Moretti, 2005)
• Inspiring curiosity and research questions
• But - which questions do they privilege and
what do they leave out?
Exercise: critiquing scholarly visualisations
Go to http://bit.ly/2kYtGx4 and follow the steps
for Exercise 3
Pair up and discuss together before reporting
back.
America's Public Bible
http://americaspublicbible.org/
http://on-broadway.nyc/
http://www.sixdegreesoffrancisbacon.com/
http://maps.bristol.gov.uk/knowyourplace/
https://www.historypin.org/
Visualizing Emancipation
http://www.americanpast.org/emancipation/
New York Society Library’s City Readers
http://cityreaders.nysoclib.org/About/visualizations
Mapping the Republic of Letters
http://www.stanford.edu/group/toolingup/rplviz/rplviz.swf
https://www.locatinglondon.org/
Digital Harlem
http://digitalharlem.org
Digital Public Library of America
http://dp.la/
Orbis
http://orbis.stanford.edu
Lost Change
http://tracemedia.co.uk/lostchange/
State of the Union
http://benschmidt.org/poli/2015-SOTU
http://viraltexts.northeastern.edu/
Comments or questions?
From the data you have to the
visualisation you want
Dealing with humanities data
Considerations for humanities data
Commercial tools often assume complete, born-
digital datasets – no missing fields or changes in
data entry over time
• Historical records often contain uncertainty
and fuzziness (e.g. date ranges, multiple
values, uncertain or unavailable information)
• Includes metadata, data, digital surrogates
Messiness in historical data
• 'Begun in Kiryu, Japan, finished in France'
• 'Bali? Java? Mexico?'
• Variations on USA:
– U.S.
– U.S.A
– U.S.A.
– USA
– United States of America
– USA ?
– United States (case)
• Inconsistency in uncertainty
– U.S.A. or England
– U.S.A./England ?
– England & U.S.A.
When were objects collected?
http://ibm.co/OS3HBa
Computers don't cope
Preparing data for visualisations
Historical data often needs manual cleaning to:
 remove rows where vital information is missing
 tidy inconsistencies in term lists or spelling
 convert words to numbers (e.g. dates)
 remove hard returns and non-ASCII characters (or
change data format)
 split multiple values in one field into other
columns (e.g. author name, date in single field)
 expand coded values (e.g. countries, language)
Open Refine
…but be careful
What do you want to visualise?
Structure
Purpose
Data
Audience
What do you want to do?
• See relationships among data points
• Compare a set of values
• Track change over time
• See the parts of a whole
See relationships among data points
• Scatterplot
• Matrix
• Network diagram
Compare a set of values
• Bar chart
• Bubble chart
• Histogram
Track change over time
• Line graph
• Stack graph
See the parts of a whole
• Pie chart
• Treemap
Key format decisions
• Static or interactive?
• Print or digital?
• Narrative or 'factual'?
• Shape (distant view) or detail (close view)?
Purpose, data, audience, structure
• Intersections of format and purpose
• Data types: quantitative, qualitative,
geographic, time series, media, entities
(people, places, events, concepts, things)
• Static, interactive; print, digital; product,
process
• Exploratory, explanatory: find new insights, or
tell a story? Pragmatic, emotive?
Dealing with complex data
• Find a visualisation type that can harbour the
data in a meaningful way or reduce the data in
a meaningful way.
– e.g. go from individual values to distribution of
values
– e.g. introduce interaction: overview, zoom and
filter, details on demand (Ben Shneiderman)
Exercise: 10 minute Viewshare tutorial
Instructions http://bit.ly/2kYtGx4
Discuss: what did you learn about preparing
data and using visualisation software?
Choosing a structure
http://extremepresentation.com/design/7-charts/
Introduction to information visualisation for humanities PhDs
http://extremepresentation.com/design/7-charts/
Giorgia Lupi and Stefanie Posavec http://www.dear-data.com/all
Preparing data
Data Preparation
• Generally needs to be in tables, one row per
item, one column per value
• Aggregate or individual values - might need to
calculate totals in advance
• Data should be made as consistent as possible
with tools like Excel, OpenRefine
Document data preparation!
Sample advice
From viewshare, on spreadsheets:
• Remove any data that is not in a solid rectangular area.
This includes white space, page titles, scattered cells,
and additional worksheets.
• Check that your formatting is consistent throughout
each column (e.g. column is all in date format, currency
format, etc. as appropriate).
• Make sure that data of the same type but in different
columns is formatted consistently (e.g. dates in
different columns are in the same date format).
If all else fails...
• Sketch out your visualisation on paper to test
it
• Iteration is key, and...
• Stubbornness is a virtue!
Exercise: try views and widgets in
Viewshare
Instructions http://bit.ly/2kYtGx4
Views
• Lists, maps, pie charts, bar charts, scatter plots, tables,
timelines or galleries
Widgets
• Search boxes, lists, tag clouds, sliders, ranges, logos or
text
How might you apply these with your own data?
Design matters
Worst practice in data visualisations
Source: http://www.forbes.com/sites/naomirobbins/2013/01/03/deceptive-donut-chart/
Worst practice in data visualisations
Source: https://twitter.com/altonncf/status/293392615225823232
Visualisations and 'truthiness'
A sample of publication printing locations 1534-1831 (British Library data)
http://bit.ly/W9VM7D
Visualising uncertainty
Matt Lincoln http://blogs.getty.edu/iris/metadata-specialists-share-their-challenges-defeats-and-triumphs/#matt
Visualising uncertainty
Publishing visualisations
• How can you contextualise, explain any
limitations of your visualisations? e.g.
– provenance and qualities of original dataset;
– what you needed to do to it to get it into software
(how transformed, how cleaned);
– what's left out of the visualisation, and why?
Best practice for design
• How effectively does the visualisation support
cognitive tasks?
• The most important and frequent visual
queries/pattern finding should be supported
with the most visually distinct objects
• Question: which examples did this well?
Do you really need a visualisation?
• Use tables when:
– doc will be used to look up individual values
– to compare individual values
– precise values are required
– the quantitative info to be communicated involves
more than one unit of measure
• Use graphs when:
– the message is contained in the shape of the values
– the document will be used to reveal relationships
among values
Don’t Do try this at home
Tools that don't require programming
• Excel
• Google Fusion Tables, Google Drive
• Viewshare
• Tableau Public
NB: be careful about sensitive data on cloud
platforms
Thank you!
http://bit.ly/2kYtGx4
Mia Ridge @mia_out
Digital Curator, British Library
CHASE Arts and Humanities in the Digital Age, February 2017
Introduction to information visualisation for humanities PhDs

More Related Content

PPTX
Beyond the Black Box: Data Visualisation
PPTX
Planning for big data (lessons from cultural heritage)
PPTX
Crowdsourcing and Cultural Heritage workshop
PPTX
Data visualisations as a gateway to programming
PDF
How do you know what you are looking for?
PPT
Visualization notes
PDF
New Forms of Collaboration in Humanities Research
PPTX
Choosy crowds and the machine age: challenges for the future of humanities cr...
Beyond the Black Box: Data Visualisation
Planning for big data (lessons from cultural heritage)
Crowdsourcing and Cultural Heritage workshop
Data visualisations as a gateway to programming
How do you know what you are looking for?
Visualization notes
New Forms of Collaboration in Humanities Research
Choosy crowds and the machine age: challenges for the future of humanities cr...

What's hot (20)

PPTX
Chaos&Order: Using visualization as a means to
 explore large heritage collec...
PDF
Butterfly Hunt: On Collecting #mla14 Tweets (#mla15 #s398)
PDF
Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddj
PDF
Intro to Data Vis for the Humanities nov 2013
PPTX
Forms of Innovation: Collaboration, Attribution, Access
PDF
Situation Dänemark
PDF
Nilges Making The Metadata Work NISO Virtual Conference Ebooks
PPTX
Gold rushwriterspresentation 2013
PDF
UKSG 2015 Mechanical curator and British Library labs
PDF
Paying for it
PPT
Research in the digital age - circa 2005
PDF
Privacy and libraries
PDF
Introduction to Semantic Web
PDF
Getting Intimate with Your Data - Working Our Way out of the Lab
PPT
Interactive Data Visualization
PDF
Museums and Digital Technologies
PPT
DPLA - an introduction for historians
PPT
Building Data-centric Media Organizations
PPT
Generous Interfaces - rich websites for digital collections
Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Butterfly Hunt: On Collecting #mla14 Tweets (#mla15 #s398)
Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddj
Intro to Data Vis for the Humanities nov 2013
Forms of Innovation: Collaboration, Attribution, Access
Situation Dänemark
Nilges Making The Metadata Work NISO Virtual Conference Ebooks
Gold rushwriterspresentation 2013
UKSG 2015 Mechanical curator and British Library labs
Paying for it
Research in the digital age - circa 2005
Privacy and libraries
Introduction to Semantic Web
Getting Intimate with Your Data - Working Our Way out of the Lab
Interactive Data Visualization
Museums and Digital Technologies
DPLA - an introduction for historians
Building Data-centric Media Organizations
Generous Interfaces - rich websites for digital collections
Ad

Similar to Introduction to information visualisation for humanities PhDs (20)

PDF
datavisualization-5thUnit.pdf
PPTX
Startupfest 2016: NOAH ILIINSKY (Amazon Web Services) - How to
PPTX
4 pillars of visualization & communication by Noah Iliinsky
PPTX
Measurecamp 7 Workshop: Data Visualisation
PPTX
Visual and interactive storytelling slides cmg 2015-final
PDF
Introduction to the FP7 CODE project @ BDBC
PDF
MPhil Lecture on Data Vis for Analysis
PPTX
Reference at the Metcalf 2018: Digging into data visualisation
PDF
Principles of data visualisation 2021
PDF
principlesofdatavisualisation2021-210407141546.pdf
PPTX
Idm unit i ppt (deleted 38112ace3a82cbb8fba22044606fd8dc)
PPTX
Principles of data visualisation 2020
PPTX
PDF
Introduction to Data Visualization
PDF
Cincinnati Tableau User Group Event #8 (Mapping)
PDF
Lecture 1
PPTX
DMDS Winter Workshop 2 Slides
PDF
Week_2_Lecture.pdf
PPT
Wikidata Introductory Workshop
PPTX
Accessible Next Level Visualizations
datavisualization-5thUnit.pdf
Startupfest 2016: NOAH ILIINSKY (Amazon Web Services) - How to
4 pillars of visualization & communication by Noah Iliinsky
Measurecamp 7 Workshop: Data Visualisation
Visual and interactive storytelling slides cmg 2015-final
Introduction to the FP7 CODE project @ BDBC
MPhil Lecture on Data Vis for Analysis
Reference at the Metcalf 2018: Digging into data visualisation
Principles of data visualisation 2021
principlesofdatavisualisation2021-210407141546.pdf
Idm unit i ppt (deleted 38112ace3a82cbb8fba22044606fd8dc)
Principles of data visualisation 2020
Introduction to Data Visualization
Cincinnati Tableau User Group Event #8 (Mapping)
Lecture 1
DMDS Winter Workshop 2 Slides
Week_2_Lecture.pdf
Wikidata Introductory Workshop
Accessible Next Level Visualizations
Ad

More from Mia (20)

PPTX
Living with Machines year two update
PPTX
Rethink research, illuminate history with the British Library
PPTX
Living with Machines: one year in
PPTX
Festival of Maintenance talk: Apps, microsites and collections online: innova...
PPTX
Operationalising AI at a national library
PPTX
Hopes, dreams and reality: crowdsourcing and the democratisation of knowledge...
PPTX
In search of the sweet spot: infrastructure at the intersection of cultural h...
PPTX
Living with Machines at The Past, Present and Future of Digital Scholarship w...
PPTX
Enabling digital scholarship through staff training: the British Library's ex...
PPTX
A modest proposal: crowdsourcing in cultural heritage benefits us all.
PPTX
Crowdsourcing at the British Library: lessons learnt and future directions
PPTX
Crowdsourcing 'In the Spotlight' at the British Library
PPTX
Crowdsourcing: the British Library experience
PPT
Chair's welcome, MCG's Museums+Tech 2017
PPTX
Historical thinking in crowdsourcing and citizen history projects
PPTX
Cross-sector collaboration for digital museum and library projects
PPTX
Connected heritage: How should Cultural Institutions Open and Connect Data?
PPTX
Wish upon a star: making crowdsourcing in cultural heritage a reality
PPTX
Doing Digital Research @ British Library
PPTX
Digitised Manuscripts and the British Library's new IIIF viewer
Living with Machines year two update
Rethink research, illuminate history with the British Library
Living with Machines: one year in
Festival of Maintenance talk: Apps, microsites and collections online: innova...
Operationalising AI at a national library
Hopes, dreams and reality: crowdsourcing and the democratisation of knowledge...
In search of the sweet spot: infrastructure at the intersection of cultural h...
Living with Machines at The Past, Present and Future of Digital Scholarship w...
Enabling digital scholarship through staff training: the British Library's ex...
A modest proposal: crowdsourcing in cultural heritage benefits us all.
Crowdsourcing at the British Library: lessons learnt and future directions
Crowdsourcing 'In the Spotlight' at the British Library
Crowdsourcing: the British Library experience
Chair's welcome, MCG's Museums+Tech 2017
Historical thinking in crowdsourcing and citizen history projects
Cross-sector collaboration for digital museum and library projects
Connected heritage: How should Cultural Institutions Open and Connect Data?
Wish upon a star: making crowdsourcing in cultural heritage a reality
Doing Digital Research @ British Library
Digitised Manuscripts and the British Library's new IIIF viewer

Recently uploaded (20)

PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
IMPACT OF LANDSLIDE.....................
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PPT
Predictive modeling basics in data cleaning process
PPTX
Managing Community Partner Relationships
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Leprosy and NLEP programme community medicine
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
DOCX
Factor Analysis Word Document Presentation
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
importance of Data-Visualization-in-Data-Science. for mba studnts
PDF
Introduction to the R Programming Language
PPTX
Introduction to Inferential Statistics.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PDF
How to run a consulting project- client discovery
IBA_Chapter_11_Slides_Final_Accessible.pptx
IMPACT OF LANDSLIDE.....................
STERILIZATION AND DISINFECTION-1.ppthhhbx
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Predictive modeling basics in data cleaning process
Managing Community Partner Relationships
retention in jsjsksksksnbsndjddjdnFPD.pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Leprosy and NLEP programme community medicine
Pilar Kemerdekaan dan Identi Bangsa.pptx
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Factor Analysis Word Document Presentation
Optimise Shopper Experiences with a Strong Data Estate.pdf
importance of Data-Visualization-in-Data-Science. for mba studnts
Introduction to the R Programming Language
Introduction to Inferential Statistics.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
How to run a consulting project- client discovery

Introduction to information visualisation for humanities PhDs