SlideShare a Scribd company logo
How is Data Made?

From Dataset Literacy to

Data Infrastructure Literacy
30th June, Web Science 2015, University of Oxford
Jonathan Gray | jonathangray.org | @jwyg
Some thoughts on data literacy
beyond the dataset.
Not just reading and using datasets.
Thinking critically and constructively
about their contexts of production.
Bigger picture: role of data in society.
Not just literacies to read and use datasets,
but literacies to read and shape data infrastructures.
What is data literacy?
What is data?
A metaphor.
Data and photography.
How is Data Made? From Dataset Literacy to Data Infrastructure Literacy
Jonathan Gray (2012) “What Data Can and Cannot Do”. The Guardian. Available at: 

http://www.theguardian.com/news/datablog/2012/may/31/data-journalism-focused-critical
Jonathan Gray (2012) “What Data Can and Cannot Do”. The Guardian. Available at: 

http://www.theguardian.com/news/datablog/2012/may/31/data-journalism-focused-critical
How is Data Made? From Dataset Literacy to Data Infrastructure Literacy
Early optimism about veracity
and fidelity of photography.
– Franklin v. State of Georgia, 69 Ga. 36; 1882 Ga
“We cannot conceive of a more impartial and truthful
witness than the sun, as its light stamps and seals the
similitude of the wound on the photograph put before
the jury; it would be more accurate than the memory of
witnesses, and as the object of all evidence is to show
truth, why should not this dumb witness show it?”
Critical literacy around photography.
Critical literacy to read images:
!
• How is the camera set up to take shots?
• What is captured and how?
• What is not captured?
• How does equipment mediate the image?
• Selection, framing, arrangement, post-
production?
Instead of the camera, the elaborate sprawl
of public information systems.
Data infrastructures as
socio-technical systems.
What do they measure or capture, and how?
But datasets are not photographs.
Specificities of data infrastructures.
Datasets are heterogeneous.
Datasets are generated by a mixture of social and
technical processes, including e.g.:
!
• Laws and policies
• Administrative protocols
• Registration procedures
• Instruments and equipment
• Software systems
• Financial audits
• Feedback systems
• Management systems
• Metadata from digital services
• Standards bodies/standardisation procedures
Data literacy is not just about
knowing how to use data analysis software
or understanding statistics..
But also understanding methods, rationales,
assumptions, definitions, technologies, institutions,
through which datasets were generated.
Democratising the data revolution.
Not just liberalising access to the
informational by-products of public institutions.
But also bringing data infrastructures back
into realm of democratic political life.
Recent examples.
1. Beneficial ownership advocacy.
2. “Statactivism” and counting the uncounted.
1. Beneficial ownership advocacy."
2. “Statactivism” and counting the uncounted.
Gray. J. & Davies, T. (2015) “Fighting Phantom Firms in the UK: From Opening Up Datasets to
Reshaping Data Infrastructures?”. Available at SSRN: http://ssrn.com/abstract=2610937
In case of campaigning around company ownership,
the disclosure of existing datasets was not enough.
Civil society organisations had to undertake a more
creative, sustained and holistic engagement with
shaping and influencing the development of data
infrastructures as socio-technical systems.
This included research and advocacy around:
!
• Costs, functionalities and user interfaces of
software systems that would run the register;
• Changes to primary and secondary legislation;
• Additional administrative requirements and their
impacts on different actors inside and outside the
public sector.
Campaigners had to look beyond the question
of what information is released, towards the
question of what information is collected and
generated by the public sector in the first place,
how this is information is generated through
data infrastructures.
1. Beneficial ownership advocacy.
2. “Statactivism” and counting the uncounted.
1. Beneficial ownership advocacy.
2. “Statactivism” and counting the uncounted.
“Statactivism”
Bruno, I. and Didier, E. and Vitale, T. (2014) “Statactivism: Forms of Action between Disclosure
and Affirmation”. Available at SSRN: http://ssrn.com/abstract=2466882
Not just blanket critique or withdrawal of
quantification and “metrification”.
Highlighting limitations of existing forms of
measurement and proposing alternatives.
For example, gender equality, climate
change, working conditions and health.
What should be measured and how?
What is not currently being measured?
Recent examples from data journalism.
How is Data Made? From Dataset Literacy to Data Infrastructure Literacy
How is Data Made? From Dataset Literacy to Data Infrastructure Literacy
New “action repertoires” for civil society actors
to shape data infrastructures.
To what extent do data infrastructures address
needs and interests of civil society actors?
How to broaden the publics that
shape data as well as the publics that use it?
Legal, social and technical measures for
making open data initiatives more
responsive to concerns of civil society?
ROUTE TO PA:
http://routetopa.eu
DEMOCRATISING
PUBLIC INFORMATION:
FROM OPENING UP DATASETS TO
RESHAPING DATA INFRASTRUCTURES?
JONATHAN GRAY
JULY 2015
Question of what is measured and how.
But also who uses information,
and how information acts.
From “information as resource” to
“information as agent”.
!
(Sandra Braman, Change of State,
MIT Press, 2009)
“Participatory data infrastructures”
In conclusion…
Going beyond focus on literacy with datasets,
towards literacy with data infrastructures
through which they are generated.
Role of data infrastructures in addressing
global challenges - from climate change
to tax base erosion.
Data infrastructures as crucial part of
democratic politics in 21st century.
Jonathan Gray | jonathangray.org | @jwyg

More Related Content

PDF
The Politics of Open Data: Past, Present and Future
PDF
Are We Measuring the Right Things? From Disclosing Datasets to! Reshaping Da...
PDF
Fighting Phantom Firms in the UK: From Opening Up Datasets to Reshaping Data ...
PDF
From Telling Stories with Data to Telling Stories with Data Infrastructures: ...
PDF
Digital Transparency and the Politics of Open Data
PDF
Towards a Genealogy of Open Data
PDF
Towards A Literacy for Data Infrastructures
PDF
Improving the Coverage of Complex Issues with Data Journalism and Digital Met...
The Politics of Open Data: Past, Present and Future
Are We Measuring the Right Things? From Disclosing Datasets to! Reshaping Da...
Fighting Phantom Firms in the UK: From Opening Up Datasets to Reshaping Data ...
From Telling Stories with Data to Telling Stories with Data Infrastructures: ...
Digital Transparency and the Politics of Open Data
Towards a Genealogy of Open Data
Towards A Literacy for Data Infrastructures
Improving the Coverage of Complex Issues with Data Journalism and Digital Met...

What's hot (20)

PDF
Statistics in Journalism Sheffield 2014
PDF
What Data Can Do: A Typology of Mechanisms . Angèle Christin
PPTX
On Digital Markets, Data, and Concentric Diversification
PDF
Data Journalism and the Remaking of Data Infrastructures
PPTX
Figures of the Many - Quantitative Concepts for Qualitative Thinking
PDF
Benefits of Open Government Data (Expanded)
PPTX
Open Government Data: What it is, Where it is Going, and the Opportunities fo...
PDF
Benefits of Open Government Data
PPTX
Engines of Order. Social Media and the Rise of Algorithmic Knowing.
PDF
Redistributing journalism: Journalism as a data public and the politics of qu...
PDF
How evaluations change with open data
PPTX
Truth, Justice, and Technicity: from Bias to the Politics of Systems
PPTX
Webinar 1: Situating Canadian Cities in an International Smart City Ecosystem
PPTX
Keynote: Today's Data Grow Tomorrow's Citizens - Tracey P. Lauriault
PDF
GI Management Transformation: from geometry to databased relationships
PPTX
Tweets are Not Created Equal. Intersecting Devices in the 1% Sample
PPTX
PDF
How to get started with Data Journalism
PDF
Mapping Issues with the Web: An Introduction to Digital Methods
PDF
Big data
Statistics in Journalism Sheffield 2014
What Data Can Do: A Typology of Mechanisms . Angèle Christin
On Digital Markets, Data, and Concentric Diversification
Data Journalism and the Remaking of Data Infrastructures
Figures of the Many - Quantitative Concepts for Qualitative Thinking
Benefits of Open Government Data (Expanded)
Open Government Data: What it is, Where it is Going, and the Opportunities fo...
Benefits of Open Government Data
Engines of Order. Social Media and the Rise of Algorithmic Knowing.
Redistributing journalism: Journalism as a data public and the politics of qu...
How evaluations change with open data
Truth, Justice, and Technicity: from Bias to the Politics of Systems
Webinar 1: Situating Canadian Cities in an International Smart City Ecosystem
Keynote: Today's Data Grow Tomorrow's Citizens - Tracey P. Lauriault
GI Management Transformation: from geometry to databased relationships
Tweets are Not Created Equal. Intersecting Devices in the 1% Sample
How to get started with Data Journalism
Mapping Issues with the Web: An Introduction to Digital Methods
Big data
Ad

Viewers also liked (20)

PDF
Data Infrastructure Literacy: Reshaping Practices of Measurement, Monitoring ...
PDF
Fake News in Digital Culture
PDF
Open Budget Data: A Landscape Analysis
PDF
Why Data Journalism Is Something You Too Should Care About
PDF
Fake News, Algorithmic Accountability and the Role of Data Journalism in the ...
PDF
Ways of Seeing Data: Towards a Critical Literacy for Data Visualisations as R...
PDF
How to Get Started with Data Journalism
PDF
Data Visualisations, Data Experiences and Data Worlds
PDF
Sourcing Practices in Data Journalism at The New York Times, The Guardian and...
PDF
A Field Guide to Fake News Launch at the International Journalism Festival 2017
PDF
Data Work: Bridging Data Journalism and Digital Social Research
PDF
Ways of Seeing Data: Towards a Critical Literacy for Data Visualisations as R...
PDF
Journalism in an Age of Big Data: What It Is, Why It Matters and Where to Start
PDF
Using Data for Science Journalism
PDF
On Digital Methods and Data Infrastructures
PDF
The Data Journalism Handbook
PDF
Connecting Data, Computation and Journalism: The Epistemological Models of Th...
PDF
What Does a Good Digital Methods Project Look Like?
PDF
The Rise of Data Journalism: The Making of Journalistic Knowledge through Qua...
PDF
Mapping Issues with the Web: An Introduction to Digital Methods
Data Infrastructure Literacy: Reshaping Practices of Measurement, Monitoring ...
Fake News in Digital Culture
Open Budget Data: A Landscape Analysis
Why Data Journalism Is Something You Too Should Care About
Fake News, Algorithmic Accountability and the Role of Data Journalism in the ...
Ways of Seeing Data: Towards a Critical Literacy for Data Visualisations as R...
How to Get Started with Data Journalism
Data Visualisations, Data Experiences and Data Worlds
Sourcing Practices in Data Journalism at The New York Times, The Guardian and...
A Field Guide to Fake News Launch at the International Journalism Festival 2017
Data Work: Bridging Data Journalism and Digital Social Research
Ways of Seeing Data: Towards a Critical Literacy for Data Visualisations as R...
Journalism in an Age of Big Data: What It Is, Why It Matters and Where to Start
Using Data for Science Journalism
On Digital Methods and Data Infrastructures
The Data Journalism Handbook
Connecting Data, Computation and Journalism: The Epistemological Models of Th...
What Does a Good Digital Methods Project Look Like?
The Rise of Data Journalism: The Making of Journalistic Knowledge through Qua...
Mapping Issues with the Web: An Introduction to Digital Methods
Ad

Similar to How is Data Made? From Dataset Literacy to Data Infrastructure Literacy (20)

PPTX
Big data and development
PDF
Beyond-Data-Literacy-2015
PPTX
Accessing and Using Big Data to Advance Social Science Knowledge
PDF
The Future of Big Data
 
DOCX
Baban Hasnat is a professor of international business and ec.docx
DOCX
June 2015 (142) MIS Quarterly Executive 67The Big Dat.docx
PDF
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
PDF
Big Data Paper
PPTX
How big data and analytics will help the world of charities
PPTX
How big data and analytics will help the world of charities
PDF
Federal Statistical System, Transparency Camp West
PPTX
Data Science For Social Good: Tackling the Challenge of Homelessness
PPTX
Big Data for Development: Opportunities and Challenges, Summary Slidedeck
PDF
Big Data For Development A Primer
PPTX
The art and science of data-driven journalism
PDF
Big data and information privacy 20190117
PPTX
Towards a More Open World
DOCX
Communications of the Association for Information SystemsV.docx
PPTX
Emancipatory Information Retrieval (Invited Talk at UCC)
PDF
Big Data, Communities and Ethical Resilience: A Framework for Action
Big data and development
Beyond-Data-Literacy-2015
Accessing and Using Big Data to Advance Social Science Knowledge
The Future of Big Data
 
Baban Hasnat is a professor of international business and ec.docx
June 2015 (142) MIS Quarterly Executive 67The Big Dat.docx
Guidance for Incorporating Big Data into Humanitarian Operations - 2015 - web...
Big Data Paper
How big data and analytics will help the world of charities
How big data and analytics will help the world of charities
Federal Statistical System, Transparency Camp West
Data Science For Social Good: Tackling the Challenge of Homelessness
Big Data for Development: Opportunities and Challenges, Summary Slidedeck
Big Data For Development A Primer
The art and science of data-driven journalism
Big data and information privacy 20190117
Towards a More Open World
Communications of the Association for Information SystemsV.docx
Emancipatory Information Retrieval (Invited Talk at UCC)
Big Data, Communities and Ethical Resilience: A Framework for Action

Recently uploaded (20)

PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
Presentation on HIE in infants and its manifestations
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
Lesson notes of climatology university.
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
master seminar digital applications in india
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Computing-Curriculum for Schools in Ghana
PDF
Complications of Minimal Access Surgery at WLH
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
O7-L3 Supply Chain Operations - ICLT Program
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Presentation on HIE in infants and its manifestations
2.FourierTransform-ShortQuestionswithAnswers.pdf
Lesson notes of climatology university.
102 student loan defaulters named and shamed – Is someone you know on the list?
Chinmaya Tiranga quiz Grand Finale.pdf
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Anesthesia in Laparoscopic Surgery in India
master seminar digital applications in india
Module 4: Burden of Disease Tutorial Slides S2 2025
Computing-Curriculum for Schools in Ghana
Complications of Minimal Access Surgery at WLH
Final Presentation General Medicine 03-08-2024.pptx
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE

How is Data Made? From Dataset Literacy to Data Infrastructure Literacy