SlideShare a Scribd company logo
Open Data
in Special Collections Libraries;
or, How Can We Be Better Than Data Brokers?
Scott Ziegler
Louisiana State University Libraries
--
NISO Virtual Conference: Open Data Projects
June 13, 2018
Open Data
Open data is data that can be freely used, re-used and
redistributed by anyone - subject only, at most, to the
requirement to attribute and sharealike.
-Open Data Handbook
(http://opendatahandbook.org/guide/en/what-is-open-data/)
@ScottLZiegler 2
Examples of Open Data
Civic​
● Philadelphia Open Data​ (https://www.opendataphilly.org/ )
● Baton Rouge Open Data ​(https://data.brla.gov/) ​
Weather ​
● National Weather Service (​https://www.weather.gov/)
● Louisiana Office of State Climatology (http://www.losc.lsu.edu/)
3@ScottLZiegler
Open Historic Data
As a subset of open data, open historic data is free for anyone to use for any
purpose and is created from historic material.
Specifically, I’ll be focusing on data created from historic material held within
special collections libraries.
4@ScottLZiegler
Digitization: From Page to Digital Facsimile
5
Dataset: From Facsimile to Computational Data
6@ScottLZiegler
Live Demo:
http://bit.ly/NISOdemo
(Everyone cross your fingers)
7
From Data to Product
Data Product
(This is the part we supply)
(Usually, lots of work needs to go here)
8@ScottLZiegler
We Don’t Open Everything
Cultural Sensitivity
Libraries and Archives have
material that represent groups
in ways that are racist, sexist,
etc.
Privacy
Personally identifiable
information about living
individuals
9@ScottLZiegler
Meanwhile,
Out in the World
10
Meanwhile, Out in the World
Algorithms of Oppression
Safiya Noble
Automating Inequality
Virginia Eubanks
Weapons of Math Destruction
Cathy O’Neil
11@ScottLZiegler
Meanwhile, Out in the World
Equifax Breach
(Reported)
Cambridge Analytica
(Reported)
September 2017 March 2018
Mark Zuckerberg
testifies before
Congress
April 2018
European Union
implements new
data collection
regulations
May 2018 June 2018
Facebook gives
data to telecom
firms
(Reported)
12@ScottLZiegler
Data Brokers
Collect information about individuals from a wide variety of sources
Package data to create a profile of a person
Sell this package to advertisers, credit agents, government entities
13@ScottLZiegler
So, Are We Better
Than Data Brokers?
14@ScottLZiegler
Intentions and Subjects
Our intentions are better
● Research not personal profit
Our subject is different
● Individuals we deal with are often historical
15@ScottLZiegler
Intentions and Subjects
Intent:
Intent is not particularly important.
Outcomes and results are important.
- Safiya Nobel, Algorithms of Oppression
Harm should be understood in wider terms than just individuals
16@ScottLZiegler
How Could We
Be Better?
17@ScottLZiegler
Taking Advantage of the Help Already Out There
Benefit from the expertise of others
● Bring the writings of humanities/social science to the development team
18@ScottLZiegler
Standardize the Practice of Asking For Help
Representation Officers
● Person in charge of investigating who is being represented in a digital project
● Research possible partners from that group/community
Tie this closely to the role of outreach/promotion.
● We want to act as though the people being described will be looking closely
at the description
19@ScottLZiegler
Standardize a Path for Feedback and Adjustments
Clarify why we did what we did
● “During the planning phase of this project, we worked with the following scholars and community
groups”
And how anyone can suggest we do it otherwise
● Perhaps a form and/or contact email address
Explain what the process looks like for considering changes
● Though we might not be able to accommodate every request for modification, these are the steps
we will take after we receive your comments
20@ScottLZiegler
This Is Going to Be Lots of Work
● It’s work to read books
● It’s work to apply these ideas to our day jobs
● It’s work to listen to criticism of our projects
● It’s work to try to get people to participate
And Also:
● It’s work to help us
● It’s work to explain things to us in a way that we’ll understand
21@ScottLZiegler
Thank You
Thanks for listening
Please reach out if you want to talk more
@ScottLZiegler
sziegler1@lsu.edu
22

More Related Content

PDF
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
PPTX
Washington Linked Data Authority Service at University of Houston
PPTX
PDF
McGeary Data Curation Network: Developing and Scaling
PDF
Keystone summer school_2015_miguel_antonio_ldcompression_4-joined
PDF
Introduction to linked data
PDF
Exploration, visualization and querying of linked open data sources
PDF
Think like a Digital Curator
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Washington Linked Data Authority Service at University of Houston
McGeary Data Curation Network: Developing and Scaling
Keystone summer school_2015_miguel_antonio_ldcompression_4-joined
Introduction to linked data
Exploration, visualization and querying of linked open data sources
Think like a Digital Curator

What's hot (20)

PDF
Trustworthy AI and Open Science
PPT
Lifting the Lid on Linked Data
PPTX
Software Sustainability: Better Software Better Science
PPT
Linked library data
PPT
LIBER Webinar: 23 Things About Research Data Management
PDF
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
PPTX
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PPT
Data Citation, The Dataverse Network ®, and Contributor Identifiers
PDF
dkNET ESP Meeting - February 2016
PDF
PPTX
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
PPTX
Current metadata landscape in the library world (Getaneh Alemu)
PDF
Linked data as a library data platform
PPTX
Getting Comfortable with Metadata Reuse
PDF
Big Data for Library Services (2017)
PPTX
Promises and Pitfalls: Linked Data, Privacy, and Library Catalogs
PPTX
Archives Hub - Data in :: Data out
PPTX
Keystone summer school 2015 paolo-missier-provenance
PPTX
Metadata for digital humanities
Trustworthy AI and Open Science
Lifting the Lid on Linked Data
Software Sustainability: Better Software Better Science
Linked library data
LIBER Webinar: 23 Things About Research Data Management
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
Data Citation, The Dataverse Network ®, and Contributor Identifiers
dkNET ESP Meeting - February 2016
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
Current metadata landscape in the library world (Getaneh Alemu)
Linked data as a library data platform
Getting Comfortable with Metadata Reuse
Big Data for Library Services (2017)
Promises and Pitfalls: Linked Data, Privacy, and Library Catalogs
Archives Hub - Data in :: Data out
Keystone summer school 2015 paolo-missier-provenance
Metadata for digital humanities
Ad

Similar to Ziegler Open Data in Special Collections Libraries (20)

PPTX
The Power of Open Data!
PPTX
Open Data & Public Libraries
PPT
Improving Access to Research Data: What does changing legislation mean for y...
PPT
Data engagement and Local Information Systems
PPTX
Open Data in Developing Countries
PPTX
Supporting Rationale Awareness in Large-Scale Online Open Participative Activ...
PPTX
Lauren Michael: The Missing Millions Democratizing Computation and Data ...
DOCX
Information is knowledge
PDF
Lecture 7: Social Web Challenges (2012)
PDF
SenseMaker TSC slides.pdf
PPTX
Characterizing Data and Software for Social Science Research
PPTX
Bosman and Kramer Open Research: A 2024 NISO Training Series, Session Four: O...
PPT
EDF2012 Nigel Shadbolt - Transparency and Open Data
PDF
Opening Plenary - Prof. Nigel Shadbolt
 
PDF
USAID’s Evolving Open Data Culture
PDF
Okfn workshop chennai
PDF
Media that matters 2013 keynote
PDF
Data, Science, Society - Claudio Gutierrez, University of Chile
PPTX
Open Data Initiatives
ODP
Open data policy for scientists as citizens and for citizen science
The Power of Open Data!
Open Data & Public Libraries
Improving Access to Research Data: What does changing legislation mean for y...
Data engagement and Local Information Systems
Open Data in Developing Countries
Supporting Rationale Awareness in Large-Scale Online Open Participative Activ...
Lauren Michael: The Missing Millions Democratizing Computation and Data ...
Information is knowledge
Lecture 7: Social Web Challenges (2012)
SenseMaker TSC slides.pdf
Characterizing Data and Software for Social Science Research
Bosman and Kramer Open Research: A 2024 NISO Training Series, Session Four: O...
EDF2012 Nigel Shadbolt - Transparency and Open Data
Opening Plenary - Prof. Nigel Shadbolt
 
USAID’s Evolving Open Data Culture
Okfn workshop chennai
Media that matters 2013 keynote
Data, Science, Society - Claudio Gutierrez, University of Chile
Open Data Initiatives
Open data policy for scientists as citizens and for citizen science
Ad

More from National Information Standards Organization (NISO) (20)

PPTX
Larry Bennett_ ALA Annual Convention 2025AL2 slides.pptx
PPTX
Potash "Our Journey & Vision for Accessible Content"
PPTX
O'Leary "Progress Assessment - How Far Are We from Delivery"
PPTX
Carpenter and O'Leary "Accessibility Standards and the Future of Inclusive Pu...
PPTX
Davidian "Transfer Code of Practice Standing Committee Update"
PPTX
Patham "NISO Open Discovery Initiative (ODI) Update"
PPTX
Hichliffe "A Standard Terminology for Peer Review"
PPTX
Levin "KBART RP Update at ALA Annual 2025"
PPTX
Carpenter "Advancing Infrastructure for Sustainable Collections: CCLP Project...
PPTX
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
PPTX
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
PDF
Carpenter "2025 NISO Annual Members Meeting"
PPTX
Allen "Social Marketing in Scholarly Communications"
PPTX
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
PDF
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
PDF
Pfeiffer "Secrets to Changing Behavior in Scholarly Communication: A 2025 NIS...
PPTX
Gilstrap "Accessibility Essentials: A 2025 NISO Training Series, Session 7, M...
PPTX
Turner "Accessibility Essentials: A 2025 NISO Training Series, Session 7, Lan...
PPTX
Comeford "Accessibility Essentials: A 2025 NISO Training Series, Session 7, A...
PPTX
Laverick and Richard "Accessibility Essentials: A 2025 NISO Training Series, ...
Larry Bennett_ ALA Annual Convention 2025AL2 slides.pptx
Potash "Our Journey & Vision for Accessible Content"
O'Leary "Progress Assessment - How Far Are We from Delivery"
Carpenter and O'Leary "Accessibility Standards and the Future of Inclusive Pu...
Davidian "Transfer Code of Practice Standing Committee Update"
Patham "NISO Open Discovery Initiative (ODI) Update"
Hichliffe "A Standard Terminology for Peer Review"
Levin "KBART RP Update at ALA Annual 2025"
Carpenter "Advancing Infrastructure for Sustainable Collections: CCLP Project...
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
Carpenter "2025 NISO Annual Members Meeting"
Allen "Social Marketing in Scholarly Communications"
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
Gibson "Secrets to Changing Behaviour in Scholarly Communication: A 2025 NISO...
Pfeiffer "Secrets to Changing Behavior in Scholarly Communication: A 2025 NIS...
Gilstrap "Accessibility Essentials: A 2025 NISO Training Series, Session 7, M...
Turner "Accessibility Essentials: A 2025 NISO Training Series, Session 7, Lan...
Comeford "Accessibility Essentials: A 2025 NISO Training Series, Session 7, A...
Laverick and Richard "Accessibility Essentials: A 2025 NISO Training Series, ...

Recently uploaded (20)

PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
Cell Types and Its function , kingdom of life
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
RMMM.pdf make it easy to upload and study
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
01-Introduction-to-Information-Management.pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Business Ethics Teaching Materials for college
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Classroom Observation Tools for Teachers
PPTX
master seminar digital applications in india
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Pre independence Education in Inndia.pdf
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
STATICS OF THE RIGID BODIES Hibbelers.pdf
Cell Types and Its function , kingdom of life
Abdominal Access Techniques with Prof. Dr. R K Mishra
Final Presentation General Medicine 03-08-2024.pptx
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
RMMM.pdf make it easy to upload and study
VCE English Exam - Section C Student Revision Booklet
01-Introduction-to-Information-Management.pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Module 4: Burden of Disease Tutorial Slides S2 2025
Business Ethics Teaching Materials for college
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Supply Chain Operations Speaking Notes -ICLT Program
Classroom Observation Tools for Teachers
master seminar digital applications in india
TR - Agricultural Crops Production NC III.pdf
Pre independence Education in Inndia.pdf
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES

Ziegler Open Data in Special Collections Libraries

  • 1. Open Data in Special Collections Libraries; or, How Can We Be Better Than Data Brokers? Scott Ziegler Louisiana State University Libraries -- NISO Virtual Conference: Open Data Projects June 13, 2018
  • 2. Open Data Open data is data that can be freely used, re-used and redistributed by anyone - subject only, at most, to the requirement to attribute and sharealike. -Open Data Handbook (http://opendatahandbook.org/guide/en/what-is-open-data/) @ScottLZiegler 2
  • 3. Examples of Open Data Civic​ ● Philadelphia Open Data​ (https://www.opendataphilly.org/ ) ● Baton Rouge Open Data ​(https://data.brla.gov/) ​ Weather ​ ● National Weather Service (​https://www.weather.gov/) ● Louisiana Office of State Climatology (http://www.losc.lsu.edu/) 3@ScottLZiegler
  • 4. Open Historic Data As a subset of open data, open historic data is free for anyone to use for any purpose and is created from historic material. Specifically, I’ll be focusing on data created from historic material held within special collections libraries. 4@ScottLZiegler
  • 5. Digitization: From Page to Digital Facsimile 5
  • 6. Dataset: From Facsimile to Computational Data 6@ScottLZiegler
  • 8. From Data to Product Data Product (This is the part we supply) (Usually, lots of work needs to go here) 8@ScottLZiegler
  • 9. We Don’t Open Everything Cultural Sensitivity Libraries and Archives have material that represent groups in ways that are racist, sexist, etc. Privacy Personally identifiable information about living individuals 9@ScottLZiegler
  • 11. Meanwhile, Out in the World Algorithms of Oppression Safiya Noble Automating Inequality Virginia Eubanks Weapons of Math Destruction Cathy O’Neil 11@ScottLZiegler
  • 12. Meanwhile, Out in the World Equifax Breach (Reported) Cambridge Analytica (Reported) September 2017 March 2018 Mark Zuckerberg testifies before Congress April 2018 European Union implements new data collection regulations May 2018 June 2018 Facebook gives data to telecom firms (Reported) 12@ScottLZiegler
  • 13. Data Brokers Collect information about individuals from a wide variety of sources Package data to create a profile of a person Sell this package to advertisers, credit agents, government entities 13@ScottLZiegler
  • 14. So, Are We Better Than Data Brokers? 14@ScottLZiegler
  • 15. Intentions and Subjects Our intentions are better ● Research not personal profit Our subject is different ● Individuals we deal with are often historical 15@ScottLZiegler
  • 16. Intentions and Subjects Intent: Intent is not particularly important. Outcomes and results are important. - Safiya Nobel, Algorithms of Oppression Harm should be understood in wider terms than just individuals 16@ScottLZiegler
  • 17. How Could We Be Better? 17@ScottLZiegler
  • 18. Taking Advantage of the Help Already Out There Benefit from the expertise of others ● Bring the writings of humanities/social science to the development team 18@ScottLZiegler
  • 19. Standardize the Practice of Asking For Help Representation Officers ● Person in charge of investigating who is being represented in a digital project ● Research possible partners from that group/community Tie this closely to the role of outreach/promotion. ● We want to act as though the people being described will be looking closely at the description 19@ScottLZiegler
  • 20. Standardize a Path for Feedback and Adjustments Clarify why we did what we did ● “During the planning phase of this project, we worked with the following scholars and community groups” And how anyone can suggest we do it otherwise ● Perhaps a form and/or contact email address Explain what the process looks like for considering changes ● Though we might not be able to accommodate every request for modification, these are the steps we will take after we receive your comments 20@ScottLZiegler
  • 21. This Is Going to Be Lots of Work ● It’s work to read books ● It’s work to apply these ideas to our day jobs ● It’s work to listen to criticism of our projects ● It’s work to try to get people to participate And Also: ● It’s work to help us ● It’s work to explain things to us in a way that we’ll understand 21@ScottLZiegler
  • 22. Thank You Thanks for listening Please reach out if you want to talk more @ScottLZiegler sziegler1@lsu.edu 22

Editor's Notes

  • #2: Talking about (1) open data in special collections: what this is, why we do it (2) what it means to work with data in the current social context, in which a shocking amount of data about is gathered, packaged and sold After talking about one specific example of using open historic data to open new types of interaction with archival material, I’ll use the case of data brokers, people and organizations that collect and sell data, as a means for thinking about what we shouldn’t be doing with open historic data. I rely heavily on the work and thoughts of many people. I’ll argue that doing so is the only way to ensure that we’re better than data brokers.
  • #8: While my team and I were busy playing with all this data, a lot was happening out in the world.
  • #10: Beyond the legal constraints (HIPPA, COPPA, etc), and traditional archival concerns of privacy, we’re also concerned about how our data is
  • #11: While my team and I were busy playing with all this data, a lot was happening out in the world.
  • #12: Significant scholarship on the misuses of data was released.
  • #13: And to compliment the scholarship “Oh shit, this affect us” Also in April, CNBC reported that data analytics firm CubeYou has app suspended from Facebook for sharing data with advertises, suggesting that what Cambridge Analytica was doing is much more common that we knew. On Tuesday of this week, as I was putting this slide together, Mark Zuckerberg was testifying to congress about the use of Facebook data.
  • #14: It’s in the context that I want to weigh what I do against the creepy business of data brokers, how gather personal information repackage and sell it to anyone for any purpose. Cambridge Analytica doesn’t consider itself a data broker, I should mention. There are plenty of shades of creepiness, to be sure. I’m using the data broker example as a flash point against which to compare my own work. I use it in general terms to mean people who benefit from data of others, by the representation of others. And what bothers me is that practice of representing other people and benefiting from this representation
  • #16: Here’s one way of thinking about it: Basically, we’re not creating/sharing this data to make money, or to make anyone’s life harder And the people that we’re describing are usually deceased, and probably cannot be harmed by our work.
  • #17: Dr Nobel writes, “Many people say to me, ‘But tech companies don’t mean to be racist; that’s not their intent.’ Intent is not particularly important. Outcomes and results are important.”
  • #19: I’ve never thought of myself as someone how doesn’t take the expertise of others seriously, but I need to face the fact that I haven’t always acted like I do. There have been many calls to combine technologists with social scientists and humanists. In my library, I work very closely with the development team, many of whom have a background in the humanities.
  • #20: Beyond bringing the writings of others to us, standardize the way that we ask outside scholars, community members for assistance Build things with the assumption that they’ll be seen by the people being described in them
  • #21: I’m not sure how well this work for everything, but a place to start is to be explicit with any digital project we put out that we understand that we’re describing people who are not here to speak for themselves. Make the process explicit: “We worked with the following scholars for advice and guidance” When we are doing our best, there is no reason to be opaque or mysterious Following Cathy O’Neil’s suggestions from her book Weapons of Math Destruction, I’d like to work on projects in which the description of people updated as needed, the decisions are transparent and the conclusion and assumptions are open to scrutiny.
  • #22: And this is just from our point of view. It’s always work to write the books and to continuously point out to us that we inadvertently harm people It’s work to help us It’s work to explain things to us in a way that will will understand Without a way to pay these scholars for their expertise, I’m not sure how many will be willing to help. Identifying individuals based on their interests is the first step I have in mind However, before anything else a commitment is necessary from us: a commitment to read what we can and learn what we can; to bring the literature into the shop and struggle with the adoption of it in our everyday work.