SlideShare a Scribd company logo
Getting Digital Preservation
Data Out Of Wikidata
Katherine Thornton and Kenneth Seals-Nutt
18 September 2019
iPres 2019
WikiDP is Part of the EaaSI Program of Work
1
Data Related to Presentation
• Links embedded in slides
• Links are available in the paper
2
Wikidata as a Data Garden
• When we plant our data in Wikidata, we
can watch it grow and harvest the results.
3
Planting Data Seeds
• Creating new items
• Adding statements to items
4
Many Gardeners Collaborating
• Other gardeners will also add statements
to items
• Multilingual support so gardeners can
collaborate in 300 human languages
• Other gardeners plant and tend in
additional domains
5
Data Harvest
• SPARQL queries are a way to request a
specific harvest basket
• Produce from this virtual garden can be
harvested at any time, in any amount, by
anyone who has access to the internet
• Reuse and remix
6
Return all software titles known to read .dxf
files
Figure 1: Try this query!
7
File formats used for 3D graphics
Figure 2: Try this query!
8
Sequence alignment software with date of
publication, programming language and li-
cense
Figure 3: Try this query!
9
File formats to which the defining ISO stan-
dard has been linked
Figure 4: Try this query!
10
File format signature of the .stl file format
Figure 5: Try this query!
11
WikiDP Demo
12
Thank you!
• Images via Wikimedia Commons, credits:
• Terraced Rice Paddies at Khau Pha Pass, by
VŨ HÙNG
• Garden House Brighton, by peganum
• Pond Apple Seeds, by Filo gèn’
• Untitled, by Milada Vigerova
• Basket of Vegetables, by Markus Spiske
13
Thank you to our EaaSI Sponsors
14

More Related Content

PPTX
Overview of bigdata
PPTX
Big data and hadoop overview
PPTX
Dbms toolkit
PDF
Tracking research data footprints - slides
PPTX
Delivering our Data Driven Future
DOCX
Overview of bigdata
Big data and hadoop overview
Dbms toolkit
Tracking research data footprints - slides
Delivering our Data Driven Future

What's hot (16)

DOCX
PPTX
Future of jobs and digital economy citi conference 090618
DOCX
DOCX
DOCX
RTF
International Journal of Data Science and Analytics(IJDA)
DOCX
DOCX
RTF
International Journal of Data Science and Analytics(IJDA)
RTF
International Journal of Data Science and Analytics(IJDA)
DOCX
DOCX
DOCX
Future of jobs and digital economy citi conference 090618
International Journal of Data Science and Analytics(IJDA)
International Journal of Data Science and Analytics(IJDA)
International Journal of Data Science and Analytics(IJDA)
Ad

Similar to Getting Digital Preservation Data Out Of Wikidata (20)

PPTX
Why would a publisher care about open data?
PDF
NFAIS Talk on Enabling FAIR Data
PPT
Why are e-Infrastructures useful from a small business perspective?
PPTX
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
PDF
John morrissey c3 dis fair working data.pptx
PDF
Data Stewardship for SPATIAL/IsoCamp 2014
PPTX
Report from RDAPlenary 3 to DataCitation Community in Australia
PDF
DLF 2017 - Introducing: Wikidata For Digital Preservation
PPT
PIDs and DOI registration with DataCite - IATUL Workshop 2013
PDF
Role of PIDs in connecting scholarly works
PPTX
data-mesh-101.pptx
PDF
A Gen3 Perspective of Disparate Data
PDF
SoilWise Stakeholder Meeting Has Been Hosted
PDF
Incentivising the uptake of reusable metadata in the survey production process
PPTX
Data Visibility and Protection at the Scale of Life Sciences
PPTX
Introduction to Big Data & Big Data 1.0 System
PPT
Lecture 01 Evolution of Decision Support Systems
PPTX
SoilWise Stakeholder Meeting has been hosted
PDF
A Data Biosphere for Biomedical Research
PPTX
Paul hu bupdate_i_digbio_ecn_2012
Why would a publisher care about open data?
NFAIS Talk on Enabling FAIR Data
Why are e-Infrastructures useful from a small business perspective?
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
John morrissey c3 dis fair working data.pptx
Data Stewardship for SPATIAL/IsoCamp 2014
Report from RDAPlenary 3 to DataCitation Community in Australia
DLF 2017 - Introducing: Wikidata For Digital Preservation
PIDs and DOI registration with DataCite - IATUL Workshop 2013
Role of PIDs in connecting scholarly works
data-mesh-101.pptx
A Gen3 Perspective of Disparate Data
SoilWise Stakeholder Meeting Has Been Hosted
Incentivising the uptake of reusable metadata in the survey production process
Data Visibility and Protection at the Scale of Life Sciences
Introduction to Big Data & Big Data 1.0 System
Lecture 01 Evolution of Decision Support Systems
SoilWise Stakeholder Meeting has been hosted
A Data Biosphere for Biomedical Research
Paul hu bupdate_i_digbio_ecn_2012
Ad

Recently uploaded (20)

PPTX
TLE Review Electricity (Electricity).pptx
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
Hindi spoken digit analysis for native and non-native speakers
PPTX
A Presentation on Touch Screen Technology
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PPTX
Tartificialntelligence_presentation.pptx
PDF
Hybrid model detection and classification of lung cancer
PPTX
1. Introduction to Computer Programming.pptx
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Getting Started with Data Integration: FME Form 101
PDF
Encapsulation theory and applications.pdf
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
Chapter 5: Probability Theory and Statistics
TLE Review Electricity (Electricity).pptx
cloud_computing_Infrastucture_as_cloud_p
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Hindi spoken digit analysis for native and non-native speakers
A Presentation on Touch Screen Technology
Assigned Numbers - 2025 - Bluetooth® Document
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Tartificialntelligence_presentation.pptx
Hybrid model detection and classification of lung cancer
1. Introduction to Computer Programming.pptx
NewMind AI Weekly Chronicles - August'25-Week II
Heart disease approach using modified random forest and particle swarm optimi...
Digital-Transformation-Roadmap-for-Companies.pptx
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Getting Started with Data Integration: FME Form 101
Encapsulation theory and applications.pdf
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Chapter 5: Probability Theory and Statistics

Getting Digital Preservation Data Out Of Wikidata

  • 1. Getting Digital Preservation Data Out Of Wikidata Katherine Thornton and Kenneth Seals-Nutt 18 September 2019 iPres 2019
  • 2. WikiDP is Part of the EaaSI Program of Work 1
  • 3. Data Related to Presentation • Links embedded in slides • Links are available in the paper 2
  • 4. Wikidata as a Data Garden • When we plant our data in Wikidata, we can watch it grow and harvest the results. 3
  • 5. Planting Data Seeds • Creating new items • Adding statements to items 4
  • 6. Many Gardeners Collaborating • Other gardeners will also add statements to items • Multilingual support so gardeners can collaborate in 300 human languages • Other gardeners plant and tend in additional domains 5
  • 7. Data Harvest • SPARQL queries are a way to request a specific harvest basket • Produce from this virtual garden can be harvested at any time, in any amount, by anyone who has access to the internet • Reuse and remix 6
  • 8. Return all software titles known to read .dxf files Figure 1: Try this query! 7
  • 9. File formats used for 3D graphics Figure 2: Try this query! 8
  • 10. Sequence alignment software with date of publication, programming language and li- cense Figure 3: Try this query! 9
  • 11. File formats to which the defining ISO stan- dard has been linked Figure 4: Try this query! 10
  • 12. File format signature of the .stl file format Figure 5: Try this query! 11
  • 14. Thank you! • Images via Wikimedia Commons, credits: • Terraced Rice Paddies at Khau Pha Pass, by VŨ HÙNG • Garden House Brighton, by peganum • Pond Apple Seeds, by Filo gèn’ • Untitled, by Milada Vigerova • Basket of Vegetables, by Markus Spiske 13
  • 15. Thank you to our EaaSI Sponsors 14