SlideShare a Scribd company logo
Towards a Common
Deposit API
(the Dataverse example)
Elizabeth Quigley
http://www.iq.harvard.edu/people/elizabeth-quigley
Philip Durbin
http://www.iq.harvard.edu/people/philip-durbin
Data + Article Depositing Now
General Repositories Domain Specific Repositories Journals
Journal + Data Automation
No Automation Full Automation
List of
recommended
repositories - but
data not required
to be deposited
Policy requiring
data to be
deposited
Journal gets
notification
when data is
deposited in a
repository
Journal System
+ Repository
are connected
Towards a common deposit api (the dataverse example) Elizabeth Quigley + Phil Durbin
Full Automation Example: OJS + DVN 3.6 (2013)
SWORD support shipped in DVN 3.6 in late 2013 and
has been carried forward into Dataverse 4.0.
http://projects.iq.harvard.edu/ojs-dvn https://youtu.be/ftK1_IvWaVc
Introducing SWORD
Simple Web-service Offering Repository Deposit (SWORD)
● http://en.wikipedia.org/wiki/SWORD_(protocol)
● a "profile" of AtomPub (RFC 5023) from Google
● http://swordapp.org
● https://github.com/swordapp
● https://twitter.com/swordapp
SWORD: The Good Parts
● Well defined standard, based on other standards (AtomPub)
○ http://swordapp.github.io/SWORDv2-Profile/SWORDProfile.html
● Good support for publishing workflows
○ "collections" as containers (dataverses, etc.)
○ "In-Progress" HTTP header for "unpublished"
○ negotiation of packaging (SimpleZip, METS, BagIt)
○ deposit receipt
● Popular in scholarly publishing
○ http://swordapp.org/sword-v2/sword-v2-implementations/
● Server library in Java (used by Dataverse)
● Client libraries in Java, PHP, Python, and Ruby
SWORD: More Good Parts!
● Open Science Framework (OSF) integration
○ https://osf.io/getting-started/#dataverse
● Python package for Dataverse APIs
○ https://github.com/IQSS/dataverse-client-python
● R package for Dataverse APIs
○ https://github.com/ropensci/dvn
● (Future) Archivematica integration
○ http://www.rdc-drc.ca/the-rdc-federated-pilot-for-data-ingest-and-preservation/
● (Future) TOP SECRET integrations! :)
Challenges with SWORD: Metadata
● Lowest common denominator for metadata: dcterms
● http://dublincore.org/documents/dcmi-terms/
○ 15 properties in the "/elements/1.1/" namespace
■ contributor, coverage, creator, date, description, format,
identifier, language, publisher, relation, rights, source, subject,
title, type
○ 55 properties in the "/terms/" namespace
■ isReferencedBy et al.
● 154 metadata fields in base install of Dataverse 4.0 (more to come!)
○ http://guides.dataverse.org/en/4.0/user/appendix.html
[sword-app-tech] client SHOULD add Dublin Core terms to the Atom Entry, MAY add any other metadata formats or foreign
markup - http://www.mail-archive.com/sword-app-tech@lists.sourceforge.net/msg00384.html
Challenges with SWORD: Dev Activity
"I'm hoping, also, that there will be
more funding at this end at some point
for some more 'core' development, but
opportunities are currently vague, so I
don't think there's any point in holding out
for it. Instead it seems much better to try
to increase community engagement with
the code and try to sustain it that way."
-- Richard Jones, SWORD spec lead, in
the "code governance" thread on sword-
app-tech, July 2014
http://www.mail-archive.com/sword-app-tech@lists.
sourceforge.net/msg00400.html
GitHub updates in 2014, 2013
Blog updates in 2013, 2012
Both SWORD and a "Native" API
● Dataverse will continue to support SWORD
● Dataverse 4.0 added a new JSON-based "native" API
○ All metadata fields supported
○ Does more than Data Deposit
■ Search
■ Permissions
■ etc.
http://guides.dataverse.org/en/latest/api
A Common Deposit API?
● Use SWORD as is?
● Develop SWORD more?
● Define new Common API?
How to Get Involved
SWORD mailing list:
http://swordapp.org/contact/
Dataverse API Community Group:
http://community.dataverse.org/community-groups/api.html

More Related Content

PPT
W3C Linked Data Platform Overview
PPTX
REST meets Semantic Web
ODP
SWORD v2 at JISC Depo meeting
ODP
Sword v2 at UKCoRR
PPT
First steps towards publishing library data on the semantic web
PPTX
SWIB14 Weaving repository contents into the Semantic Web
PDF
30° Nexa Lunch Seminar - Linked Data Platform vs real world
PDF
LOD技術解説
W3C Linked Data Platform Overview
REST meets Semantic Web
SWORD v2 at JISC Depo meeting
Sword v2 at UKCoRR
First steps towards publishing library data on the semantic web
SWIB14 Weaving repository contents into the Semantic Web
30° Nexa Lunch Seminar - Linked Data Platform vs real world
LOD技術解説

What's hot (19)

PDF
Client-side storage
PDF
DBpedia as Gaeilge Chapter
PPTX
LDP4j: A framework for the development of interoperable read-write Linked Da...
PDF
Bio2RDF presentation at Combine 2012
PPT
Getting Started With The Talis Platform
PDF
Web of Data Usage Mining
PPT
Semantic HTML
PPT
Using HDF5 tools for performance tuning and troubleshooting
PPTX
Webinar: What Your Object Storage Vendor Isn’t Telling You About NFS Support
PDF
IPWB and IPFS at WAC2017
PDF
Minerva: Drill Storage Plugin for IPFS
PPTX
2013 CrossRef Annual Meeting System Update Chuck Koscher
PPTX
Putting some Spark into HDF5
PPT
SPARQL Query Forms
PPTX
Geography in Linked Ancient World Data
PPTX
Grant Fritchey - Common problems in backup and recovery and how to handle them
PPT
Sword Bl 0903[1]
PDF
Tips for Fixing A Hacked WordPress Site - Vlad Lasky
PPTX
Introducing JDBC for SPARQL
Client-side storage
DBpedia as Gaeilge Chapter
LDP4j: A framework for the development of interoperable read-write Linked Da...
Bio2RDF presentation at Combine 2012
Getting Started With The Talis Platform
Web of Data Usage Mining
Semantic HTML
Using HDF5 tools for performance tuning and troubleshooting
Webinar: What Your Object Storage Vendor Isn’t Telling You About NFS Support
IPWB and IPFS at WAC2017
Minerva: Drill Storage Plugin for IPFS
2013 CrossRef Annual Meeting System Update Chuck Koscher
Putting some Spark into HDF5
SPARQL Query Forms
Geography in Linked Ancient World Data
Grant Fritchey - Common problems in backup and recovery and how to handle them
Sword Bl 0903[1]
Tips for Fixing A Hacked WordPress Site - Vlad Lasky
Introducing JDBC for SPARQL
Ad

Viewers also liked (8)

PDF
Data Analysis in Dataverse & Visualization of Datasets on Historical Maps by ...
PDF
TwoRavens: A Graphical, Browser-Based Statistical Interface for Data Reposito...
PPTX
Research Data Curation _ Grad Humanities Class
PDF
Citing Data in Journal Articles using JATS by Deborah A. Lapeyre
PDF
Geospatial Data Visualization: WorldMap Integration by Raman Prasad
PDF
Sharing Data Through Plots with Plotly by Alex Johnson
PDF
Metadata & Data Curation Services by Thu-Mai Christian
PPTX
API economy
 
Data Analysis in Dataverse & Visualization of Datasets on Historical Maps by ...
TwoRavens: A Graphical, Browser-Based Statistical Interface for Data Reposito...
Research Data Curation _ Grad Humanities Class
Citing Data in Journal Articles using JATS by Deborah A. Lapeyre
Geospatial Data Visualization: WorldMap Integration by Raman Prasad
Sharing Data Through Plots with Plotly by Alex Johnson
Metadata & Data Curation Services by Thu-Mai Christian
API economy
 
Ad

Similar to Towards a common deposit api (the dataverse example) Elizabeth Quigley + Phil Durbin (20)

PPTX
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
PPTX
OpenAIRE and the Case of Irish Repositories
PPT
Sword Or2008 Julieallinson
PPTX
Learning W3C Linked Data Platform with examples
PPTX
JavaOne2013 Leveraging Linked Data and OSLC
PPTX
Application integration with the W3C Linked Data standards
PPT
Sword Crig 2007 12 06
PPT
Making Repository Easier With SWORD
PDF
WWW2014 Overview of W3C Linked Data Platform 20140410
PPTX
Better integrations through open interfaces
PDF
DevOps-Roadmap
PPT
Sword Cetis 2007 06 29
PPT
Sword Cetis 2007 06 29
PDF
Enterprise guide to building a Data Mesh
PPTX
Usage of Linked Data: Introduction and Application Scenarios
PDF
Lares from LOW to PWNED
PDF
Research software and Dataverse
PPTX
Ladies Be Architects - Integration - Multi-Org, Security, JSON, Backup & Restore
PPT
Exploring the Semantic Web
PPT
SWORD: The Story So Far
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the Case of Irish Repositories
Sword Or2008 Julieallinson
Learning W3C Linked Data Platform with examples
JavaOne2013 Leveraging Linked Data and OSLC
Application integration with the W3C Linked Data standards
Sword Crig 2007 12 06
Making Repository Easier With SWORD
WWW2014 Overview of W3C Linked Data Platform 20140410
Better integrations through open interfaces
DevOps-Roadmap
Sword Cetis 2007 06 29
Sword Cetis 2007 06 29
Enterprise guide to building a Data Mesh
Usage of Linked Data: Introduction and Application Scenarios
Lares from LOW to PWNED
Research software and Dataverse
Ladies Be Architects - Integration - Multi-Org, Security, JSON, Backup & Restore
Exploring the Semantic Web
SWORD: The Story So Far

More from datascienceiqss (19)

PDF
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
PDF
iRODS/Dataverse Project by Jonathan Crabtree
PDF
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
PDF
DataTags: Sharing Privacy Sensitive Data by Latanya Sweeney
PDF
Center for Open Science and the Open Science Framework: Dataverse Add-on by S...
PDF
MIT Libraries Dataverse by Katherine McNeill
PDF
The Project TIER Dataverse: Archiving and Sharing Replicable Student Research...
PDF
Dataverse in China: Internationalization, Curation and Promotion by Yin Shenqin
PPTX
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
PDF
American Journal of Political Science & The Odum Institute: Promoting Researc...
PDF
Political Analysis Dataverse by Jonathan N. Katz
PDF
Data in Brief and Dataverse: Incentivizing Authors to Share Data by Paige Sha...
PDF
Dataverse in the Universe of Data by Christine L. Borgman
PDF
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...
PDF
Data Publishing Models by Sünje Dallmeier-Tiessen
PDF
Persistent Identifier Services and their Metadata by John Kunze
PDF
Data Citation Implementation Guidelines By Tim Clark
PDF
Contributing Code to Dataverse by Gustavo Durand
PDF
Dataverse 4.0 UX by Elizabeth Quigley
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
iRODS/Dataverse Project by Jonathan Crabtree
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Latanya Sweeney
Center for Open Science and the Open Science Framework: Dataverse Add-on by S...
MIT Libraries Dataverse by Katherine McNeill
The Project TIER Dataverse: Archiving and Sharing Replicable Student Research...
Dataverse in China: Internationalization, Curation and Promotion by Yin Shenqin
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
American Journal of Political Science & The Odum Institute: Promoting Researc...
Political Analysis Dataverse by Jonathan N. Katz
Data in Brief and Dataverse: Incentivizing Authors to Share Data by Paige Sha...
Dataverse in the Universe of Data by Christine L. Borgman
Data FAIRport Skunkworks: Common Repository Access Via Meta-Metadata Descript...
Data Publishing Models by Sünje Dallmeier-Tiessen
Persistent Identifier Services and their Metadata by John Kunze
Data Citation Implementation Guidelines By Tim Clark
Contributing Code to Dataverse by Gustavo Durand
Dataverse 4.0 UX by Elizabeth Quigley

Recently uploaded (20)

PDF
System and Network Administration Chapter 2
PDF
medical staffing services at VALiNTRY
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PDF
System and Network Administraation Chapter 3
PDF
Understanding Forklifts - TECH EHS Solution
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
top salesforce developer skills in 2025.pdf
PDF
Softaken Excel to vCard Converter Software.pdf
PPTX
Transform Your Business with a Software ERP System
PPTX
Online Work Permit System for Fast Permit Processing
PPTX
ISO 45001 Occupational Health and Safety Management System
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PDF
Design an Analysis of Algorithms I-SECS-1021-03
System and Network Administration Chapter 2
medical staffing services at VALiNTRY
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
System and Network Administraation Chapter 3
Understanding Forklifts - TECH EHS Solution
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
Odoo Companies in India – Driving Business Transformation.pdf
Adobe Illustrator 28.6 Crack My Vision of Vector Design
top salesforce developer skills in 2025.pdf
Softaken Excel to vCard Converter Software.pdf
Transform Your Business with a Software ERP System
Online Work Permit System for Fast Permit Processing
ISO 45001 Occupational Health and Safety Management System
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Design an Analysis of Algorithms II-SECS-1021-03
2025 Textile ERP Trends: SAP, Odoo & Oracle
Design an Analysis of Algorithms I-SECS-1021-03

Towards a common deposit api (the dataverse example) Elizabeth Quigley + Phil Durbin

  • 1. Towards a Common Deposit API (the Dataverse example) Elizabeth Quigley http://www.iq.harvard.edu/people/elizabeth-quigley Philip Durbin http://www.iq.harvard.edu/people/philip-durbin
  • 2. Data + Article Depositing Now General Repositories Domain Specific Repositories Journals
  • 3. Journal + Data Automation No Automation Full Automation List of recommended repositories - but data not required to be deposited Policy requiring data to be deposited Journal gets notification when data is deposited in a repository Journal System + Repository are connected
  • 5. Full Automation Example: OJS + DVN 3.6 (2013) SWORD support shipped in DVN 3.6 in late 2013 and has been carried forward into Dataverse 4.0. http://projects.iq.harvard.edu/ojs-dvn https://youtu.be/ftK1_IvWaVc
  • 6. Introducing SWORD Simple Web-service Offering Repository Deposit (SWORD) ● http://en.wikipedia.org/wiki/SWORD_(protocol) ● a "profile" of AtomPub (RFC 5023) from Google ● http://swordapp.org ● https://github.com/swordapp ● https://twitter.com/swordapp
  • 7. SWORD: The Good Parts ● Well defined standard, based on other standards (AtomPub) ○ http://swordapp.github.io/SWORDv2-Profile/SWORDProfile.html ● Good support for publishing workflows ○ "collections" as containers (dataverses, etc.) ○ "In-Progress" HTTP header for "unpublished" ○ negotiation of packaging (SimpleZip, METS, BagIt) ○ deposit receipt ● Popular in scholarly publishing ○ http://swordapp.org/sword-v2/sword-v2-implementations/ ● Server library in Java (used by Dataverse) ● Client libraries in Java, PHP, Python, and Ruby
  • 8. SWORD: More Good Parts! ● Open Science Framework (OSF) integration ○ https://osf.io/getting-started/#dataverse ● Python package for Dataverse APIs ○ https://github.com/IQSS/dataverse-client-python ● R package for Dataverse APIs ○ https://github.com/ropensci/dvn ● (Future) Archivematica integration ○ http://www.rdc-drc.ca/the-rdc-federated-pilot-for-data-ingest-and-preservation/ ● (Future) TOP SECRET integrations! :)
  • 9. Challenges with SWORD: Metadata ● Lowest common denominator for metadata: dcterms ● http://dublincore.org/documents/dcmi-terms/ ○ 15 properties in the "/elements/1.1/" namespace ■ contributor, coverage, creator, date, description, format, identifier, language, publisher, relation, rights, source, subject, title, type ○ 55 properties in the "/terms/" namespace ■ isReferencedBy et al. ● 154 metadata fields in base install of Dataverse 4.0 (more to come!) ○ http://guides.dataverse.org/en/4.0/user/appendix.html [sword-app-tech] client SHOULD add Dublin Core terms to the Atom Entry, MAY add any other metadata formats or foreign markup - http://www.mail-archive.com/sword-app-tech@lists.sourceforge.net/msg00384.html
  • 10. Challenges with SWORD: Dev Activity "I'm hoping, also, that there will be more funding at this end at some point for some more 'core' development, but opportunities are currently vague, so I don't think there's any point in holding out for it. Instead it seems much better to try to increase community engagement with the code and try to sustain it that way." -- Richard Jones, SWORD spec lead, in the "code governance" thread on sword- app-tech, July 2014 http://www.mail-archive.com/sword-app-tech@lists. sourceforge.net/msg00400.html GitHub updates in 2014, 2013 Blog updates in 2013, 2012
  • 11. Both SWORD and a "Native" API ● Dataverse will continue to support SWORD ● Dataverse 4.0 added a new JSON-based "native" API ○ All metadata fields supported ○ Does more than Data Deposit ■ Search ■ Permissions ■ etc. http://guides.dataverse.org/en/latest/api
  • 12. A Common Deposit API? ● Use SWORD as is? ● Develop SWORD more? ● Define new Common API? How to Get Involved SWORD mailing list: http://swordapp.org/contact/ Dataverse API Community Group: http://community.dataverse.org/community-groups/api.html