Closing the gap: connection points
between DMPs and repositories
Sarah Jones, Sam Rust & Magdalena Drafiova
10 June 2019, Open Repositories conference, Hamburg
dmponline@dcc.ac.uk
Workshop aims and scope
• Share current activity on machine-actionable DMPs
• Discuss latest/forthcoming features in DMPonline
• Understand repository needs
• Map out how DMPs and repositories can connect
• Use workshop outcomes to inform DMProadmap
Agenda
Time Session Name
9:00 Aims and introductions Sarah, Sam & Magdalena
9:45 DMPs – moves to machine-actionability Sarah
10:15 Demo of DMPonline features Sam & Magdalena
10:30 Coffee
11:00 Discussion: mapping connections between
DMPs and repositories
All
12:10 Wrap-up discussion and next steps Sarah, Sam & Magdalena
https://tiny.cc/DMP-workshop
Let’s learn about you…
What does a DMP contain?
Info on project
Info on data
Data management
Plans for sharing
Grant number
Project name, description
PI details
…..
Types of data to be created
Description of data / metadata
Data volume
Data format
…..
Where data will be stored and backed-up
Project partners needing access
Ethical issues
…..
Which repository will data be deposited in
Who will have access / permission to reuse
Plans for data licensing
Length of preservation
…..
http://www.dcc.ac.uk/resources/data-management-plans/checklist
Why make DMPs machine-actionable ?
Promote information flow
across systems to support:
• Data discovery
• Capacity planning
• Aggregation/integration
• Policy compliance
• ….
From Flickr by highwaysengland, CC BY 2.0
Planning &
administration
Create, analyse,
manage data
Publishing & reuse
Identifiers:
What should connect? Priorities…
IDCC workshop
• Defining a ‘perfect DMP world’
• Map use cases and priorities
• Resulted in white paper on
recommendations
http://www.dcc.ac.uk/events/workshops
/postcard-future-tools-and-services-
perfect-dmp-world
maDMP priority areas
• Common standards and protocols
• Leveraging persistent identifiers (PIDs)
• Capacity planning (institutional & data centre)
• Share/publish/deposit DMPs
• Evaluation and monitoring
• Disciplinary tailoring and recommendations
https://riojournal.com/article/13086
Who is working on this?
Research Data Alliance (RDA) is a global organisation
supporting collaboration to promote data management and
open sharing
Active DMPs
Interest Group
Exposing DMPs Working Group
- develop use cases of what can be shared,
when and with whom
- provide a reference model and alternative
strategies for exposing DMPs
https://www.rd-alliance.org/groups/exposing-
data-management-plans-wg
DMP Common Standards
Working Group
- develop a common data model with a core
set of elements
- provide reference implementations of the
data model
https://www.rd-alliance.org/groups/dmp-
common-standards-wg
CC BY-SA 4.0
DMP Common Standards - Outputs
• Common data model for machine-actionable DMPs
• to model information from standard DMPs
• NOT a template
• NOT a questionnaire
• modular design
• core set of elements
• domain specific extensions
• Reference implementations
• ready to use models
• JSON, XML, RDF, etc.
• Guidelines for adoption of the common data model
• requirements for supporting systems
• pilot studies
www.rd-alliance.org - @resdatall
CC BY-SA 4.0
Example
• Current DMPs – model questionnaires
<administrative_data>
<question>Who will be the Principle Investigator?</question>
<answer>The PI will be John Smith from our university.</answer>
</administrative_data>
• Machine-actionable DMPs – model information
"dc:creator":[ {
"foaf:name":"John Smith",
"@id":"orcid.org/0000-1111-2222-3333",
"foaf:mbox":"mailto:jsmith@tuwien.ac.at",
"madmp:institution":" AT-Vienna-University-of-Technology"
} ],
www.rd-alliance.org - @resdatall
Principles
• Reuse existing standards,
e.g. Dublin Core, PREMIS
• Use PIDs whenever
possible, e.g. ORCID
• Use controlled
vocabularies
• Develop own concepts
and vocabularies only
when needed
CC BY-SA 4.0
Model in consultation: DCC plans to test
www.rd-alliance.org - @resdatall
https://www.lucidchart.com/invitations/accept/ee26bc71-01a6-442a-b946-
5b9c910fb926
Some recent activities in DMPonline
Activities to gather more structured information for reuse
• Connecting to the RDA Metadata Standards Directory to
allow standards to be selected
• Connecting to the OpenAIRE API to pull in grant details for
H2020 projects
• Supporting export to Zenodo and RIOjournal (in
development/test)
MSD added as an answer type
Tool pulls in information from the API to let users
browse and select metadata standards
Select standards and add comments
Utilising EC grant IDs in plans
• Harvest grant IDs from OpenAIRE API
• Provide look up when entering project details
• Enables join up of DMP with other outputs
https://www.youtube.com/
watch?v=qYQf9viGTHE
Easing DMP publishing workflows
• Integration with Zenodo
• DMPonline provides an export to Zenodo option
• User signs-in to Zenodo
• DMP uploads in PDF version and pre-populates metadata
• Integration with RIOjournal
• User registers for ARPHA writing tool and obtains API key
• DMPonline stores this API key and provides a button to allow
export of DMP to ARPHA writing tool
• DMP import complies with pre-defined DMP article template in
RIO
Full text API
Request from user group to expand API to allow full-
text to be harvested
Potentially to increase functionality – link into piloting
of RDA Common standard for DMPs
Benefits and guidelines on using the API -
http://www.dcc.ac.uk/blog/sam-rust-explains-
benefits-api-dmponline
Questions
Exercise
1. What information held in a DMP could be useful to a repository?
• List all possible examples
• Be precise about the details needed
2. How could the information be used?
• Define use cases e.g. capacity planning, deposit workflow,
recommendations…
3. Map out the workflow of information exchange
• what is needed?
• when – once, at intervals?
• how to transfer info?
• what format is it needed in?
Take 15-20 mins per section. Agree on details to report back.
What next?
1. User groups
- March 2019 in Amsterdam
- April 2019 in Manchester
- Read more here too: http://www.dcc.ac.uk/blog/dmponline-user-group-where-next
- Prioritising requests from the users (full text API, notification one per review, case search sensitivity…)
2. Development plans - https://github.com/DMPRoadmap/roadmap/projects/5
- Conditional questions (#1722)
- Adding school/department field
- Statistics for the administrators
3. Newsletter
What next?
3. Newsletter
Subscribe: https://dcc.us1.list-manage.com/subscribe?u=dfcae3470db3a4ac2196a3671&id=c4710c28b8
Most recent issue: https://mailchi.mp/8c11dbd6093a/dmponline-newsletter-may-2019
- Latest updates (summary from virtual drop in sessions, news …)
- Software updates (latest releases…)
- How to videos (e.g. https://www.youtube.com/watch?v=qYQf9viGTHE)
- Knowledge exchange
- Days to remember (monthly virtual drop in sessions, user groups,
conferences…)
Connect & stay in touch
Helpdesk
Twitter
Blog
User group
Github
Slack
DMPonline@dcc.ac.uk
@DMPonline and #ActiveDMPs
http://www.dcc.ac.uk/news/DMPonline
http://tiny.cc/DMPonline-user-group
https://github.com/DMPRoadmap
http://tiny.cc/DMPRoadmap-slack
INTERNATIONAL
DIGITAL
CURATION
CONFERENCE
Collective
Curation: the many
hands that make
data work
#idcc20
www.dcc.ac.uk/events/idcc20
Organised by
www.dcc.ac.uk www.dri.ie
2020
17 – 20 February 2020
Dublin, Ireland

More Related Content

PDF
Making DMPs actionable and public
PDF
Simms-fsci-madmps-2017
PPTX
Active actionable DMPs
PPTX
International approached to DMPs - Sarah Jones
PPTX
What you need to know about DMPs
PPTX
IDCC Presentation on the Future of Data Management Planning, Feb 2016
PPTX
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
PPTX
DMPonline: what’s news & what’s coming
Making DMPs actionable and public
Simms-fsci-madmps-2017
Active actionable DMPs
International approached to DMPs - Sarah Jones
What you need to know about DMPs
IDCC Presentation on the Future of Data Management Planning, Feb 2016
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
DMPonline: what’s news & what’s coming

Similar to Connecting DMPs & Repositories (20)

PPTX
DMPOnline by Sarah Jones
PPTX
A landscape survey of Active DMPs
PDF
Simms DataONE webinar 13 Mar 18
PPT
DMP Online: update 2013
PDF
DMPTool Webinar 11: Complementary Tools
PPTX
RDA Active DMP session
PPTX
All an NCP should know about DMPs, but didn't have the time to ask
PDF
Machine-actionable Data Management Plans
PPTX
Dmponline demo 21-04-2015
PPTX
DMP lessons from Europe
PPTX
Australasian dmp interest group international involvement-Kathryn Unsworth
PDF
Workflows for machine-actionable Research Data Management Planning
PPTX
Writing a successful data management plan with the DMPTool
PPTX
Intro to Data Management Plans
PDF
Data Management Plans 2.0: A Hub of Information to Facilitate Research
PPTX
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
PPTX
DMPTuuli - what's new?
PPTX
DMPonline new directions
PPTX
What is a DMP
PPT
DMP EUDAT
DMPOnline by Sarah Jones
A landscape survey of Active DMPs
Simms DataONE webinar 13 Mar 18
DMP Online: update 2013
DMPTool Webinar 11: Complementary Tools
RDA Active DMP session
All an NCP should know about DMPs, but didn't have the time to ask
Machine-actionable Data Management Plans
Dmponline demo 21-04-2015
DMP lessons from Europe
Australasian dmp interest group international involvement-Kathryn Unsworth
Workflows for machine-actionable Research Data Management Planning
Writing a successful data management plan with the DMPTool
Intro to Data Management Plans
Data Management Plans 2.0: A Hub of Information to Facilitate Research
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
DMPTuuli - what's new?
DMPonline new directions
What is a DMP
DMP EUDAT
Ad

More from Sarah Jones (20)

PPTX
Data training tips and tricks
PPTX
EOSC and libraries
PPTX
EOSC Association priorities and activities
PPTX
Managing and sharing data: lessons from the European context
PPTX
Reflections on Open Science
PPTX
MAR comments analysis
PPTX
Introduction to Open Science and EOSC
PPTX
EOSC-MAR-update.pptx
PPTX
Intro-EOSC.pptx
PPTX
Why is EOSC so hard?
PPTX
The future of FAIR
PPTX
Data Management Planning for researchers
PPTX
Is Europe ready for Open Science
PPTX
DMPonline: 10 years, 10 lessons
PPTX
Do & don't of supporting Open Science
PPTX
Why institutions need to raise their capabilities to support FAIR
PPTX
It takes more than a village: lessons on building global research commons
PPTX
DCC and FAIR initiatives
PPTX
Intro to RDM
PPTX
Reflections on EOSC through the mirror of ARDC
Data training tips and tricks
EOSC and libraries
EOSC Association priorities and activities
Managing and sharing data: lessons from the European context
Reflections on Open Science
MAR comments analysis
Introduction to Open Science and EOSC
EOSC-MAR-update.pptx
Intro-EOSC.pptx
Why is EOSC so hard?
The future of FAIR
Data Management Planning for researchers
Is Europe ready for Open Science
DMPonline: 10 years, 10 lessons
Do & don't of supporting Open Science
Why institutions need to raise their capabilities to support FAIR
It takes more than a village: lessons on building global research commons
DCC and FAIR initiatives
Intro to RDM
Reflections on EOSC through the mirror of ARDC
Ad

Recently uploaded (20)

PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PPTX
Chapter 5: Probability Theory and Statistics
PDF
A proposed approach for plagiarism detection in Myanmar Unicode text
PDF
Flame analysis and combustion estimation using large language and vision assi...
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PPTX
Configure Apache Mutual Authentication
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
CloudStack 4.21: First Look Webinar slides
PDF
OpenACC and Open Hackathons Monthly Highlights July 2025
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PDF
The influence of sentiment analysis in enhancing early warning system model f...
PDF
Consumable AI The What, Why & How for Small Teams.pdf
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PDF
Hindi spoken digit analysis for native and non-native speakers
PPT
What is a Computer? Input Devices /output devices
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
Chapter 5: Probability Theory and Statistics
A proposed approach for plagiarism detection in Myanmar Unicode text
Flame analysis and combustion estimation using large language and vision assi...
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
A contest of sentiment analysis: k-nearest neighbor versus neural network
Configure Apache Mutual Authentication
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
CloudStack 4.21: First Look Webinar slides
OpenACC and Open Hackathons Monthly Highlights July 2025
Taming the Chaos: How to Turn Unstructured Data into Decisions
The influence of sentiment analysis in enhancing early warning system model f...
Consumable AI The What, Why & How for Small Teams.pdf
Zenith AI: Advanced Artificial Intelligence
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
Hindi spoken digit analysis for native and non-native speakers
What is a Computer? Input Devices /output devices

Connecting DMPs & Repositories

  • 1. Closing the gap: connection points between DMPs and repositories Sarah Jones, Sam Rust & Magdalena Drafiova 10 June 2019, Open Repositories conference, Hamburg dmponline@dcc.ac.uk
  • 2. Workshop aims and scope • Share current activity on machine-actionable DMPs • Discuss latest/forthcoming features in DMPonline • Understand repository needs • Map out how DMPs and repositories can connect • Use workshop outcomes to inform DMProadmap
  • 3. Agenda Time Session Name 9:00 Aims and introductions Sarah, Sam & Magdalena 9:45 DMPs – moves to machine-actionability Sarah 10:15 Demo of DMPonline features Sam & Magdalena 10:30 Coffee 11:00 Discussion: mapping connections between DMPs and repositories All 12:10 Wrap-up discussion and next steps Sarah, Sam & Magdalena https://tiny.cc/DMP-workshop
  • 5. What does a DMP contain? Info on project Info on data Data management Plans for sharing Grant number Project name, description PI details ….. Types of data to be created Description of data / metadata Data volume Data format ….. Where data will be stored and backed-up Project partners needing access Ethical issues ….. Which repository will data be deposited in Who will have access / permission to reuse Plans for data licensing Length of preservation ….. http://www.dcc.ac.uk/resources/data-management-plans/checklist
  • 6. Why make DMPs machine-actionable ? Promote information flow across systems to support: • Data discovery • Capacity planning • Aggregation/integration • Policy compliance • …. From Flickr by highwaysengland, CC BY 2.0
  • 7. Planning & administration Create, analyse, manage data Publishing & reuse Identifiers: What should connect? Priorities…
  • 8. IDCC workshop • Defining a ‘perfect DMP world’ • Map use cases and priorities • Resulted in white paper on recommendations http://www.dcc.ac.uk/events/workshops /postcard-future-tools-and-services- perfect-dmp-world
  • 9. maDMP priority areas • Common standards and protocols • Leveraging persistent identifiers (PIDs) • Capacity planning (institutional & data centre) • Share/publish/deposit DMPs • Evaluation and monitoring • Disciplinary tailoring and recommendations https://riojournal.com/article/13086
  • 10. Who is working on this? Research Data Alliance (RDA) is a global organisation supporting collaboration to promote data management and open sharing Active DMPs Interest Group Exposing DMPs Working Group - develop use cases of what can be shared, when and with whom - provide a reference model and alternative strategies for exposing DMPs https://www.rd-alliance.org/groups/exposing- data-management-plans-wg DMP Common Standards Working Group - develop a common data model with a core set of elements - provide reference implementations of the data model https://www.rd-alliance.org/groups/dmp- common-standards-wg
  • 11. CC BY-SA 4.0 DMP Common Standards - Outputs • Common data model for machine-actionable DMPs • to model information from standard DMPs • NOT a template • NOT a questionnaire • modular design • core set of elements • domain specific extensions • Reference implementations • ready to use models • JSON, XML, RDF, etc. • Guidelines for adoption of the common data model • requirements for supporting systems • pilot studies www.rd-alliance.org - @resdatall
  • 12. CC BY-SA 4.0 Example • Current DMPs – model questionnaires <administrative_data> <question>Who will be the Principle Investigator?</question> <answer>The PI will be John Smith from our university.</answer> </administrative_data> • Machine-actionable DMPs – model information "dc:creator":[ { "foaf:name":"John Smith", "@id":"orcid.org/0000-1111-2222-3333", "foaf:mbox":"mailto:jsmith@tuwien.ac.at", "madmp:institution":" AT-Vienna-University-of-Technology" } ], www.rd-alliance.org - @resdatall Principles • Reuse existing standards, e.g. Dublin Core, PREMIS • Use PIDs whenever possible, e.g. ORCID • Use controlled vocabularies • Develop own concepts and vocabularies only when needed
  • 13. CC BY-SA 4.0 Model in consultation: DCC plans to test www.rd-alliance.org - @resdatall https://www.lucidchart.com/invitations/accept/ee26bc71-01a6-442a-b946- 5b9c910fb926
  • 14. Some recent activities in DMPonline Activities to gather more structured information for reuse • Connecting to the RDA Metadata Standards Directory to allow standards to be selected • Connecting to the OpenAIRE API to pull in grant details for H2020 projects • Supporting export to Zenodo and RIOjournal (in development/test)
  • 15. MSD added as an answer type Tool pulls in information from the API to let users browse and select metadata standards
  • 16. Select standards and add comments
  • 17. Utilising EC grant IDs in plans • Harvest grant IDs from OpenAIRE API • Provide look up when entering project details • Enables join up of DMP with other outputs https://www.youtube.com/ watch?v=qYQf9viGTHE
  • 18. Easing DMP publishing workflows • Integration with Zenodo • DMPonline provides an export to Zenodo option • User signs-in to Zenodo • DMP uploads in PDF version and pre-populates metadata • Integration with RIOjournal • User registers for ARPHA writing tool and obtains API key • DMPonline stores this API key and provides a button to allow export of DMP to ARPHA writing tool • DMP import complies with pre-defined DMP article template in RIO
  • 19. Full text API Request from user group to expand API to allow full- text to be harvested Potentially to increase functionality – link into piloting of RDA Common standard for DMPs Benefits and guidelines on using the API - http://www.dcc.ac.uk/blog/sam-rust-explains- benefits-api-dmponline
  • 21. Exercise 1. What information held in a DMP could be useful to a repository? • List all possible examples • Be precise about the details needed 2. How could the information be used? • Define use cases e.g. capacity planning, deposit workflow, recommendations… 3. Map out the workflow of information exchange • what is needed? • when – once, at intervals? • how to transfer info? • what format is it needed in? Take 15-20 mins per section. Agree on details to report back.
  • 22. What next? 1. User groups - March 2019 in Amsterdam - April 2019 in Manchester - Read more here too: http://www.dcc.ac.uk/blog/dmponline-user-group-where-next - Prioritising requests from the users (full text API, notification one per review, case search sensitivity…) 2. Development plans - https://github.com/DMPRoadmap/roadmap/projects/5 - Conditional questions (#1722) - Adding school/department field - Statistics for the administrators 3. Newsletter
  • 23. What next? 3. Newsletter Subscribe: https://dcc.us1.list-manage.com/subscribe?u=dfcae3470db3a4ac2196a3671&id=c4710c28b8 Most recent issue: https://mailchi.mp/8c11dbd6093a/dmponline-newsletter-may-2019 - Latest updates (summary from virtual drop in sessions, news …) - Software updates (latest releases…) - How to videos (e.g. https://www.youtube.com/watch?v=qYQf9viGTHE) - Knowledge exchange - Days to remember (monthly virtual drop in sessions, user groups, conferences…)
  • 24. Connect & stay in touch Helpdesk Twitter Blog User group Github Slack DMPonline@dcc.ac.uk @DMPonline and #ActiveDMPs http://www.dcc.ac.uk/news/DMPonline http://tiny.cc/DMPonline-user-group https://github.com/DMPRoadmap http://tiny.cc/DMPRoadmap-slack
  • 25. INTERNATIONAL DIGITAL CURATION CONFERENCE Collective Curation: the many hands that make data work #idcc20 www.dcc.ac.uk/events/idcc20 Organised by www.dcc.ac.uk www.dri.ie 2020 17 – 20 February 2020 Dublin, Ireland