SlideShare a Scribd company logo
Research data spring
Filling the Digital Preservation Gap10/12/2015
Investigating Archivematica to preserve research
data for the longer term …because digital
preservation won’t just go away
Team
2
University of Hull:
• Chris Awre
• Richard Green
• Simon Wilson
University of York:
• Julie Allinson
• Jen Mitcham
Project aim
3
“…to investigate
Archivematica and explore
how it might be used to
provide digital preservation
functionality within a wider
infrastructure for Research
Data Management.”
Why?
4
....because we believe that digital preservation
should be a key element of the infrastructure for
managing research data for the long term
(From RDMSS PQQ)
“...preservation actions should ensure that data
remains authentic, reliable, and usable while
maintaining integrity.”
Why Archivematica?
5
....because it is open source,
standards compliant, flexible
and customisable and packages
a range of preservation tools
together
...if you want to know
more you can read our
phase 1 report
Collaborators and partners
6
The Archivematica development model
7
Artefactual develop
Archivematica
Archivematica
released
as open source
Community of users
identifies enhancements
Enhancements
sponsored
by one or more users
RDS /
Research data
UK users
COPPUL
Progress in phase 2
»Planning our own local
implementations
»Hull
»York
»Considering above campus
option for Archivematica
»Liaising with other projects
»Phase 2 Report: now available!
8
Progress in phase 2
»Enhancing Archivematica:
»DIP regeneration
»METS parsing
»Generic search API
»Choice of checksum
»Pronom integration
»Documentation
9
Progress in phase 2
Not all of the work we
have sponsored is ‘visual’
but much of it is
fundamental to the
future development of
Archivematica. Our work
is enabling
10
“The Jisc work has helped to
modernise some of the
internal infrastructure of
Archivematica”
Sarah Romkey, Artefactual Systems,
8th December 2015
Spreading the word
11
Spreading the word
12
Impact and demand
13
Impact and demand
14
Yes….sounds like a
pragmatic solution
Yes! Low down learning
curve and Archivematica
sounds just the ticket :-)
Possible but too
early to say
Do you see Archivematica as a possible digital preservation
solution for your institution? Why?
Yes - University Archivist is an
advocate and want to link in
collaboratively with the
institution's RDM developments
Possibly if it can
integrate with Pure...
Yes
Impact and demand
15
Sustainability
»All developments funded in phase 2 will be
incorporated into the main code base to be
supported for the long term by Artefactual
–look out for these in version 1.6 (due Spring 2016)
»There are already plans to build on some of the
work we have funded
–for example AIP re-ingest work from Zuse Institute
–...and more...see phase 2 report
16
Next phase
» Implement our local proof of concepts at Hull and York
» Outreach
» Paper at IDCC conference
» Presentation at UK Archivematica group meeting
» Poster at Open Repositories conference
» Poster at UK Archives Discovery Forum
» more blogs
» end of project event to disseminate our case studies
» Phase 3 project report (with assessment of success of
PoCs)
17
What we will spend the money on
»Managing and funding our own internal development work
» 2 weeks support from Artefactual Systems
» 4 new research data file signatures from The National
Archives (and further engagement on generic process)
» Outreach (conference fees, travel etc)
» Putting on our own dissemination event
18
Working for other repositories
» Archivematica -> repository
› Our model: Archivematica -> Fedora/Hydra
› Unpack a DIP and create Fedora objects
– Similar model for EPrints/DSpace?
› Could just store the DIP, but this limits access options
» Repository -> Archivematica
› Push content to Archivematica from a repository for
dark archiving
› Possible via DSpace, planned for Fedora/Hydra at Hull
19

More Related Content

PPTX
Research data spring - Jisc Digital Festival 2015
PPTX
Business cases and costs RDN
PPT
Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014
PPTX
Application of Assent in the safe - Networkshop44
PPTX
Opening up data – Jisc and CNI conference 10 July 2014
PPTX
Text mining and machine learning
PPTX
Closing plenary - John Wilkin and David Maguire
PPTX
RDA UK
Research data spring - Jisc Digital Festival 2015
Business cases and costs RDN
Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014
Application of Assent in the safe - Networkshop44
Opening up data – Jisc and CNI conference 10 July 2014
Text mining and machine learning
Closing plenary - John Wilkin and David Maguire
RDA UK

What's hot (20)

PPTX
Research data management and Cambridge and our motivations for the Pilot
PPTX
Show me the money - the long path to a sustainable RDM Facility
PPTX
What I wish I’d known at the start!
PPTX
Introduction to data and text mining - Jisc Digifest 2016
PPTX
Uncovering research - what's the standard - Jisc Digital Festival 2015
PDF
Are we failing users? Can open approaches meet their needs? - Maura Marx
PPTX
Repository and preservation systems
PPTX
Research at risk: developing a shared research data management service for UK...
PPTX
Presenting RISE
PPTX
Collaboration through technology: moving from possibility to practice - Marti...
PPTX
Making the most of digital resources - Penny Robertson, Neil Stapleton and Cl...
PPTX
How OA compliant is your institution - Jisc Digifest 2016
PPTX
Lightning Talk - Angela Dappart
PPTX
Kit-Catalogue - Discovering the Value of Equipment Sharing - Universities UK ...
PPTX
Helping you shape infrastructure to implement open access efficiently
PPTX
Archivematica for research data
PPTX
Towards a frictionless data future
PPTX
HESA data, describing research activity and #REF2021
PPTX
Recognising data sharing
PPTX
DMPOnline by Sarah Jones
Research data management and Cambridge and our motivations for the Pilot
Show me the money - the long path to a sustainable RDM Facility
What I wish I’d known at the start!
Introduction to data and text mining - Jisc Digifest 2016
Uncovering research - what's the standard - Jisc Digital Festival 2015
Are we failing users? Can open approaches meet their needs? - Maura Marx
Repository and preservation systems
Research at risk: developing a shared research data management service for UK...
Presenting RISE
Collaboration through technology: moving from possibility to practice - Marti...
Making the most of digital resources - Penny Robertson, Neil Stapleton and Cl...
How OA compliant is your institution - Jisc Digifest 2016
Lightning Talk - Angela Dappart
Kit-Catalogue - Discovering the Value of Equipment Sharing - Universities UK ...
Helping you shape infrastructure to implement open access efficiently
Archivematica for research data
Towards a frictionless data future
HESA data, describing research activity and #REF2021
Recognising data sharing
DMPOnline by Sarah Jones
Ad

Viewers also liked (17)

PPTX
Research data spring: DataVault
PDF
Artivity phase 3 pitch
PPTX
Research data spring: extending the OPD to cover RDM
PDF
Research data spring: giving researchers credit for their data
PPTX
Research data spring: clipper
PDF
Research data spring: streamlining deposit
PPTX
Using Archivemedia to preserve research data
PPTX
National Monographs Strategy - Project Overview
PPTX
The way forward together
PPTX
How have we done?
PPTX
Coming soon
PPTX
Stakeholder forum 2015 - Jisc engagement architecture - Martyn harrow
PPTX
Stakeholder forum 2015 - Engaging across the uk - Robert Haymon-Collins
PPTX
Stakeholder forum 2015 - Annual review - Martyn Harrow
PPTX
Stakeholder forum 2015 - The way forward together - Phil Richards
PPTX
Stakeholder forum 2015 - 2015 and beyond - Martyn Harrow
PPT
Mining and mapping places with multiple names
Research data spring: DataVault
Artivity phase 3 pitch
Research data spring: extending the OPD to cover RDM
Research data spring: giving researchers credit for their data
Research data spring: clipper
Research data spring: streamlining deposit
Using Archivemedia to preserve research data
National Monographs Strategy - Project Overview
The way forward together
How have we done?
Coming soon
Stakeholder forum 2015 - Jisc engagement architecture - Martyn harrow
Stakeholder forum 2015 - Engaging across the uk - Robert Haymon-Collins
Stakeholder forum 2015 - Annual review - Martyn Harrow
Stakeholder forum 2015 - The way forward together - Phil Richards
Stakeholder forum 2015 - 2015 and beyond - Martyn Harrow
Mining and mapping places with multiple names
Ad

Similar to Research data spring: filling in the digital preservation gap (20)

PPTX
"Filling the digital preservation gap" with Archivematica
PPTX
Research Data Management: CSUC activities & services
PPTX
The Benefits of Sharing - SCURL Launch
PPT
Institutional Repositories
PPT
Services, policy, guidance and training: Improving research data management a...
PDF
Planning, Marketing and Assessing a Digital Library:The Open University Digit...
PPTX
A collaborative approach to "filling the digital preservation gap" for Resear...
PDF
The Future of Finding: Resource Discovery @ The University of Oxford
PDF
The Future of Finding: Resource Discovery @ The University of Oxford
PPTX
Jisc on repositories unleashing data - Daniela Duca
PPTX
"Filling the Digital Preservation Gap" with Archivematica
PPTX
Project update: A collaborative approach to "filling the digital preservation...
PPTX
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
PPT
Digital Preservation: Other Sources of Information
PPT
Approaches to dig_mg
PPT
Preservation Issues: Other Sources of Information and Next Steps
PPT
Preservation Issues: Other Sources of Information and Next Steps
PPTX
TBOS presentation to College Scotland
PPTX
Institutional repositories
PPTX
Institutional repositories
"Filling the digital preservation gap" with Archivematica
Research Data Management: CSUC activities & services
The Benefits of Sharing - SCURL Launch
Institutional Repositories
Services, policy, guidance and training: Improving research data management a...
Planning, Marketing and Assessing a Digital Library:The Open University Digit...
A collaborative approach to "filling the digital preservation gap" for Resear...
The Future of Finding: Resource Discovery @ The University of Oxford
The Future of Finding: Resource Discovery @ The University of Oxford
Jisc on repositories unleashing data - Daniela Duca
"Filling the Digital Preservation Gap" with Archivematica
Project update: A collaborative approach to "filling the digital preservation...
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Digital Preservation: Other Sources of Information
Approaches to dig_mg
Preservation Issues: Other Sources of Information and Next Steps
Preservation Issues: Other Sources of Information and Next Steps
TBOS presentation to College Scotland
Institutional repositories
Institutional repositories

More from Jisc RDM (20)

PDF
2019-06_Eunis_Burland
PPTX
Jisc Research Data Shared Service Open Repositories 2018 Paper
PPTX
Jisc Research Data Shared Service Open Repositories 2018 24x7
PDF
Jisc Research Data Shared Service - a Samvera case study
PPTX
Building a national Data Repository Data Modelling
PPTX
Building a national Data Repository System Integration Architecture Overview
PPTX
Building a National Data Service Open Repositories 2018
PPTX
Research Data Toolkit
PPTX
Pre jisc datachampday_260318
PPTX
Stories from the Field: Data are Messy and that's (kind of) ok
PPTX
Fair data - dinkum research - by Andy Turner
PDF
2018 03 codata - making the case
PPTX
Research Data Shared Service update at DPC
PPTX
Research Data Shared Service Webinar #1
PPTX
Managing data behind creative masterpieces -RCM
PPTX
Managing data behind creative masterpieces
PPTX
Lightning Talks - Intro
PPTX
Lightning Talk - Andrew MacLellan
PPTX
Lightning Talk - Nick Sheppard
PPTX
Lightning talk - Adam Harwood
2019-06_Eunis_Burland
Jisc Research Data Shared Service Open Repositories 2018 Paper
Jisc Research Data Shared Service Open Repositories 2018 24x7
Jisc Research Data Shared Service - a Samvera case study
Building a national Data Repository Data Modelling
Building a national Data Repository System Integration Architecture Overview
Building a National Data Service Open Repositories 2018
Research Data Toolkit
Pre jisc datachampday_260318
Stories from the Field: Data are Messy and that's (kind of) ok
Fair data - dinkum research - by Andy Turner
2018 03 codata - making the case
Research Data Shared Service update at DPC
Research Data Shared Service Webinar #1
Managing data behind creative masterpieces -RCM
Managing data behind creative masterpieces
Lightning Talks - Intro
Lightning Talk - Andrew MacLellan
Lightning Talk - Nick Sheppard
Lightning talk - Adam Harwood

Recently uploaded (20)

PDF
Complications of Minimal Access Surgery at WLH
PPTX
Cell Types and Its function , kingdom of life
PDF
Classroom Observation Tools for Teachers
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
master seminar digital applications in india
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Business Ethics Teaching Materials for college
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Complications of Minimal Access Surgery at WLH
Cell Types and Its function , kingdom of life
Classroom Observation Tools for Teachers
Microbial diseases, their pathogenesis and prophylaxis
Final Presentation General Medicine 03-08-2024.pptx
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
master seminar digital applications in india
PPH.pptx obstetrics and gynecology in nursing
O7-L3 Supply Chain Operations - ICLT Program
Week 4 Term 3 Study Techniques revisited.pptx
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
102 student loan defaulters named and shamed – Is someone you know on the list?
Business Ethics Teaching Materials for college
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
Microbial disease of the cardiovascular and lymphatic systems
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf

Research data spring: filling in the digital preservation gap

  • 1. Research data spring Filling the Digital Preservation Gap10/12/2015 Investigating Archivematica to preserve research data for the longer term …because digital preservation won’t just go away
  • 2. Team 2 University of Hull: • Chris Awre • Richard Green • Simon Wilson University of York: • Julie Allinson • Jen Mitcham
  • 3. Project aim 3 “…to investigate Archivematica and explore how it might be used to provide digital preservation functionality within a wider infrastructure for Research Data Management.”
  • 4. Why? 4 ....because we believe that digital preservation should be a key element of the infrastructure for managing research data for the long term (From RDMSS PQQ) “...preservation actions should ensure that data remains authentic, reliable, and usable while maintaining integrity.”
  • 5. Why Archivematica? 5 ....because it is open source, standards compliant, flexible and customisable and packages a range of preservation tools together ...if you want to know more you can read our phase 1 report
  • 7. The Archivematica development model 7 Artefactual develop Archivematica Archivematica released as open source Community of users identifies enhancements Enhancements sponsored by one or more users RDS / Research data UK users COPPUL
  • 8. Progress in phase 2 »Planning our own local implementations »Hull »York »Considering above campus option for Archivematica »Liaising with other projects »Phase 2 Report: now available! 8
  • 9. Progress in phase 2 »Enhancing Archivematica: »DIP regeneration »METS parsing »Generic search API »Choice of checksum »Pronom integration »Documentation 9
  • 10. Progress in phase 2 Not all of the work we have sponsored is ‘visual’ but much of it is fundamental to the future development of Archivematica. Our work is enabling 10 “The Jisc work has helped to modernise some of the internal infrastructure of Archivematica” Sarah Romkey, Artefactual Systems, 8th December 2015
  • 14. Impact and demand 14 Yes….sounds like a pragmatic solution Yes! Low down learning curve and Archivematica sounds just the ticket :-) Possible but too early to say Do you see Archivematica as a possible digital preservation solution for your institution? Why? Yes - University Archivist is an advocate and want to link in collaboratively with the institution's RDM developments Possibly if it can integrate with Pure... Yes
  • 16. Sustainability »All developments funded in phase 2 will be incorporated into the main code base to be supported for the long term by Artefactual –look out for these in version 1.6 (due Spring 2016) »There are already plans to build on some of the work we have funded –for example AIP re-ingest work from Zuse Institute –...and more...see phase 2 report 16
  • 17. Next phase » Implement our local proof of concepts at Hull and York » Outreach » Paper at IDCC conference » Presentation at UK Archivematica group meeting » Poster at Open Repositories conference » Poster at UK Archives Discovery Forum » more blogs » end of project event to disseminate our case studies » Phase 3 project report (with assessment of success of PoCs) 17
  • 18. What we will spend the money on »Managing and funding our own internal development work » 2 weeks support from Artefactual Systems » 4 new research data file signatures from The National Archives (and further engagement on generic process) » Outreach (conference fees, travel etc) » Putting on our own dissemination event 18
  • 19. Working for other repositories » Archivematica -> repository › Our model: Archivematica -> Fedora/Hydra › Unpack a DIP and create Fedora objects – Similar model for EPrints/DSpace? › Could just store the DIP, but this limits access options » Repository -> Archivematica › Push content to Archivematica from a repository for dark archiving › Possible via DSpace, planned for Fedora/Hydra at Hull 19