SlideShare a Scribd company logo
Preserving repository content: practical steps for repository managersOpen repositories 2010 6-9 July, Madrid, SpainMiggie Pickton, Steve Hitchcock, Simon Coles, Debra Morris, Stephanie Meece
OutlinePreservation and repositoriesThe KeepIt projectPreservation objectivesThe KeepIt training courseMeeting preservation objectivesSteps to preservation readiness
Preservation and repositoriesPermanent access to scholarly outputs is at the heart of the open access movement:“Open Access means immediate, permanent, free online access to the full text of all refereed research journal articles”  [1]Long term or permanent access is a stated goal of many repositories…… but few repositories have implemented practical action plans for preservationWhy?[1] Harnad, S. (2005) The Implementation of the Berlin Declaration on Open Access. D-Lib Magazine, 11(3).
Why so slow to address preservation issues in repositories?Preservation tools and services exist, but few have addressed the specific needs of repositoriesIn practical terms they often necessitate action that is additional to rather than integral to repository workflowRepository content is typically highly varied and complex:Descriptive metadata and file formats used inconsistently Items deposited by those without knowledge or expertise in managing digital assetsRepository managers are inexperienced in digital preservation and lack time and confidence to deal with it.
The KeepIt projectPreservation tools and servicesTraining and advicePractical and achievable preservation plans for repositories
KeepIt project exemplarsNECTARResearchArtsScienceTeaching
KeepIt exemplars – starting outStructured one to one meetings between exemplar managers and the preservation specialist to establish the current status of each repository:PoliciesManagement and reportingRepository missionToolsBudgetApproach to planningContent profileStorageServices and supportGrowth predictionsFuture plans
KeepIt exemplar objectivesCommon themes:Tools  and procedures  e.g. to deal with different file formatsCosts  e.g. for supporting business plans and funding bidsOrganizational issues  e.g. institutional and user concerns, advocacy, training and documentationPlus repository-specific objectives e.g.responsibilities of content creator and repository (EdShare)Facilitating (automating) preservation actions  (eCrystals)
Eprints preservation toolkitDeveloped for EPrints 3.2. Incorporates:Storage (including cloud storage)Format management (characterisation)Risk assessmentPreservation planningPreservation action e.g. migration… enabling a complete preservation workflow
The KeepIt training courseModule 1, Organizational issues, audit, selection and appraisalSchool of ECS, University of Southampton, 19 January 2010Module 2, institutional and lifecycle preservation costs School of ECS, University of Southampton, 5 February 2010Module 3, Primer on preservation workflow, formats and characterisationWestminster-Kingsway College, London, 2 March 2010Module 4, Putting storage, format management and preservation planning in the repositoryUniversity of Southampton, 18-19 March 2010Module 5, Trust, of the repository, of the tools and services it chooses University of Northampton, 30 March 2010
Course toolsData Asset Framework (DAF) - identify, locate, describe and assess research data assetsAssessing Digital Institutional Assets self assessment toolkit (AIDA) Keeping Research Data Safe (KRDS) – benefits and costs of a repositoryLIFE3 – predictive costing tool for digital contentEprints preservation toolkit DROID & JHOVE – file format identification and characterisationPREMIS - data dictionary for preservation metadataPlato - preservation planning tool from PLANETSDigital Repository Audit Method Based on Risk Assessment (DRAMBORA) – repository risk                      assessment and reporting
Meeting preservation objectives - eCrystals All four exemplars will upgrade to Eprints 3.2 and implement the Eprints preservation toolseCrystals:will add their two main file formats (CIF and CML) to DROID as part of an investigation into the automatic validation and verification of content;have published initial cost data as part of the Keeping Research Data Safe (KRDS2) study;are working to understand and develop the relationship between a research data repository and the host institution or research community in terms of migration of preservation plans.
Meeting preservation objectives - EdShare has identified the most prevalent file types in the repository and in the university’s VLE (Blackboard) and is working with IT colleagues to generate statistical information;is planning some comparative analysis with other institutional learning and teaching repositories (how similar/dissimilar is EdShare?);is exploring the specific institutional concerns with respect to preserving resources for learning and teaching – what does this mean for institutional beneficiaries/stakeholders e.g. teachers, educational leaders, students?is contributing to the ‘Southampton Learning Environment’ – a framework for supporting, delivering and enhancing learning and teaching across the whole university community.
Meeting preservation objectives - NECTARhas upgraded to Eprints 3.2 and is ready for the Eprints preservation tools to be installed;has undertaken a Research Data Project using the DAF methodology to establish how researchers at Northampton create, use and manage their data;is using the results of this survey to inform policy, guidance and preservation planning for the repository;has spread the preservation message by involving colleagues with expertise in technical, metadata and collection management areas in appropriate elements of the KeepIt training course.
Meeting preservation objectives – UAL Research Onlineis dealing with the preservation implications of a range of image, audio and video formats, and beginning to consider archived websites;is using the DRAMBORA self-assessment tool to identify the activities and assets of the repository and to identify, assess and calculate the associated risks;which will inform guides to digital preservation for researchers, repository staff and senior managers;but Stephanie notes: “We all know that we very much need to assess, manage and minimise risk, but preservation tasks tend to fall into the ‘Important’ category, not the ‘Urgent’ one. It’s easy to spend six months attending to ‘Urgent’ work, and never get to any of the ‘Important’ bits.”
KeepIt winsPursuing a ‘preservation’ agenda has delivered wider benefits to the repository managers:Increased our own understanding of the content of our repositoriesEnabled greater engagement with content providersRaised awareness (of the repository as well as digital preservation) among colleagues and managersProvided the opportunity for us to reflect on our repository’s current status and think strategically about its futureConvinced us that it is possible to formulate practical and achievable preservation plans.
Steps to preservation readinessKnow the institutional context. A preservation plan must meet the needs of the institution and its stakeholders; Develop preservation policy appropriate to your institution’s and users’ needs; Make a business case for preservation;Identify an appropriate preservation metadata schema to describe your institution’s types of output; Identify tools to support preservation planning and decision-making; Consider storing repository content in multiple locations;Promote the preservation services offered by the repository. Build trust among your user community.
AcknowledgementWe are grateful to the JISC for funding the KeepIt project and the earlier Preserv and Preserv2 projects upon which it draws.
Further informationMiggie Pickton, NECTAR Repository Manager, University of Northampton, miggie.pickton@northampton.ac.ukSteve Hitchcock, KeepIt Project Manager, University of Southampton, sh94r@ecs.soton.ac.ukSimon Coles, eCrystals Repository Manager, University of Southampton, S.J.Coles@soton.ac.ukDebra Morris, EdShare Repository Manager, University of Southampton, D.Morris@soton.ac.ukStephanie Meece, UAL Research Online Repository Manager, University of the Arts, London, s.meece@arts.ac.ukOr check out the KeepIt project blog at:http://blogs.ecs.soton.ac.uk/keepit/

More Related Content

PPT
Collaboration on appraisal and collection development for the long-term prese...
PPT
Investigation into Private LOCKSS Networks
PDF
How to open repositories
PPT
Digital Preservation Process: Preparation and Requirements
PDF
DIY Research Data Management training Kit for Librarians
PPT
Research Data MANTRA
PPTX
RDM Programme @ Edinburgh
PPTX
Introduction to RDM for trainee physicians
Collaboration on appraisal and collection development for the long-term prese...
Investigation into Private LOCKSS Networks
How to open repositories
Digital Preservation Process: Preparation and Requirements
DIY Research Data Management training Kit for Librarians
Research Data MANTRA
RDM Programme @ Edinburgh
Introduction to RDM for trainee physicians

What's hot (14)

PDF
Digital Preservation Policies - SCAPE
PPT
RDM Training Initiatives @ Edinburgh – DIY RDM Training Kit for Librarians
PPT
DCC 101: Preservation
PPT
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
PPT
Digital Curation 101: Preserve
PDF
Integrating Heterogeneous and Distributed Information about Marine Species th...
POTX
Lightning Reports on 2015 CASRAI Standards Work: Data Management Plan
PDF
Research Data Management Training for Librarians - An Edinburgh Approach
PPT
Before you start: Strategic Planning and Business Cases
PPTX
Edin casestudy-ou-rr-2011
PPTX
‘Good, better, best’? Examining the range and rationales of institutional dat...
PPT
Tdr Overview Pres Advocates
PPT
Sh Sue Tr Policy
PPTX
Providing support and services for researchers in good data governance
Digital Preservation Policies - SCAPE
RDM Training Initiatives @ Edinburgh – DIY RDM Training Kit for Librarians
DCC 101: Preservation
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
Digital Curation 101: Preserve
Integrating Heterogeneous and Distributed Information about Marine Species th...
Lightning Reports on 2015 CASRAI Standards Work: Data Management Plan
Research Data Management Training for Librarians - An Edinburgh Approach
Before you start: Strategic Planning and Business Cases
Edin casestudy-ou-rr-2011
‘Good, better, best’? Examining the range and rationales of institutional dat...
Tdr Overview Pres Advocates
Sh Sue Tr Policy
Providing support and services for researchers in good data governance
Ad

Viewers also liked (13)

PPT
Transforming repositories: from repository managers to institutional data man...
PPT
InSPECT Significant Properties Framework (SPs part 2), by Stephen Grace and G...
PPT
Keepit Course 5: Concluding the course
PPT
Update on the JISC KeepIt Repository Preservation Exemplars Project, June 2010
PDF
Apresentação GT - Digital Preservation
PPT
Starting a Digital Preservation Program
PPT
Trust and repository audit: can repository managers assure trustworthiness?
PPTX
EPrints Preservation: Why we need Preservation Planning
PPT
KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, by Martin Donn...
PPT
Trusted Repositories
PDF
Smithsonian Trustworthy Digital Repository Roundtable
PPT
Keepit Course 5: Revision
PPT
Introduction to digital curation
Transforming repositories: from repository managers to institutional data man...
InSPECT Significant Properties Framework (SPs part 2), by Stephen Grace and G...
Keepit Course 5: Concluding the course
Update on the JISC KeepIt Repository Preservation Exemplars Project, June 2010
Apresentação GT - Digital Preservation
Starting a Digital Preservation Program
Trust and repository audit: can repository managers assure trustworthiness?
EPrints Preservation: Why we need Preservation Planning
KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, by Martin Donn...
Trusted Repositories
Smithsonian Trustworthy Digital Repository Roundtable
Keepit Course 5: Revision
Introduction to digital curation
Ad

Similar to Preserving repository content: practical steps for repository managers by Miggie Pickton (20)

PPT
Repositories and digital preservation
PPT
Hans Hofman - European Perspectives on Digital Preservation
PPT
Digital Preservation
PDF
Corrado -- Establishing the Landscape
PPT
Digital Preservation
PPT
Neil Grindley
PPT
KeepIt Course 1: Digital Preservation Tools for Repository Managers
PPTX
Preservation for 21st Century Library Collections
PPT
Trm Planets Training Pp Module
PDF
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
PPTX
Digital Presentation Best Practices: Lessons Learned From Across the Pond
PPTX
Digital Preservation Best Practices: Lessons Learned From Across the Pond
PPTX
Stability in the Midst of Change: Addressing Challenges for Digital Preservation
PPT
An Introduction to Digital Preservation
PPTX
Completepresentation
PPTX
Preservation and archiving unit 1
PDF
Starting Small: Practical First Steps in Digital Preservation
PPTX
Principles and Practice of Managing Digital 'Stuff'
PPT
Brief Introduction to Digital Preservation
PPT
Digital preservation
Repositories and digital preservation
Hans Hofman - European Perspectives on Digital Preservation
Digital Preservation
Corrado -- Establishing the Landscape
Digital Preservation
Neil Grindley
KeepIt Course 1: Digital Preservation Tools for Repository Managers
Preservation for 21st Century Library Collections
Trm Planets Training Pp Module
IFLA ARL Webinar Series: Digital Preservation - Managing Publications and Dat...
Digital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the Pond
Stability in the Midst of Change: Addressing Challenges for Digital Preservation
An Introduction to Digital Preservation
Completepresentation
Preservation and archiving unit 1
Starting Small: Practical First Steps in Digital Preservation
Principles and Practice of Managing Digital 'Stuff'
Brief Introduction to Digital Preservation
Digital preservation

More from JISC KeepIt project (20)

PPT
Keepit Course 5: Tools for Assessing Trustworthy Repositories
PPT
Keepit Course 5: Trust
PPT
Preservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
PPT
Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...
PPT
KeepIt Course 4: digital preservation recap, by Andreas Rauber, Hannes Kulovi...
PPT
KeepIt Course 4: Putting storage, format management and preservation planning...
PPT
Keepit Course 3: Provenance (and OPM), based on slides by Luc Moreau
PPT
KeepIt Course 3: Applying Preservation Metadata to Repositories
PPT
Significant Properties - Where Next? (SPs part 6), by Stephen Grace and Garet...
PPT
Supporting Significant Properties in a Working Archive (SPs part 5), by Steph...
PPT
Significant Properties, Practical 2: Stakeholder Analysis (SPs part 4), by St...
PPT
Significant Properties, Practical 1: Object Analysis (SPs part 3), by Stephen...
PPT
Introducing Significant Properties (SPs part 1), by Stephen Grace and Gareth ...
PPT
KeepIt Course 3: preservation workflow
PPT
KeepIt Course 3: primer on preservation workflow, formats and characterisation
PPT
LIFE3: Predicting Long Term Preservation Costs, by Brian Hole
PPT
Costs, Policy, and Benefits in Long-term Digital Preservation, by Neil Beagrie
PPT
KeepIt Course 2: preservation costs
PPT
The AIDA toolkit: Assessing Institutional Digital Assets, by Ed Pinsent
PPT
DAF group exercise: scoping data and curation requirements, by Sarah Jones
Keepit Course 5: Tools for Assessing Trustworthy Repositories
Keepit Course 5: Trust
Preservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...
KeepIt Course 4: digital preservation recap, by Andreas Rauber, Hannes Kulovi...
KeepIt Course 4: Putting storage, format management and preservation planning...
Keepit Course 3: Provenance (and OPM), based on slides by Luc Moreau
KeepIt Course 3: Applying Preservation Metadata to Repositories
Significant Properties - Where Next? (SPs part 6), by Stephen Grace and Garet...
Supporting Significant Properties in a Working Archive (SPs part 5), by Steph...
Significant Properties, Practical 2: Stakeholder Analysis (SPs part 4), by St...
Significant Properties, Practical 1: Object Analysis (SPs part 3), by Stephen...
Introducing Significant Properties (SPs part 1), by Stephen Grace and Gareth ...
KeepIt Course 3: preservation workflow
KeepIt Course 3: primer on preservation workflow, formats and characterisation
LIFE3: Predicting Long Term Preservation Costs, by Brian Hole
Costs, Policy, and Benefits in Long-term Digital Preservation, by Neil Beagrie
KeepIt Course 2: preservation costs
The AIDA toolkit: Assessing Institutional Digital Assets, by Ed Pinsent
DAF group exercise: scoping data and curation requirements, by Sarah Jones

Recently uploaded (20)

PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
cuic standard and advanced reporting.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PPT
Teaching material agriculture food technology
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Electronic commerce courselecture one. Pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Modernizing your data center with Dell and AMD
PPTX
Big Data Technologies - Introduction.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
cuic standard and advanced reporting.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Encapsulation_ Review paper, used for researhc scholars
Teaching material agriculture food technology
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Machine learning based COVID-19 study performance prediction
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Electronic commerce courselecture one. Pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Unlocking AI with Model Context Protocol (MCP)
Spectral efficient network and resource selection model in 5G networks
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Advanced methodologies resolving dimensionality complications for autism neur...
Modernizing your data center with Dell and AMD
Big Data Technologies - Introduction.pptx

Preserving repository content: practical steps for repository managers by Miggie Pickton

  • 1. Preserving repository content: practical steps for repository managersOpen repositories 2010 6-9 July, Madrid, SpainMiggie Pickton, Steve Hitchcock, Simon Coles, Debra Morris, Stephanie Meece
  • 2. OutlinePreservation and repositoriesThe KeepIt projectPreservation objectivesThe KeepIt training courseMeeting preservation objectivesSteps to preservation readiness
  • 3. Preservation and repositoriesPermanent access to scholarly outputs is at the heart of the open access movement:“Open Access means immediate, permanent, free online access to the full text of all refereed research journal articles” [1]Long term or permanent access is a stated goal of many repositories…… but few repositories have implemented practical action plans for preservationWhy?[1] Harnad, S. (2005) The Implementation of the Berlin Declaration on Open Access. D-Lib Magazine, 11(3).
  • 4. Why so slow to address preservation issues in repositories?Preservation tools and services exist, but few have addressed the specific needs of repositoriesIn practical terms they often necessitate action that is additional to rather than integral to repository workflowRepository content is typically highly varied and complex:Descriptive metadata and file formats used inconsistently Items deposited by those without knowledge or expertise in managing digital assetsRepository managers are inexperienced in digital preservation and lack time and confidence to deal with it.
  • 5. The KeepIt projectPreservation tools and servicesTraining and advicePractical and achievable preservation plans for repositories
  • 7. KeepIt exemplars – starting outStructured one to one meetings between exemplar managers and the preservation specialist to establish the current status of each repository:PoliciesManagement and reportingRepository missionToolsBudgetApproach to planningContent profileStorageServices and supportGrowth predictionsFuture plans
  • 8. KeepIt exemplar objectivesCommon themes:Tools and procedures e.g. to deal with different file formatsCosts e.g. for supporting business plans and funding bidsOrganizational issues e.g. institutional and user concerns, advocacy, training and documentationPlus repository-specific objectives e.g.responsibilities of content creator and repository (EdShare)Facilitating (automating) preservation actions (eCrystals)
  • 9. Eprints preservation toolkitDeveloped for EPrints 3.2. Incorporates:Storage (including cloud storage)Format management (characterisation)Risk assessmentPreservation planningPreservation action e.g. migration… enabling a complete preservation workflow
  • 10. The KeepIt training courseModule 1, Organizational issues, audit, selection and appraisalSchool of ECS, University of Southampton, 19 January 2010Module 2, institutional and lifecycle preservation costs School of ECS, University of Southampton, 5 February 2010Module 3, Primer on preservation workflow, formats and characterisationWestminster-Kingsway College, London, 2 March 2010Module 4, Putting storage, format management and preservation planning in the repositoryUniversity of Southampton, 18-19 March 2010Module 5, Trust, of the repository, of the tools and services it chooses University of Northampton, 30 March 2010
  • 11. Course toolsData Asset Framework (DAF) - identify, locate, describe and assess research data assetsAssessing Digital Institutional Assets self assessment toolkit (AIDA) Keeping Research Data Safe (KRDS) – benefits and costs of a repositoryLIFE3 – predictive costing tool for digital contentEprints preservation toolkit DROID & JHOVE – file format identification and characterisationPREMIS - data dictionary for preservation metadataPlato - preservation planning tool from PLANETSDigital Repository Audit Method Based on Risk Assessment (DRAMBORA) – repository risk assessment and reporting
  • 12. Meeting preservation objectives - eCrystals All four exemplars will upgrade to Eprints 3.2 and implement the Eprints preservation toolseCrystals:will add their two main file formats (CIF and CML) to DROID as part of an investigation into the automatic validation and verification of content;have published initial cost data as part of the Keeping Research Data Safe (KRDS2) study;are working to understand and develop the relationship between a research data repository and the host institution or research community in terms of migration of preservation plans.
  • 13. Meeting preservation objectives - EdShare has identified the most prevalent file types in the repository and in the university’s VLE (Blackboard) and is working with IT colleagues to generate statistical information;is planning some comparative analysis with other institutional learning and teaching repositories (how similar/dissimilar is EdShare?);is exploring the specific institutional concerns with respect to preserving resources for learning and teaching – what does this mean for institutional beneficiaries/stakeholders e.g. teachers, educational leaders, students?is contributing to the ‘Southampton Learning Environment’ – a framework for supporting, delivering and enhancing learning and teaching across the whole university community.
  • 14. Meeting preservation objectives - NECTARhas upgraded to Eprints 3.2 and is ready for the Eprints preservation tools to be installed;has undertaken a Research Data Project using the DAF methodology to establish how researchers at Northampton create, use and manage their data;is using the results of this survey to inform policy, guidance and preservation planning for the repository;has spread the preservation message by involving colleagues with expertise in technical, metadata and collection management areas in appropriate elements of the KeepIt training course.
  • 15. Meeting preservation objectives – UAL Research Onlineis dealing with the preservation implications of a range of image, audio and video formats, and beginning to consider archived websites;is using the DRAMBORA self-assessment tool to identify the activities and assets of the repository and to identify, assess and calculate the associated risks;which will inform guides to digital preservation for researchers, repository staff and senior managers;but Stephanie notes: “We all know that we very much need to assess, manage and minimise risk, but preservation tasks tend to fall into the ‘Important’ category, not the ‘Urgent’ one. It’s easy to spend six months attending to ‘Urgent’ work, and never get to any of the ‘Important’ bits.”
  • 16. KeepIt winsPursuing a ‘preservation’ agenda has delivered wider benefits to the repository managers:Increased our own understanding of the content of our repositoriesEnabled greater engagement with content providersRaised awareness (of the repository as well as digital preservation) among colleagues and managersProvided the opportunity for us to reflect on our repository’s current status and think strategically about its futureConvinced us that it is possible to formulate practical and achievable preservation plans.
  • 17. Steps to preservation readinessKnow the institutional context. A preservation plan must meet the needs of the institution and its stakeholders; Develop preservation policy appropriate to your institution’s and users’ needs; Make a business case for preservation;Identify an appropriate preservation metadata schema to describe your institution’s types of output; Identify tools to support preservation planning and decision-making; Consider storing repository content in multiple locations;Promote the preservation services offered by the repository. Build trust among your user community.
  • 18. AcknowledgementWe are grateful to the JISC for funding the KeepIt project and the earlier Preserv and Preserv2 projects upon which it draws.
  • 19. Further informationMiggie Pickton, NECTAR Repository Manager, University of Northampton, miggie.pickton@northampton.ac.ukSteve Hitchcock, KeepIt Project Manager, University of Southampton, sh94r@ecs.soton.ac.ukSimon Coles, eCrystals Repository Manager, University of Southampton, S.J.Coles@soton.ac.ukDebra Morris, EdShare Repository Manager, University of Southampton, D.Morris@soton.ac.ukStephanie Meece, UAL Research Online Repository Manager, University of the Arts, London, s.meece@arts.ac.ukOr check out the KeepIt project blog at:http://blogs.ecs.soton.ac.uk/keepit/

Editor's Notes

  • #2: For repository managers, by repository managers.
  • #4: IntroductionFew people would disagree that preservation of repository content is important. Indeed, the stated aim of most repositories is to provide permanent open access to the material therein. Why, then, have so few repositories implemented practical action plans for long term preservation of their content?
  • #5: There could be several reasons. Although a number of preservation tools and services already exist, until now few have addressed the specific needs of repositories; in practical terms they have necessitated action that is additional rather than integral to repository workflow. Repository content is typically highly varied and complex, while descriptive metadata and file formats are used inconsistently and deposited by those without knowledge or expertise in managing digital assets. Busy repository managers with little, if any, experience in digital preservation have lacked time and confidence to tackle what is perceived as an important but complex and scary problem.The JISC-funded KeepIt project is bringing together existing preservation tools and services with appropriate training and advice on preservation strategy, policy, costs, metadata, storage, format management and trust to enable the participating repository managers to formulate practical and achievable preservation plans.
  • #6: The JISC-funded KeepIt project is bringing together existing preservation tools and services with appropriate training and advice on preservation strategy, policy, costs, metadata, storage, format management and trust to enable the participating repository managers to formulate practical and achievable preservation plans.
  • #7: The KeepIt projectInstitutional repositories are host to a range of different materials, including research papers, teaching materials, creative outputs and datasets. The four participating repositories, NECTAR, EdShare, UAL Research Online and eCrystals, are representative of all of these output types. The managers of the four exemplar repositories plus an experienced preservation specialist and a technical developer make up the KeepIt project team.
  • #8: A systematic survey of the exemplar repositoriesThe structured meetings between repository managers and the preservation specialist established the current status of each repository; its mission; management and reporting structure; policy; approach to planning; budget; tools, services and support; storage; content profile; future plans and growth projections. From each conversation a picture emerged of the whole repository within its institutional context. These conversations not only served to highlight areas of need for future preservation related action, they also provided the opportunity for repository managers to reflect on their current position and to share their thoughts regarding preservation.
  • #9: Setting preservation objectivesRepository managers were also asked to set preservation objectives. Common themes included tools (especially to deal with a range of file formats and ideally integrated with repository workflow); costs (for supporting business plans and funding bids) and organisational issues (such as institutional and user concerns, advocacy, training and documentation). Other objectives occurred uniquely in response to each repository’s institutional context.e.g. EdShare Objective 4 - to understand the relationship between the responsibilities of EdShare and the responsibilities of the creator of the content being added to EdShare. (http://blogs.ecs.soton.ac.uk/keepit/2009/09/29/edshare-repository-preservation-objectives/)e.g.eCrystalsObjective 2 - to investigate how performing preservation actions can be made easy! Learning the minimum requirements for the maximum return (the 80% rule). What can be automated and what technologies can be implemented, both unseen by the repository software and as ancillary tools. (http://blogs.ecs.soton.ac.uk/keepit/2009/10/23/ecrystals-repository-preservation-objectives/ ) The collected surveys and contributed objectives can be found under this blog tag: http://blogs.ecs.soton.ac.uk/keepit/tag/exemplar-profiles/A synthesis of the four repositories’ objectives is given here: http://blogs.ecs.soton.ac.uk/keepit/2009/12/15/acting-on-repository-preservation-objectives/
  • #10: While the repository managers focused on their separate preservation needs, Dave Tarrant, the project developer, and his colleagues were making rapid progress on a brand new set of tools to manage an integrated repository preservation workflow. These Eprints plugin tools are not the subject of this presentation – they will be fully covered in the Eprints user group session on Friday (see http://or2010.fecyt.es/Resources/documentos/eprints/CompletePreservationWithEPrints.pdf ) – but they are significant in that they uniquely offer repository managers the opportunity to embed preservation activity, including format management, risk assessment and storage, within the day to day life of the repository.
  • #11: Designed with input from repository managers and responding to their preservation objectives,therepository specific KeepIt preservation training course covered the organisational and financial framework of repository preservation; metadata; some new preservation tools; and issues of trust between repository, users and services.
  • #12: Increasing knowledge about preservation and repositoriesRepository managers from both within and outside the project attended the KeepIt training course. This was centred on the tools and services which are available to support repository preservation. Several of these, for example, the Data Asset Framework (DAF), the Assessing Digital Institutional Assets self assessment toolkit (AIDA), LIFE3, and the Plato preservation planning tool from Planets, were pre-existing tools designed for more general use in digital preservation, but their application was focussed here for the first time on the special needs of repositories. Others, such as DRAMBORA from the Digital Curation Centre, were developed specifically for repositories. Although the project partners were all using Eprints software for their repositories, the KeepIt course attracted participants with a range of repository types. Only one part of the course – the section covering the new Eprints plugins described above – was software-specific; the remainder of the course was of direct relevance to all repositories.
  • #13: The eCrystals repository has three major goals in fulfilling its preservation objectives. The first is a short-term task and is to directly assist the management of preservation tasks of a research data repository by a research group through implementation of microservices. To this end the primary file types in the repository (CIF and CML) will be identified to the DROID service as part of an investigation into the automatic validation and verification of content. This work meets original objectives 1,2 and 3 (preservation actions for non experts). The second, longer-term goal, is to understand and develop the relationship between a research data repository and the host institution or research community in terms of migration of preservation plans. This ranges from the short – medium term in the local repository case to the longer term of the institutional or subject repository and addresses objectives 3 and 4 (administrating repositories). Finally our original objective 4 was to develop costings for researchers and we have published initial cost data for our repository as part of the Keeping Research Data Safe study. Keeping Research Data Safe (Phase 2) report: http://www.jisc.ac.uk/publications/reports/2010/keepingresearchdatasafe2.aspx
  • #14: In meeting its preservation objectives, EdShare now has two priorities. The first is to identify the most prevalent file types in EdShare and, as a complement to this piece of work, to identify the most prevalent file types in the institutional VLE, Blackboard. Having identified these file types, EdShare staff are working with the technical team to identify the preservation needs for these types. It is likely that EdShare will use the EPrints 3.2 plugin developed by Dave Tarrant for this purpose.The second priority is to explore and understand the specific institutional concerns of the University of Southampton in the preservation of resources for learning and teaching. This work will align very well with significant ongoing work to develop the “Southampton Learning Environment” – a framework for supporting, delivering and enhancing learning and teaching across the whole University community – EdShare is a key aspect of this work, together with its relationship to the institutional VLE and other aspects of the student experience at Southampton. 
  • #15: Like EdShare, NECTAR’s main objectives were todefine the preservation needs of all file types and formats held in NECTAR and to have procedures and tools to support these. As a direct result of the KeepIt training course the NECTAR team is using the DAF methodology to undertake an audit of research data at The University of Northampton and will upgrade their Eprints software to version 3.2 to accommodate the new tools for identifying file types and assessing preservation risks. The DAF project has already raised awareness of preservation issues among the research community and the findings will inform future preservation policy and planning. A third objective, to ensure that preservation training was offered to the broader repository team, was satisfied by inviting technical, metadata and collection management specialists to appropriate elements of the KeepIt training course. This not only spread the acquired knowledge across a wider pool of people, it also promoted engagement with the preservation agenda.
  • #16: UAL Research online has a different challenge, with most of its content comprising audio, video and other non text objects. UAL are using the DRAMBORA self-assessment tool to identify the activities and assets of the repository and to identify, assess and calculate the associated risks. DRAMBORA is appropriate for UAL because it is a self-assessment exercise which can be applied to repositories in infancy, it is appropriate in scale, and it is designed for repositories rather than all the digital assets of an organisation. The outcomes of the DRAMBORA project will enable repository staff to define appropriate risk management measures for the repository. Stephanie Meece has blogged on this work here: http://blogs.ecs.soton.ac.uk/keepit/2010/06/11/digital-preservation-risk-management-and-ual-research-online/
  • #18: Steps to preservation readinessA key aim of both the KeepIt project and its training course was to demystify repository preservation and render it manageable to those responsible for repositories. A priority was to enable repository managers, now informed about preservation and armed with appropriate tools, to take practical steps toward preserving repository content.  To this end, the following actions are recommended. The order of the steps is not fixed and there will be some overlap. Know the institutional context. A preservation plan must meet the needs of the institution and its stakeholders. Find out what potential repository content is being produced and by whom. Consider how it is developed, managed and stored (the DAF tool uses this process for research data management, but the principles may be applied to all forms of repository content). Understand your stakeholders’ current ability to support preservation. Develop preservation policy appropriate to your institution’s and users’ needs. Consider the content of your repository: does it all need preserving, and for how long? A clear policy will determine the scope of preservation activity and support the repository manager in future decision-making.Make a business case for preservation– gain the support of your senior managers and demonstrate that preservation can be achieved at realistic cost. The LIFE3 model may be helpful. This should be built into the repository software and will form part of the standard workflow.Identify tools to support preservation planning and decision-making. Use of an appropriate tool will not only provide evidence for future preservation action but in some cases may also facilitate the action itself. So, for example, the Eprints preservation plugins enable file formats to be identified, characterised and risk-assessed and the Plato tool creates an appropriate preservation plan based on your defined requirements, an evaluation of potential strategies for migration and an analysis of the results of these strategies. Consider storing repository content in multiple locations, for example in managed 'cloud' storage services. Repository tools such as the EPrints storage plugin, or services such as DuraCloud, can help.Explain to your depositors the benefits of preservation and how the repository can help. Promote the preservation services offered by the repository. Build trust among your user community.