SlideShare a Scribd company logo
Digital Preservation Research:  A review of the challenges Kevin Ashley Head Of Digital Archives [email_address] This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland License. To view a copy of this license, visit  http://creativecommons.org/licenses/by-nc-sa/2.5/scotland/   ; or, (b) send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.
Background Some past documents defining research areas: “Invest To Save” - DELOS/NSF 2002/2003  (I2S) “It’s About Time” - NSF/Library Of Congress 2002  (IAT) Liz Lyon - “Dealing with Data” (2007) Warwick statement (2005) and others on European agenda Common themes - some common authors
Invest to Save Small team of authors 21 (25) research areas; 7 supporting issues (legal, organisational, etc.) Explicitly trans-continental
It’s About Time 51-member workshop Small editorial team to draft outputs 64 research issues USA participants and funders only
“A period of time long enough for there to be concern about the impacts of changing technologies, including support for new media and data formats, and of a changing user community, on the information held in the repository. This period extends into the indefinite future.” OAIS definition. What’s long-term preservation? It’s not always forever!
Where is the research now Some has been done Some is being done Some has yet to be started There may also be new challenges not yet identified
Work that is done Salvage and rescue (digital material)  (I2S) Perhaps we can learn to make it easier or cheaper Repository models  (Both) But Dspace etc. lack preservation architecture Format repositories  (Both) Distributed storage  (Both) but what about torrent-type models Understanding media  (Both)
Work that is being done cost modelling and process modelling  (Both) collection completeness  (I2S) ; anomaly detection  (Both) Complex entities - web archiving  (Both) Scalability  (Both) Preservable metadata: PREMIS is a big step  (Both) Audio and video preservation  (I2S)
Work being done 2 Certification + trustworthiness:  (Both) Of repositories and content Effective refreshing of media  (IAT) Automated acquisition and description  (Both) Representation Information Registries  (I2S)
Work being done 3 Automated policy application Workflow preservation and sharing Distributed content (e.g. comments/data) Disciplines or places ? Generic database preservation
Work that remains Newer formats: virtual worlds, musical scores, XML  (I2S) Accelerated ageing of systems + software  (I2S) Software repositories:  (Both) Architectures Classification schemes and content models
Work that remains 2 Multilingual entities  (I2S) preservation/migration, not creation  Anomaly detection: At ingest; migration; of a collection  (IAT) Automated provenance generation  (IAT) Migration of authentication information  (IAT) Defining designated communities
Work that remains 3 Self-aware objects  (I2S) Scaling down  (I2S) Market analysis: customer base and needs  (IAT) Formal models for selection  (Both) Exchange of content and services between repositories  (Both) Migrating ontologies, schemas, etc  (IAT)
What type of research? I2S requires pragmatic and theoretical research “Practice informs research in a fashion similar to the ways in which research informs practice.” Need good pathways in both directions
Final observations General problem: tension between big abstracts and specifics.  Sometimes research is too specific Some of the problems are defined too generally Good research can be done by those outside the field Those within it must define the problems better

More Related Content

PPTX
Community ORCID dashboard - COrDa
PDF
Digital Preservation in Production (DPN and DuraCloud Vault)
PDF
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides
PPTX
Research Data Services Best Practices by Dalal Rahme
PPT
Assessment criteria
PDF
Data Citation Implementation Guidelines By Tim Clark
PPT
Hypatia for dlf 2011
PDF
Global Standards for System Interoperability: CERIF
Community ORCID dashboard - COrDa
Digital Preservation in Production (DPN and DuraCloud Vault)
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides
Research Data Services Best Practices by Dalal Rahme
Assessment criteria
Data Citation Implementation Guidelines By Tim Clark
Hypatia for dlf 2011
Global Standards for System Interoperability: CERIF

What's hot (20)

PDF
6.15.17 DSpace-Cris Webinar Presentation Slides
PPTX
Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal
PPT
Intrallect March09
PPTX
Completepresentation
PPT
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
PPTX
INSTRUCT - Integrated Structural Biology Infrastructure
PPTX
Data Management for Collaboration, Access, and Interoperability
PPSX
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
PPTX
NPA Data science: Progression pathway topics
PPT
292 daniel dollar ssp yale_28_may2008
PPT
Introduction to CrossRef for Researchers
PPTX
DSpace standard Data model and DSpace-CRIS
PDF
Giovane Moura - Cybersecurity voor .nl
PPTX
Fedora 4 :Introduction and Overview
PDF
An Introduction to Linked Data and Microdata
PPT
Metadata for your Digital Collections
PPT
Rots RDAP11 Data Archives in Federal Agencies
PPTX
LAC Group - Metadata for mere mortals (Choosing standards)
PPT
Researching multilingually at the borders of language, the body, law and the ...
PPTX
Repository technologies
6.15.17 DSpace-Cris Webinar Presentation Slides
Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal
Intrallect March09
Completepresentation
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
INSTRUCT - Integrated Structural Biology Infrastructure
Data Management for Collaboration, Access, and Interoperability
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
NPA Data science: Progression pathway topics
292 daniel dollar ssp yale_28_may2008
Introduction to CrossRef for Researchers
DSpace standard Data model and DSpace-CRIS
Giovane Moura - Cybersecurity voor .nl
Fedora 4 :Introduction and Overview
An Introduction to Linked Data and Microdata
Metadata for your Digital Collections
Rots RDAP11 Data Archives in Federal Agencies
LAC Group - Metadata for mere mortals (Choosing standards)
Researching multilingually at the borders of language, the body, law and the ...
Repository technologies
Ad

Similar to Digital Curation: gaps and challenges (20)

PPT
Digital Preservation
PPT
Repositories and digital preservation
PPT
Digital Preservation
PDF
Caplan and York, 'What It Takes To Make It Last: E-Resources Preservation"
PPT
The digital preservation technical context
PPT
An Introduction to Digital Preservation
PDF
Open Source Software for Digital Preservation Repositories : A Survey
PDF
Open Source Software for Digital Preservation Repositories : A Survey
PPT
Hans Hofman - European Perspectives on Digital Preservation
PPTX
Digital Presentation Best Practices: Lessons Learned From Across the Pond
PPTX
Digital Preservation Best Practices: Lessons Learned From Across the Pond
PPT
Getaneh Alemu
PPT
Brief Introduction to Digital Preservation
PDF
Digital preservation: an introduction
PPT
Digital Preservation
PPT
Digital Preservation
PDF
Corrado -- Establishing the Landscape
PPT
Preservation and Access: Achieving the Best of Both Worlds
PDF
Preservation Metadata Initiatives and Standards
PDF
Digital preservation 101_links
Digital Preservation
Repositories and digital preservation
Digital Preservation
Caplan and York, 'What It Takes To Make It Last: E-Resources Preservation"
The digital preservation technical context
An Introduction to Digital Preservation
Open Source Software for Digital Preservation Repositories : A Survey
Open Source Software for Digital Preservation Repositories : A Survey
Hans Hofman - European Perspectives on Digital Preservation
Digital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the Pond
Getaneh Alemu
Brief Introduction to Digital Preservation
Digital preservation: an introduction
Digital Preservation
Digital Preservation
Corrado -- Establishing the Landscape
Preservation and Access: Achieving the Best of Both Worlds
Preservation Metadata Initiatives and Standards
Digital preservation 101_links
Ad

More from Kevin Ashley (19)

PPTX
RISE - the DCC's Research Infrastructure Self-Evaluation Framework
PPTX
An analysis of open data and open science policies in Europe - a SPARCEurope ...
PPTX
National Research Data Services in the UK and elsewhere (#confdados)
PPTX
Supporting open research - how to help your researchers - Vitae15
PPTX
University of Northumbria Research
PPTX
Inverting the data pyramid: maximising the value of data reuse (IMCW2014/ICKM...
PPTX
My data, your data, our data - increasing data value through reuse (Eurocris2...
PPTX
Use and reuse: research data locally & globally #esipfed
PPTX
Research data: burden or treasure? (Talk from #fote13)
PPTX
Data Quality and Data Curation - a personal view
PPT
Data and the webmanager
PPT
Research data for repository managers
PPT
Research Data Management: the UK national change programme (Nordbib)
PPT
Trust: when we need it and how to get it
PPT
Missing links closing talk - with notes
PPT
What can the DCC do for you? Sheffield Roadshow
PPT
Audit and outsourcing: their role in creating interoperable repository infras...
PPT
JISC repositories and preservation programme: Plenary presentation 2009
PPT
ipres2008: the Digital Preservation Training Programme
RISE - the DCC's Research Infrastructure Self-Evaluation Framework
An analysis of open data and open science policies in Europe - a SPARCEurope ...
National Research Data Services in the UK and elsewhere (#confdados)
Supporting open research - how to help your researchers - Vitae15
University of Northumbria Research
Inverting the data pyramid: maximising the value of data reuse (IMCW2014/ICKM...
My data, your data, our data - increasing data value through reuse (Eurocris2...
Use and reuse: research data locally & globally #esipfed
Research data: burden or treasure? (Talk from #fote13)
Data Quality and Data Curation - a personal view
Data and the webmanager
Research data for repository managers
Research Data Management: the UK national change programme (Nordbib)
Trust: when we need it and how to get it
Missing links closing talk - with notes
What can the DCC do for you? Sheffield Roadshow
Audit and outsourcing: their role in creating interoperable repository infras...
JISC repositories and preservation programme: Plenary presentation 2009
ipres2008: the Digital Preservation Training Programme

Recently uploaded (20)

PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Electronic commerce courselecture one. Pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Machine learning based COVID-19 study performance prediction
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Approach and Philosophy of On baking technology
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Spectroscopy.pptx food analysis technology
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Spectral efficient network and resource selection model in 5G networks
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Electronic commerce courselecture one. Pdf
MYSQL Presentation for SQL database connectivity
Encapsulation_ Review paper, used for researhc scholars
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
The AUB Centre for AI in Media Proposal.docx
Reach Out and Touch Someone: Haptics and Empathic Computing
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Network Security Unit 5.pdf for BCA BBA.
MIND Revenue Release Quarter 2 2025 Press Release
Machine learning based COVID-19 study performance prediction
20250228 LYD VKU AI Blended-Learning.pptx
Approach and Philosophy of On baking technology
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Spectroscopy.pptx food analysis technology
Digital-Transformation-Roadmap-for-Companies.pptx
Unlocking AI with Model Context Protocol (MCP)
Spectral efficient network and resource selection model in 5G networks

Digital Curation: gaps and challenges

  • 1. Digital Preservation Research: A review of the challenges Kevin Ashley Head Of Digital Archives [email_address] This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/2.5/scotland/ ; or, (b) send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.
  • 2. Background Some past documents defining research areas: “Invest To Save” - DELOS/NSF 2002/2003 (I2S) “It’s About Time” - NSF/Library Of Congress 2002 (IAT) Liz Lyon - “Dealing with Data” (2007) Warwick statement (2005) and others on European agenda Common themes - some common authors
  • 3. Invest to Save Small team of authors 21 (25) research areas; 7 supporting issues (legal, organisational, etc.) Explicitly trans-continental
  • 4. It’s About Time 51-member workshop Small editorial team to draft outputs 64 research issues USA participants and funders only
  • 5. “A period of time long enough for there to be concern about the impacts of changing technologies, including support for new media and data formats, and of a changing user community, on the information held in the repository. This period extends into the indefinite future.” OAIS definition. What’s long-term preservation? It’s not always forever!
  • 6. Where is the research now Some has been done Some is being done Some has yet to be started There may also be new challenges not yet identified
  • 7. Work that is done Salvage and rescue (digital material) (I2S) Perhaps we can learn to make it easier or cheaper Repository models (Both) But Dspace etc. lack preservation architecture Format repositories (Both) Distributed storage (Both) but what about torrent-type models Understanding media (Both)
  • 8. Work that is being done cost modelling and process modelling (Both) collection completeness (I2S) ; anomaly detection (Both) Complex entities - web archiving (Both) Scalability (Both) Preservable metadata: PREMIS is a big step (Both) Audio and video preservation (I2S)
  • 9. Work being done 2 Certification + trustworthiness: (Both) Of repositories and content Effective refreshing of media (IAT) Automated acquisition and description (Both) Representation Information Registries (I2S)
  • 10. Work being done 3 Automated policy application Workflow preservation and sharing Distributed content (e.g. comments/data) Disciplines or places ? Generic database preservation
  • 11. Work that remains Newer formats: virtual worlds, musical scores, XML (I2S) Accelerated ageing of systems + software (I2S) Software repositories: (Both) Architectures Classification schemes and content models
  • 12. Work that remains 2 Multilingual entities (I2S) preservation/migration, not creation Anomaly detection: At ingest; migration; of a collection (IAT) Automated provenance generation (IAT) Migration of authentication information (IAT) Defining designated communities
  • 13. Work that remains 3 Self-aware objects (I2S) Scaling down (I2S) Market analysis: customer base and needs (IAT) Formal models for selection (Both) Exchange of content and services between repositories (Both) Migrating ontologies, schemas, etc (IAT)
  • 14. What type of research? I2S requires pragmatic and theoretical research “Practice informs research in a fashion similar to the ways in which research informs practice.” Need good pathways in both directions
  • 15. Final observations General problem: tension between big abstracts and specifics. Sometimes research is too specific Some of the problems are defined too generally Good research can be done by those outside the field Those within it must define the problems better