SlideShare a Scribd company logo
Jisc Research Data Management
Shared Service Workshop:
An institutional perspective
Jenny Mitcham
Digital Archivist
Borthwick Institute for Archives
University of York
22nd February 2016
The RDM Problem at York
• At the University of York, RDM is not *yet* a
solved problem
• We have a repository:
– But not all the necessary workflows in place to ingest
and manage research data
• We have a CRIS that researchers can use to enter
metadata about their research data:
– But ad hoc manual workflows for getting hold of the
dataset
• We are not currently addressing all the long term
preservation needs of the research data
What am I talking about?
You
Digital preservation refers to
the series of managed
activities necessary to ensure
continued access to digital
materials for as long as
necessary.
Digital preservation ...refers
to all of the actions required
to maintain access to digital
materials beyond the limits of
media failure or technological
change.*
Me
Oh I see!
Not just
storage then?
* Text shamelessly stolen from the DPC Preservation Handbook
This is a digital archive
The Open Archival Information System (OAIS)
Filling the digital preservation gap:
Project aim
“…to investigate
Archivematica and explore
how it might be used to
provide digital preservation
functionality within a wider
infrastructure for Research
Data Management.”
To find out more…
Date: 24th February
Time: 13.50-14.10
Place: Zurich 1
Session: B3: Digital Preservation
Paper: "Filling the Preservation Gap" for
Research Data by Jenny Mitcham
We wanted to be able to answer the
following questions...
• What is the nature of current research data at
York (ie: file format, size, sensitivity)?
• How is research data stored, managed and
shared currently?
• What are the barriers to people managing
their data effectively?
• Where are the gaps in current provision, and
what services do we need to provide to fill
these gaps?
RDM questionnaire (DAF audit)
• Questionnaire based
(loosely) on DCC’s DAF
• Informed by examples
from other institutions
• Used Google forms
• Sent to research staff and
students
• March-May 2013
• 188 responses
Where is your digital data stored?
How long do you have to keep your data?
How often is it backed up?
Will you deposit your data with an
archive?
Reasons why not:
• It is not something I had ever considered - 42%
• It is not something my funder requires - 35%
• There isn't a suitable data centre for my discipline – 18%
Data management issues
Large volume of data caused problems managing and accessing it 75 41%
Problem finding or accessing research data from former colleagues e.g.
PhD students or research staff who have left the University
69 38%
Problem locating where files are stored 62 34%
Absence of file naming conventions made it difficult to find the file you were
looking for
56 31%
Insufficient digital storage space 56 31%
Lack of version control caused confusion 52 29%
Inability to read files in old software formats on old media or because of
expired software licences
44 24%
No data management issues 44 24%
Difficulty interpreting data due to inadequate or lost documentation 43 24%
Insufficient physical storage space 23 13%
Problems establishing ownership of data 14 8%
Problems reading files because of security and encryption 10 5%
Other 9 5%
Value of research data
“There has probably been an awful lot
of good data lost due to poor practice
in archiving ...”
“Storing vast datasets which are not part of
the final publication adds a lot of cost for
very little benefit.”
“Unprocessed data is generally large
and difficult to analyse, unless the
analysis tools are provided in the
archive.”
“I hope strongly that in the future I might
contribute to a widely available repository
for musical instruction/examples ....both for
other players/composers and for
musicological researchers.”
Researchers
What does research data look like?
York RDM questionnaire
2013: Please select the main
types of electronic research
data you generate
Top research data applications at York
NDSA Levels
of Digital
Preservation:
Level 2 requires
you to know what
you’ve got ...
and levels 3 and 4
build on this
The importance of Pronom
The importance of identification
How well are our top 20
formats represented in
Pronom?
• Better than expected
• Sometimes partial
• Sometimes quite
generic (without a
version number)
MATLAB N
SPSS Partial
Stata N
R N
EndNote Partial
NVivo N
LaTeX Partial
Python NWolfram
Mathematica Partial
Gaussian N
ChemDraw Partial
SAS Partial
ArcGIS Partial
GraphPad Prism Partial
Adobe Photoshop Partial
ATLAS.ti N
C++ N
Eclipse NA? No native file formats
MS Excel Y
RSB - ImageJ Partial
Let’s all contribute
Viewing unidentified files in
Archivematica
Some final points
• It is great that Jisc has included digital
preservation as an element of the shared
service
• …but it is not just a question of adopting the
tools
– we may also need to enhance them
– and integrate them
• We need to work with the wider digital
preservation community too
Where to find out more
http://www.york.ac.uk/borthwick/

More Related Content

PPTX
Jisc research data shared service overview IDCC 2016
PPTX
Research at risk: developing a shared research data management service for UK...
PPTX
Research data management and Cambridge and our motivations for the Pilot
PPTX
Business cases and costs RDN
PPTX
Text mining and machine learning
PPTX
Archivematica for research data
PPTX
Jisc Research data shared service overview and update - May 2016
PPTX
Rachel Bruce on DMP
Jisc research data shared service overview IDCC 2016
Research at risk: developing a shared research data management service for UK...
Research data management and Cambridge and our motivations for the Pilot
Business cases and costs RDN
Text mining and machine learning
Archivematica for research data
Jisc Research data shared service overview and update - May 2016
Rachel Bruce on DMP

What's hot (20)

PPTX
Recognising data sharing
PPTX
A discovery service for UK research data
PPTX
RDA UK
PPTX
Show me the money - the long path to a sustainable RDM Facility
PPTX
Research Data Shared Service Webinar #1
PPTX
Discovering the research data alliance
PPTX
Data sharing in the Netherlands
PPTX
From Box to Hydra via Archivematica
PPTX
Frances Burton on sensitive data
PPTX
Research Data Shared Service update at DPC
PPTX
Journal research data policy update
PPTX
Gold, silver, bronze - research data network
PPTX
DAF Survey Results, research data network
PDF
2018 03 codata - making the case
PPTX
Recognising data sharing
PPTX
Rubrics for DMPs
PPT
Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014
PPTX
What I wish I’d known at the start!
PPTX
UK Research Data Discovery Service metadata schema
PPTX
Presenting RISE
Recognising data sharing
A discovery service for UK research data
RDA UK
Show me the money - the long path to a sustainable RDM Facility
Research Data Shared Service Webinar #1
Discovering the research data alliance
Data sharing in the Netherlands
From Box to Hydra via Archivematica
Frances Burton on sensitive data
Research Data Shared Service update at DPC
Journal research data policy update
Gold, silver, bronze - research data network
DAF Survey Results, research data network
2018 03 codata - making the case
Recognising data sharing
Rubrics for DMPs
Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014
What I wish I’d known at the start!
UK Research Data Discovery Service metadata schema
Presenting RISE
Ad

Similar to Jisc Research Data Management Shared Service Workshop: An institutional perspective (20)

PPTX
A collaborative approach to "filling the digital preservation gap" for Resear...
PPTX
A collaborative approach to "filling the digital preservation gap" for Resear...
PPTX
A collaborative approach to filling the digital preservation gap for RDM
PPT
Digital Archives in Theory and Practice
PDF
Looking After Your Data: RDM @ Edinburgh
PPTX
UKSG Conference 2017 Breakout - Jisc Research Data Shared Service - John Kaye
PPTX
Filling the Digital Preservation Gap - Acting on Change
PPTX
“Filling the digital preservation gap” an update from the Jisc Research Data ...
PPTX
"Filling the Digital Preservation Gap" with Archivematica
PDF
Research Data Management at Edinburgh: Effecting Culture Change
PDF
Research Data Management Inititatives at University of Edinburgh
PPT
Introduction to Research Data Management
PPTX
UK data management environment and support
PPTX
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
PPT
Services, policy, guidance and training: Improving research data management a...
PPT
D.3.1: State of the Art - Linked Data and Digital Preservation
PPTX
RDM Programme @ Edinburgh: Data Librarian Experience
PPTX
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
PPT
Introduction to digital curation
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to filling the digital preservation gap for RDM
Digital Archives in Theory and Practice
Looking After Your Data: RDM @ Edinburgh
UKSG Conference 2017 Breakout - Jisc Research Data Shared Service - John Kaye
Filling the Digital Preservation Gap - Acting on Change
“Filling the digital preservation gap” an update from the Jisc Research Data ...
"Filling the Digital Preservation Gap" with Archivematica
Research Data Management at Edinburgh: Effecting Culture Change
Research Data Management Inititatives at University of Edinburgh
Introduction to Research Data Management
UK data management environment and support
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Services, policy, guidance and training: Improving research data management a...
D.3.1: State of the Art - Linked Data and Digital Preservation
RDM Programme @ Edinburgh: Data Librarian Experience
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Introduction to digital curation
Ad

More from Jisc RDM (20)

PDF
2019-06_Eunis_Burland
PPTX
Jisc Research Data Shared Service Open Repositories 2018 Paper
PPTX
Jisc Research Data Shared Service Open Repositories 2018 24x7
PDF
Jisc Research Data Shared Service - a Samvera case study
PPTX
Building a national Data Repository Data Modelling
PPTX
Building a national Data Repository System Integration Architecture Overview
PPTX
Building a National Data Service Open Repositories 2018
PPTX
Research Data Toolkit
PPTX
Pre jisc datachampday_260318
PPTX
Stories from the Field: Data are Messy and that's (kind of) ok
PPTX
Fair data - dinkum research - by Andy Turner
PPTX
Managing data behind creative masterpieces -RCM
PPTX
Managing data behind creative masterpieces
PPTX
Lightning Talks - Intro
PPTX
Lightning Talk - Andrew MacLellan
PPTX
Lightning Talk - Nick Sheppard
PPTX
Lightning Talk - Angela Dappart
PPTX
Lightning talk - Adam Harwood
PPTX
Lightning Talk - Chris Awre
PPTX
Researcher engagement
2019-06_Eunis_Burland
Jisc Research Data Shared Service Open Repositories 2018 Paper
Jisc Research Data Shared Service Open Repositories 2018 24x7
Jisc Research Data Shared Service - a Samvera case study
Building a national Data Repository Data Modelling
Building a national Data Repository System Integration Architecture Overview
Building a National Data Service Open Repositories 2018
Research Data Toolkit
Pre jisc datachampday_260318
Stories from the Field: Data are Messy and that's (kind of) ok
Fair data - dinkum research - by Andy Turner
Managing data behind creative masterpieces -RCM
Managing data behind creative masterpieces
Lightning Talks - Intro
Lightning Talk - Andrew MacLellan
Lightning Talk - Nick Sheppard
Lightning Talk - Angela Dappart
Lightning talk - Adam Harwood
Lightning Talk - Chris Awre
Researcher engagement

Recently uploaded (20)

PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PPTX
Lesson notes of climatology university.
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Weekly quiz Compilation Jan -July 25.pdf
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
PPTX
UNIT III MENTAL HEALTH NURSING ASSESSMENT
PDF
RMMM.pdf make it easy to upload and study
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
History, Philosophy and sociology of education (1).pptx
PDF
Trump Administration's workforce development strategy
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
LDMMIA Reiki Yoga Finals Review Spring Summer
Lesson notes of climatology university.
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
Final Presentation General Medicine 03-08-2024.pptx
Supply Chain Operations Speaking Notes -ICLT Program
Weekly quiz Compilation Jan -July 25.pdf
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Final Presentation General Medicine 03-08-2024.pptx
Anesthesia in Laparoscopic Surgery in India
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
UNIT III MENTAL HEALTH NURSING ASSESSMENT
RMMM.pdf make it easy to upload and study
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
History, Philosophy and sociology of education (1).pptx
Trump Administration's workforce development strategy
Microbial diseases, their pathogenesis and prophylaxis

Jisc Research Data Management Shared Service Workshop: An institutional perspective

  • 1. Jisc Research Data Management Shared Service Workshop: An institutional perspective Jenny Mitcham Digital Archivist Borthwick Institute for Archives University of York 22nd February 2016
  • 2. The RDM Problem at York • At the University of York, RDM is not *yet* a solved problem • We have a repository: – But not all the necessary workflows in place to ingest and manage research data • We have a CRIS that researchers can use to enter metadata about their research data: – But ad hoc manual workflows for getting hold of the dataset • We are not currently addressing all the long term preservation needs of the research data
  • 3. What am I talking about? You Digital preservation refers to the series of managed activities necessary to ensure continued access to digital materials for as long as necessary. Digital preservation ...refers to all of the actions required to maintain access to digital materials beyond the limits of media failure or technological change.* Me Oh I see! Not just storage then? * Text shamelessly stolen from the DPC Preservation Handbook
  • 4. This is a digital archive The Open Archival Information System (OAIS)
  • 5. Filling the digital preservation gap: Project aim “…to investigate Archivematica and explore how it might be used to provide digital preservation functionality within a wider infrastructure for Research Data Management.”
  • 6. To find out more… Date: 24th February Time: 13.50-14.10 Place: Zurich 1 Session: B3: Digital Preservation Paper: "Filling the Preservation Gap" for Research Data by Jenny Mitcham
  • 7. We wanted to be able to answer the following questions... • What is the nature of current research data at York (ie: file format, size, sensitivity)? • How is research data stored, managed and shared currently? • What are the barriers to people managing their data effectively? • Where are the gaps in current provision, and what services do we need to provide to fill these gaps?
  • 8. RDM questionnaire (DAF audit) • Questionnaire based (loosely) on DCC’s DAF • Informed by examples from other institutions • Used Google forms • Sent to research staff and students • March-May 2013 • 188 responses
  • 9. Where is your digital data stored?
  • 10. How long do you have to keep your data?
  • 11. How often is it backed up?
  • 12. Will you deposit your data with an archive? Reasons why not: • It is not something I had ever considered - 42% • It is not something my funder requires - 35% • There isn't a suitable data centre for my discipline – 18%
  • 13. Data management issues Large volume of data caused problems managing and accessing it 75 41% Problem finding or accessing research data from former colleagues e.g. PhD students or research staff who have left the University 69 38% Problem locating where files are stored 62 34% Absence of file naming conventions made it difficult to find the file you were looking for 56 31% Insufficient digital storage space 56 31% Lack of version control caused confusion 52 29% Inability to read files in old software formats on old media or because of expired software licences 44 24% No data management issues 44 24% Difficulty interpreting data due to inadequate or lost documentation 43 24% Insufficient physical storage space 23 13% Problems establishing ownership of data 14 8% Problems reading files because of security and encryption 10 5% Other 9 5%
  • 14. Value of research data “There has probably been an awful lot of good data lost due to poor practice in archiving ...” “Storing vast datasets which are not part of the final publication adds a lot of cost for very little benefit.” “Unprocessed data is generally large and difficult to analyse, unless the analysis tools are provided in the archive.” “I hope strongly that in the future I might contribute to a widely available repository for musical instruction/examples ....both for other players/composers and for musicological researchers.” Researchers
  • 15. What does research data look like? York RDM questionnaire 2013: Please select the main types of electronic research data you generate
  • 16. Top research data applications at York
  • 17. NDSA Levels of Digital Preservation: Level 2 requires you to know what you’ve got ... and levels 3 and 4 build on this
  • 19. The importance of identification How well are our top 20 formats represented in Pronom? • Better than expected • Sometimes partial • Sometimes quite generic (without a version number) MATLAB N SPSS Partial Stata N R N EndNote Partial NVivo N LaTeX Partial Python NWolfram Mathematica Partial Gaussian N ChemDraw Partial SAS Partial ArcGIS Partial GraphPad Prism Partial Adobe Photoshop Partial ATLAS.ti N C++ N Eclipse NA? No native file formats MS Excel Y RSB - ImageJ Partial
  • 21. Viewing unidentified files in Archivematica
  • 22. Some final points • It is great that Jisc has included digital preservation as an element of the shared service • …but it is not just a question of adopting the tools – we may also need to enhance them – and integrate them • We need to work with the wider digital preservation community too
  • 23. Where to find out more http://www.york.ac.uk/borthwick/