SlideShare a Scribd company logo
Using DAF as a data scoping tool  for institutional repositories   Sarah Jones DCC, University of Glasgow [email_address]
Background to DAF project “ JISC should develop a Data Audit Framework to enable all universities and colleges to carry out an audit of departmental data collections, awareness, policies and practice for data curation and preservation”   Liz Lyon, Dealing with Data: Roles, Rights,  Responsibilities and Relationships, (2007)
Scope of work DAF Development (DAFD) Project   (University of Glasgow; King’s College London; University of Edinburgh; UKOLN, University of Bath) Four pilot implementation projects University of Edinburgh King’s College London Imperial College London University College London
The methodology http://www.data-audit.eu/DAF_Methodology.pdf
Themes addressed in DAF surveys Data :  type / format, volume, description, creator, funder Creation : policy, naming, versioning, metadata & documentation Management : storage, backup, roles and responsibilities, planning  Access:  restrictions, rights, security, frequency, ease of retrieval, publish Sharing:   collaborators, requirements to share, methods, concerns  Preservation : selection / retention, repository services, obsolescence Gaps / needs : services, advice, support, infrastructure
Subject areas of DAF pilots DAFD test cases :  GeoSciences; Archaeology; Mechanical Engineering; Humanities University of Edinburgh   Physiology; Divinity; History; Brain Imaging; Astronomy University College London Archaeology; Scandinavian Studies; Physics & Astronomy; Life & Medical Sciences Imperial College London Chemical Engineering; Physics; Business School King’s College London Geography; Psychiatry; Environmental Research; Biomedical And Health Sciences DataShare examples Cardiac group; Dept of International Development; Social Sciences
Generalised findings Lots of data were created Few policies for data creation, storage and management Researchers unsure where to begin and were often unaware of available support Often no place of deposit or funds for preservation Pilot implementation findings  http://www.data-audit.eu/findings.html IJDC paper   http://www.ijdc.net/ijdc/article/view/91/109
Workshop on next steps for DAF Many of the pilots found the actual process of gathering information on data management was more valuable than the asset register. The DAF approach was felt to be useful for defining requirements to improve data management.  (JISC funded DMI projects)   A suggestion was made to enhance DAF with practical examples / guidance from the pilot studies.  (Implementation Guide) Align the DAF process with other data management planning tools.  (IDMP project between AIDA, DAF, DRAMBORA, LIFE)

More Related Content

PDF
Presentation to the Woolcock Institute of Medical Research
PDF
Data discovery and sharing at UCLH
PPTX
Research Data Management from a Software Engineering Perspective
PPTX
Australasian dmp interest group international involvement-Kathryn Unsworth
PDF
How Jisc supports reporting, communicating and measuring research in the UK
PPTX
Report from RDAPlenary 3 to DataCitation Community in Australia
PDF
UCSF CER - Comparative Effectiveness Large Dataset Analysis Core (Symposium 2...
PDF
Data management planning – what it is and how to do it
Presentation to the Woolcock Institute of Medical Research
Data discovery and sharing at UCLH
Research Data Management from a Software Engineering Perspective
Australasian dmp interest group international involvement-Kathryn Unsworth
How Jisc supports reporting, communicating and measuring research in the UK
Report from RDAPlenary 3 to DataCitation Community in Australia
UCSF CER - Comparative Effectiveness Large Dataset Analysis Core (Symposium 2...
Data management planning – what it is and how to do it

What's hot (20)

PPTX
Lightning Talks - Intro
PPTX
Improving RDM through closer integration of electronic lab notebooks and data...
PPTX
Business case and cost modelling for an end-to-end RDM service
PDF
Extending opd to cover research data management
PPTX
PPTX
Performances, preservation and policy implications: digital curation and pres...
PPTX
DATAD-R: Criteria for Trusted African Institutional Repositories
PDF
Making Research Data Repositories Visible – The re3data.org Registry
PDF
SDI – National to Global: perspectives from the UK academic sector
PDF
Research data management and sharing of medical data
PDF
re3data.org presented at 3rd RDA Plenary
PPTX
International scholarly infrastructures
PPTX
FAIRsharing presentation at the Japan Science and Technology Agency
PPTX
IWSG Science Gateways
PPTX
FAIR approach to Research Data in Australia
PDF
What is an archaeological research infrastructure and why do we need it? Aims...
PPT
COBWEB, AIP-6, and Access Management Federations
PDF
Trusted Data Repository - an Australia Community of Practice
PDF
Elab 16 5-13-re3data-scholze-final
PDF
Introduction to FAIR principles - for impact and reuse of research data
Lightning Talks - Intro
Improving RDM through closer integration of electronic lab notebooks and data...
Business case and cost modelling for an end-to-end RDM service
Extending opd to cover research data management
Performances, preservation and policy implications: digital curation and pres...
DATAD-R: Criteria for Trusted African Institutional Repositories
Making Research Data Repositories Visible – The re3data.org Registry
SDI – National to Global: perspectives from the UK academic sector
Research data management and sharing of medical data
re3data.org presented at 3rd RDA Plenary
International scholarly infrastructures
FAIRsharing presentation at the Japan Science and Technology Agency
IWSG Science Gateways
FAIR approach to Research Data in Australia
What is an archaeological research infrastructure and why do we need it? Aims...
COBWEB, AIP-6, and Access Management Federations
Trusted Data Repository - an Australia Community of Practice
Elab 16 5-13-re3data-scholze-final
Introduction to FAIR principles - for impact and reuse of research data
Ad

Viewers also liked (11)

PPTX
Session 05 cleaning and exploring
PDF
Consulting Skills for Data Scientists
PPTX
Session 01 designing and scoping a data science project
PPT
KeepIt Course 4: Putting storage, format management and preservation planning...
PPT
LIFE3: Predicting Long Term Preservation Costs, by Brian Hole
PPT
Significant Properties, Practical 1: Object Analysis (SPs part 3), by Stephen...
PPT
Preservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
PPT
KeepIt Course 3: preservation workflow
PPT
Significant Properties - Where Next? (SPs part 6), by Stephen Grace and Garet...
PDF
Max Shron, Thinking with Data at the NYC Data Science Meetup
PPTX
La electricidad
Session 05 cleaning and exploring
Consulting Skills for Data Scientists
Session 01 designing and scoping a data science project
KeepIt Course 4: Putting storage, format management and preservation planning...
LIFE3: Predicting Long Term Preservation Costs, by Brian Hole
Significant Properties, Practical 1: Object Analysis (SPs part 3), by Stephen...
Preservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
KeepIt Course 3: preservation workflow
Significant Properties - Where Next? (SPs part 6), by Stephen Grace and Garet...
Max Shron, Thinking with Data at the NYC Data Science Meetup
La electricidad
Ad

Similar to Using DAF as a Data Scoping Tool, by Sarah Jones (20)

PPT
DAF methodology
PPTX
Services, policy, guidance and training: Improving research data management a...
PPT
Services, policy, guidance and training: Improving research data management a...
PPT
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
PDF
Developing institutional RDM services
PPT
Data management policies
PDF
Sarah Jones RDM from a disciplinary perspective
PPTX
Meeting the NSF DMP Requirement June 13, 2012
PPT
Libraries and Research Data Management – What Works? Lessons Learned from the...
PPTX
Implementing Open Access: Effective Management of Your Research Data
PDF
Looking After Your Data: RDM @ Edinburgh
PPTX
Institutional Data Management Blueprint
PPTX
Research Data Management Initiatives at the University of Edinburgh
PPT
Dc101 oxford sj_16062010
PPT
Research Data Management (RDM) Initiatives at the University of Edinburgh
PDF
Research Data Management Inititatives at University of Edinburgh
PPT
Survey of research data management practices up2010digschol2011
PPT
Introduction to Research Data Management
PPTX
From policy to practice with DMP Online
PPTX
Martin Donnelly Sarah Jones DMP Online
DAF methodology
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Developing institutional RDM services
Data management policies
Sarah Jones RDM from a disciplinary perspective
Meeting the NSF DMP Requirement June 13, 2012
Libraries and Research Data Management – What Works? Lessons Learned from the...
Implementing Open Access: Effective Management of Your Research Data
Looking After Your Data: RDM @ Edinburgh
Institutional Data Management Blueprint
Research Data Management Initiatives at the University of Edinburgh
Dc101 oxford sj_16062010
Research Data Management (RDM) Initiatives at the University of Edinburgh
Research Data Management Inititatives at University of Edinburgh
Survey of research data management practices up2010digschol2011
Introduction to Research Data Management
From policy to practice with DMP Online
Martin Donnelly Sarah Jones DMP Online

More from JISC KeepIt project (20)

PPTX
EPrints Preservation: Why we need Preservation Planning
PPTX
Preserving repository content: practical steps for repository managers by Mig...
PPT
Update on the JISC KeepIt Repository Preservation Exemplars Project, June 2010
PPT
Transforming repositories: from repository managers to institutional data man...
PPT
Keepit Course 5: Concluding the course
PPT
Keepit Course 5: Revision
PPT
KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, by Martin Donn...
PPT
Keepit Course 5: Tools for Assessing Trustworthy Repositories
PPT
Keepit Course 5: Trust
PPT
Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...
PPT
KeepIt Course 4: digital preservation recap, by Andreas Rauber, Hannes Kulovi...
PPT
Keepit Course 3: Provenance (and OPM), based on slides by Luc Moreau
PPT
KeepIt Course 3: Applying Preservation Metadata to Repositories
PPT
Supporting Significant Properties in a Working Archive (SPs part 5), by Steph...
PPT
Significant Properties, Practical 2: Stakeholder Analysis (SPs part 4), by St...
PPT
InSPECT Significant Properties Framework (SPs part 2), by Stephen Grace and G...
PPT
Introducing Significant Properties (SPs part 1), by Stephen Grace and Gareth ...
PPT
KeepIt Course 3: primer on preservation workflow, formats and characterisation
PPT
Costs, Policy, and Benefits in Long-term Digital Preservation, by Neil Beagrie
PPT
KeepIt Course 2: preservation costs
EPrints Preservation: Why we need Preservation Planning
Preserving repository content: practical steps for repository managers by Mig...
Update on the JISC KeepIt Repository Preservation Exemplars Project, June 2010
Transforming repositories: from repository managers to institutional data man...
Keepit Course 5: Concluding the course
Keepit Course 5: Revision
KeepIt Course 5: DRAMBORA: Risk and Trust and Data Management, by Martin Donn...
Keepit Course 5: Tools for Assessing Trustworthy Repositories
Keepit Course 5: Trust
Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...
KeepIt Course 4: digital preservation recap, by Andreas Rauber, Hannes Kulovi...
Keepit Course 3: Provenance (and OPM), based on slides by Luc Moreau
KeepIt Course 3: Applying Preservation Metadata to Repositories
Supporting Significant Properties in a Working Archive (SPs part 5), by Steph...
Significant Properties, Practical 2: Stakeholder Analysis (SPs part 4), by St...
InSPECT Significant Properties Framework (SPs part 2), by Stephen Grace and G...
Introducing Significant Properties (SPs part 1), by Stephen Grace and Gareth ...
KeepIt Course 3: primer on preservation workflow, formats and characterisation
Costs, Policy, and Benefits in Long-term Digital Preservation, by Neil Beagrie
KeepIt Course 2: preservation costs

Recently uploaded (20)

PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
KodekX | Application Modernization Development
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Encapsulation theory and applications.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
MYSQL Presentation for SQL database connectivity
Understanding_Digital_Forensics_Presentation.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Unlocking AI with Model Context Protocol (MCP)
Digital-Transformation-Roadmap-for-Companies.pptx
KodekX | Application Modernization Development
The AUB Centre for AI in Media Proposal.docx
Empathic Computing: Creating Shared Understanding
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Encapsulation theory and applications.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Building Integrated photovoltaic BIPV_UPV.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Big Data Technologies - Introduction.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm

Using DAF as a Data Scoping Tool, by Sarah Jones

  • 1. Using DAF as a data scoping tool for institutional repositories Sarah Jones DCC, University of Glasgow [email_address]
  • 2. Background to DAF project “ JISC should develop a Data Audit Framework to enable all universities and colleges to carry out an audit of departmental data collections, awareness, policies and practice for data curation and preservation” Liz Lyon, Dealing with Data: Roles, Rights, Responsibilities and Relationships, (2007)
  • 3. Scope of work DAF Development (DAFD) Project (University of Glasgow; King’s College London; University of Edinburgh; UKOLN, University of Bath) Four pilot implementation projects University of Edinburgh King’s College London Imperial College London University College London
  • 5. Themes addressed in DAF surveys Data : type / format, volume, description, creator, funder Creation : policy, naming, versioning, metadata & documentation Management : storage, backup, roles and responsibilities, planning Access: restrictions, rights, security, frequency, ease of retrieval, publish Sharing: collaborators, requirements to share, methods, concerns Preservation : selection / retention, repository services, obsolescence Gaps / needs : services, advice, support, infrastructure
  • 6. Subject areas of DAF pilots DAFD test cases : GeoSciences; Archaeology; Mechanical Engineering; Humanities University of Edinburgh Physiology; Divinity; History; Brain Imaging; Astronomy University College London Archaeology; Scandinavian Studies; Physics & Astronomy; Life & Medical Sciences Imperial College London Chemical Engineering; Physics; Business School King’s College London Geography; Psychiatry; Environmental Research; Biomedical And Health Sciences DataShare examples Cardiac group; Dept of International Development; Social Sciences
  • 7. Generalised findings Lots of data were created Few policies for data creation, storage and management Researchers unsure where to begin and were often unaware of available support Often no place of deposit or funds for preservation Pilot implementation findings http://www.data-audit.eu/findings.html IJDC paper http://www.ijdc.net/ijdc/article/view/91/109
  • 8. Workshop on next steps for DAF Many of the pilots found the actual process of gathering information on data management was more valuable than the asset register. The DAF approach was felt to be useful for defining requirements to improve data management. (JISC funded DMI projects) A suggestion was made to enhance DAF with practical examples / guidance from the pilot studies. (Implementation Guide) Align the DAF process with other data management planning tools. (IDMP project between AIDA, DAF, DRAMBORA, LIFE)

Editor's Notes

  • #2: - I’ll start off with some background context / an overview to DAF - Harry will then explain how it’s been used at Southampton - then we’ll do a group exercise.
  • #3: DAF established in response to a recommendation in the Dealing with Data report. This recognised a lack of awareness as to what data were held within HE institutions and how they were being managed. How can unis make the most of their research data when it is unclear: what there is; where these data are; how they’re being managed; options for reuse etc DAF tries to help users find these things out. Can be a useful tool for repositories to identify data for ingest, or to see what the requirements for support are from researchers left curating data without the necessary resources / skills.
  • #4: 5 projects funded by JISC over a 6 month period in 2008 Development project to come up with the methodology and develop an online tool Implementation projects to test this out and investigate the research data challenge
  • #5: The methodology has four incremental stages, one for planning, one for wrap up and two main audit stages. Stages 2 & 3 pick up directly on the two aspects in the original recommendation i.e. what data exist (inventory stage) and what’s happening to them (assessment stage). Planning: define scope / expected outcomes of the survey; conduct preliminary research; set up interviews / questionnaires. Identifying data: collect basic information (name, description, creator, location); broad mapping to get feel for the extent of data holdings; classification helps refine scope of next stage. Assessing data: look into a few datasets / collections in more depth to identify weaknesses in data management and risks; consider the whole lifecycle. Reporting: collate and analyse information collected; make recommendations on how to improve data management. Information was typically collected by a mix of questionnaires and interviews.
  • #6: Themes covered all activities in the data lifecycle. Some found this model useful as a way to guide discussion Across all themes there was a tendency to unpick issues and concerns
  • #7: Pilots were in a mixture of disciplines and sizes of organisation (research group, departments, schools etc). Focus of implementations differed slightly too. Some were more repository based e.g. Imperial College more concerned with capacity planning so asked questions about data size, growth rates, planned retention, formats… DataShare examples were undertaken to identify suitable data for ingest in light of a lack of voluntary deposits
  • #8: - Lots of data – often complex: survey data and 3D visualisations, CAD drawings. - Didn’t come across any many policies – very ad hoc. - People didn’t know what to do – wanted support – but also unaware of where they could turn e.g. to repository. - Often nowhere for data to go – didn’t always have data centres in their subject area, or the ability to deposit their data in the institutional repositories. Researchers wanted to keep and reuse data but didn’t have time or skills to do it themselves – need for data curation infrastructure. Role for IRs here.
  • #9: We had a workshop in 2009 to collate lessons from pilots and decide next steps for DAF. These were the three main recommendations made. Most institutions were still in the early stages of developing infrastructure so the approach was more useful for gathering requirements than identifying data to manage. DAF has been suggested as a tool for new JISC data management infrastructure projects to use for scoping requirements. The exercise today will focus on this usage too – scoping data & gathering requirements for the repository’s role in data management 2. Lessons / approaches from the pilots have been brought together to help others – see the implementation guide. 3. Some new work has been funded (JISC IDMP project) to see how DAF and other tools can be brought together to help institutions develop their data management strategy.