SlideShare a Scribd company logo
A Data Curation Framework
Data Curation and Research Support Services
Research Support Community Day 2016
A Data Curation Framework: Data Curation and Research Support Services
University library research support
● Information literacy, e.g. information seeking, literature searches
● Collection access, e.g. acquisition, document supply, digitisation
● Scholarship, e.g. citation management, open access publishing
● Data literacy, e.g. research data management planning
● Mix of generic and domain services
● Access to resources
● Access to scholarly skills development
University library research support for DH
● Digital literacy, e.g. training in use of tools like Omeka
● Technical literacy, e.g. training in web publishing and site management
● Project management support, e.g. planning, technical development
● Repackaging digitised collection material as data
● Infrastructure literacy, e.g. training in use of infrastructure
● Mix of generic and domain services
● Access to scholarly skills development
● Access to infrastructure
Data Curation Framework
● Data curation framework developed for liaison and engagement, i.e. it is
a tool to highlight areas to tackle and where collaboration may occur
● Align eResearch service providers’ efforts with university research
support service development, and change as services evolve
Identify where:
● support enables researchers to undertake eResearch, i.e. within
university or national infrastructure
● generic versus specialised support services are provided, i.e. within
university or by third party providers
Framework: 4 components
Data processing
Data storage
Data archiving
Research Data Management
Data services Research Project Management
Framework: data processing
● Improve discoverability and reuse of collected data through
repackaging or transforming data ready for publishing or transit.
Framework: data storage
● Enable data transit to and from international, national and local data
storage services.
Framework: data archiving
● Enable data archiving practices to be embedded into data curation and
research project activities.
Framework: data services
● Provide data and services for researchers to exploit for data intensive
research projects.
Capability development
Service/skills development:
● technical processing
● resource analysis and management
● archival processing
Capability Areas:
● data access
● data integration
● data interoperability (to support data and publication discoverability)
Technical processing
● Selection and reuse of appropriate universal, domain specific or archival
ontologies, schemas and vocabularies.
● Crosswalking techniques, and data and metadata structuring methods.
For the improvement of reuse, access to and discoverability of data.
Resource analysis and management
Analysis of:
● data types and storage options
● researcher requirements
For the provision of appropriate storage and access to data.
Archiving processing
Selection of:
● backup or export regimes
● tools and services
● procedures to capture and retain data
To improve immediate and ongoing access to data.
Work Areas
Examples
Examples
● Data derived from digitised material
● Data derived from descriptive metadata
● Bulk data for web download
● Bulk data for remote access
● File network as data backup
● Repository as data archive
● Project support for eResearch
● Data with research value
Data derived from digitised material
Digitisation activities commonly conducted to provide access to library
resources may be extended with data processing activities to enable the
digital material to be treated as a data source by a researcher.
An example output could be the OCR text made available as a corpus.
Data derived from descriptive metadata
Metadata describing unique materials is commonly treated as a series of
catalogue records for display. Metadata may be transformed and published
as a dataset.
An example output could be the metadata made available in its local
schema, crosswalked into a different format, or structured as linked open
data.
Bulk data for web download
Digital material that is within a lower file size range and non-sensitive can be
packaged up with guidelines for reuse and published openly online.
An example output would be a set of steps and a template for file
arrangements and instructions aimed at data users.
Bulk data for remote access
Digital material that is too large to download over the web with standard
internet speeds or sensitive can be packaged up for transfer to shared and
secure storage services. There may also be a need for the data to be near to
applications in the cloud or high performance computing services.
An example output would be a set of steps and a template for setting access
arrangements, file arrangements and instructions aimed at data users.
File network as data backup
Retention copies of data may need to be maintained in file networks as a
short term solution for the purposes of data backup. These storage services
may be provided institutionally, through third parties or on national
infrastructure.
An example output would be guidance information on the services available
to researchers, including costs, security concerns, and backup routines.
Repository as data archive
Archival copies of data may need to be maintained in repositories as a long
term solution for preservation. These repository services may be provided
institutionally, through third parties or on national infrastructure.
An example output would be guidance information on the services available
to researchers, including costs, security concerns, and preservation routines.
Project support for eResearch
Research support services have been focused on provision of access to
information as a scholarly resource and support the generation of research
publications.
An example output would be a list of services already available and the
identification of gaps in service where capacity may be developed to support
data intensive research projects.
Data with research value
Data has research value at different points in its generation and ongoing
retention.
An example output would be a list of collected data available to make
accessible, the selection criteria for prioritising a data collection for
enhancing its accessibility because of its research value, and the rationale
for the techniques (e.g. web publishing, remote storage, repackaging etc)
employed to support accessibility.

More Related Content

PDF
Identifying classes and objects ooad
PPTX
Graph databases
PPTX
The art of implementing data lineage
PPTX
Data platform modernization with Databricks.pptx
PDF
Data modeling for the business 09282010
PPS
Requirements Management
PPTX
Big data and Hadoop
PDF
Web Intelligence - Tutorial1
Identifying classes and objects ooad
Graph databases
The art of implementing data lineage
Data platform modernization with Databricks.pptx
Data modeling for the business 09282010
Requirements Management
Big data and Hadoop
Web Intelligence - Tutorial1

What's hot (20)

PPTX
Data cleansing
PDF
MongoDB Lab Manual (1).pdf used in data science
PDF
Systems Analysis and Design 8th Edition Kendall Solutions Manual
PDF
Developing a Knowledge Graph of your Competency, Skills, and Knowledge at NASA
PDF
Critical Success Factors of Process Redesign
PPTX
What it means to be FAIR
PPTX
Turning Raw Data Into Gold With A Data Lakehouse.pptx
PPT
Fundamental file structure concepts & managing files of records
PPTX
Database design best practices
PDF
AstraZeneca - The promise of graphs & graph-based learning in drug discovery
PPT
Aspect Oriented Software Development
PPTX
Snowflake Data Access.pptx
PPTX
Dama - Protecting Sensitive Data on a Database
PPTX
Unit 4-apache pig
PDF
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
PDF
Data modelling 101
PPT
Transactions in dbms
PDF
Analytics, Business Intelligence, and Data Science - What's the Progression?
PPT
Spatial data mining
Data cleansing
MongoDB Lab Manual (1).pdf used in data science
Systems Analysis and Design 8th Edition Kendall Solutions Manual
Developing a Knowledge Graph of your Competency, Skills, and Knowledge at NASA
Critical Success Factors of Process Redesign
What it means to be FAIR
Turning Raw Data Into Gold With A Data Lakehouse.pptx
Fundamental file structure concepts & managing files of records
Database design best practices
AstraZeneca - The promise of graphs & graph-based learning in drug discovery
Aspect Oriented Software Development
Snowflake Data Access.pptx
Dama - Protecting Sensitive Data on a Database
Unit 4-apache pig
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
Data modelling 101
Transactions in dbms
Analytics, Business Intelligence, and Data Science - What's the Progression?
Spatial data mining
Ad

Viewers also liked (10)

PPT
Data curation and preservation: the Digital Curation Centre
PPT
Local objects global stories: Ingrid Mason
PDF
Digital curation a tool for knowledge management
PDF
Digital Curation and Methods for Teaching Digital Literacy Skills
PDF
Digital Curation in Libraries: An innovative way of content preservation and...
PDF
digital
PDF
HOW TO CHECK WHATSAPP ON ANOTHER PHONE
PPT
ελεύθερο λογισμικό 1
PDF
Data model scorecard (Article 5 of 11)
PPTX
Librarians Conducting Research: Researcher Librarian Partnerships
Data curation and preservation: the Digital Curation Centre
Local objects global stories: Ingrid Mason
Digital curation a tool for knowledge management
Digital Curation and Methods for Teaching Digital Literacy Skills
Digital Curation in Libraries: An innovative way of content preservation and...
digital
HOW TO CHECK WHATSAPP ON ANOTHER PHONE
ελεύθερο λογισμικό 1
Data model scorecard (Article 5 of 11)
Librarians Conducting Research: Researcher Librarian Partnerships
Ad

Similar to A Data Curation Framework: Data Curation and Research Support Services (20)

PPTX
What infrastructure is necessary for successful research data management (RDM...
PPTX
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PPT
Introduction to Research Data Management
PPT
Dc101 oxford sj_16062010
PDF
Data accessibilityandchallenges
PPTX
Staffing Research Data Services at University of Edinburgh
PPTX
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
PPT
Open Data and Institutional Repositories
PPTX
Introduction to Research Data Management
PDF
Planning for Research Data Management
PDF
Planning for Research Data Managment
PPTX
Good Practice in Research Data Management
PDF
Relationship Building and Advocacy Across the Campus
PPTX
RDM & ELNs @ Edinburgh
PPTX
Shareable by Design: Making Better Use of your Research
PPTX
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
PPTX
Scholze liber 2015-06-25_final
PPTX
Resources for Research Data Managers - 2014-05-28 - University of Oxford
What infrastructure is necessary for successful research data management (RDM...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
Introduction to Research Data Management
Dc101 oxford sj_16062010
Data accessibilityandchallenges
Staffing Research Data Services at University of Edinburgh
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
Open Data and Institutional Repositories
Introduction to Research Data Management
Planning for Research Data Management
Planning for Research Data Managment
Good Practice in Research Data Management
Relationship Building and Advocacy Across the Campus
RDM & ELNs @ Edinburgh
Shareable by Design: Making Better Use of your Research
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Scholze liber 2015-06-25_final
Resources for Research Data Managers - 2014-05-28 - University of Oxford

More from SusanMRob (20)

PDF
Innovative services across the research lifecyle v1.5 20180209
PPTX
Amanda Lawarence lief linked semantic platforms project summary
PPTX
ERA NTROs no worries RSCD2018
PPTX
Ingrid Mason rscday 2018_eresearch
PPTX
Pru Mitchell rscd2018
PPTX
Lisa Kruesi presentation_kruesi_condronRSCD18
PPTX
Dawn Mc loughlin_researchsupportcommunityday2018
PPTX
Mullumby Charing rscd2018_predatory_scnm
PPTX
Nicola Ivory rscd2018
PPTX
Jayshree Mamtora moving towards a new IR using Pure_rscd2018
PPTX
Julia Phillips rscd_2018
PPTX
Chris Evans Research Support Community Day 2018
PPTX
Research in practice for LIS professionals twitter chat RSCD2018
PPTX
Wikipedia Editing rscd2018
PPTX
The Conversation rscd2018
PPTX
Are New Digital Literacies Skills Neededrscd2018
PPTX
Clarivate ERA Supplier rscd2018
PPTX
Journal Data Sharing Policies rscd2018
PPTX
Wikimedia Australia rscd2018
PPTX
Bibliometric Competencies rscd2018
Innovative services across the research lifecyle v1.5 20180209
Amanda Lawarence lief linked semantic platforms project summary
ERA NTROs no worries RSCD2018
Ingrid Mason rscday 2018_eresearch
Pru Mitchell rscd2018
Lisa Kruesi presentation_kruesi_condronRSCD18
Dawn Mc loughlin_researchsupportcommunityday2018
Mullumby Charing rscd2018_predatory_scnm
Nicola Ivory rscd2018
Jayshree Mamtora moving towards a new IR using Pure_rscd2018
Julia Phillips rscd_2018
Chris Evans Research Support Community Day 2018
Research in practice for LIS professionals twitter chat RSCD2018
Wikipedia Editing rscd2018
The Conversation rscd2018
Are New Digital Literacies Skills Neededrscd2018
Clarivate ERA Supplier rscd2018
Journal Data Sharing Policies rscd2018
Wikimedia Australia rscd2018
Bibliometric Competencies rscd2018

Recently uploaded (20)

DOCX
"Project Management: Ultimate Guide to Tools, Techniques, and Strategies (2025)"
PDF
Parts of Speech Prepositions Presentation in Colorful Cute Style_20250724_230...
PPTX
Project and change Managment: short video sequences for IBA
PPTX
worship songs, in any order, compilation
PPTX
Relationship Management Presentation In Banking.pptx
DOC
学位双硕士UTAS毕业证,墨尔本理工学院毕业证留学硕士毕业证
PPTX
S. Anis Al Habsyi & Nada Shobah - Klasifikasi Hambatan Depresi.pptx
PPTX
Hydrogel Based delivery Cancer Treatment
PPTX
Tablets And Capsule Preformulation Of Paracetamol
PDF
oil_refinery_presentation_v1 sllfmfls.pdf
PPTX
PHIL.-ASTRONOMY-AND-NAVIGATION of ..pptx
PPTX
Presentation for DGJV QMS (PQP)_12.03.2025.pptx
PPTX
Introduction to Effective Communication.pptx
PDF
Instagram's Product Secrets Unveiled with this PPT
PPTX
Non-Verbal-Communication .mh.pdf_110245_compressed.pptx
PPTX
chapter8-180915055454bycuufucdghrwtrt.pptx
PDF
natwest.pdf company description and business model
PPTX
INTERNATIONAL LABOUR ORAGNISATION PPT ON SOCIAL SCIENCE
PPTX
fundraisepro pitch deck elegant and modern
DOCX
ENGLISH PROJECT FOR BINOD BIHARI MAHTO KOYLANCHAL UNIVERSITY
"Project Management: Ultimate Guide to Tools, Techniques, and Strategies (2025)"
Parts of Speech Prepositions Presentation in Colorful Cute Style_20250724_230...
Project and change Managment: short video sequences for IBA
worship songs, in any order, compilation
Relationship Management Presentation In Banking.pptx
学位双硕士UTAS毕业证,墨尔本理工学院毕业证留学硕士毕业证
S. Anis Al Habsyi & Nada Shobah - Klasifikasi Hambatan Depresi.pptx
Hydrogel Based delivery Cancer Treatment
Tablets And Capsule Preformulation Of Paracetamol
oil_refinery_presentation_v1 sllfmfls.pdf
PHIL.-ASTRONOMY-AND-NAVIGATION of ..pptx
Presentation for DGJV QMS (PQP)_12.03.2025.pptx
Introduction to Effective Communication.pptx
Instagram's Product Secrets Unveiled with this PPT
Non-Verbal-Communication .mh.pdf_110245_compressed.pptx
chapter8-180915055454bycuufucdghrwtrt.pptx
natwest.pdf company description and business model
INTERNATIONAL LABOUR ORAGNISATION PPT ON SOCIAL SCIENCE
fundraisepro pitch deck elegant and modern
ENGLISH PROJECT FOR BINOD BIHARI MAHTO KOYLANCHAL UNIVERSITY

A Data Curation Framework: Data Curation and Research Support Services

  • 1. A Data Curation Framework Data Curation and Research Support Services Research Support Community Day 2016
  • 3. University library research support ● Information literacy, e.g. information seeking, literature searches ● Collection access, e.g. acquisition, document supply, digitisation ● Scholarship, e.g. citation management, open access publishing ● Data literacy, e.g. research data management planning ● Mix of generic and domain services ● Access to resources ● Access to scholarly skills development
  • 4. University library research support for DH ● Digital literacy, e.g. training in use of tools like Omeka ● Technical literacy, e.g. training in web publishing and site management ● Project management support, e.g. planning, technical development ● Repackaging digitised collection material as data ● Infrastructure literacy, e.g. training in use of infrastructure ● Mix of generic and domain services ● Access to scholarly skills development ● Access to infrastructure
  • 5. Data Curation Framework ● Data curation framework developed for liaison and engagement, i.e. it is a tool to highlight areas to tackle and where collaboration may occur ● Align eResearch service providers’ efforts with university research support service development, and change as services evolve Identify where: ● support enables researchers to undertake eResearch, i.e. within university or national infrastructure ● generic versus specialised support services are provided, i.e. within university or by third party providers
  • 6. Framework: 4 components Data processing Data storage Data archiving Research Data Management Data services Research Project Management
  • 7. Framework: data processing ● Improve discoverability and reuse of collected data through repackaging or transforming data ready for publishing or transit.
  • 8. Framework: data storage ● Enable data transit to and from international, national and local data storage services.
  • 9. Framework: data archiving ● Enable data archiving practices to be embedded into data curation and research project activities.
  • 10. Framework: data services ● Provide data and services for researchers to exploit for data intensive research projects.
  • 11. Capability development Service/skills development: ● technical processing ● resource analysis and management ● archival processing Capability Areas: ● data access ● data integration ● data interoperability (to support data and publication discoverability)
  • 12. Technical processing ● Selection and reuse of appropriate universal, domain specific or archival ontologies, schemas and vocabularies. ● Crosswalking techniques, and data and metadata structuring methods. For the improvement of reuse, access to and discoverability of data.
  • 13. Resource analysis and management Analysis of: ● data types and storage options ● researcher requirements For the provision of appropriate storage and access to data.
  • 14. Archiving processing Selection of: ● backup or export regimes ● tools and services ● procedures to capture and retain data To improve immediate and ongoing access to data.
  • 16. Examples ● Data derived from digitised material ● Data derived from descriptive metadata ● Bulk data for web download ● Bulk data for remote access ● File network as data backup ● Repository as data archive ● Project support for eResearch ● Data with research value
  • 17. Data derived from digitised material Digitisation activities commonly conducted to provide access to library resources may be extended with data processing activities to enable the digital material to be treated as a data source by a researcher. An example output could be the OCR text made available as a corpus.
  • 18. Data derived from descriptive metadata Metadata describing unique materials is commonly treated as a series of catalogue records for display. Metadata may be transformed and published as a dataset. An example output could be the metadata made available in its local schema, crosswalked into a different format, or structured as linked open data.
  • 19. Bulk data for web download Digital material that is within a lower file size range and non-sensitive can be packaged up with guidelines for reuse and published openly online. An example output would be a set of steps and a template for file arrangements and instructions aimed at data users.
  • 20. Bulk data for remote access Digital material that is too large to download over the web with standard internet speeds or sensitive can be packaged up for transfer to shared and secure storage services. There may also be a need for the data to be near to applications in the cloud or high performance computing services. An example output would be a set of steps and a template for setting access arrangements, file arrangements and instructions aimed at data users.
  • 21. File network as data backup Retention copies of data may need to be maintained in file networks as a short term solution for the purposes of data backup. These storage services may be provided institutionally, through third parties or on national infrastructure. An example output would be guidance information on the services available to researchers, including costs, security concerns, and backup routines.
  • 22. Repository as data archive Archival copies of data may need to be maintained in repositories as a long term solution for preservation. These repository services may be provided institutionally, through third parties or on national infrastructure. An example output would be guidance information on the services available to researchers, including costs, security concerns, and preservation routines.
  • 23. Project support for eResearch Research support services have been focused on provision of access to information as a scholarly resource and support the generation of research publications. An example output would be a list of services already available and the identification of gaps in service where capacity may be developed to support data intensive research projects.
  • 24. Data with research value Data has research value at different points in its generation and ongoing retention. An example output would be a list of collected data available to make accessible, the selection criteria for prioritising a data collection for enhancing its accessibility because of its research value, and the rationale for the techniques (e.g. web publishing, remote storage, repackaging etc) employed to support accessibility.