SlideShare a Scribd company logo
Metadata and data citation
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
Research Data Management
Workshop 2.5
Learning Outcomes
By the end of this session you will be able to
• Discuss the varying requirements of metadata
that will enable researchers to identify the
potential of a particular dataset
• Evaluate ways of citing data
• Articulate and reflect upon some of the issues
involved with citing data and datasets
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
Session 2.5 overview
• EPSRC principles and expectations
• What is sufficient metadata?
• How to cite data?
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
EPSRC Principle 6
• “Sufficient metadata should be recorded and made openly
available to enable other researchers to understand the
potential for further research and re-use of the data.
Published results should always include information on how
to access the supporting data.”
http://www.epsrc.ac.uk/about/standards/researchdata/Pages/principles.
aspx
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
EPSRC Expectation 5
• “Research organisations will ensure that appropriately
structured metadata describing the research data they hold
is published (normally within 12 months of the data being
generated) and made freely accessible on the internet; in
each case the metadata must be sufficient to allow others to
understand what research data exists, why, when and how it
was generated, and how to access it. Where the research
data referred to in the metadata is a digital object it is
expected that the metadata will include use of a robust
digital object identifier (For example as available through the
DataCite organisation - http://datacite.org).”
http://www.epsrc.ac.uk/about/standards/researchdata/Pages/exp
ectations.aspx
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
Activity 1: Metadata
• What is “sufficient metadata” that enables
“other researchers to understand the
potential for further research and re-use of
the data”?
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
Activity 1: Metadata
The University of Poppleton holds a dataset with meteorological
observations, taken at the university’s weather station. In particular, it
contains a set of precipitation measurements since the foundation of
the university. A climatologist, Jenny Fairweather, is interested in this
dataset for her research into climate change. She is looking for trends
in the weather. A meteorologist, Wilson Rainbird, who works for the
UK Met Office wants to use these data for the purposes of weather
prediction. He is mainly interested in combining these precipitation
measurements with other similar datasets. A researcher, Alice Snowe,
from another university’s Accident Research Unit conducts most of her
research in the area of road traffic accidents. She would like to map
the precipitation measurements to another dataset containing
information on road accidents in order to analyse possible
correlations. Lastly, the university’s data repository manager, John
Shower, is concerned with issues regarding data access and IPR.
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
Activity 1: Metadata
• What is “sufficient metadata” for each of
these stakeholders “to understand the
potential for further research and re-use of
the data”?
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
Example
• The DaMaRO project at the University of Oxford has developed a
metadata schema for its DataFinder (Rumsey, 2012).
• A three-tier metadata approach:
– Mandatory minimal metadata to enable basic discovery, such as
Creator, Title, Publisher, Date, Location, Access terms & conditions
– Mandatory contextual metadata (mostly administrative and partly
based on EPSRC expectations), such as Funding Agency, Grant Number,
Last access request date, Project Information, Data Generation
Process, Why the data was generated, Date (range) of data collection,
Reasons for embargo
– Optional metadata (including discipline-specific metadata) to enable
reuse, such as machine settings and experimental conditions under
which the data were gathered
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
Activity 2: Data citation
• How should data be cited?
• There are no established standards for data
citation yet, although some style manuals
such as the APA’s (in the 5th and 6th editions)
and some repositories such as the UK Data
Archive do provide instructions.
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
Activity 2: Data citation
• Researcher, Alice Snowe, from another university’s
Accident Research Unit is seeking to use the dataset
with precipitation measurements going back to the
foundation of the University. This dataset was
deposited in 2011 by the University’s meteorologist,
Christopher Oldman Frost, and covers all years up to
and including 2010. It consists of data subsets that are
organised per year, each consisting of several files,
including Excel spreadsheets, Word files, and image
files (digitised observations written down on paper). Of
course, Mr Oldman Frost is not the only meteorologist
who has been involved in taking the measurements
that make up this dataset.
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
Activity 2: Data citation
• Alice Snowe is now writing a research paper for Science
called ‘The correlation between bicycle accidents and
precipitation in urban centres during the rush hour’.
She needs to cite our institutional repository’s dataset.
In particular she will need to refer to the precipitation
measurements of 4 May 1979. Elsewhere in her article
she also needs to refer to a subset covering the winter
months of the years 1981-1985.
• Write down the references that Alice Snowe needs to
give in her article.
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
APA
Basic form:
• Rightsholder. (Year). Title of data set (Version number)
[Description of form]. Location: Name of producer.
or
Rightsholder. (Year). Title of data set (Version number)
[Description of form]. Retrieved from http://
• University of Poppleton (2011). Precipitation
measurements 1905-2010 taken at Western Bank
weather station [Data files and documentation].
Poppleton: The University of Poppleton,
Meteorological Service.
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
DataCite
• DataCite (http://www.datacite.org) is a not-for-profit
organisation that aims to promote and support the
sharing of research data
• They are developing an infrastructure that supports
methods of data citation, discovery, and access
• They are currently leveraging the DOI (Digital Object
Identifier) infrastructure, which is also used for
research articles
• They can provide DOIs for datasets
• DataCite DOIs have to resolve to a public landing page
with information about the dataset and a direct link to
it
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
DataCite
Basic form:
• Creator (PublicationYear): Title. Version. Publisher.
ResourceType. Identifier
• Version and ResourceType are optional elements
• For citation purposes, DataCite recommends that DOI
names are displayed as linkable, permanent URLs
• More info in DataCite (2011)
• University of Poppleton (2011): Precipitation
measurements 1905-2010 taken at Western Bank weather
station. Meteorological service, The University of
Poppleton. http://dx.doi.org/10.1594/UoP.MS.298
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
Activity 2: Data citation
• What practical issues did you encounter when
writing the references for Alice Snowe’s
research paper? How could these issues be
solved?
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
Data Citation
• Issues include (Ball & Duke, 2011a and b):
– At what granularity should data be made citeable?
– How to credit each contributor in a dataset that is
assembled from very many contributions?
– Where in a research paper should a data citation
be given (e.g. a paper describing a dataset versus
subsequent papers using it)?
– What to do with frequently updated data?
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
REFERENCES
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
References
• American Psychological Association (2010). Publication
Manual of the American Psychological Association (6th
edition). Washington, DC: American Psychological Association,
pp. 210-211.
• Ball, A., & Duke, M. (2011a). Data Citation and Linking. DCC
Briefing Papers. Edinburgh: Digital Curation Centre. Retrieved
from http://www.dcc.ac.uk/resources/briefing-
papers/introduction-curation/data-citation-and-linking
• Ball, A., & Duke, M. (2011b). How to Cite Datasets and Link to
Publications. DCC How-To Guides. Edinburgh: Digital Curation
Centre. Retrieved from http://www.dcc.ac.uk/resources/how-
guides/cite-datasets
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose
References
• DataCite (2011). DataCite Metadata Schema for the Publication and
Citation of Research Data. Version 2.2. London: DataCite. Retrieved
from http://schema.datacite.org/meta/kernel-2.2/doc/DataCite-
MetadataKernel_v2.2.pdf. doi:10.5438/0005
• DataCite (n.d.). Why cite data? Hannover. Retrieved from
http://datacite.org/whycitedata
• Rumsey, S. (2012). Just enough metadata: Metadata for research
datasets in institutional data repositories [PowerPoint
presentation]. Oxford: The University of Oxford. Retrieved from
http://damaro.oucs.ox.ac.uk/docs/Just%20enough%20metadata%2
0v3-1.pdf
• UK Data Archive (n.d.). Citing Data. Colchester. Retrieved from
http://www.data-archive.ac.uk/conditions/citing-data
May-15
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmrose

More Related Content

PPTX
The Road from Millennium to Alma: Two Tracks, One Destination
PPTX
Opening Scholarly Communication in Social Sciences (OSCOSS)
PPT
Ifla 2010
PPT
PPT
JIBS 2009 Bibliometrics And The REF 2009-11-13
PPTX
Green, gold, uncle sam, and information literacy
PPTX
Team 6 Assignment
PPTX
From Spreadsheets to SUSHI: Five Years of Assessing Use of E-Resources
The Road from Millennium to Alma: Two Tracks, One Destination
Opening Scholarly Communication in Social Sciences (OSCOSS)
Ifla 2010
JIBS 2009 Bibliometrics And The REF 2009-11-13
Green, gold, uncle sam, and information literacy
Team 6 Assignment
From Spreadsheets to SUSHI: Five Years of Assessing Use of E-Resources

What's hot (9)

PPTX
From Spreadsheets to SUSHI: Five Years of Assessing E-Resources
PPT
2011-09-27-JATS-Con-Presentation_Schwarzman
PDF
Opening Research Data in EU Universities: Policies, Motivators and Challenges
PDF
Open Access Statistics: An Examination how to Generate Interoperable Usage In...
PDF
The Needs of Stakeholders in the RDM Process - the role of LEARN
PPTX
Resources in uct libraries fin_hon_2017
PPTX
Fixing the infrastructure for open science
PPT
An analysis and characterization of DMPs in NSF proposals from the University...
PPTX
Sjr education
From Spreadsheets to SUSHI: Five Years of Assessing E-Resources
2011-09-27-JATS-Con-Presentation_Schwarzman
Opening Research Data in EU Universities: Policies, Motivators and Challenges
Open Access Statistics: An Examination how to Generate Interoperable Usage In...
The Needs of Stakeholders in the RDM Process - the role of LEARN
Resources in uct libraries fin_hon_2017
Fixing the infrastructure for open science
An analysis and characterization of DMPs in NSF proposals from the University...
Sjr education
Ad

Viewers also liked (8)

PPT
ALA 2012 - Metadata and data curation services
PDF
Enterprise Data World Webinars: Metadata Management – Getting Off On The Righ...
PPT
Metadata Repositories in Health Care - Master Data Management Approach to Met...
PDF
Valen Metadata and the [Data] Repository
PPTX
Taxonomy And Metadata
PPT
Taxonomies and Metadata in Information Architecture
PDF
Introduction to metadata management
PPT
Metadata in data warehouse
ALA 2012 - Metadata and data curation services
Enterprise Data World Webinars: Metadata Management – Getting Off On The Righ...
Metadata Repositories in Health Care - Master Data Management Approach to Met...
Valen Metadata and the [Data] Repository
Taxonomy And Metadata
Taxonomies and Metadata in Information Architecture
Introduction to metadata management
Metadata in data warehouse
Ad

Similar to RDMRose 2.5 Metadata and data citation (20)

PPTX
Managing and Sharing Research Data: Good practices for an ideal world...in th...
PDF
Research Integrity Advisor and Data Management
PDF
A brief overview of metadata for datasets
PPTX
Gobinda Chowdhury
PPTX
RDMRose 1.1 The basics
PDF
Developing institutional RDM services
PPTX
Data Exchange, Data Citation: An overview of some community work
PPTX
Shareable by Design: Making Better Use of your Research
PPTX
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
PPTX
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
PPTX
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
PDF
Researh data management
PPTX
Data Exchange, Data Citation: An overview of some community work
PDF
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
PPTX
DataONE Education Module 08: Data Citation
PPTX
FSCI Data Discovery
PDF
Dataverse, Cloud Dataverse, and DataTags
PDF
Data publishing at the UQ Library
PPTX
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
PPTX
Data Literacy: Creating and Managing Reserach Data
Managing and Sharing Research Data: Good practices for an ideal world...in th...
Research Integrity Advisor and Data Management
A brief overview of metadata for datasets
Gobinda Chowdhury
RDMRose 1.1 The basics
Developing institutional RDM services
Data Exchange, Data Citation: An overview of some community work
Shareable by Design: Making Better Use of your Research
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Researh data management
Data Exchange, Data Citation: An overview of some community work
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
DataONE Education Module 08: Data Citation
FSCI Data Discovery
Dataverse, Cloud Dataverse, and DataTags
Data publishing at the UQ Library
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Data Literacy: Creating and Managing Reserach Data

More from RDMRose (20)

DOCX
RDMRose introduction
DOCX
RDMRose 3.2 Advocacy role cards
DOCX
RDMRose 4.1 Handout institutional case study
PPTX
RDMRose 0.0 Introduction to the course
PPTX
RDMRose 1.2 Research and researchers
PPTX
RDMRose 1.4 The research data lifecycle
PPTX
RDMRose 1.5 Data management and sharing plans
PPTX
RDMRose 1.6 Research data services
PPTX
RDMRose 2.1 Research data services
PPTX
RDMRose 2.2 Practical data management
PPTX
RDMRose 2.3 Institutional data repository policies
PPTX
RDMRose 2.4 Designing library webpages
PPTX
RDMRose 2.6 Interviewing a researcher
PPTX
RDMRose 3.1 Data Asset Framewok surveys
PPTX
RDMRose 3.2 Advocacy
PPTX
RDMRose 3.3 Training researchers
PPTX
Rdm rose v3-slides-4.1-an-institutional-case-study
PPTX
RDMRose 4.2 RDM as a wicked problem
PPTX
RDMRose 4.3 Review of the workshops
PPTX
RDMRose 4.4 Resources for further study
RDMRose introduction
RDMRose 3.2 Advocacy role cards
RDMRose 4.1 Handout institutional case study
RDMRose 0.0 Introduction to the course
RDMRose 1.2 Research and researchers
RDMRose 1.4 The research data lifecycle
RDMRose 1.5 Data management and sharing plans
RDMRose 1.6 Research data services
RDMRose 2.1 Research data services
RDMRose 2.2 Practical data management
RDMRose 2.3 Institutional data repository policies
RDMRose 2.4 Designing library webpages
RDMRose 2.6 Interviewing a researcher
RDMRose 3.1 Data Asset Framewok surveys
RDMRose 3.2 Advocacy
RDMRose 3.3 Training researchers
Rdm rose v3-slides-4.1-an-institutional-case-study
RDMRose 4.2 RDM as a wicked problem
RDMRose 4.3 Review of the workshops
RDMRose 4.4 Resources for further study

Recently uploaded (20)

PPTX
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
PDF
Introduction to Data Science and Data Analysis
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PPTX
IMPACT OF LANDSLIDE.....................
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PPTX
SAP 2 completion done . PRESENTATION.pptx
DOCX
Factor Analysis Word Document Presentation
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
modul_python (1).pptx for professional and student
PPT
DU, AIS, Big Data and Data Analytics.ppt
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PDF
Microsoft Core Cloud Services powerpoint
PDF
[EN] Industrial Machine Downtime Prediction
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PDF
Transcultural that can help you someday.
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
Introduction to Data Science and Data Analysis
Topic 5 Presentation 5 Lesson 5 Corporate Fin
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
IMPACT OF LANDSLIDE.....................
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
SAP 2 completion done . PRESENTATION.pptx
Factor Analysis Word Document Presentation
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
modul_python (1).pptx for professional and student
DU, AIS, Big Data and Data Analytics.ppt
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
retention in jsjsksksksnbsndjddjdnFPD.pptx
Optimise Shopper Experiences with a Strong Data Estate.pdf
Microsoft Core Cloud Services powerpoint
[EN] Industrial Machine Downtime Prediction
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Transcultural that can help you someday.
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...

RDMRose 2.5 Metadata and data citation

  • 1. Metadata and data citation May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose Research Data Management Workshop 2.5
  • 2. Learning Outcomes By the end of this session you will be able to • Discuss the varying requirements of metadata that will enable researchers to identify the potential of a particular dataset • Evaluate ways of citing data • Articulate and reflect upon some of the issues involved with citing data and datasets May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 3. Session 2.5 overview • EPSRC principles and expectations • What is sufficient metadata? • How to cite data? May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 4. EPSRC Principle 6 • “Sufficient metadata should be recorded and made openly available to enable other researchers to understand the potential for further research and re-use of the data. Published results should always include information on how to access the supporting data.” http://www.epsrc.ac.uk/about/standards/researchdata/Pages/principles. aspx May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 5. EPSRC Expectation 5 • “Research organisations will ensure that appropriately structured metadata describing the research data they hold is published (normally within 12 months of the data being generated) and made freely accessible on the internet; in each case the metadata must be sufficient to allow others to understand what research data exists, why, when and how it was generated, and how to access it. Where the research data referred to in the metadata is a digital object it is expected that the metadata will include use of a robust digital object identifier (For example as available through the DataCite organisation - http://datacite.org).” http://www.epsrc.ac.uk/about/standards/researchdata/Pages/exp ectations.aspx May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 6. Activity 1: Metadata • What is “sufficient metadata” that enables “other researchers to understand the potential for further research and re-use of the data”? May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 7. Activity 1: Metadata The University of Poppleton holds a dataset with meteorological observations, taken at the university’s weather station. In particular, it contains a set of precipitation measurements since the foundation of the university. A climatologist, Jenny Fairweather, is interested in this dataset for her research into climate change. She is looking for trends in the weather. A meteorologist, Wilson Rainbird, who works for the UK Met Office wants to use these data for the purposes of weather prediction. He is mainly interested in combining these precipitation measurements with other similar datasets. A researcher, Alice Snowe, from another university’s Accident Research Unit conducts most of her research in the area of road traffic accidents. She would like to map the precipitation measurements to another dataset containing information on road accidents in order to analyse possible correlations. Lastly, the university’s data repository manager, John Shower, is concerned with issues regarding data access and IPR. May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 8. Activity 1: Metadata • What is “sufficient metadata” for each of these stakeholders “to understand the potential for further research and re-use of the data”? May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 9. Example • The DaMaRO project at the University of Oxford has developed a metadata schema for its DataFinder (Rumsey, 2012). • A three-tier metadata approach: – Mandatory minimal metadata to enable basic discovery, such as Creator, Title, Publisher, Date, Location, Access terms & conditions – Mandatory contextual metadata (mostly administrative and partly based on EPSRC expectations), such as Funding Agency, Grant Number, Last access request date, Project Information, Data Generation Process, Why the data was generated, Date (range) of data collection, Reasons for embargo – Optional metadata (including discipline-specific metadata) to enable reuse, such as machine settings and experimental conditions under which the data were gathered May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 10. Activity 2: Data citation • How should data be cited? • There are no established standards for data citation yet, although some style manuals such as the APA’s (in the 5th and 6th editions) and some repositories such as the UK Data Archive do provide instructions. May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 11. Activity 2: Data citation • Researcher, Alice Snowe, from another university’s Accident Research Unit is seeking to use the dataset with precipitation measurements going back to the foundation of the University. This dataset was deposited in 2011 by the University’s meteorologist, Christopher Oldman Frost, and covers all years up to and including 2010. It consists of data subsets that are organised per year, each consisting of several files, including Excel spreadsheets, Word files, and image files (digitised observations written down on paper). Of course, Mr Oldman Frost is not the only meteorologist who has been involved in taking the measurements that make up this dataset. May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 12. Activity 2: Data citation • Alice Snowe is now writing a research paper for Science called ‘The correlation between bicycle accidents and precipitation in urban centres during the rush hour’. She needs to cite our institutional repository’s dataset. In particular she will need to refer to the precipitation measurements of 4 May 1979. Elsewhere in her article she also needs to refer to a subset covering the winter months of the years 1981-1985. • Write down the references that Alice Snowe needs to give in her article. May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 13. APA Basic form: • Rightsholder. (Year). Title of data set (Version number) [Description of form]. Location: Name of producer. or Rightsholder. (Year). Title of data set (Version number) [Description of form]. Retrieved from http:// • University of Poppleton (2011). Precipitation measurements 1905-2010 taken at Western Bank weather station [Data files and documentation]. Poppleton: The University of Poppleton, Meteorological Service. May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 14. DataCite • DataCite (http://www.datacite.org) is a not-for-profit organisation that aims to promote and support the sharing of research data • They are developing an infrastructure that supports methods of data citation, discovery, and access • They are currently leveraging the DOI (Digital Object Identifier) infrastructure, which is also used for research articles • They can provide DOIs for datasets • DataCite DOIs have to resolve to a public landing page with information about the dataset and a direct link to it May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 15. DataCite Basic form: • Creator (PublicationYear): Title. Version. Publisher. ResourceType. Identifier • Version and ResourceType are optional elements • For citation purposes, DataCite recommends that DOI names are displayed as linkable, permanent URLs • More info in DataCite (2011) • University of Poppleton (2011): Precipitation measurements 1905-2010 taken at Western Bank weather station. Meteorological service, The University of Poppleton. http://dx.doi.org/10.1594/UoP.MS.298 May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 16. Activity 2: Data citation • What practical issues did you encounter when writing the references for Alice Snowe’s research paper? How could these issues be solved? May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 17. Data Citation • Issues include (Ball & Duke, 2011a and b): – At what granularity should data be made citeable? – How to credit each contributor in a dataset that is assembled from very many contributions? – Where in a research paper should a data citation be given (e.g. a paper describing a dataset versus subsequent papers using it)? – What to do with frequently updated data? May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 18. REFERENCES May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 19. References • American Psychological Association (2010). Publication Manual of the American Psychological Association (6th edition). Washington, DC: American Psychological Association, pp. 210-211. • Ball, A., & Duke, M. (2011a). Data Citation and Linking. DCC Briefing Papers. Edinburgh: Digital Curation Centre. Retrieved from http://www.dcc.ac.uk/resources/briefing- papers/introduction-curation/data-citation-and-linking • Ball, A., & Duke, M. (2011b). How to Cite Datasets and Link to Publications. DCC How-To Guides. Edinburgh: Digital Curation Centre. Retrieved from http://www.dcc.ac.uk/resources/how- guides/cite-datasets May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose
  • 20. References • DataCite (2011). DataCite Metadata Schema for the Publication and Citation of Research Data. Version 2.2. London: DataCite. Retrieved from http://schema.datacite.org/meta/kernel-2.2/doc/DataCite- MetadataKernel_v2.2.pdf. doi:10.5438/0005 • DataCite (n.d.). Why cite data? Hannover. Retrieved from http://datacite.org/whycitedata • Rumsey, S. (2012). Just enough metadata: Metadata for research datasets in institutional data repositories [PowerPoint presentation]. Oxford: The University of Oxford. Retrieved from http://damaro.oucs.ox.ac.uk/docs/Just%20enough%20metadata%2 0v3-1.pdf • UK Data Archive (n.d.). Citing Data. Colchester. Retrieved from http://www.data-archive.ac.uk/conditions/citing-data May-15 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmrose

Editor's Notes

  • #10: Cf UK Data Archive’s distinction between data-level documentation and study-level documentation
  • #14: APA: American Psychological Association