SlideShare a Scribd company logo
Data Citation and DOIs
George Alter
University of Michigan
Data Citation and DOIs
Why is data citation important?
• Creators of data have a right to expect their work
to be acknowledged
• Citation will enhance the careers of data
producers
• Recognition encourages others to share data
• Citations are an element in evaluating the impact
of a data collection
NSF Biosketch
f. Biographical Sketch(es)
(c) Products
A list of: (i) up to five products most closely related to the
proposed project; and (ii) up to five other significant
products, whether or not related to the proposed project.
Acceptable products must be citable and accessible
including but not limited to publications, data sets,
software, patents, and copyrights…
Each product must include full citation information
including (where applicable and practicable) names of all
authors, date of publication or release, title, title of
enclosing work such as journal or book, volume, issue,
pages, website and Uniform Resource Locator (URL) or
other Persistent Identifier.
• Academic
rewards are
often tied to
citations.
• It is easy to
count
citations of
publications.
• Data re-use is
very difficult
to count.
The Problem:
Inconsistent Placement of References
Data-PASS letter to the American Sociological Association,
August 8, 2010
Similar letters sent to American Economics Association, American Education Research
Association, and American Political Science Association.
Persistent Identifiers
• A long-lasting reference to a digital object
• URLs point to locations, which are unstable
• Persistent Identifiers provide a name and a
locator
• Digital Object Identifiers (DOIs) are widely
used for publications
• DOIs are resolved by Registration Agencies
DOIs are used to
resolve rights and
subscriptions
DataCite
• DOI Registration Agency created for scientific data
– Maintains the resolution infrastructure
– Maintains a searchable database of metadata
– Manages the identifiers over the long term
– Establishes and shares best practice
• Focused on improving the scholarly infrastructure around
datasets and other non-textual information
• Founded December 1st 2009 in London
When DOIs are used in citations, the citing
articles can be recovered by search engines.
This data set has
been used in 330
publications, but
only 15 used the
DOI.
It is easy to get a citation and a
DOI from a data repository.
DOI
Where are we now?
• Links between data and publications are not
available, because journals do not cite data
consistently.
• Without consistent citation, aggregators
(Thomson Reuters, Scopus, Google Scholar)
cannot automate links
• The impact of data creation (and data
creators) is difficult to measure
Data citation standards, including DOIs,
are an easy policy for journals to adopt.

More Related Content

PPTX
TAIR ICAR 2010 Presentation
PPTX
Next generation data services at the Marriott Library
PDF
Scientific Data and peer review session at Dryad event, May 2015
PPTX
Payton Eliminating Conflicts in Ebook Metadata
PPTX
Research information management: making sense of it all
PDF
Lcewebinar rdm 5-steps_for_libraries
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PPTX
Research data management workshop april12 2016
TAIR ICAR 2010 Presentation
Next generation data services at the Marriott Library
Scientific Data and peer review session at Dryad event, May 2015
Payton Eliminating Conflicts in Ebook Metadata
Research information management: making sense of it all
Lcewebinar rdm 5-steps_for_libraries
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
Research data management workshop april12 2016

What's hot (20)

PPTX
RDMRose 2.6 Interviewing a researcher
PPT
Alison McNab - Document management tools for the next decade: writing, citing...
PPTX
NISO Plus: Data Discovery and Reuse: AI Solutions & the Human Factor
PPTX
Burton - Security, Privacy and Trust
PDF
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
PDF
Navigating the data management ecosystem - Dan Valen
PDF
RDAP 16 Poster: Interpreting Local Data Policies in Practice
PDF
RDA FAIR Data Maturity Model
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PDF
Open Science: Research Data Management
PPTX
Data challenges for researchers
POTX
Using Bibliometrics to Keep Up with the Joneses
PDF
Pikas using bibliometrics to make sense of research proposals
PDF
NIH BD2K DataMed metadata model - Force11, 2016
PDF
Peer Reviewing Data: experiences from a data journal
PPTX
Transparency and reproducibility in research
PDF
Persistent Identifier Services and their Metadata by John Kunze
PPTX
Altmetrics : Rodrigo Costas Comesaña
PPTX
NISO Training Thursday Crafting a Scientific Data Management Plan
PDF
Data availability
RDMRose 2.6 Interviewing a researcher
Alison McNab - Document management tools for the next decade: writing, citing...
NISO Plus: Data Discovery and Reuse: AI Solutions & the Human Factor
Burton - Security, Privacy and Trust
Attribution from a Research Library Perspective, on NISO Webinar: How Librari...
Navigating the data management ecosystem - Dan Valen
RDAP 16 Poster: Interpreting Local Data Policies in Practice
RDA FAIR Data Maturity Model
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
Open Science: Research Data Management
Data challenges for researchers
Using Bibliometrics to Keep Up with the Joneses
Pikas using bibliometrics to make sense of research proposals
NIH BD2K DataMed metadata model - Force11, 2016
Peer Reviewing Data: experiences from a data journal
Transparency and reproducibility in research
Persistent Identifier Services and their Metadata by John Kunze
Altmetrics : Rodrigo Costas Comesaña
NISO Training Thursday Crafting a Scientific Data Management Plan
Data availability
Ad

Similar to Data Citation and DOIs (20)

PPTX
data citation
PDF
Data Publishing Models by Sünje Dallmeier-Tiessen
PPTX
DataONE Education Module 08: Data Citation
PPTX
RDAP13 Elizabeth Moss: The impact of data reuse
PDF
Content Registration at Crossref - LIVE Bangkok
PDF
How can we ensure research data is re-usable? The role of Publishers in Resea...
PPTX
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
PDF
V.3 poster current citations and a future with linked data
PPT
A Data Citation Roadmap for Scholarly Data Repositories
PPTX
[4.1] Data Citation and DOI's - Research Data Management - part of PhD course...
PPTX
Options for online profiles
PDF
New Metadata Developments - Crossref LIVE South Africa
PPTX
DOIs for Research Publication
PPTX
20160607 citation4software panel
PPTX
Fsci 2018 friday3_august_am6
PPTX
ODIN Final Event - Publishing and citing, and the role of persistent identifiers
PPTX
Shareable by Design: Making Better Use of your Research
PDF
New product developments - Jennifer Lin - London LIVE 2017
PPTX
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
PDF
Parsons citation geodata2014
data citation
Data Publishing Models by Sünje Dallmeier-Tiessen
DataONE Education Module 08: Data Citation
RDAP13 Elizabeth Moss: The impact of data reuse
Content Registration at Crossref - LIVE Bangkok
How can we ensure research data is re-usable? The role of Publishers in Resea...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
V.3 poster current citations and a future with linked data
A Data Citation Roadmap for Scholarly Data Repositories
[4.1] Data Citation and DOI's - Research Data Management - part of PhD course...
Options for online profiles
New Metadata Developments - Crossref LIVE South Africa
DOIs for Research Publication
20160607 citation4software panel
Fsci 2018 friday3_august_am6
ODIN Final Event - Publishing and citing, and the role of persistent identifiers
Shareable by Design: Making Better Use of your Research
New product developments - Jennifer Lin - London LIVE 2017
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Parsons citation geodata2014
Ad

More from ARDC (20)

PPTX
Introduction to ADA
PPTX
Architecture and Standards
PPTX
Data Sharing and Release Legislation
PPT
Australian Dementia Network (ADNet)
PPTX
Investigator-initiated clinical trials: a community perspective
PPTX
NCRIS and the health domain
PPTX
International perspective for sharing publicly funded medical research data
PPTX
Clinical trials data sharing
PPTX
Clinical trials and cohort studies
PPTX
Introduction to vision and scope
PPTX
FAIR for the future: embracing all things data
PDF
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
PDF
Skilling-up-in-research-data-management-20181128
PDF
Research data management and sharing of medical data
PPTX
Findable, Accessible, Interoperable and Reusable (FAIR) data
PPTX
Applying FAIR principles to linked datasets: Opportunities and Challenges
PDF
How to make your data count webinar, 26 Nov 2018
PDF
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
PDF
How FAIR is your data? Copyright, licensing and reuse of data
PDF
Peter neish DMPs BoF eResearch 2018
Introduction to ADA
Architecture and Standards
Data Sharing and Release Legislation
Australian Dementia Network (ADNet)
Investigator-initiated clinical trials: a community perspective
NCRIS and the health domain
International perspective for sharing publicly funded medical research data
Clinical trials data sharing
Clinical trials and cohort studies
Introduction to vision and scope
FAIR for the future: embracing all things data
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
Skilling-up-in-research-data-management-20181128
Research data management and sharing of medical data
Findable, Accessible, Interoperable and Reusable (FAIR) data
Applying FAIR principles to linked datasets: Opportunities and Challenges
How to make your data count webinar, 26 Nov 2018
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
How FAIR is your data? Copyright, licensing and reuse of data
Peter neish DMPs BoF eResearch 2018

Recently uploaded (20)

PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
annual-report-2024-2025 original latest.
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
Fluorescence-microscope_Botany_detailed content
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Introduction to machine learning and Linear Models
Business Ppt On Nestle.pptx huunnnhhgfvu
Supervised vs unsupervised machine learning algorithms
annual-report-2024-2025 original latest.
IBA_Chapter_11_Slides_Final_Accessible.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
ISS -ESG Data flows What is ESG and HowHow
Fluorescence-microscope_Botany_detailed content
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Introduction to Knowledge Engineering Part 1
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Introduction-to-Cloud-ComputingFinal.pptx
Miokarditis (Inflamasi pada Otot Jantung)
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Introduction to machine learning and Linear Models

Data Citation and DOIs

  • 1. Data Citation and DOIs George Alter University of Michigan
  • 3. Why is data citation important? • Creators of data have a right to expect their work to be acknowledged • Citation will enhance the careers of data producers • Recognition encourages others to share data • Citations are an element in evaluating the impact of a data collection
  • 4. NSF Biosketch f. Biographical Sketch(es) (c) Products A list of: (i) up to five products most closely related to the proposed project; and (ii) up to five other significant products, whether or not related to the proposed project. Acceptable products must be citable and accessible including but not limited to publications, data sets, software, patents, and copyrights… Each product must include full citation information including (where applicable and practicable) names of all authors, date of publication or release, title, title of enclosing work such as journal or book, volume, issue, pages, website and Uniform Resource Locator (URL) or other Persistent Identifier.
  • 5. • Academic rewards are often tied to citations. • It is easy to count citations of publications. • Data re-use is very difficult to count.
  • 6. The Problem: Inconsistent Placement of References Data-PASS letter to the American Sociological Association, August 8, 2010 Similar letters sent to American Economics Association, American Education Research Association, and American Political Science Association.
  • 7. Persistent Identifiers • A long-lasting reference to a digital object • URLs point to locations, which are unstable • Persistent Identifiers provide a name and a locator • Digital Object Identifiers (DOIs) are widely used for publications • DOIs are resolved by Registration Agencies
  • 8. DOIs are used to resolve rights and subscriptions
  • 9. DataCite • DOI Registration Agency created for scientific data – Maintains the resolution infrastructure – Maintains a searchable database of metadata – Manages the identifiers over the long term – Establishes and shares best practice • Focused on improving the scholarly infrastructure around datasets and other non-textual information • Founded December 1st 2009 in London
  • 10. When DOIs are used in citations, the citing articles can be recovered by search engines. This data set has been used in 330 publications, but only 15 used the DOI.
  • 11. It is easy to get a citation and a DOI from a data repository. DOI
  • 12. Where are we now? • Links between data and publications are not available, because journals do not cite data consistently. • Without consistent citation, aggregators (Thomson Reuters, Scopus, Google Scholar) cannot automate links • The impact of data creation (and data creators) is difficult to measure
  • 13. Data citation standards, including DOIs, are an easy policy for journals to adopt.