SlideShare a Scribd company logo
Workflows for Publishing Data
Varsha Khodiyar, PhD
Data Curation Editor, Scientific Data
Nature Publishing Group
varsha.khodiyar@nature.com
@varsha_khodiyar
@scientificdata
Scientific Data's experience as an early adopter
RDA P7, 1st to 3rd March 2016
Mandatory and recommended key components
of data publishing – WG results
Austin et al. in review. Report preprint doi:10.5281/zenodo.34542
Implemented by Scientific Data Under wider consideration
by Springer Nature
Implementation of required elements
• Data PID required to complete
manuscript submission
• Data Citation policy enforced by
editorial process
• Use of structured repositories which
capture subject-specific metadata
• Curation of discovery level metadata
(regardless of repository) by dedicated
Data Curation Editor
• Machine readable metadata aids
discovery
Additional elements - Context
4
• Data Descriptor designed to encourage
full documentation of data generation
• Articles analysing described data are
captured in machine readable metadata
(ISA format)
• Linked as associated publication to Data
Descriptor online
• Analysis articles published in Nature
Publishing Group journals link back to
Data Descriptor
• Software availability statement required
for previously unpublished software and
code
Additional elements - Quality
5
• Provision of manuscript (and metadata
templates) to help authors provide reuse level
metadata
• Dependant on repositories for curation by
domain experts
• Editorial Board selected based on expertise in
data generation/reuse in their field
• Ensure that peer reviewers can access data
easily and confidentially
• Encourage peer reviewers to view and comment
on the actual data as part of their assessment
• Editorial office regularly asked for advice on
data deposition and repository selection
• Data Descriptors aid visibility of data by
considering them as first class publications
• Data Descriptors discoverable via common
publication indices such as PubMed
• Discovery level machine readable metadata
(in ISA format) generated for every Data
Descriptor
• Currently trialling use of metadata for data
discovery (ISAexplorer)
• Open to suggestions for other uses of
Scientific Data’s machine readable metadata
Additional elements – Visibility / Accessibility
6

More Related Content

PDF
Data sharing as part of the research ecosystem
PDF
Data sharing as part of the research workflow
PPTX
Gaining credit for sharing research data: Viewpoints on Data Publishing
PPTX
Data peer review workshop
PPTX
The challenge of sharing data well, how publishers can help
PPTX
Publishing and impact 20141028
PPTX
Transparency and reproducibility in research
PDF
Enhance your rese​arch impact through open science
Data sharing as part of the research ecosystem
Data sharing as part of the research workflow
Gaining credit for sharing research data: Viewpoints on Data Publishing
Data peer review workshop
The challenge of sharing data well, how publishers can help
Publishing and impact 20141028
Transparency and reproducibility in research
Enhance your rese​arch impact through open science

What's hot (20)

PDF
Valen Metadata and the [Data] Repository
PDF
Scientific Data and peer review session at Dryad event, May 2015
PPTX
Why would a publisher care about open data?
PPTX
Data, data, everywhere? Not nearly enough!
PDF
John morrissey c3 dis fair working data.pptx
PDF
Sue cook c3 dis dm-ps 1.pptx
PPTX
Wilson-npg-scientific data-nfdp13
PDF
Natasha intro to rdm c3 dis may 2018.pptx
PPTX
DataONE Education Module 01: Why Data Management?
PPTX
DataONE Education Module 03: Data Management Planning
PPTX
Best practices data collection
PPTX
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
PPTX
data citation
PPTX
Identifying and tracking research resources using RRIDs: a practical approach
PPTX
TAIR ICAR 2010 Presentation
PPTX
Searching beyond datasets in the Social Sciences
PPTX
THOR Workshop - Data Publishing Elsevier
PDF
NIH BD2K DataMed metadata model - Force11, 2016
PPTX
THOR Workshop - Introduction
PDF
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
Valen Metadata and the [Data] Repository
Scientific Data and peer review session at Dryad event, May 2015
Why would a publisher care about open data?
Data, data, everywhere? Not nearly enough!
John morrissey c3 dis fair working data.pptx
Sue cook c3 dis dm-ps 1.pptx
Wilson-npg-scientific data-nfdp13
Natasha intro to rdm c3 dis may 2018.pptx
DataONE Education Module 01: Why Data Management?
DataONE Education Module 03: Data Management Planning
Best practices data collection
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
data citation
Identifying and tracking research resources using RRIDs: a practical approach
TAIR ICAR 2010 Presentation
Searching beyond datasets in the Social Sciences
THOR Workshop - Data Publishing Elsevier
NIH BD2K DataMed metadata model - Force11, 2016
THOR Workshop - Introduction
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
Ad

Viewers also liked (7)

PDF
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
PDF
Usages des réseaux sociaux académiques : enjeux et opportunités (2016)
PDF
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
PDF
Difusión y visibilidad de la producción científica en la web def
PPTX
Curriculum Mapping
PDF
Curriculum Mapping & Analysis: Basic Definitions
PPTX
Introduction to Curriculum Mapping
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Usages des réseaux sociaux académiques : enjeux et opportunités (2016)
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
Difusión y visibilidad de la producción científica en la web def
Curriculum Mapping
Curriculum Mapping & Analysis: Basic Definitions
Introduction to Curriculum Mapping
Ad

Similar to Workflows for Publishing Data; Scientific Data's experience as an early adopter (20)

PDF
Preparing your data for sharing and publishing
PDF
Effective research data management
PDF
Peer Reviewing Data: experiences from a data journal
PPTX
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
PPTX
Applying ocr to extract information : Text mining
PDF
20170621_System requirements of data journal platform
PPTX
Emerging domain agnostic functionalities on the handle-centered networks
PPTX
Shareable by Design: Making Better Use of your Research
PDF
Psp v 1 draft 2016 01 15 (1)
PPTX
Dataverse for Journals
PDF
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
PDF
Praetzellis "Data Management Planning and Tools"
PPTX
20160607 citation4software panel
PPTX
Draux "Working with Scholarly APIs: A NISO Training Series, Session Four: Dig...
PDF
ICIC 2017: Publication Analysis and Publication Strategy
PDF
Data Publishing Models by Sünje Dallmeier-Tiessen
PDF
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
PPTX
Essentials 4 Data Support: a fine course in FAIR Data Support
PPTX
S cook ands_ttt2_perth_rdm_training
PPTX
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Preparing your data for sharing and publishing
Effective research data management
Peer Reviewing Data: experiences from a data journal
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Applying ocr to extract information : Text mining
20170621_System requirements of data journal platform
Emerging domain agnostic functionalities on the handle-centered networks
Shareable by Design: Making Better Use of your Research
Psp v 1 draft 2016 01 15 (1)
Dataverse for Journals
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
Praetzellis "Data Management Planning and Tools"
20160607 citation4software panel
Draux "Working with Scholarly APIs: A NISO Training Series, Session Four: Dig...
ICIC 2017: Publication Analysis and Publication Strategy
Data Publishing Models by Sünje Dallmeier-Tiessen
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
Essentials 4 Data Support: a fine course in FAIR Data Support
S cook ands_ttt2_perth_rdm_training
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...

More from Varsha Khodiyar (20)

PDF
Digital transformation to enable a FAIR approach for health data science
PDF
Lessons from the UK: Data access, patient trust & real-world impact with heal...
PDF
COVID-19 variants, vaccines and tests
PDF
COVID-19 variants and vaccines
PDF
Data citation and sharing during article publication
PDF
The importance of research data repositories
PDF
What role can publishers play in the open data ecosystem?
PDF
Five essentials factors for unlocking the potential for Open Research Data
PPTX
New approaches to data management: supporting FAIR data sharing at Springer N...
PPTX
The value of data curation as part of the publishing process
PDF
Facilitating good research data management practice as part of scholarly publ...
PDF
Practical challenges for researchers in data sharing
PDF
Update from Data policy standardisation and implementation IG
PPTX
Data Publishing and Institutional Repositories
PDF
Gaining credit for sharing research data
PPTX
Clinical Data Publishing at Scientific Data
PPTX
Privacy and Publication: challenges and opportunities for clinical data
PPTX
Why should researchers care about data curation?
PPTX
Share & Flourish workshop, Leiden, August 2014
PPTX
Open science: your questions answered
Digital transformation to enable a FAIR approach for health data science
Lessons from the UK: Data access, patient trust & real-world impact with heal...
COVID-19 variants, vaccines and tests
COVID-19 variants and vaccines
Data citation and sharing during article publication
The importance of research data repositories
What role can publishers play in the open data ecosystem?
Five essentials factors for unlocking the potential for Open Research Data
New approaches to data management: supporting FAIR data sharing at Springer N...
The value of data curation as part of the publishing process
Facilitating good research data management practice as part of scholarly publ...
Practical challenges for researchers in data sharing
Update from Data policy standardisation and implementation IG
Data Publishing and Institutional Repositories
Gaining credit for sharing research data
Clinical Data Publishing at Scientific Data
Privacy and Publication: challenges and opportunities for clinical data
Why should researchers care about data curation?
Share & Flourish workshop, Leiden, August 2014
Open science: your questions answered

Recently uploaded (20)

PPT
Presentation of a Romanian Institutee 2.
PPTX
Biomechanics of the Hip - Basic Science.pptx
PPT
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PPT
6.1 High Risk New Born. Padetric health ppt
PPT
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
PPTX
A powerpoint on colorectal cancer with brief background
PPTX
perinatal infections 2-171220190027.pptx
PPTX
Welcome-grrewfefweg-students-of-2024.pptx
PPTX
Hypertension_Training_materials_English_2024[1] (1).pptx
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PPTX
Substance Disorders- part different drugs change body
PPTX
Understanding the Circulatory System……..
PPTX
endocrine - management of adrenal incidentaloma.pptx
PDF
Science Form five needed shit SCIENEce so
PPTX
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
PPTX
BODY FLUIDS AND CIRCULATION class 11 .pptx
PPTX
BIOMOLECULES PPT........................
PPTX
Fluid dynamics vivavoce presentation of prakash
PPTX
SCIENCE 4 Q2W5 PPT.pptx Lesson About Plnts and animals and their habitat
Presentation of a Romanian Institutee 2.
Biomechanics of the Hip - Basic Science.pptx
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
6.1 High Risk New Born. Padetric health ppt
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
A powerpoint on colorectal cancer with brief background
perinatal infections 2-171220190027.pptx
Welcome-grrewfefweg-students-of-2024.pptx
Hypertension_Training_materials_English_2024[1] (1).pptx
lecture 2026 of Sjogren's syndrome l .pdf
Substance Disorders- part different drugs change body
Understanding the Circulatory System……..
endocrine - management of adrenal incidentaloma.pptx
Science Form five needed shit SCIENEce so
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
BODY FLUIDS AND CIRCULATION class 11 .pptx
BIOMOLECULES PPT........................
Fluid dynamics vivavoce presentation of prakash
SCIENCE 4 Q2W5 PPT.pptx Lesson About Plnts and animals and their habitat

Workflows for Publishing Data; Scientific Data's experience as an early adopter

  • 1. Workflows for Publishing Data Varsha Khodiyar, PhD Data Curation Editor, Scientific Data Nature Publishing Group varsha.khodiyar@nature.com @varsha_khodiyar @scientificdata Scientific Data's experience as an early adopter RDA P7, 1st to 3rd March 2016
  • 2. Mandatory and recommended key components of data publishing – WG results Austin et al. in review. Report preprint doi:10.5281/zenodo.34542 Implemented by Scientific Data Under wider consideration by Springer Nature
  • 3. Implementation of required elements • Data PID required to complete manuscript submission • Data Citation policy enforced by editorial process • Use of structured repositories which capture subject-specific metadata • Curation of discovery level metadata (regardless of repository) by dedicated Data Curation Editor • Machine readable metadata aids discovery
  • 4. Additional elements - Context 4 • Data Descriptor designed to encourage full documentation of data generation • Articles analysing described data are captured in machine readable metadata (ISA format) • Linked as associated publication to Data Descriptor online • Analysis articles published in Nature Publishing Group journals link back to Data Descriptor • Software availability statement required for previously unpublished software and code
  • 5. Additional elements - Quality 5 • Provision of manuscript (and metadata templates) to help authors provide reuse level metadata • Dependant on repositories for curation by domain experts • Editorial Board selected based on expertise in data generation/reuse in their field • Ensure that peer reviewers can access data easily and confidentially • Encourage peer reviewers to view and comment on the actual data as part of their assessment • Editorial office regularly asked for advice on data deposition and repository selection
  • 6. • Data Descriptors aid visibility of data by considering them as first class publications • Data Descriptors discoverable via common publication indices such as PubMed • Discovery level machine readable metadata (in ISA format) generated for every Data Descriptor • Currently trialling use of metadata for data discovery (ISAexplorer) • Open to suggestions for other uses of Scientific Data’s machine readable metadata Additional elements – Visibility / Accessibility 6

Editor's Notes

  • #3: Published product in Scientific Data’s case is the data paper, which we call the Data Descriptor
  • #6: Peer-reviewers are not expected to check every data file or "curate" the data.  This is a task we feel is best performed by expert repositories, and with support from our in-house data curation support.  Rejections after review remain rare, but on at least a few occasions peer-reviewers have identified issues within the actual data files that ultimately led to rejection (e.g. evidence of data contamination or other serious quality issues). We believe that making the data easily available to peer reviewers can actually save them time in these cases, because they do not need to "play detective" -- expects can often make an assessment more rapidly and more accurately when presented with the real data.
  • #7: Data Descriptors are discoverable on nature.com, visited by millions, and via common publication indices (PubMed, MEDLINE, Google Scholar -- Scopus and Thomson Reuters to come soon). This also makes them amenable to tracking by traditional metrics, like citation. Scientific Data also delivers progressively FAIR metadata (Findable, Accessible, Interoperable and Reusable)