SlideShare a Scribd company logo
The challenge of sharing data well and how publishers can help
Varsha Khodiyar, PhD
DREAM Challenges and Epidemium@RECOMB
workshop, Paris
19.04.2018
1
What are the challenges to sharing data?
12%
16%
20%
23%
28%
Costs of
sharing data
Lack of time
to deposit
data
Not knowing
which
repository to
use
Unsure
about
copyright
and licensing
Organising
data in a
presentable
and useful
wayTotal respondents: 7719
Stuart, D. et al. Practical challenges for researchers in data sharing. Springer Nature
whitepaper (2018) https://doi. org/10.6084/m9.figshare.5975011
2
Scientific Data, a Nature Research journal
nature.com/scientificdata
3
The Data Descriptor
Sections
• Title
• Abstract
• Background & Summary
• Methods
• Data Records
• Technical Validation
• Usage Notes
• Figures & Tables
• References
• Data Citations
• Detailed descriptions of the methods and technical analyses supporting the
quality of the measurements.
• Does not contain tests of new scientific hypotheses
• Data must be archived in a repository at submission
• Peer reviewers are asked to comment on the reusability of the work as presented
4
Data peer review
nature.com/sdata/policies/for-referees
Experimental
Rigor and
Technical Data
Quality
Were data produced in a sound manner?
Technical quality of data – appropriate statistical analyses?
Experimental rigor - appropriate depth, coverage?
Completeness
of the
Description
Sufficient detail to allow others to reproduce these steps?
Sufficient detail to allow others to reuse this data?
Consistent with relevant minimum reporting standards?
Integrity of the
Data Files and
Repository
Record
Do data files appear complete and match manuscript
descriptions?
Are data archived to the most appropriate repository?
5
We capture information about the dataset being described in each Data Descriptor.
During the metadata curation process
• Manuscript re-read
• Data archive checked
• Minor issues with the data and/or manuscript often identified
• Metadata captured in ISA-Tab format
Metadata curation and final data checking
6
ISA-Tab metadata files
scientificdata.isa-explorer.org
Structured Summary table appears after abstract
github.com/ScientificDataLabs/ISA-tab
7
Data Descriptors help researchers share their data in a reusable
way
7
“The Data Descriptor made it easier to
use the data, for me it was critical that
everything was there…all the technical
details like voxel size.”
Professor Daniele Marinazzo
8
Helping researchers know where to share their data
nature.com/sdata/data-policies/repositories
Browse our recommended data repositories online.
• We currently list more than 100 repositories, across biological,
medical, physical and social sciences
• When requested we provide guidance to authors on the best place to
store their data
9
Helping researchers understand data licensing
springernature.com/gp/authors/research-data-policy
10
Reuse of Challenge data published at Scientific Data
11
Other Challenge dataset publications at Scientific Data
12
How can we help Challenge participants?
Visit nature.com/scientificdata
Email scientificdata@nature.com
Tweet @ScientificData

More Related Content

PPTX
Gaining credit for sharing research data: Viewpoints on Data Publishing
PDF
Scientific Data and peer review session at Dryad event, May 2015
PDF
Data sharing as part of the research workflow
PDF
Henderson "Institutional Identifiers"
PDF
John morrissey c3 dis fair working data.pptx
PDF
Peer Reviewing Data: experiences from a data journal
PDF
Natasha intro to rdm c3 dis may 2018.pptx
PDF
Sue cook c3 dis dm-ps 1.pptx
Gaining credit for sharing research data: Viewpoints on Data Publishing
Scientific Data and peer review session at Dryad event, May 2015
Data sharing as part of the research workflow
Henderson "Institutional Identifiers"
John morrissey c3 dis fair working data.pptx
Peer Reviewing Data: experiences from a data journal
Natasha intro to rdm c3 dis may 2018.pptx
Sue cook c3 dis dm-ps 1.pptx

What's hot (20)

PPT
Journal Data Requirements
PDF
Preparing your data for sharing and publishing
PDF
Managing sensitive data in your repository
PDF
Data sharing as part of the research ecosystem
PDF
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
PPTX
Workflows for Publishing Data; Scientific Data's experience as an early adopter
PPTX
EDI Training Module 12: An Introduction to Metadata and Data Repositories
PPTX
Data Publishing and Institutional Repositories
PDF
Introduction to the Environmental Data Initiative (EDI)
PDF
Valen Metadata and the [Data] Repository
PPTX
Research data management workshop april12 2016
PDF
Hawkins "Implementation of the CONSER Standard Record"
PDF
Research data management at TU Eindhoven
PPTX
Managing your data paget
PDF
DataShare - Pauline Ward to University of Edinburgh School of Chemistry - 3 f...
PPTX
DataONE Education Module 01: Why Data Management?
PPTX
THOR Workshop - Data Publishing Elsevier
PPTX
Best practices data collection
PDF
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
PPTX
Data Management for Postgraduate students by Lynn Woolfrey
Journal Data Requirements
Preparing your data for sharing and publishing
Managing sensitive data in your repository
Data sharing as part of the research ecosystem
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Workflows for Publishing Data; Scientific Data's experience as an early adopter
EDI Training Module 12: An Introduction to Metadata and Data Repositories
Data Publishing and Institutional Repositories
Introduction to the Environmental Data Initiative (EDI)
Valen Metadata and the [Data] Repository
Research data management workshop april12 2016
Hawkins "Implementation of the CONSER Standard Record"
Research data management at TU Eindhoven
Managing your data paget
DataShare - Pauline Ward to University of Edinburgh School of Chemistry - 3 f...
DataONE Education Module 01: Why Data Management?
THOR Workshop - Data Publishing Elsevier
Best practices data collection
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Data Management for Postgraduate students by Lynn Woolfrey
Ad

Similar to The challenge of sharing data well, how publishers can help (20)

PPTX
Data peer review workshop
PPTX
FAIR for the future: embracing all things data
PPTX
Research data management workshop April 2016
PDF
Gaining credit for sharing research data
PPTX
Fsci 2018 thursday2_august_am6
PDF
Researh data management
PPT
Planning for Research Data Management: 26th January 2016
PDF
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
PDF
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
PPTX
A National Approach to Open Data in Ireland: Publishers and Research Data Man...
PPTX
Rebecca Grant - Publishers and RDM
PPT
Managing data throughout the research lifecycle
PPTX
Transparency and reproducibility in research
PPTX
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
PDF
Enhance your rese​arch impact through open science
PPTX
Talk on Research Data Management
PDF
INSERM - Data Management & Reuse of Health Data - May 2017
PPTX
How and Why to Share Your Data
PDF
Planning for Research Data Management
PPTX
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Data peer review workshop
FAIR for the future: embracing all things data
Research data management workshop April 2016
Gaining credit for sharing research data
Fsci 2018 thursday2_august_am6
Researh data management
Planning for Research Data Management: 26th January 2016
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
A National Approach to Open Data in Ireland: Publishers and Research Data Man...
Rebecca Grant - Publishers and RDM
Managing data throughout the research lifecycle
Transparency and reproducibility in research
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Enhance your rese​arch impact through open science
Talk on Research Data Management
INSERM - Data Management & Reuse of Health Data - May 2017
How and Why to Share Your Data
Planning for Research Data Management
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Ad

More from Varsha Khodiyar (19)

PDF
Digital transformation to enable a FAIR approach for health data science
PDF
Lessons from the UK: Data access, patient trust & real-world impact with heal...
PDF
COVID-19 variants, vaccines and tests
PDF
COVID-19 variants and vaccines
PDF
Data citation and sharing during article publication
PDF
The importance of research data repositories
PDF
What role can publishers play in the open data ecosystem?
PDF
Five essentials factors for unlocking the potential for Open Research Data
PPTX
New approaches to data management: supporting FAIR data sharing at Springer N...
PPTX
The value of data curation as part of the publishing process
PDF
Facilitating good research data management practice as part of scholarly publ...
PDF
Practical challenges for researchers in data sharing
PDF
Update from Data policy standardisation and implementation IG
PPTX
Clinical Data Publishing at Scientific Data
PPTX
Privacy and Publication: challenges and opportunities for clinical data
PPTX
Why should researchers care about data curation?
PPTX
Share & Flourish workshop, Leiden, August 2014
PPTX
Open science: your questions answered
PPTX
Open for science to support replication
Digital transformation to enable a FAIR approach for health data science
Lessons from the UK: Data access, patient trust & real-world impact with heal...
COVID-19 variants, vaccines and tests
COVID-19 variants and vaccines
Data citation and sharing during article publication
The importance of research data repositories
What role can publishers play in the open data ecosystem?
Five essentials factors for unlocking the potential for Open Research Data
New approaches to data management: supporting FAIR data sharing at Springer N...
The value of data curation as part of the publishing process
Facilitating good research data management practice as part of scholarly publ...
Practical challenges for researchers in data sharing
Update from Data policy standardisation and implementation IG
Clinical Data Publishing at Scientific Data
Privacy and Publication: challenges and opportunities for clinical data
Why should researchers care about data curation?
Share & Flourish workshop, Leiden, August 2014
Open science: your questions answered
Open for science to support replication

Recently uploaded (20)

PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
BIOMOLECULES PPT........................
PPTX
famous lake in india and its disturibution and importance
PPTX
Microbiology with diagram medical studies .pptx
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
HPLC-PPT.docx high performance liquid chromatography
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PDF
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
PPTX
2Systematics of Living Organisms t-.pptx
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
Introduction to Cardiovascular system_structure and functions-1
PDF
Placing the Near-Earth Object Impact Probability in Context
PPT
POSITIONING IN OPERATION THEATRE ROOM.ppt
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPT
6.1 High Risk New Born. Padetric health ppt
PPT
protein biochemistry.ppt for university classes
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
Classification Systems_TAXONOMY_SCIENCE8.pptx
BIOMOLECULES PPT........................
famous lake in india and its disturibution and importance
Microbiology with diagram medical studies .pptx
TOTAL hIP ARTHROPLASTY Presentation.pptx
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
HPLC-PPT.docx high performance liquid chromatography
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
2Systematics of Living Organisms t-.pptx
ECG_Course_Presentation د.محمد صقران ppt
Introduction to Cardiovascular system_structure and functions-1
Placing the Near-Earth Object Impact Probability in Context
POSITIONING IN OPERATION THEATRE ROOM.ppt
Phytochemical Investigation of Miliusa longipes.pdf
6.1 High Risk New Born. Padetric health ppt
protein biochemistry.ppt for university classes
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
Biophysics 2.pdffffffffffffffffffffffffff
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...

The challenge of sharing data well, how publishers can help

  • 1. The challenge of sharing data well and how publishers can help Varsha Khodiyar, PhD DREAM Challenges and Epidemium@RECOMB workshop, Paris 19.04.2018
  • 2. 1 What are the challenges to sharing data? 12% 16% 20% 23% 28% Costs of sharing data Lack of time to deposit data Not knowing which repository to use Unsure about copyright and licensing Organising data in a presentable and useful wayTotal respondents: 7719 Stuart, D. et al. Practical challenges for researchers in data sharing. Springer Nature whitepaper (2018) https://doi. org/10.6084/m9.figshare.5975011
  • 3. 2 Scientific Data, a Nature Research journal nature.com/scientificdata
  • 4. 3 The Data Descriptor Sections • Title • Abstract • Background & Summary • Methods • Data Records • Technical Validation • Usage Notes • Figures & Tables • References • Data Citations • Detailed descriptions of the methods and technical analyses supporting the quality of the measurements. • Does not contain tests of new scientific hypotheses • Data must be archived in a repository at submission • Peer reviewers are asked to comment on the reusability of the work as presented
  • 5. 4 Data peer review nature.com/sdata/policies/for-referees Experimental Rigor and Technical Data Quality Were data produced in a sound manner? Technical quality of data – appropriate statistical analyses? Experimental rigor - appropriate depth, coverage? Completeness of the Description Sufficient detail to allow others to reproduce these steps? Sufficient detail to allow others to reuse this data? Consistent with relevant minimum reporting standards? Integrity of the Data Files and Repository Record Do data files appear complete and match manuscript descriptions? Are data archived to the most appropriate repository?
  • 6. 5 We capture information about the dataset being described in each Data Descriptor. During the metadata curation process • Manuscript re-read • Data archive checked • Minor issues with the data and/or manuscript often identified • Metadata captured in ISA-Tab format Metadata curation and final data checking
  • 7. 6 ISA-Tab metadata files scientificdata.isa-explorer.org Structured Summary table appears after abstract github.com/ScientificDataLabs/ISA-tab
  • 8. 7 Data Descriptors help researchers share their data in a reusable way 7 “The Data Descriptor made it easier to use the data, for me it was critical that everything was there…all the technical details like voxel size.” Professor Daniele Marinazzo
  • 9. 8 Helping researchers know where to share their data nature.com/sdata/data-policies/repositories Browse our recommended data repositories online. • We currently list more than 100 repositories, across biological, medical, physical and social sciences • When requested we provide guidance to authors on the best place to store their data
  • 10. 9 Helping researchers understand data licensing springernature.com/gp/authors/research-data-policy
  • 11. 10 Reuse of Challenge data published at Scientific Data
  • 12. 11 Other Challenge dataset publications at Scientific Data
  • 13. 12 How can we help Challenge participants? Visit nature.com/scientificdata Email scientificdata@nature.com Tweet @ScientificData

Editor's Notes

  • #3: Main findings: Researchers do share and use one another’s data but lack places to put it. They would value a high quality data publication
  • #6: We do not expect reviewers to open every file or reuse the data in it’s entirety.
  • #7: Often identify minor issues with the data, files, or archive as part of review process, e.g. Typos in data archive and/or manuscript tables File names differences between archive and manuscript Typos in accession IDs in manuscript Final data citation checking (e.g. changes to data DOI) Essentially facilitates an additional final check that the data and manuscript are as accurate as possible prior to final publication, so that data is maximally reusable. The curation process adds another layer of checks to the manuscript, ensuring that the published article and data archive are as accurate as possible, maximising data reusability.
  • #9: Daniele knew about the dataset prior to Chris’ paper being published, as Chris had shared this in Torrent Exchange.  However he did not access the data from this.  He saw on Twitter when the SciData paper was published and then read the paper. Daniele said “I would never have collected this data myself, as it’s not my primary field of work”. He said the Data Descriptor made it easier to use the data “for me it was critical that everything was there [in the Data Descriptor]…all the technical details like voxel size.”