Towards open and
reproducible neuroscience
in the age of big data
Chris Gorgolewski
@ChrisFiloG
http://bit.ly/2ifdaeX
Who discovered the structure of
the DNA?
Rosalind Franklin
and photograph 51
Hubble
telescope
legacy
https://www.scientificamerican.com/article/how-
old-observations-are-building-hubbles-legacy/
ImageNet
https://qz.com/1034972/the-data-that-
changed-the-direction-of-ai-research-and-
possibly-the-world/
Revisiting the Unreasonable
Effectiveness of Data
https://arxiv.org/abs/1707.02968
Maximizing the impact
Getting more data out there
Removing barriers
Neuroimaging data sharing
hierarchy
Poldrack and Gorgolewski, 2014
Towards open and reproducible neuroscience in the age of big data
NeuroVault stats
• 35,000 maps
• 1440 collections
• 61% public
• 21% linked
• Representing 86 different journals
• 296 mentions on Google Scholar
Ultimate consent form
• Inform participants about your intention to
share data
• Explain the benefits
• Discuss the risks
open-brain-consent.readthedocs.org
Incentive I - policies
Journal policies – case of NPG
Checklist with compulsory data
availability declaration
“Badges are stupid, but they work.” – Brian Nosek
Journal policies – case of PloS
“All data and related metadata
underlying the findings reported in a
submitted manuscript should be
deposited in an appropriate public
repository, unless already provided
as part of the submitted article”
Incentive II - credit
Gorgolewski, Milham, and Margulies,
• Neuroinformatics (Springer)
• GigaScience (BGI, BioMed Central)
• Scientific Data (Nature Publising Group)
• F1000Research (Faculty of 1000)
• Data in Brief (Elsevier)
• Journal of Open Psychology Data (Ubiquity
press)
• Frontiers
Where to publish data papers?
Incentive III - analysis
NeuroVault benefits - decoding
NeuroVault benefits –
similarity search
NeuroVault benefits –
similarity search
NeuroVault benefits –
Gene decoding
Gorgolewski KJ, Fox AS, Chang L et al. Tight
fitting genes: finding relations between
statistical maps and gene expression patterns.
F1000Posters 2014,5:1607 (poster)
NeuroVault benefits -
Gene decoding
a free online platform for sharing and
analysis of neuroimaging data
OpenNeuro.org - Poster #1677 28
Demo
OpenNeuro.org - Poster #1677 29
OpenNeuro - available Pipelines
• FMRIPREP
• MRIQC
• C-PAC
• FreeSurfer
• ndmg
• SPM
• BARACUS
• MAGeTBrain
Coming soon:
• QAP
• OPPNI
• automatic analysis
• TRACULA
• BROCCOLI
30
MRIQC – Quality control
for structural and functional images
MRIQC.ORG
Towards open and reproducible neuroscience in the age of big data
Towards open and reproducible neuroscience in the age of big data
OpenNeuro is free to use by anyone
under the agreement that the data will be made
publicly available after 18 months.
Reproducibility
Data snapshots + software containers
==
reproducibility
Maximizing the impact II
Making data easier to use
Brain Imaging Data Structure
bids.neuroimaging.io
OpenNeuro - Preprocessed data
Towards open and reproducible neuroscience in the age of big data
Towards open and reproducible neuroscience in the age of big data
Science in the age of
Open Data
Faster
Going from hypothesis to answer in a
couple of weeks
Cheaper
~$3 mln
cost of reacquiring data for each of the reuses of
OpenfMRI datasets (2017)
Higher quality
Wicherts JM, Bakker M, Molenaar D (2011) Willingness to Share Research
Data Is Related to the Strength of the Evidence and the Quality of Reporting of
Statistical Results. PLoS ONE 6(11): e26828. doi:
10.1371/journal.pone.0026828
More inclusive and competitive
Same datasets are available to all
researchers.
Conclusion
1. we need to get more data in the
hands of more researchers
2. building tools and platforms is crucial
to achieve this goal
Poldracklab
Know of any neuroinformatics faculty jobs? Get in touch! (asking for a friend ;)

More Related Content

PPTX
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
PPTX
A practical guide to practicing open science
PPTX
ML Researcher’s Guide to Open Brain Imaging Data
PPTX
OpenNeuro: a free online platform for sharing and analysis of neuroimaging data
PPTX
Modern tools for sharing and synthesizing neuroimaging results
PPTX
Reproducibility and replicability: a practical approach
PPTX
Containers in Science: neuroimaging use cases
PPTX
Reproducible research: theory
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
A practical guide to practicing open science
ML Researcher’s Guide to Open Brain Imaging Data
OpenNeuro: a free online platform for sharing and analysis of neuroimaging data
Modern tools for sharing and synthesizing neuroimaging results
Reproducibility and replicability: a practical approach
Containers in Science: neuroimaging use cases
Reproducible research: theory

What's hot (20)

PPT
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
PPTX
2016 07 12_purdue_bigdatainomics_seandavis
PPTX
Laurie Goodman at #SSPBoston: Article+Data+Tools Reproducibility, Reuse, & Ra...
PDF
Data citation metrics : best practice to enable new metrics for research data
PDF
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use Cases
PPTX
Knowledge graph construction for research & medicine
PDF
BIOMAG2018 - Denis Engemann - MNE-HCP
PPTX
SEEKing our way to better presentation of data and models from scientific inv...
PDF
An Ontology-Driven Integration Framework for Smart Communities
PPTX
Being Reproducible: SSBSS Summer School 2017
PPTX
Upgrading the Scholarly Infrastructure
PDF
Reproducible Research and the Cloud
PPTX
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
PPTX
Machines are people too
PPTX
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
PDF
Executive Summary - Data Management Hub
PPTX
Networking Materials Data
PPT
Peer Review and Science2.0
PPTX
The Research Object Initiative: Frameworks and Use Cases
PPTX
Why should researchers care about data curation?
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
2016 07 12_purdue_bigdatainomics_seandavis
Laurie Goodman at #SSPBoston: Article+Data+Tools Reproducibility, Reuse, & Ra...
Data citation metrics : best practice to enable new metrics for research data
From Queries to Algorithms to Advanced ML: 3 Pharmaceutical Graph Use Cases
Knowledge graph construction for research & medicine
BIOMAG2018 - Denis Engemann - MNE-HCP
SEEKing our way to better presentation of data and models from scientific inv...
An Ontology-Driven Integration Framework for Smart Communities
Being Reproducible: SSBSS Summer School 2017
Upgrading the Scholarly Infrastructure
Reproducible Research and the Cloud
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
Machines are people too
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Executive Summary - Data Management Hub
Networking Materials Data
Peer Review and Science2.0
The Research Object Initiative: Frameworks and Use Cases
Why should researchers care about data curation?
Ad

Similar to Towards open and reproducible neuroscience in the age of big data (20)

PDF
Share and Reuse: how data sharing can take your research to the next level
PDF
Data sharing in neuroimaging: incentives, tools, and challenges
PPTX
Donders neuroimage toolkit - open science and good practices
PPTX
Towards open and reproducible neuroscience in the age of big data
PPTX
Using Open Science to advance science - advancing open data
PDF
NeuroVault and the vision for data sharing in neuroimaging
PPTX
CuttingEEG - Open Science, Open Data and BIDS for EEG
PPTX
Using Open Science to accelerate advancements in auditory EEG signal processing
PDF
Paul Allen Open Science
PDF
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
PPTX
Connecting GLIMR with the BIDS initiative
PPTX
International perspective for sharing publicly funded medical research data
PPTX
Open Science: Where Theory Meets Practice
PDF
Brain Imaging Data Structure and Center for Reproducible Neuroscince
PDF
Databases and Ontologies: Where do we go from here?
PPTX
In Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
PPTX
Will Biomedical Research Fundamentally Change in the Era of Big Data?
PPTX
How and Why to Share Your Data
PDF
The OpenCon Intro to Open Data
PPTX
Open repositories for neuroimaging research
Share and Reuse: how data sharing can take your research to the next level
Data sharing in neuroimaging: incentives, tools, and challenges
Donders neuroimage toolkit - open science and good practices
Towards open and reproducible neuroscience in the age of big data
Using Open Science to advance science - advancing open data
NeuroVault and the vision for data sharing in neuroimaging
CuttingEEG - Open Science, Open Data and BIDS for EEG
Using Open Science to accelerate advancements in auditory EEG signal processing
Paul Allen Open Science
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Connecting GLIMR with the BIDS initiative
International perspective for sharing publicly funded medical research data
Open Science: Where Theory Meets Practice
Brain Imaging Data Structure and Center for Reproducible Neuroscince
Databases and Ontologies: Where do we go from here?
In Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
Will Biomedical Research Fundamentally Change in the Era of Big Data?
How and Why to Share Your Data
The OpenCon Intro to Open Data
Open repositories for neuroimaging research
Ad

More from Krzysztof Gorgolewski (12)

PPTX
Study pre-registration: Benefits and considerations
PPTX
FMRIPREP - robust and easy to use fMRI preprocessing pipeline
PPTX
Evaluation of full brain parcellation schemes using the NeuroVault database o...
PPTX
Quality control for structural and functional MRI
PPTX
Software testing for scientists
PPTX
Docker for scientists
PDF
The Brain Imaging Data Structure (OHBM 2016)
PDF
Brain Imaging Data Structure
PDF
Meta analysis in neuroimaging 101
PDF
Making data sharing count
PDF
If you liked it you should've put a p-value on it ...or not
PPTX
Reusable Science: How not to slip from the shoulders of giants
Study pre-registration: Benefits and considerations
FMRIPREP - robust and easy to use fMRI preprocessing pipeline
Evaluation of full brain parcellation schemes using the NeuroVault database o...
Quality control for structural and functional MRI
Software testing for scientists
Docker for scientists
The Brain Imaging Data Structure (OHBM 2016)
Brain Imaging Data Structure
Meta analysis in neuroimaging 101
Making data sharing count
If you liked it you should've put a p-value on it ...or not
Reusable Science: How not to slip from the shoulders of giants

Recently uploaded (20)

PDF
From Molecular Interactions to Solubility in Deep Eutectic Solvents: Explorin...
PDF
Cosmology using numerical relativity - what hapenned before big bang?
PPTX
Introduction to Immunology (Unit-1).pptx
PPT
Enhancing Laboratory Quality Through ISO 15189 Compliance
PDF
GROUP 2 ORIGINAL PPT. pdf Hhfiwhwifhww0ojuwoadwsfjofjwsofjw
PPTX
2currentelectricity1-201006102815 (1).pptx
PDF
Chapter 3 - Human Development Poweroint presentation
PPTX
LIPID & AMINO ACID METABOLISM UNIT-III, B PHARM II SEMESTER
PPTX
Preformulation.pptx Preformulation studies-Including all parameter
PPT
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
PPTX
gene cloning powerpoint for general biology 2
PDF
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
PDF
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
PPT
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
PDF
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
PPTX
ELISA(Enzyme linked immunosorbent assay)
PPTX
limit test definition and all limit tests
PPTX
Platelet disorders - thrombocytopenia.pptx
PPTX
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
PDF
The Future of Telehealth: Engineering New Platforms for Care (www.kiu.ac.ug)
From Molecular Interactions to Solubility in Deep Eutectic Solvents: Explorin...
Cosmology using numerical relativity - what hapenned before big bang?
Introduction to Immunology (Unit-1).pptx
Enhancing Laboratory Quality Through ISO 15189 Compliance
GROUP 2 ORIGINAL PPT. pdf Hhfiwhwifhww0ojuwoadwsfjofjwsofjw
2currentelectricity1-201006102815 (1).pptx
Chapter 3 - Human Development Poweroint presentation
LIPID & AMINO ACID METABOLISM UNIT-III, B PHARM II SEMESTER
Preformulation.pptx Preformulation studies-Including all parameter
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
gene cloning powerpoint for general biology 2
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
ELISA(Enzyme linked immunosorbent assay)
limit test definition and all limit tests
Platelet disorders - thrombocytopenia.pptx
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
The Future of Telehealth: Engineering New Platforms for Care (www.kiu.ac.ug)

Towards open and reproducible neuroscience in the age of big data