SlideShare a Scribd company logo
Brain Imaging
Data Structure
CHRIS	GORGOLEWSKI
STANFORD	UNIVERSITY
Getting lost in your data
Getting lost in your data
• MRI has been used to study
the human brain for over 20
years.
• Despite similarities in
experimental designs and data
types each researcher tends to
organize and describe their
data in their own way.
http://www.nature.com/news/brain-imaging -fmri-2-0-1.10365
Getting lost in your data
Heterogeneity in data description practices causes:
• problems in sharing data (even within the same lab),
• unnecessary manual metadata input when running
processing pipelines,
• no way to automatically validate completeness of a
given dataset,
• difficulties in combining data from multi-center studies.
Brain Imaging Data Structure
Brain Imaging Data Structure (BIDS) is a new
way for standardizing, describing and
organizing results of a human neuroimaging
experiment.
Who is it for?
1. Lab PIs. It will make handing over one dataset from one
student/postdocto another easy.
2. Workflow developers. It’s easier to write pipelines expecting
a particular file organization.
3. Database curators. Accepting one dataset format will make
curation easier.
Principles behind BIDS
1. Adoption is crucial.
2. Don’t reinvent the wheel.
3. Some meta data is better than no metadata
4. Don’t rely on external software (databases) or
complicated file formats (RDF).
5. Aim to capture 80% of experiments but give the
remaining 20% space to extend the standard.
Implementation
1. Some metadata is encoded in the folder structure.
2. Some metadata is replicated in the file name for simplicity.
3. Use of tab separated files for tabular data.
4. Use of NIFTI files for imaging data.
5. Use of JSON files for dictionary type metadata.
6. Use of legacy text file formats for b vectors/values and
physiological data.
7. Make certain folder hierarchy levels optional for simplicity.
8. Allows for arbitrary files not covered by the spec to be
included in any way the researchers deem appropriate.
Why TSV?
1. Simple text format with wide software support.
2. Strings with commas do not need to be escaped by
quotation marks.
Why NiFTI?
Pros:
1. Widest support from software packages.
2. Designed for neuroimaging.
Cons:
1. Poor metadata support.
2. Memory mapped random access to compressed
NifTI is hard to implement.
Why JSON?
1. Simple text (you can use notepad to edit).
2. Wide support from different programming languages.
3. Simpler than XML, but almost as powerful.
4. Extensible with linked data.
BIDS features
1. Handles multiple sessions and runs
2. Supports sparse acquisition (via slice timing)
3. Supports contiguous acquisition covariates (breathing, cardiac
etc.)
4. Supports multiple field map formats
5. Supports multiple types of anatomical scans
6. Supports function MRI: both task based and resting state.
7. Supports diffusions data (together with corresponding bvec, bval
files)
8. Supports behavioral variables on the level of subjects
(demographics), sessions, and runs.
Folder organization (simplified)
sub-control01/
anat/
sub-control01_T1w.nii.gz
sub-control01_T1w.json
sub-control01_T2w.nii.gz
sub-control01_T2w.json
func/
sub-control01_task-
nback_bold.nii.gz
sub-control01_task-nback_bold.json
sub-control01_task-nback_events.tsv
sub-control01_task-nback_cont-
physio.tsv
sub-control01_task-nback_cont-
physio.json
sub-control01_task-
nback_sbref.nii.gz
dwi/
sub-control01_dwi.nii.gz
sub-control01_dwi.bval
sub-control01_dwi.bvec
fmap
sub-control01_phasediff.nii.gz
sub-control01_phasediff.json
sub-control01_magnitude1.nii.gz
sub-control01_scans.tsv
participants.tsv
dataset_description.json
README
CHANGES
Example events file
onset duration trial_type ResponseTime
1.2 0.6 go 1.435
5.6 0.6 stop 1.739
…
Example metadata file
{
"RepetitionTime": 3.0,
"EchoTime": 0.0003,
"FlipAngle": 78,
"SliceTiming": [0.0, 0.2, 0.4, 0.6, 0.8, 1.0,
1.2, 1.4, 1.6, 1.8, 2.0, 2.2, 2.4, 2.6, 2.8],
"MultibandAccellerationFactor": 4,
"ParallelReductionFactorInPlane": 2
}
Example demographics file
participant_id age sex
sub-001 34 M
Sub-002 12 F
Sub-003 33 F
Keys to success
1. Make the community involved in the design process.
2. Provide a good validation tool (browser based!).
3. Build tools/workflows/pipelines that make adopting BIDS
worthwhile (AA, Nipype, C-PAC etc.)
4. Get support from databases (LORIS, COINS, SciTran,
OpenfMRI, XNAT, etc.)
Existing tools
1. bids-validator:https://github.com/INCF/bids-validator (demo)
2. openfmri2bids:https://github.com/INCF/openfmri2bids
3. bidsutils: https://github.com/INCF/bidsutils
4. dcm2niix: https://github.com/neurolabusc/dcm2niix
5. dicm2nii:
http://www.mathworks.com/matlabcentral/fileexchange/42997-
dicom-to-nifti-converter--nifti-tool-and-viewer
6. Quality Assessment Protocol: http://preprocessed-connectomes-
project.github.io/quality-assessment-protocol
7. SciTran: https://scitran.github.io
Upcoming tools
1. OpenfMRI (internal format)
2. XNAT (import)
3. COINS (export)
4. heudiconv (conversion)
5. LORIS (import)
6. C-PAC (import)
7. NIAK (import)
8. Nipype (import)
Why	do	I	care
Data sharing drives
progress
Data sharing drives
progress
$878,400
how much it would cost to perform
studies using OpenfMRI data if it did not exist
Convincing people to share
data is hard
1. Publication as an incentive (data papers – Gorgolewski et al.
2013)
2. Sharing only statistical derivatives (NeuroVault – Gorgolewski
et al. 2014)
Poldrack and Gorgolewski, 2014
Convincing people to share
data is hard
1. Publication as an incentive (data papers – Gorgolewski et al.
2013)
2. Sharing only statistical derivatives (NeuroVault – Gorgolewski
et al. 2014)
3. Journal policies (see PloS One, F1000Research Scientific Data)
Data sharing fears
1. Fear of being scooped
2. Fear of someone finding a mistake
3. Misconceptions about the ownership of the
data
Brain Imaging Data Structure and Center for Reproducible Neuroscince
Stanford | Center for Reproducible
Neurscience
Analyzing for reproducibility
reproducibility.stanford.edu
• Automated quality control reporting
• Data analysis service
• Using cutting edge, robust and well tested methods
• Leveraging supercomputer power not accessible to
most labs
• Quantify reproducibility by out of sample prediction
estimates
• “Glass box” – in depth documentation describing all data
analysis steps
Stanford | Center for Reproducible
Neurscience
Analyzing for reproducibility
reproducibility.stanford.edu
• The service is completely free of charge
• Under one condition: the data will be publicly available
after a grace period
Stanford | Center for Reproducible
Neurscience
Analyzing for reproducibility
reproducibility.stanford.edu
• CRN will:
• Make more data publicly available
• Improve access to best methods and algorithms
(including yours!)
• Enable automatic data exploration and hypothesis
generation
• Foster the culture of looking at out of sample
predictions and effect sizes
Acknowledgments
The Poldrack Lab @ Stanford
Data Sharing Task Force
bids.neuroimaging.io

More Related Content

PDF
Brain Imaging Data Structure
PDF
Share and Reuse: how data sharing can take your research to the next level
PPTX
Towards open and reproducible neuroscience in the age of big data
PDF
A basic course on Research data management, part 1: what and why
PPTX
The habits of highly successful data:
PPTX
Ten habits of highly effective data
PDF
A basic course on Research data management: part 1 - part 4
PPTX
Ten Habits of Highly Successful Data
Brain Imaging Data Structure
Share and Reuse: how data sharing can take your research to the next level
Towards open and reproducible neuroscience in the age of big data
A basic course on Research data management, part 1: what and why
The habits of highly successful data:
Ten habits of highly effective data
A basic course on Research data management: part 1 - part 4
Ten Habits of Highly Successful Data

What's hot (20)

PDF
A basic course on Research data management, part 4: caring for your data, or ...
PPTX
The Donders Repository
PPTX
Introduction to Data Management
PDF
Reproducible research: First steps.
PPT
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
PPTX
Research Data Management and Librarians
PPTX
The Brain Imaging Data Structure and its use for fNIRS
PPTX
Donders neuroimage toolkit - open science and good practices
PPTX
Using Open Science to advance science - advancing open data
PDF
Data management (1)
PPTX
Donders Repository - removing barriers for management and sharing of research...
PPTX
METRO RDM Webinar
PDF
The State of Open Research Data
PPT
Smit-Scrap supplementary material-nfdp13
PDF
BIOMAG2018 - Denis Engemann - MNE-HCP
PPTX
Organizing EEG data using the Brain Imaging Data Structure
PPTX
Modern tools for sharing and synthesizing neuroimaging results
PPTX
DataONE Education Module 07: Metadata
PDF
Research data management: course 0HV90, Behavioral Research Methods
PPTX
Developing a Research Case Study
A basic course on Research data management, part 4: caring for your data, or ...
The Donders Repository
Introduction to Data Management
Reproducible research: First steps.
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
Research Data Management and Librarians
The Brain Imaging Data Structure and its use for fNIRS
Donders neuroimage toolkit - open science and good practices
Using Open Science to advance science - advancing open data
Data management (1)
Donders Repository - removing barriers for management and sharing of research...
METRO RDM Webinar
The State of Open Research Data
Smit-Scrap supplementary material-nfdp13
BIOMAG2018 - Denis Engemann - MNE-HCP
Organizing EEG data using the Brain Imaging Data Structure
Modern tools for sharing and synthesizing neuroimaging results
DataONE Education Module 07: Metadata
Research data management: course 0HV90, Behavioral Research Methods
Developing a Research Case Study
Ad

Viewers also liked (6)

PDF
The Brain Imaging Data Structure (OHBM 2016)
PDF
XNAT Tuning & Monitoring
PPTX
Docker for scientists
PPTX
Software testing for scientists
PPTX
Evaluation of full brain parcellation schemes using the NeuroVault database o...
PPTX
Study pre-registration: Benefits and considerations
The Brain Imaging Data Structure (OHBM 2016)
XNAT Tuning & Monitoring
Docker for scientists
Software testing for scientists
Evaluation of full brain parcellation schemes using the NeuroVault database o...
Study pre-registration: Benefits and considerations
Ad

Similar to Brain Imaging Data Structure and Center for Reproducible Neuroscince (20)

PPTX
Connecting GLIMR with the BIDS initiative
PDF
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
PDF
Data sharing in neuroimaging: incentives, tools, and challenges
PDF
NeuroVault and the vision for data sharing in neuroimaging
PPTX
Open repositories for neuroimaging research
PPTX
CuttingEEG - Open Science, Open Data and BIDS for EEG
PPTX
The real world of ontologies and phenotype representation: perspectives from...
PPTX
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
PPTX
Data-knowledge transition zones within the biomedical research ecosystem
PDF
Bid scoin ohbm2019-poster
PPTX
Big data from small data: A deep survey of the neuroscience landscape data via
PDF
Databases and Ontologies: Where do we go from here?
PPTX
RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
PPTX
Neuroscience as networked science
PPTX
Using Open Science to accelerate advancements in auditory EEG signal processing
PPTX
The Neuroscience Information Framework:The present and future of neuroscience...
PPTX
Challenges in the analysis of EEG – How Open Source and Open Data can help
PPTX
The possibility and probability of a global Neuroscience Information Framework
PDF
Introduction to Neuroimaging Informatics
PDF
Shawna - Poster Edit 2
Connecting GLIMR with the BIDS initiative
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Data sharing in neuroimaging: incentives, tools, and challenges
NeuroVault and the vision for data sharing in neuroimaging
Open repositories for neuroimaging research
CuttingEEG - Open Science, Open Data and BIDS for EEG
The real world of ontologies and phenotype representation: perspectives from...
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
Data-knowledge transition zones within the biomedical research ecosystem
Bid scoin ohbm2019-poster
Big data from small data: A deep survey of the neuroscience landscape data via
Databases and Ontologies: Where do we go from here?
RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
Neuroscience as networked science
Using Open Science to accelerate advancements in auditory EEG signal processing
The Neuroscience Information Framework:The present and future of neuroscience...
Challenges in the analysis of EEG – How Open Source and Open Data can help
The possibility and probability of a global Neuroscience Information Framework
Introduction to Neuroimaging Informatics
Shawna - Poster Edit 2

More from Krzysztof Gorgolewski (12)

PPTX
Reproducibility and replicability: a practical approach
PPTX
ML Researcher’s Guide to Open Brain Imaging Data
PPTX
A practical guide to practicing open science
PPTX
Towards open and reproducible neuroscience in the age of big data
PPTX
OpenNeuro: a free online platform for sharing and analysis of neuroimaging data
PPTX
Containers in Science: neuroimaging use cases
PPTX
FMRIPREP - robust and easy to use fMRI preprocessing pipeline
PPTX
Quality control for structural and functional MRI
PDF
Meta analysis in neuroimaging 101
PDF
Making data sharing count
PDF
If you liked it you should've put a p-value on it ...or not
PPTX
Reusable Science: How not to slip from the shoulders of giants
Reproducibility and replicability: a practical approach
ML Researcher’s Guide to Open Brain Imaging Data
A practical guide to practicing open science
Towards open and reproducible neuroscience in the age of big data
OpenNeuro: a free online platform for sharing and analysis of neuroimaging data
Containers in Science: neuroimaging use cases
FMRIPREP - robust and easy to use fMRI preprocessing pipeline
Quality control for structural and functional MRI
Meta analysis in neuroimaging 101
Making data sharing count
If you liked it you should've put a p-value on it ...or not
Reusable Science: How not to slip from the shoulders of giants

Recently uploaded (20)

PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PPTX
Introduction to Cardiovascular system_structure and functions-1
PDF
Placing the Near-Earth Object Impact Probability in Context
PPTX
Comparative Structure of Integument in Vertebrates.pptx
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
2. Earth - The Living Planet earth and life
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PDF
An interstellar mission to test astrophysical black holes
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PDF
Sciences of Europe No 170 (2025)
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PPTX
Cell Membrane: Structure, Composition & Functions
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
Introduction to Cardiovascular system_structure and functions-1
Placing the Near-Earth Object Impact Probability in Context
Comparative Structure of Integument in Vertebrates.pptx
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
2. Earth - The Living Planet earth and life
Biophysics 2.pdffffffffffffffffffffffffff
An interstellar mission to test astrophysical black holes
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
AlphaEarth Foundations and the Satellite Embedding dataset
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
Taita Taveta Laboratory Technician Workshop Presentation.pptx
Classification Systems_TAXONOMY_SCIENCE8.pptx
Sciences of Europe No 170 (2025)
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
Cell Membrane: Structure, Composition & Functions
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS

Brain Imaging Data Structure and Center for Reproducible Neuroscince