SlideShare a Scribd company logo
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Challenges in the analysis of EEG –
how Open Source and Open Data can help
Robert OOSTENVELD
robert.oostenveld@donders.ru.nl
8 Sept 2024 - Chengdu, China
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Challenges in the analysis of EEG
EEG experiments are very flexible
EEG has very high temporal resolution
EEG is relatively affordable
EEG data is noisy
EEG data is highly multivariate
EEG datasets are large
Acquiring EEG is a lot of work
Many EEG analysis methods by different groups that aim for similar results
Not so many standardized pipelines
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Open Science = better science
Open access publications
Open peer review
Pre-registration
Open educational resources
Open methodology
Open software
Open hardware
Open data
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Open Science - improves quality, efficiency, inclusivity, reprodicibility, …
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
About me
Since about 20 years heading the Open Source FieldTrip toolbox.
From 2015 onward heading the Open Source EEGsynth artistic BCI toolbox.
From 2012 onward responsible for MEG in the Human Connectome Project
(HCP).
From 2014 onward involved in setting up the Donders and Radboud
institutional repositories, which now hold >450 public datasets.
From 2019-2023 member of the Brain Imaging Data Structure (BIDS)
steering group.
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Why share your code as Open Source
Allows others to …
• re-use and replicate your findings.
• improve on your work and share their improvements back to you.
• to find bugs and errors in your work, so that you don’t continue in the wrong
direction.
Increases the reproducibility and impact of your work.
Reprodicibility is a major principle underpinning the scientific method.
Impact is what drives innovation and what you need to fund future research.
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Sharing code can be simple … or complex
It can be as simple as adding your analysis code as appendix, supplementary
material to your paper, or sharing it online.
It can be as complex as setting up a long lasting project like FieldTrip or EEGLAB.
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
If you have not shared code before…
You can look at others’ code that you are using.
What makes it useful for you, what is it that you are missing?
Version control
Documentation on the goals -> the aim
A simple example to get started
Documentation on the details -> the implementation
A way to get in touch (email, forum, issue tracking system)
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Learning from others - FieldTrip
FieldTrip is software that we wrote in the lab to work together
with a research team of about 5 people.
Not the aim to make a big project out of it, but just to get our own research done.
Others starting using it internally, guests researchers started using it, and they took it
home with them. Then others (whom we did not know) started using it …
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Learning from others - FieldTrip
We initially started with version control using CVS, then SVN, then git.
Using the latest/greatest tool in general is smart.
In 2011 we published a paper that describes the toolbox.
This is cited a lot, which counts as impact.
We already used it in in-house teaching, hence we initially shared the tutorials and example data.
Knowing how to make and maintain a website is an important skill!
Each piece of code that was meant to be used by others requires documentation. Write and discuss
the function “help” and reference documentation before writing the actual code.
Get others involved. Email discussion list, GitHub issues.
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
FieldTrip also learned from others
Early collaboration with EEGLAB (Arno Delorme and Scott Makeig):
They knew how to deal with “users” to give training and provide documentation.
Early collaboration with SPM team (FIL, UCL):
They knew how to make code maintanable using version control, consistency,
dependencies.
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Learning from others
Technical aspects of the code:
- Computer science and software engineering
Aspects of the documentation:
- Not very different from teaching material
- A “wiki” like system is easy to set up
- MarkDown documents are nowadays common
Aspects of community management and collaboration:
- Hard to learn, but you can look at others and copy
https://collections.plos.org/collection/ten-simple-rules/
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Managing an Open Source project
Find a convenient place to host your code, documentation and community.
GitHub is designed for “social coding” and working together, not only between
developers, but also with users.
There are alternatives for GitHub, like GitLab or the Chinese counterpart Gitee. Make
sure that people can find it!
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Open Data
Findable
Make your data available in a catalog or repository
with a persistent identifier (DOI, handle) and metadata
Accessible
Be explicit about data usage terms (agreement with downloader)
Interoperable
Make your data human and machine readable, e.g. BIDS
Reusable
Make sure you document enough details, e.g. “data descriptor” paper
that can be cited, along with citing our data -> measurable impact!
ilkinson et al. (2016) The FAIR Guiding Principles. Sci Data.
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Making EEG data FAIR
The Brain Imaging Data Structure (BIDS) is a community initiative
to make data more FAIR and to improve data sharing
http://www.bids-standard.org/
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
What is BIDS?
BIDS is a way to organize your existing raw data
To improve consistent and complete documentation
To facilitate re-use by your future self and others
BIDS is not
A new file format
A search engine
A data sharing platform
http://www.bids-standard.org/
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Brain Imaging Data Structure (BIDS)
External reuse of data: publishing, sharing
Internal reuse of data: archiving, curation, collaborating
Fundamental for repositories like OpenNeuro, PublicNeuro, but also for our own
Donders Repository and as the basis for new analysis pipelines and workflow
development
https://openneuro.org
https://publicneuro.eu
https://data.donders.ru.nl
PublicNeuro.EU data.donders.ru.nl
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Brain Imaging Data Structure
http://www.bids-standard.org/
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Brain Imaging Data Structure
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
BIDS for EEG, MEG, iEEG, MRI, NIRS, PET, motion capture, …
Just a bunch of directories and files on disk.
No special software required (although tools are available).
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
BIDS for EEG, MEG, iEEG, MRI, NIRS, PET, motion capture, …
data/README
CHANGES
dataset_description.json
participants.tsv
/sub-01/anat/…
/sub-01/meg/…
/sub-01/eeg/sub-01_task-auditory_eeg.edf
/sub-01/eeg/sub-01_task-auditory_eeg.json
/sub-01/eeg/sub-01_task-auditory_channels.tsv
/sub-01/eeg/sub-01_task-auditory_events.tsv
/sub-01/eeg/sub-01_electrodes.tsv
/sub-01/eeg/sub-01_coordsystem.json
Actual EEG data
Directory structure
Metadata
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
BIDS “sidecar” files for metadata
see also https://github.com/bids-standard/bids-examples
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
BIDS “sidecar” files for metadata
see also https://github.com/bids-standard/bids-examples
1) represent otherwise missing data
2) make it easier to query/search
As example for EEG:
_participants.tsv and json
_sessions.tsv and json
_scans.tsv and json
_eeg.json
_channels.tsv and json
_electrodes.tsv and json
_coordsystem.json
_photos.jpg
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
+
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Putting code and data together
Research group A: dataset A + analysis pipeline A = result A
Research group B: dataset B + analysis pipeline B = result B
The published results are somewhat comparable, but neither group knows whether
their findings generalize or whether their data or methods are the best.
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Putting code and data together – sharing data
Research group A: dataset A + analysis pipeline A = result AA
dataset B + analysis pipeline A = result BA
Research group B: dataset B + analysis pipeline B = result BB
dataset A + analysis pipeline B = result AB
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Putting code and data together – sharing data and code
Research group A: dataset A + analysis pipeline A = result AA
dataset B + analysis pipeline A = result BA
Research group B: dataset B + analysis pipeline B = result BB
dataset A + analysis pipeline B = result AB
Research group C: dataset A + analysis pipeline A = result AA
dataset A + analysis pipeline B = result AB
dataset B + analysis pipeline A = result BA
dataset B + analysis pipeline B = result BB
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Putting code and data together – BIDS apps
Large data analyses require well-structured data and efficient tools.
An “BIDS app” is a way of organizing your code to make it easy for others
to use it on their data.
MyBidsApp <inputdir> <outputdir> <participant|group> ...
The input directory must be formatted as a BIDS data structure.
The output directory can be formatted as a BIDS derivative data structure.
The application runs either over all participants, or results in agregate group results.
You can combine the two levels by calling it twice.
Gorgolewski et al. (2017) PLoS Comput Biol. https://doi.org/10.1371/journal.pcbi.1005209
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Putting code and data together – BIDS apps
BIDS apps are implemented as docker or singularity containers and can run on large
computational infrastructure, like HPC clusters, WeBrain, CBRAIN, NeuroDesk,
BrainLife, etc.
Since EEG datasets can be large, transferring and maintaning a copy of the data
(and results) is not always feasible.
BIDS apps allow you to bring the (standardized) computation
towards the (standardized) data.
Dong et al. (2021) WeBrain. NeuroImage
Hayashi et al. (2024) Brainlife. Nature Methods
Renton et al. (2024) NeuroDesk. Nature Methods
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Open Science - improves quality, efficiency, inclusivity, reprodicibility, …
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Challenges in the analysis of EEG
Sophisticated algorithms are very detailed and hard to (re)implement.
Sharing code makes reuse efficient and possible.
Open source makes it possible to collaborate, investigate and improve details.
Data is getting more complex, good quality data is hard and expensive to acquire.
Well-curated existing data is invaluable for reuse!
The BIDS standard ensures data to be FAIR, organized and documented.
Being able to write your own algorithms and acquiring your own data
are still important skills.
Open Source allows early career students to get a head-start.
Open Data allows early career students to start with their research straight away.
Robert OOSTENVELD – 8 Sept 2024 – Chengdu
Challenges in the analysis of EEG –
how Open Source and Open Data can help
Robert Oostenveld
robert.oostenveld@donders.ru.nl
8 Sept 2024 - Chengdu, China

More Related Content

PPTX
Donders neuroimage toolkit - open science and good practices
PPTX
CuttingEEG - Open Science, Open Data and BIDS for EEG
PPTX
Organizing EEG data using the Brain Imaging Data Structure
PPTX
Connecting GLIMR with the BIDS initiative
PPTX
Developing and sharing tools for bioelectromagnetic research
PPTX
The Brain Imaging Data Structure and its use for fNIRS
PPTX
HKU Data Curation MLIM7350 Class 8
PDF
Brain Imaging Data Structure and Center for Reproducible Neuroscince
Donders neuroimage toolkit - open science and good practices
CuttingEEG - Open Science, Open Data and BIDS for EEG
Organizing EEG data using the Brain Imaging Data Structure
Connecting GLIMR with the BIDS initiative
Developing and sharing tools for bioelectromagnetic research
The Brain Imaging Data Structure and its use for fNIRS
HKU Data Curation MLIM7350 Class 8
Brain Imaging Data Structure and Center for Reproducible Neuroscince

Similar to Challenges in the analysis of EEG – How Open Source and Open Data can help (20)

PDF
Better neuroimaging data processing: driven by evidence, open communities, an...
PPTX
Using Open Science to advance science - advancing open data
PPTX
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
PPTX
2014 aus-agta
PPT
Aaas Data Intensive Science And Grid
PPT
eScience: A Transformed Scientific Method
PPTX
A practical guide to practicing open science
PPTX
In Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
PPTX
AIDR2019 - standards - tools - incentives - what does it take to enable data ...
PPTX
Data, Data Everywhere: What's A Publisher to Do?
PDF
Share and Reuse: how data sharing can take your research to the next level
PPTX
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
PPT
AMIA 2014
PPTX
Introduction to Big Data and its Potential for Dementia Research
PDF
Data sharing in neuroimaging: incentives, tools, and challenges
PPT
ImageJ and the SciJava software stack
PPTX
FAIR for the future: embracing all things data
PPTX
Managing Your Research Data
PPTX
Biomedical Data Sciences - New Name and New Opportunities for Change?
PPTX
Reproducible research: theory
Better neuroimaging data processing: driven by evidence, open communities, an...
Using Open Science to advance science - advancing open data
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
2014 aus-agta
Aaas Data Intensive Science And Grid
eScience: A Transformed Scientific Method
A practical guide to practicing open science
In Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
AIDR2019 - standards - tools - incentives - what does it take to enable data ...
Data, Data Everywhere: What's A Publisher to Do?
Share and Reuse: how data sharing can take your research to the next level
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
AMIA 2014
Introduction to Big Data and its Potential for Dementia Research
Data sharing in neuroimaging: incentives, tools, and challenges
ImageJ and the SciJava software stack
FAIR for the future: embracing all things data
Managing Your Research Data
Biomedical Data Sciences - New Name and New Opportunities for Change?
Reproducible research: theory
Ad

More from Robert Oostenveld (20)

PPTX
FieldTrip tutorial at WIRED20204 in Paris
PPTX
Spectral-, source-, connectivity- and network analysis of EEG and MEG data
PPTX
EEG, MEG and FieldTrip
PPTX
The Donders Repository
PPTX
OHBM 2020 OSR - Brain research data sharing and personal data privacy
PDF
BIOMAG2018 - Denis Engemann - MNE-HCP
PDF
BIOMAG2018 - Tzvetan Popov - HCP from a user's perspective
PDF
BIOMAG2018 - Vladimir Litvak - Frontiers
PDF
BIOMAG2018 - Jan-Mathijs Schoffelen - COBIDAS
PDF
BIOMAG2018 - Darren Price - CamCAN
PPTX
ChildBrain/Predictable summer school - Open Science
PPTX
Using Open Science to accelerate advancements in auditory EEG signal processing
PPTX
Donders Repository - removing barriers for management and sharing of research...
PPTX
RDM and the Donders Repository
PPTX
Donders Research Data Repository
PDF
Real-time EEG: timing and block size
PPTX
EEG signal background and real-time processing
PPTX
On the large scale of studying dynamics with MEG: Lessons learned from the Hu...
PPTX
Group analyses with FieldTrip
PPTX
Donders Institute - Research Data Management
FieldTrip tutorial at WIRED20204 in Paris
Spectral-, source-, connectivity- and network analysis of EEG and MEG data
EEG, MEG and FieldTrip
The Donders Repository
OHBM 2020 OSR - Brain research data sharing and personal data privacy
BIOMAG2018 - Denis Engemann - MNE-HCP
BIOMAG2018 - Tzvetan Popov - HCP from a user's perspective
BIOMAG2018 - Vladimir Litvak - Frontiers
BIOMAG2018 - Jan-Mathijs Schoffelen - COBIDAS
BIOMAG2018 - Darren Price - CamCAN
ChildBrain/Predictable summer school - Open Science
Using Open Science to accelerate advancements in auditory EEG signal processing
Donders Repository - removing barriers for management and sharing of research...
RDM and the Donders Repository
Donders Research Data Repository
Real-time EEG: timing and block size
EEG signal background and real-time processing
On the large scale of studying dynamics with MEG: Lessons learned from the Hu...
Group analyses with FieldTrip
Donders Institute - Research Data Management
Ad

Recently uploaded (20)

PPT
protein biochemistry.ppt for university classes
PPTX
BIOMOLECULES PPT........................
PPTX
Comparative Structure of Integument in Vertebrates.pptx
PDF
Phytochemical Investigation of Miliusa longipes.pdf
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PDF
An interstellar mission to test astrophysical black holes
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PPTX
Cell Membrane: Structure, Composition & Functions
PPTX
2. Earth - The Living Planet earth and life
PPTX
2Systematics of Living Organisms t-.pptx
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PDF
The scientific heritage No 166 (166) (2025)
PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
famous lake in india and its disturibution and importance
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
protein biochemistry.ppt for university classes
BIOMOLECULES PPT........................
Comparative Structure of Integument in Vertebrates.pptx
Phytochemical Investigation of Miliusa longipes.pdf
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
An interstellar mission to test astrophysical black holes
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
Taita Taveta Laboratory Technician Workshop Presentation.pptx
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
HPLC-PPT.docx high performance liquid chromatography
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
Cell Membrane: Structure, Composition & Functions
2. Earth - The Living Planet earth and life
2Systematics of Living Organisms t-.pptx
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
The scientific heritage No 166 (166) (2025)
2. Earth - The Living Planet Module 2ELS
Classification Systems_TAXONOMY_SCIENCE8.pptx
famous lake in india and its disturibution and importance
cpcsea ppt.pptxssssssssssssssjjdjdndndddd

Challenges in the analysis of EEG – How Open Source and Open Data can help

  • 1. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Challenges in the analysis of EEG – how Open Source and Open Data can help Robert OOSTENVELD robert.oostenveld@donders.ru.nl 8 Sept 2024 - Chengdu, China
  • 2. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Challenges in the analysis of EEG EEG experiments are very flexible EEG has very high temporal resolution EEG is relatively affordable EEG data is noisy EEG data is highly multivariate EEG datasets are large Acquiring EEG is a lot of work Many EEG analysis methods by different groups that aim for similar results Not so many standardized pipelines
  • 3. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Open Science = better science Open access publications Open peer review Pre-registration Open educational resources Open methodology Open software Open hardware Open data
  • 4. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Open Science - improves quality, efficiency, inclusivity, reprodicibility, …
  • 5. Robert OOSTENVELD – 8 Sept 2024 – Chengdu About me Since about 20 years heading the Open Source FieldTrip toolbox. From 2015 onward heading the Open Source EEGsynth artistic BCI toolbox. From 2012 onward responsible for MEG in the Human Connectome Project (HCP). From 2014 onward involved in setting up the Donders and Radboud institutional repositories, which now hold >450 public datasets. From 2019-2023 member of the Brain Imaging Data Structure (BIDS) steering group.
  • 6. Robert OOSTENVELD – 8 Sept 2024 – Chengdu
  • 7. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Why share your code as Open Source Allows others to … • re-use and replicate your findings. • improve on your work and share their improvements back to you. • to find bugs and errors in your work, so that you don’t continue in the wrong direction. Increases the reproducibility and impact of your work. Reprodicibility is a major principle underpinning the scientific method. Impact is what drives innovation and what you need to fund future research.
  • 8. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Sharing code can be simple … or complex It can be as simple as adding your analysis code as appendix, supplementary material to your paper, or sharing it online. It can be as complex as setting up a long lasting project like FieldTrip or EEGLAB.
  • 9. Robert OOSTENVELD – 8 Sept 2024 – Chengdu If you have not shared code before… You can look at others’ code that you are using. What makes it useful for you, what is it that you are missing? Version control Documentation on the goals -> the aim A simple example to get started Documentation on the details -> the implementation A way to get in touch (email, forum, issue tracking system)
  • 10. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Learning from others - FieldTrip FieldTrip is software that we wrote in the lab to work together with a research team of about 5 people. Not the aim to make a big project out of it, but just to get our own research done. Others starting using it internally, guests researchers started using it, and they took it home with them. Then others (whom we did not know) started using it …
  • 11. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Learning from others - FieldTrip We initially started with version control using CVS, then SVN, then git. Using the latest/greatest tool in general is smart. In 2011 we published a paper that describes the toolbox. This is cited a lot, which counts as impact. We already used it in in-house teaching, hence we initially shared the tutorials and example data. Knowing how to make and maintain a website is an important skill! Each piece of code that was meant to be used by others requires documentation. Write and discuss the function “help” and reference documentation before writing the actual code. Get others involved. Email discussion list, GitHub issues.
  • 12. Robert OOSTENVELD – 8 Sept 2024 – Chengdu FieldTrip also learned from others Early collaboration with EEGLAB (Arno Delorme and Scott Makeig): They knew how to deal with “users” to give training and provide documentation. Early collaboration with SPM team (FIL, UCL): They knew how to make code maintanable using version control, consistency, dependencies.
  • 13. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Learning from others Technical aspects of the code: - Computer science and software engineering Aspects of the documentation: - Not very different from teaching material - A “wiki” like system is easy to set up - MarkDown documents are nowadays common Aspects of community management and collaboration: - Hard to learn, but you can look at others and copy https://collections.plos.org/collection/ten-simple-rules/
  • 14. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Managing an Open Source project Find a convenient place to host your code, documentation and community. GitHub is designed for “social coding” and working together, not only between developers, but also with users. There are alternatives for GitHub, like GitLab or the Chinese counterpart Gitee. Make sure that people can find it!
  • 15. Robert OOSTENVELD – 8 Sept 2024 – Chengdu
  • 16. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Open Data Findable Make your data available in a catalog or repository with a persistent identifier (DOI, handle) and metadata Accessible Be explicit about data usage terms (agreement with downloader) Interoperable Make your data human and machine readable, e.g. BIDS Reusable Make sure you document enough details, e.g. “data descriptor” paper that can be cited, along with citing our data -> measurable impact! ilkinson et al. (2016) The FAIR Guiding Principles. Sci Data.
  • 17. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Making EEG data FAIR The Brain Imaging Data Structure (BIDS) is a community initiative to make data more FAIR and to improve data sharing http://www.bids-standard.org/
  • 18. Robert OOSTENVELD – 8 Sept 2024 – Chengdu What is BIDS? BIDS is a way to organize your existing raw data To improve consistent and complete documentation To facilitate re-use by your future self and others BIDS is not A new file format A search engine A data sharing platform http://www.bids-standard.org/
  • 19. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Brain Imaging Data Structure (BIDS) External reuse of data: publishing, sharing Internal reuse of data: archiving, curation, collaborating Fundamental for repositories like OpenNeuro, PublicNeuro, but also for our own Donders Repository and as the basis for new analysis pipelines and workflow development https://openneuro.org https://publicneuro.eu https://data.donders.ru.nl PublicNeuro.EU data.donders.ru.nl
  • 20. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Brain Imaging Data Structure http://www.bids-standard.org/
  • 21. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Brain Imaging Data Structure
  • 22. Robert OOSTENVELD – 8 Sept 2024 – Chengdu BIDS for EEG, MEG, iEEG, MRI, NIRS, PET, motion capture, … Just a bunch of directories and files on disk. No special software required (although tools are available).
  • 23. Robert OOSTENVELD – 8 Sept 2024 – Chengdu BIDS for EEG, MEG, iEEG, MRI, NIRS, PET, motion capture, … data/README CHANGES dataset_description.json participants.tsv /sub-01/anat/… /sub-01/meg/… /sub-01/eeg/sub-01_task-auditory_eeg.edf /sub-01/eeg/sub-01_task-auditory_eeg.json /sub-01/eeg/sub-01_task-auditory_channels.tsv /sub-01/eeg/sub-01_task-auditory_events.tsv /sub-01/eeg/sub-01_electrodes.tsv /sub-01/eeg/sub-01_coordsystem.json Actual EEG data Directory structure Metadata
  • 24. Robert OOSTENVELD – 8 Sept 2024 – Chengdu BIDS “sidecar” files for metadata see also https://github.com/bids-standard/bids-examples
  • 25. Robert OOSTENVELD – 8 Sept 2024 – Chengdu BIDS “sidecar” files for metadata see also https://github.com/bids-standard/bids-examples 1) represent otherwise missing data 2) make it easier to query/search As example for EEG: _participants.tsv and json _sessions.tsv and json _scans.tsv and json _eeg.json _channels.tsv and json _electrodes.tsv and json _coordsystem.json _photos.jpg
  • 26. Robert OOSTENVELD – 8 Sept 2024 – Chengdu +
  • 27. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Putting code and data together Research group A: dataset A + analysis pipeline A = result A Research group B: dataset B + analysis pipeline B = result B The published results are somewhat comparable, but neither group knows whether their findings generalize or whether their data or methods are the best.
  • 28. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Putting code and data together – sharing data Research group A: dataset A + analysis pipeline A = result AA dataset B + analysis pipeline A = result BA Research group B: dataset B + analysis pipeline B = result BB dataset A + analysis pipeline B = result AB
  • 29. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Putting code and data together – sharing data and code Research group A: dataset A + analysis pipeline A = result AA dataset B + analysis pipeline A = result BA Research group B: dataset B + analysis pipeline B = result BB dataset A + analysis pipeline B = result AB Research group C: dataset A + analysis pipeline A = result AA dataset A + analysis pipeline B = result AB dataset B + analysis pipeline A = result BA dataset B + analysis pipeline B = result BB
  • 30. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Putting code and data together – BIDS apps Large data analyses require well-structured data and efficient tools. An “BIDS app” is a way of organizing your code to make it easy for others to use it on their data. MyBidsApp <inputdir> <outputdir> <participant|group> ... The input directory must be formatted as a BIDS data structure. The output directory can be formatted as a BIDS derivative data structure. The application runs either over all participants, or results in agregate group results. You can combine the two levels by calling it twice. Gorgolewski et al. (2017) PLoS Comput Biol. https://doi.org/10.1371/journal.pcbi.1005209
  • 31. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Putting code and data together – BIDS apps BIDS apps are implemented as docker or singularity containers and can run on large computational infrastructure, like HPC clusters, WeBrain, CBRAIN, NeuroDesk, BrainLife, etc. Since EEG datasets can be large, transferring and maintaning a copy of the data (and results) is not always feasible. BIDS apps allow you to bring the (standardized) computation towards the (standardized) data. Dong et al. (2021) WeBrain. NeuroImage Hayashi et al. (2024) Brainlife. Nature Methods Renton et al. (2024) NeuroDesk. Nature Methods
  • 32. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Open Science - improves quality, efficiency, inclusivity, reprodicibility, …
  • 33. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Challenges in the analysis of EEG Sophisticated algorithms are very detailed and hard to (re)implement. Sharing code makes reuse efficient and possible. Open source makes it possible to collaborate, investigate and improve details. Data is getting more complex, good quality data is hard and expensive to acquire. Well-curated existing data is invaluable for reuse! The BIDS standard ensures data to be FAIR, organized and documented. Being able to write your own algorithms and acquiring your own data are still important skills. Open Source allows early career students to get a head-start. Open Data allows early career students to start with their research straight away.
  • 34. Robert OOSTENVELD – 8 Sept 2024 – Chengdu Challenges in the analysis of EEG – how Open Source and Open Data can help Robert Oostenveld robert.oostenveld@donders.ru.nl 8 Sept 2024 - Chengdu, China

Editor's Notes

  • #1: For those that do not know me, I am …
  • #5: Besides being a researcher on MEG and EEG data analysis and source reconstruction methods on which I have published more than 150 papers, …
  • #10: As an interesting note: Pedro Valdes-Sosa already downloaded the software in 2004, when it was just released to the public
  • #20: Initiated around 2015 in response to challenges encountered in the OpenFMRI (now OpenNeuro) data repository project
  • #22: For MRI imaging data the “eeg” directory is named “anat”, “func” or “dwi” and the data is stored in nifti files There can also be session (or visit) layer, in between the subject and the modality
  • #25: In the case of MRI imaging data the sidecars contain information about the scanner, protocols, and also the task
  • #34: For those that do not know me, I am …