SlideShare a Scribd company logo
RDM and the Donders Repository
Robert Oostenveld & Eric Maris
r.oostenveld@donders.ru.nl e.maris@donders.ru.nl
Research data management at the Donders
What is the organization of the institute?
What is the type of research data?
Data is organized in research projects
What is the data workflow?
What are the roles and responsibilities?
Collection types
How FAIR is this?
Demonstration
Reflection and questions
Current and future challenges
Considerations for data stewards of other institutes
RDM and the Donders Repository
Outline of the presentation
The DI consists of four centres, representing
three faculties and two governing boards,
with strong links to other research institutes.
Research at the DI is organized along four
research themes, involving around 80
principal investigators (PIs) and 500
researchers, which are scattered across the
campus.
RDM and the Donders Repository
http://www.ru.nl/donders/about/organization/
The organization of the Donders Institute (DI)
RU
DI
DCC DCCN DCMN DCN
formal chain of management
principal investigators
researchers
Estimated 35 TB/year over 100 million files.
Heterogeneous with respect to content (different types of biological and
behavioural data, analysis scripts, text, …) and file type (DICOM, Excel,
MATLAB and Python scripts, Word, …). Note that there is not that much
tabular data.
Including data from human voluntary subjects (among which patients), of
which some potentially allow to identify persons (anatomical MRIs,
video, ...).
Personal data (name, address, telephone number, ..) are not stored in the
DR.
RDM and the Donders Repository
Research data at the DI
Although DR can store data of an arbitrary volume, it was designed
primarily for research projects at the scale of a single peer-reviewed
publication.
Projects are identified and managed per center and outside of the DR.
Projects must have a unique identifier that is used to link to the
associated collections inside the DR .
Projects have a clear start and end point.
RDM and the Donders Repository
See also http://www.ru.nl/donders/research/data/rdm-nutshell/
Data are organized according to research project
The researcher …
1. Obtains approval for his project
2. Acquires data
Uploads a copy of the raw data
3. Analyzes data and writes a manuscript
Uploads the analysis scripts and
research documentation
4. Publishes his manuscript
Uploads the to-be-shared data and
receives a DOI
5. Publishes his manuscript
Adds the shared data DOI to his
publication
RDM and the Donders Repository
What is the data workflow?
Centre director decides who is eligible as a collection manager
The research administrator initiates a collection in the DR, associates
the collection with a project in the center administration, assigns a
manager (typically, the responsible PI of the project), specifies the disk
quota and embargo duration (when applicable)
The collection manager assigns other collection managers, collection
contributors and viewers (typically, postdocs, PhD students, Master
students, and external collaborators)
Collection managers and contributors provide a collection description
(e.g. metadata such as title, abstract, keywords) and the collection
content (i.e. the data)
Collection viewer can access, but not change anything
RDM and the Donders Repository
What are the roles and responsibilities?
– Data acquisition collection (DAC)
– Research documentation collection (RDC)
– Data sharing collection (DSC)
DAC and RDC are internal; a closed DSC is an external collection.
The difference between DAC and RDC reflects the research at the DI.
The specifics of collection types are not a core feature of the DR, but
they are not easily configurable either.
RDM and the Donders Repository
Collection types
access after strong authentication
(e.g., employee credentials, thorough
check) and authorisation by the
collection manager
access after weak authentication
(e.g. Social ID) and the
signing of a data use agreement
(DUA)
RDM and the Donders Repository
See https://www.force11.org/group/fairgroup/fairprinciples
How FAIR is this all?
Findable
DAC and RDC internally findable,
DSC external findable and have DOI
(also indexed by RIS, DANS, google).
Accessible
Authorization linked to authentication and
agreement with DUA (for external), or expicit
authorization by manager (for internal).
Interoperable
Limited use of formal ontologies (in metadata) and
no such requirements on data, since highly
discipline specific.
Reusable
OK wrt data usage license, but all other aspects
are highly discipline specific.
Web interface for managing collection properties and roles (metadata)
please go to http://data.donders.ru.nl/ and sign up
File browser interface for managing collection content (data)
please go to https://cyberduck.io and install
Online training and documentation (help)
please go here for configuration instructions
RDM and the Donders Repository
See http://data.donders.ru.nl
Demonstration
The DI-RDM pilot project has reached a state at which we can say
that it is successful, but it is not completely finalized yet.
Some topics that are not fully resolved:
– Governance of both technical and procedural aspects
– Long-term sustainable financial model
– Training of researchers
– Procedural compliance
– How to deal with (future) changes in
– the organizational structure
– the data workflow
RDM and the Donders Repository
Current and future challenges
The DR is meant as an archive system, not as network storage.
Access to the data requires separate software (at this moment).
The DR website is configurable with respect to a number of DI-specific
elements, such as textual information about the research institute, logo, privacy
policy, controlled vocabularies, DUAs, ….
The DR has some flexibility in organizational structure, but assumes it to be
hierarchical (e.g. in our case /RU/Institute/Centre).
The DR allows for some flexibility in mapping the research workflow onto
different collection types with their own requirements, but this requires ISC
software development.
The DR business logic with respect to authorization (with distinct
responsibilities for centre director, research admin, collection manager,
contributor, and viewer) is at the heart of the system, and offers very little
flexibility.
RDM and the Donders Repository
Considerations for data stewards of other institutes
Are projects well-defined in your institute? Specifically, does a
project have a unique identifier in your institute’s administration?
Is it clear what the life span of a project is?
Is it clear who should be responsible for a given project, also in the
long run?
Is it clear who should contribute to a project, without being
responsible for the project as a whole?
Is it clear what are the research data that should be managed
(preserved, documented, and shared)?
What are the legal constraints on the management of the data?
RDM and the Donders Repository
See also formal protocols on http://www.ru.nl/donders/research/data/reference-documents/
and other presentations at http://www.ru.nl/donders/research/data/presentations/
Questions for data stewards of other institutes
www.ru.nl/donders
www.ru.nl/donders/research/data
data.donders.ru.nl

More Related Content

PPT
Metadata lecture 1, intro
PPT
Metadata: A concept
PDF
Metadata Standards
PPT
Metadata lecture 3, metadata schemes
PPTX
Research data management workshop april12 2016
PPT
Metadata an overview
PPTX
Presentation IS
PPTX
Rebecca Grant - DRI/ARA(I) Training: Introduction to EAD - Metadata and Metad...
Metadata lecture 1, intro
Metadata: A concept
Metadata Standards
Metadata lecture 3, metadata schemes
Research data management workshop april12 2016
Metadata an overview
Presentation IS
Rebecca Grant - DRI/ARA(I) Training: Introduction to EAD - Metadata and Metad...

What's hot (20)

PPTX
Research data management & planning: an introduction
PDF
Preparing Data for Sharing: The FAIR Principles
PPTX
Activities of JaLC as a national service
PPTX
Paul2 ecn 2012
PPT
香港六合彩
PPT
Metadata harvesting Tools
PPT
Does metadata matter?
PDF
The OI Project - Geoffrey Bilder
PPT
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
PDF
Research Data Management and Sharing for the Social Sciences and Humanities
PDF
WHAT ARE METADATA STANDARDS? EXPLAIN DUBLIN CORE IN DETAIL.
PPT
University of Bath Research Data Management training for researchers
PPT
Metadata For Catalogers (introductions)
PPTX
Archives&information world 5651
PPTX
Research data management for historians
PDF
The Metadata Secret in Your Data
PPT
Data management
PPTX
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
PPTX
Research Data Management Fundamentals for MSU Engineering Students
Research data management & planning: an introduction
Preparing Data for Sharing: The FAIR Principles
Activities of JaLC as a national service
Paul2 ecn 2012
香港六合彩
Metadata harvesting Tools
Does metadata matter?
The OI Project - Geoffrey Bilder
DATA MANAGEMENT – WHAT DOES IT MEAN FOR RESEARCHERS?
Research Data Management and Sharing for the Social Sciences and Humanities
WHAT ARE METADATA STANDARDS? EXPLAIN DUBLIN CORE IN DETAIL.
University of Bath Research Data Management training for researchers
Metadata For Catalogers (introductions)
Archives&information world 5651
Research data management for historians
The Metadata Secret in Your Data
Data management
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Research Data Management Fundamentals for MSU Engineering Students
Ad

Similar to RDM and the Donders Repository (20)

PPTX
Donders Institute - Research Data Management
PPTX
The Donders Repository
PPTX
What infrastructure is necessary for successful research data management (RDM...
PDF
Looking After Your Data: RDM @ Edinburgh
PPTX
RDM services: an opportunity for libraries
PPTX
Research Data Management at Imperial College London
PDF
E research africa presentation (19 nov 2014)
PPTX
Making research data more resourceful - Jisc digital festival 2015
PDF
Going Full Circle: Research Data Management @ University of Pretoria
PPTX
Open data and research data management at the University of Edinburgh: polici...
PPT
Introduction to Research Data Management
PPTX
RDM Programme at University of Edinburgh
PDF
RDM programme @ Edinburgh an institutional approach
PPTX
RDM Programme @ Edinburgh
PPTX
Developing Research Data Management Policy and Services
PPTX
Research Data Management at the University of Edinburgh
PPT
Rdm slides march 2014
PPT
Building Research Data Management Services - Robin Rice
PPTX
Donders Repository - removing barriers for management and sharing of research...
PDF
Andrew Cox Research data management
Donders Institute - Research Data Management
The Donders Repository
What infrastructure is necessary for successful research data management (RDM...
Looking After Your Data: RDM @ Edinburgh
RDM services: an opportunity for libraries
Research Data Management at Imperial College London
E research africa presentation (19 nov 2014)
Making research data more resourceful - Jisc digital festival 2015
Going Full Circle: Research Data Management @ University of Pretoria
Open data and research data management at the University of Edinburgh: polici...
Introduction to Research Data Management
RDM Programme at University of Edinburgh
RDM programme @ Edinburgh an institutional approach
RDM Programme @ Edinburgh
Developing Research Data Management Policy and Services
Research Data Management at the University of Edinburgh
Rdm slides march 2014
Building Research Data Management Services - Robin Rice
Donders Repository - removing barriers for management and sharing of research...
Andrew Cox Research data management
Ad

More from Robert Oostenveld (20)

PPTX
Challenges in the analysis of EEG – How Open Source and Open Data can help
PPTX
FieldTrip tutorial at WIRED20204 in Paris
PPTX
Developing and sharing tools for bioelectromagnetic research
PPTX
Connecting GLIMR with the BIDS initiative
PPTX
Spectral-, source-, connectivity- and network analysis of EEG and MEG data
PPTX
EEG, MEG and FieldTrip
PPTX
Donders neuroimage toolkit - open science and good practices
PPTX
Using Open Science to advance science - advancing open data
PPTX
OHBM 2020 OSR - Brain research data sharing and personal data privacy
PPTX
The Brain Imaging Data Structure and its use for fNIRS
PPTX
Organizing EEG data using the Brain Imaging Data Structure
PDF
BIOMAG2018 - Denis Engemann - MNE-HCP
PDF
BIOMAG2018 - Tzvetan Popov - HCP from a user's perspective
PDF
BIOMAG2018 - Vladimir Litvak - Frontiers
PDF
BIOMAG2018 - Jan-Mathijs Schoffelen - COBIDAS
PDF
BIOMAG2018 - Darren Price - CamCAN
PPTX
CuttingEEG - Open Science, Open Data and BIDS for EEG
PPTX
ChildBrain/Predictable summer school - Open Science
PPTX
Using Open Science to accelerate advancements in auditory EEG signal processing
PPTX
Donders Research Data Repository
Challenges in the analysis of EEG – How Open Source and Open Data can help
FieldTrip tutorial at WIRED20204 in Paris
Developing and sharing tools for bioelectromagnetic research
Connecting GLIMR with the BIDS initiative
Spectral-, source-, connectivity- and network analysis of EEG and MEG data
EEG, MEG and FieldTrip
Donders neuroimage toolkit - open science and good practices
Using Open Science to advance science - advancing open data
OHBM 2020 OSR - Brain research data sharing and personal data privacy
The Brain Imaging Data Structure and its use for fNIRS
Organizing EEG data using the Brain Imaging Data Structure
BIOMAG2018 - Denis Engemann - MNE-HCP
BIOMAG2018 - Tzvetan Popov - HCP from a user's perspective
BIOMAG2018 - Vladimir Litvak - Frontiers
BIOMAG2018 - Jan-Mathijs Schoffelen - COBIDAS
BIOMAG2018 - Darren Price - CamCAN
CuttingEEG - Open Science, Open Data and BIDS for EEG
ChildBrain/Predictable summer school - Open Science
Using Open Science to accelerate advancements in auditory EEG signal processing
Donders Research Data Repository

Recently uploaded (20)

PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPT
Quality review (1)_presentation of this 21
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
Fluorescence-microscope_Botany_detailed content
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
Lecture1 pattern recognition............
PPTX
Computer network topology notes for revision
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PDF
Foundation of Data Science unit number two notes
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Quality review (1)_presentation of this 21
Supervised vs unsupervised machine learning algorithms
Fluorescence-microscope_Botany_detailed content
.pdf is not working space design for the following data for the following dat...
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Lecture1 pattern recognition............
Computer network topology notes for revision
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Foundation of Data Science unit number two notes
Introduction-to-Cloud-ComputingFinal.pptx
IB Computer Science - Internal Assessment.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf

RDM and the Donders Repository

  • 1. RDM and the Donders Repository Robert Oostenveld & Eric Maris r.oostenveld@donders.ru.nl e.maris@donders.ru.nl
  • 2. Research data management at the Donders What is the organization of the institute? What is the type of research data? Data is organized in research projects What is the data workflow? What are the roles and responsibilities? Collection types How FAIR is this? Demonstration Reflection and questions Current and future challenges Considerations for data stewards of other institutes RDM and the Donders Repository Outline of the presentation
  • 3. The DI consists of four centres, representing three faculties and two governing boards, with strong links to other research institutes. Research at the DI is organized along four research themes, involving around 80 principal investigators (PIs) and 500 researchers, which are scattered across the campus. RDM and the Donders Repository http://www.ru.nl/donders/about/organization/ The organization of the Donders Institute (DI) RU DI DCC DCCN DCMN DCN formal chain of management principal investigators researchers
  • 4. Estimated 35 TB/year over 100 million files. Heterogeneous with respect to content (different types of biological and behavioural data, analysis scripts, text, …) and file type (DICOM, Excel, MATLAB and Python scripts, Word, …). Note that there is not that much tabular data. Including data from human voluntary subjects (among which patients), of which some potentially allow to identify persons (anatomical MRIs, video, ...). Personal data (name, address, telephone number, ..) are not stored in the DR. RDM and the Donders Repository Research data at the DI
  • 5. Although DR can store data of an arbitrary volume, it was designed primarily for research projects at the scale of a single peer-reviewed publication. Projects are identified and managed per center and outside of the DR. Projects must have a unique identifier that is used to link to the associated collections inside the DR . Projects have a clear start and end point. RDM and the Donders Repository See also http://www.ru.nl/donders/research/data/rdm-nutshell/ Data are organized according to research project
  • 6. The researcher … 1. Obtains approval for his project 2. Acquires data Uploads a copy of the raw data 3. Analyzes data and writes a manuscript Uploads the analysis scripts and research documentation 4. Publishes his manuscript Uploads the to-be-shared data and receives a DOI 5. Publishes his manuscript Adds the shared data DOI to his publication RDM and the Donders Repository What is the data workflow?
  • 7. Centre director decides who is eligible as a collection manager The research administrator initiates a collection in the DR, associates the collection with a project in the center administration, assigns a manager (typically, the responsible PI of the project), specifies the disk quota and embargo duration (when applicable) The collection manager assigns other collection managers, collection contributors and viewers (typically, postdocs, PhD students, Master students, and external collaborators) Collection managers and contributors provide a collection description (e.g. metadata such as title, abstract, keywords) and the collection content (i.e. the data) Collection viewer can access, but not change anything RDM and the Donders Repository What are the roles and responsibilities?
  • 8. – Data acquisition collection (DAC) – Research documentation collection (RDC) – Data sharing collection (DSC) DAC and RDC are internal; a closed DSC is an external collection. The difference between DAC and RDC reflects the research at the DI. The specifics of collection types are not a core feature of the DR, but they are not easily configurable either. RDM and the Donders Repository Collection types access after strong authentication (e.g., employee credentials, thorough check) and authorisation by the collection manager access after weak authentication (e.g. Social ID) and the signing of a data use agreement (DUA)
  • 9. RDM and the Donders Repository See https://www.force11.org/group/fairgroup/fairprinciples How FAIR is this all? Findable DAC and RDC internally findable, DSC external findable and have DOI (also indexed by RIS, DANS, google). Accessible Authorization linked to authentication and agreement with DUA (for external), or expicit authorization by manager (for internal). Interoperable Limited use of formal ontologies (in metadata) and no such requirements on data, since highly discipline specific. Reusable OK wrt data usage license, but all other aspects are highly discipline specific.
  • 10. Web interface for managing collection properties and roles (metadata) please go to http://data.donders.ru.nl/ and sign up File browser interface for managing collection content (data) please go to https://cyberduck.io and install Online training and documentation (help) please go here for configuration instructions RDM and the Donders Repository See http://data.donders.ru.nl Demonstration
  • 11. The DI-RDM pilot project has reached a state at which we can say that it is successful, but it is not completely finalized yet. Some topics that are not fully resolved: – Governance of both technical and procedural aspects – Long-term sustainable financial model – Training of researchers – Procedural compliance – How to deal with (future) changes in – the organizational structure – the data workflow RDM and the Donders Repository Current and future challenges
  • 12. The DR is meant as an archive system, not as network storage. Access to the data requires separate software (at this moment). The DR website is configurable with respect to a number of DI-specific elements, such as textual information about the research institute, logo, privacy policy, controlled vocabularies, DUAs, …. The DR has some flexibility in organizational structure, but assumes it to be hierarchical (e.g. in our case /RU/Institute/Centre). The DR allows for some flexibility in mapping the research workflow onto different collection types with their own requirements, but this requires ISC software development. The DR business logic with respect to authorization (with distinct responsibilities for centre director, research admin, collection manager, contributor, and viewer) is at the heart of the system, and offers very little flexibility. RDM and the Donders Repository Considerations for data stewards of other institutes
  • 13. Are projects well-defined in your institute? Specifically, does a project have a unique identifier in your institute’s administration? Is it clear what the life span of a project is? Is it clear who should be responsible for a given project, also in the long run? Is it clear who should contribute to a project, without being responsible for the project as a whole? Is it clear what are the research data that should be managed (preserved, documented, and shared)? What are the legal constraints on the management of the data? RDM and the Donders Repository See also formal protocols on http://www.ru.nl/donders/research/data/reference-documents/ and other presentations at http://www.ru.nl/donders/research/data/presentations/ Questions for data stewards of other institutes

Editor's Notes

  • #11: aan begin registreren deel 1: instemmen met een DUA -> toevoegen als viewer collection details op webdav link klikken -> open in new tab username en OTP invullen en browsen deel 2: ik maak een DAC met Eric als initial manager Eric lost in op Firefox browser Eric maakt de deelnemers contributor dan klikken ze wat rond, veranderen wat attributen deel 3: deelnemers installeren cyberduck configureren cyberduck maken verbinding en voegen iets kleins toe aan hun collectie met Eric wat laten we niet zien - stager - configurable content - emails - user management door research admin - documentatie beperkt