SlideShare a Scribd company logo
3TU.Datacentrum 
OpenML Workshop (III) @ Eindhoven 
TU/e, 22-10-2014 
l.osinski@tue.nl, TU/e IEC/Library 
Available under CC BY license, which permits 
unrestricted use, distribution, and reproduction in 
any medium, provided the original author and 
source are credited
Sharing research data 
Why? 
 It’s expected by research funders, journals, professional organizations and 
research evaluators 
 Because of scientific integrity: reproducibility of results 
 Because of re-using results: data-driven science 
 You benefit from it: increases your visibility and enhances the trustworthiness of 
your research 
How? 
 On request 
 Personal website 
 Publishing / archiving in a repository 
International Open Access Week
Re-using research data 
To be re-used, data should be 
 Findable: DOI; metadata (to allow discovery) 
 Accessible: ≠ open access; licenses to use; to humans and machines 
 Intelligible, assessable: metadata (to allow understandability) 
 Interoperable: combining across multiple sources 
 Preserved: long-term availability 
Source: Research Data Netherlands / 
Marina Noordegraaf
3TU.Datacentrum #1 
 Findability + citability: 3TU.DC assigns DOI’s; discovery metadata are 
mandatory; data sets are indexed by DataCite, Google, Data Citation Index 
 Accessibility: 3TU.DC = open access; embargo’s (6 months) are allowed 
 Intelligible, assessable, interoperable: up to the researcher 
 Preservation: 3TU.DC has quality mark Data Seal of Approval 
Source: Research Data Netherlands / 
Marina Noordegraaf
3TU.Datacentrum #2 
 File format support levels 
 Self-upload of simple data sets (≤ 4 Gb) 
 Tailor-made solutions 
 Upload and download statistics 
 Collections of data sets 
Source: Research Data Netherlands / 
Marina Noordegraaf
DOI’s and OpenML #1 
 DataCite Netherlands : assigns and distributes DOI’s on behalf of DataCite 
to research organizations and data centers in NL 
 Organizations can register DOI’s for its objects by applying for an account 
at DataCite Netherlands 
+ Objects need to be persistent, long-term available 
+ Objects are preferably open access; restricted access is allowed 
+ Objects should be citable (metadata added) 
+ Objects must have a public landing page 
Source: Research Data Netherlands / 
Marina Noordegraaf
DOI’s and OpenML #2 
 Organizations must ensure maintenance and supply of metadata 
 A contract will be signed to ensure the abovementioned points, after that 
the organization will receive its own DOI prefix 
 Costs: € 1000,- (once-only, subject to changes) 
 Creating DOIs: manually via web forms ↔ uploading xml resources files 
Source: Research Data Netherlands / 
Marina Noordegraaf
URL’s of mentioned webpages 
(in order of appearance) 
1. OpenML Workshop (III) @ Eindhoven: http://eindhoven2014.openml.org/ 
2. Website IEC/Library [TU/e]: http://w3.tue.nl/nl/diensten/bib/ 
3. Data on request (Reinhart-Rogoff paper): http://dx.doi.org/10.1257/aer.100.2.573 
4. Data on personal website (Thomas Piketty): http://piketty.pse.ens.fr/en/capital21c2 
5. Publishing data (3TU.Datacentrum): http://data.3tu.org 
6. International Open Access Week: http://www.openaccessweek.org 
7. DataCite metadata search: http://search.datacite.org/ui 
8. Data Citation Index (Thomson Reuters): 
http://wokinfo.com/products_tools/multidisciplinary/dci/ 
9. Data Seal of Approval: http://www.datasealofapproval.org 
10. File format support levels: 
http://datacentrum.3tu.nl/fileadmin/editor_upload/File_formats/Digital_Preservation_Suppo 
rt_levels.pdf 
11. DataCite Netherlands: http://datacite.tudelft.nl/info/home/ 
International Open Access Week

More Related Content

PPTX
Zenodo - The catch-all repository
PPTX
Belgium webinar - openAIRE Research Graph
PPT
Imac 090924
PDF
OpenAIRE – The path from OpenAIRE to EOSC in Belgium
PDF
OpenMinted: It's Uses and Benefits for the Social Sciences
PPT
How can repositories support the text mining of their content and why?
PPTX
Open science policy in flanders
PDF
iRODS User Group Meeting 2016 - MUMC+
Zenodo - The catch-all repository
Belgium webinar - openAIRE Research Graph
Imac 090924
OpenAIRE – The path from OpenAIRE to EOSC in Belgium
OpenMinted: It's Uses and Benefits for the Social Sciences
How can repositories support the text mining of their content and why?
Open science policy in flanders
iRODS User Group Meeting 2016 - MUMC+

What's hot (20)

PPTX
The Future is All Mine
PDF
Probabilistic indexing for archival holdings - possibilities and limits
PDF
TIB's action for research data managament as a national library's strategy in...
PPTX
Connecting Heterogeneous Collections using Linked Data
PDF
OpenMinTeD: Making Sense of Large Volumes of Data
PDF
Text Mining: the next data frontier. Beyond Open Access
PPT
Jan Brase: Data and Libraries - the DataCite consortium
PPT
Open Access Repository Junction
PDF
The Breakdown: What is OpenMinTeD?
PDF
EPOS metadata catalogue
PPTX
OpenAIRE in 8 minutes - Introduction to European einfrastructures session at ...
PDF
Scholze goportis 4-11-14
PDF
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
PPT
News from the DOI and DataCite Community
PDF
Service Integration to Enhance RDM
PPTX
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
PPT
The Tropical Rain Forest Information Center
PDF
PPT
DataCite and its DOI infrastructure - IASSIST 2013
PDF
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
The Future is All Mine
Probabilistic indexing for archival holdings - possibilities and limits
TIB's action for research data managament as a national library's strategy in...
Connecting Heterogeneous Collections using Linked Data
OpenMinTeD: Making Sense of Large Volumes of Data
Text Mining: the next data frontier. Beyond Open Access
Jan Brase: Data and Libraries - the DataCite consortium
Open Access Repository Junction
The Breakdown: What is OpenMinTeD?
EPOS metadata catalogue
OpenAIRE in 8 minutes - Introduction to European einfrastructures session at ...
Scholze goportis 4-11-14
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
News from the DOI and DataCite Community
Service Integration to Enhance RDM
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
The Tropical Rain Forest Information Center
DataCite and its DOI infrastructure - IASSIST 2013
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Ad

Viewers also liked (16)

PDF
Portfolio
PPTX
Surfer Dude Questions
PPTX
PERTURBAÇÕES DO ESPECTRO DO AUTISMO: A AÇÃO NA ESCOLA
PDF
Escola que Reforma?
PPTX
Im A Chatter Q S
PPTX
Im A Chatter Qs 1
PPTX
SÉCULO XXI: DESAFIOS E DILEMAS DA EDUCAÇÃO INCLUSIVA
PDF
Auteursrecht in academische omgeving: DPO Professionaliseringsbijeenkomst, 23...
PPTX
ESPECTRO DO AUTISMO; FALARMOS DE ALUNBOS E DE APRENDIZAGENS
PPTX
PODE A AVALIAÇÃO PARA AS APRENDIZAGENS SER INCLUSIVA?
PDF
A basic course on Research data management, part 3: sharing your data
PDF
A basic course on Research data management, part 4: caring for your data, or ...
PDF
Apoios Educativos: Do Paradigma da Pessoa ao Paradigma dos Contextos
PDF
Necessidades Educativas Especiais: Avaliar
PDF
A basic course on Research data management, part 1: what and why
PPTX
The Seasons Of The Year
Portfolio
Surfer Dude Questions
PERTURBAÇÕES DO ESPECTRO DO AUTISMO: A AÇÃO NA ESCOLA
Escola que Reforma?
Im A Chatter Q S
Im A Chatter Qs 1
SÉCULO XXI: DESAFIOS E DILEMAS DA EDUCAÇÃO INCLUSIVA
Auteursrecht in academische omgeving: DPO Professionaliseringsbijeenkomst, 23...
ESPECTRO DO AUTISMO; FALARMOS DE ALUNBOS E DE APRENDIZAGENS
PODE A AVALIAÇÃO PARA AS APRENDIZAGENS SER INCLUSIVA?
A basic course on Research data management, part 3: sharing your data
A basic course on Research data management, part 4: caring for your data, or ...
Apoios Educativos: Do Paradigma da Pessoa ao Paradigma dos Contextos
Necessidades Educativas Especiais: Avaliar
A basic course on Research data management, part 1: what and why
The Seasons Of The Year
Ad

Similar to 3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2014 / Leon Osinski (20)

PDF
How to make your research data open : presentation held at the VU Open Scienc...
PDF
OpenML Tutorial ECMLPKDD 2015
PDF
Research data management : Open Research Data pilot, data management (plans),...
PPTX
Open Data Repositories
PDF
OpenML data@Sheffield
PPTX
Research data management : [part of] PROOF course Finding and controlling sci...
PDF
Your Research Data Management with the support of 3TU.Datacentrum
PPT
3 tu.dc 5min nordbib jp rombouts
PDF
Be prepared to share your research data / Leon Osinski
PDF
FAIR data: LOUD for all audiences
PPTX
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
PDF
ML Schema: Machine Learning Schema
PPTX
Workshop 4: Open Science & Open Data for Librarians/Ina Smith
PPT
Elag workshop sessie 1 en 2 v10
PDF
OpenML 2014
PPT
What does open science mean? A stakeholder perspective
PDF
A basic course on Research data management: part 1 - part 4
PDF
OpenAIRE webinar. Open Research Data in H2020
PPTX
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
PPTX
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
How to make your research data open : presentation held at the VU Open Scienc...
OpenML Tutorial ECMLPKDD 2015
Research data management : Open Research Data pilot, data management (plans),...
Open Data Repositories
OpenML data@Sheffield
Research data management : [part of] PROOF course Finding and controlling sci...
Your Research Data Management with the support of 3TU.Datacentrum
3 tu.dc 5min nordbib jp rombouts
Be prepared to share your research data / Leon Osinski
FAIR data: LOUD for all audiences
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
ML Schema: Machine Learning Schema
Workshop 4: Open Science & Open Data for Librarians/Ina Smith
Elag workshop sessie 1 en 2 v10
OpenML 2014
What does open science mean? A stakeholder perspective
A basic course on Research data management: part 1 - part 4
OpenAIRE webinar. Open Research Data in H2020
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
OpenAIRE and Eudat services and tools to support FAIR DMP implementation

More from Leon Osinski (20)

PDF
Articles and research data : DML Update, 08-10-2020
PDF
PROOF course Writing articles and abstracts in English, part: Copyright in ac...
PDF
Good (enough) research data management practices
PDF
What funders want you to do with your data
PDF
Research data management at TU Eindhoven
PDF
Research data management: course OGO Quantitative research (21-11-2018)
PDF
Discussion CC licenses for data
PDF
Research data management: course 0HV90, Behavioral Research Methods
PDF
Be open: what funders want you to do with your publications and research data
PDF
A basic course on Reseach data management, part 2: protecting and organizing ...
PDF
Research data management
PPTX
How to get FUN out of sharing your data : FUN meeting, 02-04-2015 by Leon Osi...
PPTX
( Dutch ) Dataverse Network : Workshop (Dutch) Dataverse Network voor 3TU.Dat...
PPTX
Horizon 2020 and research data : info meeting Horizon 2020 @ TUe, 07-10-2014 ...
PPTX
Copyright and citation issues : PROOF course Writing articles and abstracts /...
PDF
Onderzoeksdata-bepalingen van financiers van universitair onderzoek in NL: Ma...
PDF
OA beleid subscriptie-uitgevers / Saskia Woutersen-Windhouwer, Leon Osinski
PDF
Research data management during and after your research ; an introduction / L...
PDF
Wat als alle artikelen open access beschikbaar zijn? / Leon Osinski
PDF
Open access : recente ontwikkelingen / Leon Osinski
Articles and research data : DML Update, 08-10-2020
PROOF course Writing articles and abstracts in English, part: Copyright in ac...
Good (enough) research data management practices
What funders want you to do with your data
Research data management at TU Eindhoven
Research data management: course OGO Quantitative research (21-11-2018)
Discussion CC licenses for data
Research data management: course 0HV90, Behavioral Research Methods
Be open: what funders want you to do with your publications and research data
A basic course on Reseach data management, part 2: protecting and organizing ...
Research data management
How to get FUN out of sharing your data : FUN meeting, 02-04-2015 by Leon Osi...
( Dutch ) Dataverse Network : Workshop (Dutch) Dataverse Network voor 3TU.Dat...
Horizon 2020 and research data : info meeting Horizon 2020 @ TUe, 07-10-2014 ...
Copyright and citation issues : PROOF course Writing articles and abstracts /...
Onderzoeksdata-bepalingen van financiers van universitair onderzoek in NL: Ma...
OA beleid subscriptie-uitgevers / Saskia Woutersen-Windhouwer, Leon Osinski
Research data management during and after your research ; an introduction / L...
Wat als alle artikelen open access beschikbaar zijn? / Leon Osinski
Open access : recente ontwikkelingen / Leon Osinski

Recently uploaded (20)

PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
master seminar digital applications in india
PDF
A systematic review of self-coping strategies used by university students to ...
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Cell Types and Its function , kingdom of life
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
Computing-Curriculum for Schools in Ghana
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
O5-L3 Freight Transport Ops (International) V1.pdf
202450812 BayCHI UCSC-SV 20250812 v17.pptx
STATICS OF THE RIGID BODIES Hibbelers.pdf
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Microbial disease of the cardiovascular and lymphatic systems
Supply Chain Operations Speaking Notes -ICLT Program
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
O7-L3 Supply Chain Operations - ICLT Program
master seminar digital applications in india
A systematic review of self-coping strategies used by university students to ...
Abdominal Access Techniques with Prof. Dr. R K Mishra
Pharma ospi slides which help in ospi learning
Cell Types and Its function , kingdom of life
Final Presentation General Medicine 03-08-2024.pptx
Final Presentation General Medicine 03-08-2024.pptx
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Computing-Curriculum for Schools in Ghana

3TU.Datacentrum: presentation for OpenML Workshop (III) at Eindhoven, 22-10-2014 / Leon Osinski

  • 1. 3TU.Datacentrum OpenML Workshop (III) @ Eindhoven TU/e, 22-10-2014 l.osinski@tue.nl, TU/e IEC/Library Available under CC BY license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
  • 2. Sharing research data Why?  It’s expected by research funders, journals, professional organizations and research evaluators  Because of scientific integrity: reproducibility of results  Because of re-using results: data-driven science  You benefit from it: increases your visibility and enhances the trustworthiness of your research How?  On request  Personal website  Publishing / archiving in a repository International Open Access Week
  • 3. Re-using research data To be re-used, data should be  Findable: DOI; metadata (to allow discovery)  Accessible: ≠ open access; licenses to use; to humans and machines  Intelligible, assessable: metadata (to allow understandability)  Interoperable: combining across multiple sources  Preserved: long-term availability Source: Research Data Netherlands / Marina Noordegraaf
  • 4. 3TU.Datacentrum #1  Findability + citability: 3TU.DC assigns DOI’s; discovery metadata are mandatory; data sets are indexed by DataCite, Google, Data Citation Index  Accessibility: 3TU.DC = open access; embargo’s (6 months) are allowed  Intelligible, assessable, interoperable: up to the researcher  Preservation: 3TU.DC has quality mark Data Seal of Approval Source: Research Data Netherlands / Marina Noordegraaf
  • 5. 3TU.Datacentrum #2  File format support levels  Self-upload of simple data sets (≤ 4 Gb)  Tailor-made solutions  Upload and download statistics  Collections of data sets Source: Research Data Netherlands / Marina Noordegraaf
  • 6. DOI’s and OpenML #1  DataCite Netherlands : assigns and distributes DOI’s on behalf of DataCite to research organizations and data centers in NL  Organizations can register DOI’s for its objects by applying for an account at DataCite Netherlands + Objects need to be persistent, long-term available + Objects are preferably open access; restricted access is allowed + Objects should be citable (metadata added) + Objects must have a public landing page Source: Research Data Netherlands / Marina Noordegraaf
  • 7. DOI’s and OpenML #2  Organizations must ensure maintenance and supply of metadata  A contract will be signed to ensure the abovementioned points, after that the organization will receive its own DOI prefix  Costs: € 1000,- (once-only, subject to changes)  Creating DOIs: manually via web forms ↔ uploading xml resources files Source: Research Data Netherlands / Marina Noordegraaf
  • 8. URL’s of mentioned webpages (in order of appearance) 1. OpenML Workshop (III) @ Eindhoven: http://eindhoven2014.openml.org/ 2. Website IEC/Library [TU/e]: http://w3.tue.nl/nl/diensten/bib/ 3. Data on request (Reinhart-Rogoff paper): http://dx.doi.org/10.1257/aer.100.2.573 4. Data on personal website (Thomas Piketty): http://piketty.pse.ens.fr/en/capital21c2 5. Publishing data (3TU.Datacentrum): http://data.3tu.org 6. International Open Access Week: http://www.openaccessweek.org 7. DataCite metadata search: http://search.datacite.org/ui 8. Data Citation Index (Thomson Reuters): http://wokinfo.com/products_tools/multidisciplinary/dci/ 9. Data Seal of Approval: http://www.datasealofapproval.org 10. File format support levels: http://datacentrum.3tu.nl/fileadmin/editor_upload/File_formats/Digital_Preservation_Suppo rt_levels.pdf 11. DataCite Netherlands: http://datacite.tudelft.nl/info/home/ International Open Access Week

Editor's Notes

  • #2: Introducing myself and IEC/Library
  • #3: Open access week Because data providing the evidence for a published paper can be asked for by others in view of verificating or replicating your results (scientific integrity) Because journal, funder or code of conduct demand data to be accessible Because data are unique and / or valuable (non-repeatable observations) Because data are an asset, worth sharing in order to be reused or built on by others UPSIDE: Uniform Principle of Sharing Integral Data and Materials Expeditiously
  • #4: Findable + citeable Accessibility doesn’t necessarily means open access Findable: easy to find both by humans and computers based on mandatory description of the metadata that allow researchers to track and trace interesting datasets; Accessible: stored long term such that they can be easily accessed and/or downloaded with well-defined license and access conditions (Open Access when possible), whether at the level of metadata, or at the level of the actual data content; Interoperable: ready to be combined (across multiple sources) by humans as well as computers; Re-Usable: ready to be used for future research and to be processed further using computational methods. Different levels of accessibility: not accessible, after request, made available on a personal website, published with a DOI; by machines
  • #5: Costs: 3500 / 4500 euro per Tb per 20 year
  • #6: Costs: 3500 / 4500 euro per Tb per 20 year