SlideShare a Scribd company logo
dans.knaw.nl
DANS is een instituut van KNAW en NWO
Open Research Data in H2020
Marjan Grootveld
OpenAIRE webinar, 26 October 2016
Who we are
Open Access Infrastructure for Research in Europe
www.openaire.eu
DANS: Data Archiving and Networked
Services
Institute of Dutch
Academy and
Research Funding
Organisation
(KNAW & NWO)
since 2005
First predecessor
dates back to
1964 (Steinmetz
Foundation),
Historical Data
Archive 1989
Mission: promote
and provide
permanent access
to digital research
information
4
DataverseNL for
short- and mid-
term storage
EASY: certified long-term
Electronic Archiving
System for self-deposit
NARCIS: Gateway
to scholarly
information in the
Netherlands
Research data in context
Contents
• Brief recap from recent OpenAIRE-EUDAT webinars
• The updated Guidelines for FAIR Data Management:
• F, A, I, R
• Costs, data security, ethical aspects, other RDM procedures
• Recommendations
• Links to EC and OpenAIRE information
5
Recent webinars
Introductory RDM webinar, Tony Ross-Hellauer & Sarah Jones, 26 May:
• Reasons to manage data
• How to manage and share data (+ how to respond to concerns about
sharing)
• EUDAT & OpenAIRE services
Q&A document: https://b2drop.eudat.eu/s/0H6qRgwdwkAVFvD#pdfviewer
“How to write a DMP”, Sarah Jones & Marjan Grootveld, 7/14 July:
• What is a Data Management Plan and why to write it?
• Example DMPs in different domains, with lots of links!
• Lessons and guidance (e.g. storing =/= archiving; how to find a
repository; file-naming conventions)
All recordings and slides are on https://eudat.eu/events/webinars
https://www.eudat.eu Research Data Services, Expertise & Technology
6
Recap: why manage data?
(Not for the research funder, but for life we make data management plans)
Make your research easier
Stop yourself drowning in irrelevant stuff
Save data for later
Avoid accusations of fraud or bad science
Write a data paper, connect your nano publications
Share your data for re-use & get them validated in real life
Get credit for it
7
NON PECUNIAE INVESTIGATIONIS CURATORE
SED VITAE FACIMUS PROGRAMMAS DATORUM PROCURATIONIS
Horizon 2020 infographic
Horizon 2020: Open Research Data Pilot
The use of a Data Management Plan (DMP) is
required for projects participating in the Open
Research Data Pilot, detailing what data the
project will generate, whether and how they will
be exploited or made accessible for verification
and re-use, and how they will be curated and
preserved.
http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf
9
Guidelines on FAIR DM v.3
Structure of the Guidelines:
1.Background: extension of the pilot
2.DMP general definition
3.Proposal, submission and evaluation
4.RDM plans during the project life cycle
5.Support
6.Annex 1: the DMP template
1. Data summary
2. FAIR data
3. Allocation of resources
4. Data security
5. Ethical aspects
6. Other issues
7. Summary table “Fair DM at a glance”
10
http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf
What’s new?
• You should develop a DMP for your project.
• There is a single DMP template from start to finish.
• The DMP template is inspired by the FAIR principles:
research data should be findable, accessible, interoperable
and re-usable (without suggesting any specific technology,
standard, or implementation solution).
Also explicit in the new guidelines:
• From 1-1-2017 the pilot will cover all thematic areas of
Horizon 2020.
• Costs related to open access to research data are eligible
for reimbursement during the duration of the project under
the conditions defined in the Grant Agreement.
11
Good things that remain
Whether a (proposed) project participates in the
ORD pilot or chooses to opt out does not affect
the evaluation of that project: proposals will not
be penalised for opting out.
Participating in the ORD pilot does
not necessarily mean opening up all
your research data: as open as
possible, as closed as necessary.
The DMP is a living document.
You are not required to
provide detailed answers to all
the questions in the first
version of the DMP (due M6).
Deposit in a research data repository:
a. the data needed to validate the results presented
in scientific publications, including the metadata;
b. any other data, including the metadata, as
specified in the DMP;
c. plus for a-b the documentation and the tools
that are needed to validate the results, e.g.
specialised software or software code, algorithms
and analysis protocols (when possible, these
instruments themselves).
12
DMPonline
A web-based tool to help researchers write DMPs
Guidance from EUDAT and OpenAIRE being added
https://dmponline.dcc.ac.uk
Choose your
funder to get
their specific
template
Choose any
additional
optional
guidance
13
§2 Making data FAIR
Findable
– Assign persistent IDs, provide rich metadata, register in a
searchable resource, ...
Accessible
– Retrievable by their ID using a standard protocol, metadata remain
accessible even if data aren’t...
Interoperable
– Use formal, broadly applicable languages, use standard
vocabularies, qualified references...
Reusable
– Rich, accurate metadata, clear licences, provenance, use of
community standards...
14
www.force11.org/group/fairgroup/fairprinciples and http://www.nature.com/articles/sdata201618
EC in the Guidelines: “This template is not intended as a strict
technical implementation of the FAIR principles, it is rather inspired
by FAIR as a general concept.”
EC Infographic:
http://ec.europa.eu/research/images/infographics/policy/open-data-2016-
w920.png
15
Some F questions
2.1 Making data findable, including provisions for metadata
• Use metadata and specify standards for metadata creation
(if any). If there are no standards in your discipline
describe what type of metadata will be created and how.
• Search keywords
• Persistent and unique identifiers such as DOI
• File and folder naming conventions: see OpenAIRE-EUDAT
July webinar
• Versioning of the datasets and clear version numbers
16
Metadata and documentation
• Metadata and documentation is needed to find and
understand research data.
• Think about what others would need in order to find,
evaluate, understand, and reuse your data.
• Get others to check the metadata to improve quality.
• Use standards to enable interoperability.
http://rd-alliance.github.io/metadata-directory
17
Some A questions
2.2 Making data openly accessible:
• Explain which data can’t be shared openly, if any
• Specify how access will be provided in case of restrictions,
e.g. through a data committee, a license, or arranged with
the repository.
• Will methods or software tools needed to access the data
(if any) be included or documented?
• Deposit the data and associated metadata, documentation
and code preferably in certified repositories which support
Open Access.
Data Seal of Approval
ICSU World Data System
nestor seal
ISO 16363
18
Where to find a repository?
More information: https://www.openaire.eu/opendatapilot-repository
Zenodo: http://www.zenodo.org Re3data.org: http://www.re3data.org
19
File format considerations
No clearcut definitions of “sustainable file format”.
Each archives has its own expertise, related to its designated
community. Examples:
http://dans.knaw.nl/en/deposit/information-about-depositing-data?set_language=en
http://researchdata.4tu.nl/en/publishing-research/data-description-and-formats/
4TU.ResearchData DANS
Level 1 Level 2 or 3 Preferred Accepted
audio .wav .ra, .mp3, .wma .wav, .flac .aiff, .mp3, .aac
chemistry NMR, ChemDoodle, ….pdb, .xyz
databases
delimited flat file
w/DDL .mdb, .dbf, .acdb .sql, .siard, .csv .mdb, .dbf, .hdf5 …
video
.mp1, .mp2, .mp4,
.mov …
.mpg2, .mpg4, .avi,
.mov .mkv
20
Interoperability
Before clocks were invented, people
kept time using different instruments to
observe the Sun’s zenith at noon.
Towns and cities set clocks based on
sunsets and sunrises. Time calculation
became a serious problem for people
travelling by train, sometimes hundreds
of miles in a day. UTC is the World's
Time Standard.
21
Some I questions
2.3 Making data interoperable
• Specify what data and metadata vocabularies, standards or
methodologies you will follow to facilitate interoperability.
• Standard vocabulary to allow inter-disciplinary
interoperability or a mapping from your vocabulary to more
commonly used ontologies?
22
Some R questions
2.4 Increase data re-use (through clarifying licences)
• License the data to permit the widest reuse possible
• Specify a data embargo, if this is needed
• How long will the data remain reusable?
• Describe data quality assurance processes
Re-use over time
23
Licensing research data and software
EUDAT licensing wizard help you pick licence for data & software
http://ufal.github.io/public-license-selector/
You should also license Open Access data, or waive rights.
Horizon 2020 Open Access
guidelines point to:
or
24
Keep everything? For always?
When regenerating data is cheaper than archiving, don’t archive.
Select what data you’ll need and want to retain.
10 years is often stated in data policies and academic codes, but
data can be valuable for ages, in climatology, sociology, health
sciences, astronomy, linguistics, … Look beyond minimal retention
periods where relevant.
“The lifetime of software is generally not as long as that of data”
(Daniel Katz e.a. http://bit.ly/2eScCKp)
RDNL Selection criteria: http://www.researchdata.nl/en/services/data-
management/selecting-research-data/
DCC How-to guide: http://www.dcc.ac.uk/resources/how-guides/appraise-select-data
25
§3 Allocation of resources
• What are the costs for making data FAIR in your project?
• Resources for long term preservation
Check the UK Data Service Costing model.
Rule of thumb: 5% of the project budget is spent on RDM.
The High Level Expert Group on the European Open Science Cloud
recommends that “well budgeted data stewardship plans should be
made mandatory and we expect that on average about 5% of
research expenditure should be spent on properly managing and
stewarding data”.
UKDS model http://www.data-archive.ac.uk/create-manage/planning-for-sharing/costing
HLEG report
http://ec.europa.eu/research/openscience/pdf/realising_the_european_open_science_cloud_2016.p
df#view=fit&pagemode=none p. 19
26
§4-6
Data security
• Provisions for data recovery, secure storage, transfer of
sensitive data?
• Safely stored in certified repositories for long term
preservation and curation?
Ethical aspects
• Any ethical or legal issues that can impact data sharing?
• Informed consent for data sharing and long term
preservation included in questionnaires dealing with
personal data?
Which other national/funder/sectorial/departmental
procedures for data management do you use (if any)?
27
Closing remarks
Image “Fishbone” CC BY-NC-ND 2.0 by ttps://www.flickr.com/photos/mrjnl/
Recommendations
• Think about the desired end result and plan for this.
• Involve all work packages and partners to get a coherent
plan.
• “Sharing” means “outside the consortium”.
• Approach the DMP in whatever way best fits your project:
• EC template is intended as a service, not an obligation. Read the
background information and the guidance, and use it as a checklist.
• More than one dataset? Describe generically what is
possible and dataset-specific what is necessary.
• Focus effort on datasets you’ll create rather than reuse.
29
The EC Open Research Data pilot
Key sources of information
• Guidelines on Open Access to Scientific Publications and Research Data in Horizon
2020
http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilo
t/h2020-hi-oa-pilot-guide_en.pdf
• Guidelines on Data Management in Horizon 2020
http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilo
t/h2020-hi-oa-data-mgt_en.pdf
• Annotated model grant agreement, clause 29.3
http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/amga/h2
020-amga_en.pdf
• New infographic summarising key policy points
http://ec.europa.eu/research/press/2016/pdf/opendata-infographic_072016.pdf
• Open Access and Data Management
• http://ec.europa.eu/research/participants/docs/h2020-funding-guide/cross-cutting-
issues/open-access-dissemination_en.htm
30
OpenAIRE support materials
• Briefing papers,
factsheets, webinars,
workshops, FAQs
• Information on:
• Open Research Data Pilot
• Creating a data
management plan
• Selecting a data repository
• Personal data
https://www.openaire.eu/opendatapilot
https://www.openaire.eu/support
31
dans.knaw.nl
DANS is een instituut van KNAW en NWO
Thank you!
Acknowledgements:
Thanks to Sarah Jones (DCC), OpenAIRE and EUDAT for slides.
marjan.grootveld@dans.knaw.nl
http://dans.knaw.nl/

More Related Content

PPTX
OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...
PPTX
(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)
PPTX
Open Data: Sharing the Main Actor of a Scientific Story - Paola Masuzzo
PPTX
OpenAIRE: Services for Funders - Lightning Talk at #DI4R conference (Krakov, ...
PPTX
Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial ...
PPTX
OpenAIRE workshop @ OR2016 - From Repositories, for repositories
PPTX
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
PPTX
H2020 Open Research Data pilot
OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...
(Open) Research Data Management in H2020 (ISERD – Tel Aviv, Oct 31, 2016)
Open Data: Sharing the Main Actor of a Scientific Story - Paola Masuzzo
OpenAIRE: Services for Funders - Lightning Talk at #DI4R conference (Krakov, ...
Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial ...
OpenAIRE workshop @ OR2016 - From Repositories, for repositories
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
H2020 Open Research Data pilot

What's hot (20)

PPTX
OpenAIRE in 8 minutes - Introduction to European einfrastructures session at ...
PPTX
Alma Swan - PASTEUR4OA: Policy alignment and effectiveness
PDF
Making research visible, making research count
PPTX
OpenAIRE: Directrices 3.0, desarrollos y servicios para Gestores de Repositorios
PPTX
LIBER Webinar: Are the FAIR Data Principles really fair?
PPTX
EPSRC Policy Compliance: What researchers need to know
PDF
Open Access to Research Data in H2020
PDF
FAIR Ddata in trustworthy repositories: the basics
PPTX
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
PDF
OpenAIRE@info day_amsterdam_jan_2016
PPTX
Fair data principles for AOASG
PPTX
OpenAIRE: eInfrastructure for Open Science
PPTX
Supporting the development of a national Research Data Discovery Service - A ...
PPTX
Enabling better science - Results and vision of the OpenAIRE infrastructure a...
PPTX
EPSRC research data expectations and PURE for datasets
PPTX
Research Data Management in GLAM: Managing Data for Cultural Heritage
PPTX
Towards a European Research Information Infrastructure
PPTX
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
PPTX
Open Science Globally: Some Developments/Dr Simon Hodson
PPTX
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
OpenAIRE in 8 minutes - Introduction to European einfrastructures session at ...
Alma Swan - PASTEUR4OA: Policy alignment and effectiveness
Making research visible, making research count
OpenAIRE: Directrices 3.0, desarrollos y servicios para Gestores de Repositorios
LIBER Webinar: Are the FAIR Data Principles really fair?
EPSRC Policy Compliance: What researchers need to know
Open Access to Research Data in H2020
FAIR Ddata in trustworthy repositories: the basics
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE@info day_amsterdam_jan_2016
Fair data principles for AOASG
OpenAIRE: eInfrastructure for Open Science
Supporting the development of a national Research Data Discovery Service - A ...
Enabling better science - Results and vision of the OpenAIRE infrastructure a...
EPSRC research data expectations and PURE for datasets
Research Data Management in GLAM: Managing Data for Cultural Heritage
Towards a European Research Information Infrastructure
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
Open Science Globally: Some Developments/Dr Simon Hodson
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
Ad

Viewers also liked (20)

PDF
Zenodo Repository and the Open Research Data in H2020 (OAW2016)
PPTX
OpenAIRE webinar on Open Access in H2020 (OAW2016)
PPTX
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
PDF
Nos systèmes : dossier de partenariat
PDF
Comment diffuser mes données de recherche ?
PDF
Acceso abierto - Open access #infografia
PPTX
Presentation of the OpenAIRE webinars during the Open Access Week 2016
PPT
Transition to Open Science in Europe
PPTX
Open Science: Application and Benefits
PPT
Guerilla Open Access Manifesto
PDF
Copyright management in open access projects
PPT
(Not such) New Challenges for Open Access, Ivana Hebrang Grgić
PPTX
The OpenAIRE Catalogue of Services: Towards Open Science - Workshop: Design y...
PPTX
The Shift to Open Access Publishing
PDF
Mesa redonda: Funcionamiento eficaz en el peer review, reconocimiento a los r...
PDF
Open access (OA) in the Research Excellence Framework (REF) - Ben Johnson, HEFCE
PPTX
Beyond Open Access: Creating Culture By, With, and For the Public
PDF
Open science and the individual researcher
PDF
DOAJ ICDL 2016: The Changing Landscape and Future of Open Access in India
PPTX
Horizon 2020 and the open research data pilot
Zenodo Repository and the Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Access in H2020 (OAW2016)
Overview of the data pilot and OpenAIRE tools, Elly Dijk and Marjan Grootveld...
Nos systèmes : dossier de partenariat
Comment diffuser mes données de recherche ?
Acceso abierto - Open access #infografia
Presentation of the OpenAIRE webinars during the Open Access Week 2016
Transition to Open Science in Europe
Open Science: Application and Benefits
Guerilla Open Access Manifesto
Copyright management in open access projects
(Not such) New Challenges for Open Access, Ivana Hebrang Grgić
The OpenAIRE Catalogue of Services: Towards Open Science - Workshop: Design y...
The Shift to Open Access Publishing
Mesa redonda: Funcionamiento eficaz en el peer review, reconocimiento a los r...
Open access (OA) in the Research Excellence Framework (REF) - Ben Johnson, HEFCE
Beyond Open Access: Creating Culture By, With, and For the Public
Open science and the individual researcher
DOAJ ICDL 2016: The Changing Landscape and Future of Open Access in India
Horizon 2020 and the open research data pilot
Ad

Similar to OpenAIRE webinar on Open Research Data in H2020 (OAW2016) (20)

PPTX
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
PPTX
H2020 Open Data Pilot
PDF
OpenAIRE webinar. Open Research Data in H2020
PPTX
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
PPTX
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
PPTX
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
PPTX
Open Access Week 2017: Research data management and data management plans (Fl...
PPT
H2020 data pilot openaire
PPT
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
PDF
The state of global research data initiatives: observations from a life on th...
PPTX
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
PPTX
Research Data Management: CSUC activities & services
PPTX
Open Data Strategies and Research Data Realities
PPTX
Open, FAIR data and RDM
PDF
OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...
PPTX
WEBINAR: Open Research Data in Horizon 2020
PPTX
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
PPTX
Research data management : [part of] PROOF course Finding and controlling sci...
PPTX
Bosman and Kramer Open Research: A 2024 NISO Training Series, Session Four: O...
PPTX
Data Management and Horizon 2020
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
H2020 Open Data Pilot
OpenAIRE webinar. Open Research Data in H2020
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Open Access Week 2017: Research data management and data management plans (Fl...
H2020 data pilot openaire
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
The state of global research data initiatives: observations from a life on th...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
Research Data Management: CSUC activities & services
Open Data Strategies and Research Data Realities
Open, FAIR data and RDM
OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...
WEBINAR: Open Research Data in Horizon 2020
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
Research data management : [part of] PROOF course Finding and controlling sci...
Bosman and Kramer Open Research: A 2024 NISO Training Series, Session Four: O...
Data Management and Horizon 2020

More from OpenAIRE (20)

PDF
10th OpenAIRE Content Providers Community Call
PDF
9th Content Providers Community Call\
PPTX
OpenAIRE in the European Open Science Cloud (EOSC)
PDF
8th Content Providers Community Call
PDF
7th Content Providers Community Call
PDF
OpenAIRE PROVIDE Dashboard for Turkish repository managers
PDF
What will it cost to manage and share my data?
PDF
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
PDF
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
PDF
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
PDF
6th Content Providers Community Call
PPTX
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
PPTX
20200504_Research Data & the GDPR: How Open is Open?
PDF
20200504_Data, Data Ownership and Open Science
PPTX
20200429_Research Data & the GDPR: How Open is Open? (updated version)
PDF
20200429_Data, Data Ownership and Open Science
PPTX
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
PDF
COVID-19: Activities, tools, best practice and contact points in Greece
PDF
5th Content Providers Community Call
PDF
4th Content Providers Community Call
10th OpenAIRE Content Providers Community Call
9th Content Providers Community Call\
OpenAIRE in the European Open Science Cloud (EOSC)
8th Content Providers Community Call
7th Content Providers Community Call
OpenAIRE PROVIDE Dashboard for Turkish repository managers
What will it cost to manage and share my data?
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
6th Content Providers Community Call
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_Research Data & the GDPR: How Open is Open?
20200504_Data, Data Ownership and Open Science
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Data, Data Ownership and Open Science
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
COVID-19: Activities, tools, best practice and contact points in Greece
5th Content Providers Community Call
4th Content Providers Community Call

Recently uploaded (20)

PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
PPT
protein biochemistry.ppt for university classes
PDF
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
PDF
. Radiology Case Scenariosssssssssssssss
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PDF
The scientific heritage No 166 (166) (2025)
PDF
diccionario toefl examen de ingles para principiante
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
Cell Membrane: Structure, Composition & Functions
PDF
An interstellar mission to test astrophysical black holes
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PDF
HPLC-PPT.docx high performance liquid chromatography
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Derivatives of integument scales, beaks, horns,.pptx
Introduction to Fisheries Biotechnology_Lesson 1.pptx
Biophysics 2.pdffffffffffffffffffffffffff
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
Classification Systems_TAXONOMY_SCIENCE8.pptx
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
protein biochemistry.ppt for university classes
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
. Radiology Case Scenariosssssssssssssss
The KM-GBF monitoring framework – status & key messages.pptx
The scientific heritage No 166 (166) (2025)
diccionario toefl examen de ingles para principiante
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
ECG_Course_Presentation د.محمد صقران ppt
Cell Membrane: Structure, Composition & Functions
An interstellar mission to test astrophysical black holes
Taita Taveta Laboratory Technician Workshop Presentation.pptx
HPLC-PPT.docx high performance liquid chromatography

OpenAIRE webinar on Open Research Data in H2020 (OAW2016)

  • 1. dans.knaw.nl DANS is een instituut van KNAW en NWO Open Research Data in H2020 Marjan Grootveld OpenAIRE webinar, 26 October 2016
  • 2. Who we are Open Access Infrastructure for Research in Europe www.openaire.eu
  • 3. DANS: Data Archiving and Networked Services Institute of Dutch Academy and Research Funding Organisation (KNAW & NWO) since 2005 First predecessor dates back to 1964 (Steinmetz Foundation), Historical Data Archive 1989 Mission: promote and provide permanent access to digital research information
  • 4. 4 DataverseNL for short- and mid- term storage EASY: certified long-term Electronic Archiving System for self-deposit NARCIS: Gateway to scholarly information in the Netherlands Research data in context
  • 5. Contents • Brief recap from recent OpenAIRE-EUDAT webinars • The updated Guidelines for FAIR Data Management: • F, A, I, R • Costs, data security, ethical aspects, other RDM procedures • Recommendations • Links to EC and OpenAIRE information 5
  • 6. Recent webinars Introductory RDM webinar, Tony Ross-Hellauer & Sarah Jones, 26 May: • Reasons to manage data • How to manage and share data (+ how to respond to concerns about sharing) • EUDAT & OpenAIRE services Q&A document: https://b2drop.eudat.eu/s/0H6qRgwdwkAVFvD#pdfviewer “How to write a DMP”, Sarah Jones & Marjan Grootveld, 7/14 July: • What is a Data Management Plan and why to write it? • Example DMPs in different domains, with lots of links! • Lessons and guidance (e.g. storing =/= archiving; how to find a repository; file-naming conventions) All recordings and slides are on https://eudat.eu/events/webinars https://www.eudat.eu Research Data Services, Expertise & Technology 6
  • 7. Recap: why manage data? (Not for the research funder, but for life we make data management plans) Make your research easier Stop yourself drowning in irrelevant stuff Save data for later Avoid accusations of fraud or bad science Write a data paper, connect your nano publications Share your data for re-use & get them validated in real life Get credit for it 7 NON PECUNIAE INVESTIGATIONIS CURATORE SED VITAE FACIMUS PROGRAMMAS DATORUM PROCURATIONIS
  • 9. Horizon 2020: Open Research Data Pilot The use of a Data Management Plan (DMP) is required for projects participating in the Open Research Data Pilot, detailing what data the project will generate, whether and how they will be exploited or made accessible for verification and re-use, and how they will be curated and preserved. http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf 9
  • 10. Guidelines on FAIR DM v.3 Structure of the Guidelines: 1.Background: extension of the pilot 2.DMP general definition 3.Proposal, submission and evaluation 4.RDM plans during the project life cycle 5.Support 6.Annex 1: the DMP template 1. Data summary 2. FAIR data 3. Allocation of resources 4. Data security 5. Ethical aspects 6. Other issues 7. Summary table “Fair DM at a glance” 10 http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf
  • 11. What’s new? • You should develop a DMP for your project. • There is a single DMP template from start to finish. • The DMP template is inspired by the FAIR principles: research data should be findable, accessible, interoperable and re-usable (without suggesting any specific technology, standard, or implementation solution). Also explicit in the new guidelines: • From 1-1-2017 the pilot will cover all thematic areas of Horizon 2020. • Costs related to open access to research data are eligible for reimbursement during the duration of the project under the conditions defined in the Grant Agreement. 11
  • 12. Good things that remain Whether a (proposed) project participates in the ORD pilot or chooses to opt out does not affect the evaluation of that project: proposals will not be penalised for opting out. Participating in the ORD pilot does not necessarily mean opening up all your research data: as open as possible, as closed as necessary. The DMP is a living document. You are not required to provide detailed answers to all the questions in the first version of the DMP (due M6). Deposit in a research data repository: a. the data needed to validate the results presented in scientific publications, including the metadata; b. any other data, including the metadata, as specified in the DMP; c. plus for a-b the documentation and the tools that are needed to validate the results, e.g. specialised software or software code, algorithms and analysis protocols (when possible, these instruments themselves). 12
  • 13. DMPonline A web-based tool to help researchers write DMPs Guidance from EUDAT and OpenAIRE being added https://dmponline.dcc.ac.uk Choose your funder to get their specific template Choose any additional optional guidance 13
  • 14. §2 Making data FAIR Findable – Assign persistent IDs, provide rich metadata, register in a searchable resource, ... Accessible – Retrievable by their ID using a standard protocol, metadata remain accessible even if data aren’t... Interoperable – Use formal, broadly applicable languages, use standard vocabularies, qualified references... Reusable – Rich, accurate metadata, clear licences, provenance, use of community standards... 14 www.force11.org/group/fairgroup/fairprinciples and http://www.nature.com/articles/sdata201618
  • 15. EC in the Guidelines: “This template is not intended as a strict technical implementation of the FAIR principles, it is rather inspired by FAIR as a general concept.” EC Infographic: http://ec.europa.eu/research/images/infographics/policy/open-data-2016- w920.png 15
  • 16. Some F questions 2.1 Making data findable, including provisions for metadata • Use metadata and specify standards for metadata creation (if any). If there are no standards in your discipline describe what type of metadata will be created and how. • Search keywords • Persistent and unique identifiers such as DOI • File and folder naming conventions: see OpenAIRE-EUDAT July webinar • Versioning of the datasets and clear version numbers 16
  • 17. Metadata and documentation • Metadata and documentation is needed to find and understand research data. • Think about what others would need in order to find, evaluate, understand, and reuse your data. • Get others to check the metadata to improve quality. • Use standards to enable interoperability. http://rd-alliance.github.io/metadata-directory 17
  • 18. Some A questions 2.2 Making data openly accessible: • Explain which data can’t be shared openly, if any • Specify how access will be provided in case of restrictions, e.g. through a data committee, a license, or arranged with the repository. • Will methods or software tools needed to access the data (if any) be included or documented? • Deposit the data and associated metadata, documentation and code preferably in certified repositories which support Open Access. Data Seal of Approval ICSU World Data System nestor seal ISO 16363 18
  • 19. Where to find a repository? More information: https://www.openaire.eu/opendatapilot-repository Zenodo: http://www.zenodo.org Re3data.org: http://www.re3data.org 19
  • 20. File format considerations No clearcut definitions of “sustainable file format”. Each archives has its own expertise, related to its designated community. Examples: http://dans.knaw.nl/en/deposit/information-about-depositing-data?set_language=en http://researchdata.4tu.nl/en/publishing-research/data-description-and-formats/ 4TU.ResearchData DANS Level 1 Level 2 or 3 Preferred Accepted audio .wav .ra, .mp3, .wma .wav, .flac .aiff, .mp3, .aac chemistry NMR, ChemDoodle, ….pdb, .xyz databases delimited flat file w/DDL .mdb, .dbf, .acdb .sql, .siard, .csv .mdb, .dbf, .hdf5 … video .mp1, .mp2, .mp4, .mov … .mpg2, .mpg4, .avi, .mov .mkv 20
  • 21. Interoperability Before clocks were invented, people kept time using different instruments to observe the Sun’s zenith at noon. Towns and cities set clocks based on sunsets and sunrises. Time calculation became a serious problem for people travelling by train, sometimes hundreds of miles in a day. UTC is the World's Time Standard. 21
  • 22. Some I questions 2.3 Making data interoperable • Specify what data and metadata vocabularies, standards or methodologies you will follow to facilitate interoperability. • Standard vocabulary to allow inter-disciplinary interoperability or a mapping from your vocabulary to more commonly used ontologies? 22
  • 23. Some R questions 2.4 Increase data re-use (through clarifying licences) • License the data to permit the widest reuse possible • Specify a data embargo, if this is needed • How long will the data remain reusable? • Describe data quality assurance processes Re-use over time 23
  • 24. Licensing research data and software EUDAT licensing wizard help you pick licence for data & software http://ufal.github.io/public-license-selector/ You should also license Open Access data, or waive rights. Horizon 2020 Open Access guidelines point to: or 24
  • 25. Keep everything? For always? When regenerating data is cheaper than archiving, don’t archive. Select what data you’ll need and want to retain. 10 years is often stated in data policies and academic codes, but data can be valuable for ages, in climatology, sociology, health sciences, astronomy, linguistics, … Look beyond minimal retention periods where relevant. “The lifetime of software is generally not as long as that of data” (Daniel Katz e.a. http://bit.ly/2eScCKp) RDNL Selection criteria: http://www.researchdata.nl/en/services/data- management/selecting-research-data/ DCC How-to guide: http://www.dcc.ac.uk/resources/how-guides/appraise-select-data 25
  • 26. §3 Allocation of resources • What are the costs for making data FAIR in your project? • Resources for long term preservation Check the UK Data Service Costing model. Rule of thumb: 5% of the project budget is spent on RDM. The High Level Expert Group on the European Open Science Cloud recommends that “well budgeted data stewardship plans should be made mandatory and we expect that on average about 5% of research expenditure should be spent on properly managing and stewarding data”. UKDS model http://www.data-archive.ac.uk/create-manage/planning-for-sharing/costing HLEG report http://ec.europa.eu/research/openscience/pdf/realising_the_european_open_science_cloud_2016.p df#view=fit&pagemode=none p. 19 26
  • 27. §4-6 Data security • Provisions for data recovery, secure storage, transfer of sensitive data? • Safely stored in certified repositories for long term preservation and curation? Ethical aspects • Any ethical or legal issues that can impact data sharing? • Informed consent for data sharing and long term preservation included in questionnaires dealing with personal data? Which other national/funder/sectorial/departmental procedures for data management do you use (if any)? 27
  • 28. Closing remarks Image “Fishbone” CC BY-NC-ND 2.0 by ttps://www.flickr.com/photos/mrjnl/
  • 29. Recommendations • Think about the desired end result and plan for this. • Involve all work packages and partners to get a coherent plan. • “Sharing” means “outside the consortium”. • Approach the DMP in whatever way best fits your project: • EC template is intended as a service, not an obligation. Read the background information and the guidance, and use it as a checklist. • More than one dataset? Describe generically what is possible and dataset-specific what is necessary. • Focus effort on datasets you’ll create rather than reuse. 29
  • 30. The EC Open Research Data pilot Key sources of information • Guidelines on Open Access to Scientific Publications and Research Data in Horizon 2020 http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilo t/h2020-hi-oa-pilot-guide_en.pdf • Guidelines on Data Management in Horizon 2020 http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilo t/h2020-hi-oa-data-mgt_en.pdf • Annotated model grant agreement, clause 29.3 http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/amga/h2 020-amga_en.pdf • New infographic summarising key policy points http://ec.europa.eu/research/press/2016/pdf/opendata-infographic_072016.pdf • Open Access and Data Management • http://ec.europa.eu/research/participants/docs/h2020-funding-guide/cross-cutting- issues/open-access-dissemination_en.htm 30
  • 31. OpenAIRE support materials • Briefing papers, factsheets, webinars, workshops, FAQs • Information on: • Open Research Data Pilot • Creating a data management plan • Selecting a data repository • Personal data https://www.openaire.eu/opendatapilot https://www.openaire.eu/support 31
  • 32. dans.knaw.nl DANS is een instituut van KNAW en NWO Thank you! Acknowledgements: Thanks to Sarah Jones (DCC), OpenAIRE and EUDAT for slides. marjan.grootveld@dans.knaw.nl http://dans.knaw.nl/

Editor's Notes

  • #9: EC Infographic: http://ec.europa.eu/research/images/infographics/policy/open-data-2016-w920.png
  • #23: Are the data produced in the project interoperable, that is allowing data exchange and re-use between researchers, institutions, organisations, countries, etc. (i.e. adhering to standards for formats, as much as possible compliant with available (open) software applications, and in particular facilitating re-combinations with different datasets from different origins)? What data and metadata vocabularies, standards or methodologies will you follow to make your data interoperable? Will you be using standard vocabularies for all data types present in your data set, to allow inter-disciplinary interoperability? In case it is unavoidable that you use uncommon or generate project specific ontologies or vocabularies, will you provide mappings to more commonly used ontologies?
  • #24: How will the data be licensed to permit the widest re-use possible? When will the data be made available for re-use? If an embargo is sought to give time to publish or seek patents, specify why and how long this will apply, bearing in mind that research data should be made available as soon as possible. Are the data produced and/or used in the project useable by third parties, in particular after the end of the project? If the re-use of some data is restricted, explain why. How long is it intended that the data remains re-usable? Are data quality assurance processes described?
  • #25: Remember to give also your open data and software a proper licence. The OA guidelines under Horizon 2020 point to CC-0 or CC-BY as a straightforward and effective way to make it possible for others to mine, exploit and reproduce the data. See p11 at: http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-pilot-guide_en.pdf
  • #26: Explain your selection criteria in the DMP.
  • #27: http://www.data-archive.ac.uk/create-manage/planning-for-sharing/costing http://ec.europa.eu/research/openscience/pdf/realising_the_european_open_science_cloud_2016.pdf#view=fit&pagemode=none
  • #28: Ethical and legal issues can also be discussed in the context of the ethics review. If relevant, include references to ethics deliverables and ethics chapter in the Description of the Action.
  • #29: Let’s move on to the considerations to make when managing and sharing data