SlideShare a Scribd company logo
7
Most read
8
Most read
12
Most read
S a r a h J o n e s
A s s o c i a t e D i r e c t o r , D i g i t a l C u r a t i o n C e n t r e
R a p p o r t e u r o f F A I R d a t a e x p e r t g r o u p
s a r a h . j o n e s @ g l a s g o w . a c . u k
T w i t t e r : @ s j D C C
The FAIR data concept
What is FAIR?
A set of principles to ensure that data are
shared in a way that enables & enhances reuse,
by humans and machines
Image CC-BY-SA by SangyaPundir
Origins of FAIR
• Emerged from a workshop held in Leiden in 2014
• Come from life sciences but intended for all data
• Issued by FORCE11 community
• Echo previous principles on open data & curation
OECD Principles and
Guidelines for Access to
Research Data from
Public Funding (2007)
Science as an Open Enterprise (2012)
notion of ‘intelligent openness’ where
data are accessible, intelligible,
assessable and useable G8 Science Ministers
Statement (2013)
Why has FAIR gained traction?
 Meaningful and memorable articulation of concepts
 Natural desire to want to be ‘fair’
 Easy to understand at a high-level, but…
“We understand the basic principle of FAIR, but the terminology is often difficult
to grasp immediately. Things could be explained better in plain language”
“The term interoperable is quite confusing sometimes and mixed with re-use.”
OpenAIRE & FAIR data Expert Group H2020 DMP survey
https://doi.org/10.5281/zenodo.1120245
What FAIR means: 15 principles
Findable
F1. (meta)data are assigned a globally unique and
eternally persistent identifier.
F2. data are described with rich metadata.
F3. (meta)data are registered or indexed in a
searchable resource.
F4. metadata specify the data identifier.
Interoperable
I1. (meta)data use a formal, accessible, shared, and
broadly applicable language for knowledge
representation.
I2. (meta)data use vocabularies that follow FAIR
principles.
I3. (meta)data include qualified references to other
(meta)data.
Accessible
A1 (meta)data are retrievable by their
identifier using a standardized communications
protocol.
A1.1 the protocol is open, free, and universally
implementable.
A1.2 the protocol allows for an authentication and
authorization procedure, where necessary.
A2 metadata are accessible, even when the data are
no longer available.
Reusable
R1. meta(data) have a plurality of accurate and
relevant attributes.
R1.1. (meta)data are released with a clear and
accessible data usage license.
R1.2. (meta)data are associated with
their provenance.
R1.3. (meta)data meet domain-relevant community
standards.
Slide CC-BY by Erik Schultes, Leiden UMC
doi: 10.1038/sdata.2016.18
The FAIR data principles explained
• Clarifications from the Dutch
Techcentre for Life Sciences
• Each principle is a link to
further clarification,
examples and context
https://www.dtls.nl/fair-
data/fair-principles-explained
R1. Meta(data) are richly described with a plurality of accurate and
relevant attributes
• By giving data many ‘labels’, it will be much easier to find and reuse the data.
• Provide not just metadata that allows discovery, but also metadata that
richly describes the context under which that data was generated
• “plurality” indicates that metadata should be as generous as possible, even to
the point of providing information that may seem irrelevant.
FAIR data checklist
 Findable
- Persistent ID
- Metadata online
 Accessible
- Data online
- Restrictions where needed
 Interoperable
- Use standards, controlled vocabs
- Common (open) formats
 Reusable
- Rich documentation
- Clear usage licence
https://doi.org/10.5281/zenodo.1065991
FAIR vs open data
• FAIR data does not have to be open
• Data can be shared under restrictions & still be FAIR
• Making data FAIR ensures it can be found,
understood and reused
• Open data is a subset of all the data shared
"As open as possible, as closed as necessary"
Implementing FAIR
• The principles do not specify particular
technologies or implementations e.g. semantic web
• FAIR is not a standard to be followed or strict
criteria – it’s a spectrum / continuum
• Supporting FAIR will require investment in
infrastructure, coordination across initiatives and
engagement with research communities
Context is king
 Practices varies across research communities
 particle physics mostly shares its data inside the large
consortia attached to its experiments
 social scientists typically provide lots of documentation and
methodological information supporting reuse
 Communities need to self-organise and define what
FAIR data means in their context
Emerging initiatives
Pilot project and several
infrastructure investments
https://eoscpilot.eu
http://tiny.cc/EOSC-projects
https://www.go-fair.org
https://www.rd-alliance.org
Remit of the FAIR data expert group
1. To develop recommendations on what needs to be done to turn each
component of the FAIR data principles into reality
2. To propose indicators to measure progress on each of the FAIR
components
3. To provide input to the proposed European Open Science Cloud
(EOSC) action plan on how to make data FAIR
4. To contribute to the evaluation of the Horizon 2020 Data
Management Plan (DMP) template and development of associated
sector / discipline-specific guidance
5. To provide input on the issue of costing and financing data
management activities
http://tinyurl.com/FAIR-EG
FAIR EG progress and plans
 Most of the report text
is already written
 Preliminary
recommendations
 Working meeting to
initiate FAIR Data
Action plan next week
 Draft due end May
 Public consultation
around EOSC Summit
in June
 Final Report and Action
Plan due end
September
 Announce / launch at
Austrian Presidency
events in October
Recommendations to facilitate change
 Define what actions are required by different stakeholders
(EC, member states, disciplines, international) and when
 Structure and prioritise actions across key topics: policy,
skills, standards, infrastructure, costs, rewards…
 Use these recommendations to develop
a core FAIR Data Action Plan
 Support the definition of more detailed
FAIR Data Action Plans at domain and
member state level
Thanks - questions?
Sarah Jones
sarah.jones@glasgow.ac.uk
Twitter: @sjDCC
www.dcc.ac.uk
http://tinyurl.com/FAIR-EG
www.force11.org/group/fairgroup/fairprinciples

More Related Content

PPTX
Data Quality & Data Governance
PPTX
Research Methodology - Data Collection
PPTX
DLS dynamic light scattering
PPTX
Analytical centrifugation
PDF
Defining a Tech Project Vision in Eight Quick Steps pdf
PDF
The ABCs of Treating Data as Product
PPTX
FRET, FRAP, TIFR MICROSCOPY
PPTX
Data Quality & Data Governance
Research Methodology - Data Collection
DLS dynamic light scattering
Analytical centrifugation
Defining a Tech Project Vision in Eight Quick Steps pdf
The ABCs of Treating Data as Product
FRET, FRAP, TIFR MICROSCOPY

What's hot (20)

PPTX
What are the FAIR data principles?
PDF
Preparing Data for Sharing: The FAIR Principles
PPTX
PPTX
FAIR principles and metrics for evaluation
PPTX
The future of FAIR
PPTX
Introduction to Open Science and EOSC
PPTX
Intro to Data Management Plans
PPTX
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
PPTX
Intro to RDM
PDF
FAIR Ddata in trustworthy repositories: the basics
PPTX
FAIRy stories: the FAIR Data principles in theory and in practice
PDF
Basics of Research Data Management
PDF
Introduction on Data Science
PPTX
What it means to be FAIR
PDF
OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...
PDF
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
PDF
Querying the Wikidata Knowledge Graph
PPTX
Creating a Data Management Plan
PDF
Logical Data Fabric: Architectural Components
PPTX
Cloudera SDX
What are the FAIR data principles?
Preparing Data for Sharing: The FAIR Principles
FAIR principles and metrics for evaluation
The future of FAIR
Introduction to Open Science and EOSC
Intro to Data Management Plans
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
Intro to RDM
FAIR Ddata in trustworthy repositories: the basics
FAIRy stories: the FAIR Data principles in theory and in practice
Basics of Research Data Management
Introduction on Data Science
What it means to be FAIR
OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Querying the Wikidata Knowledge Graph
Creating a Data Management Plan
Logical Data Fabric: Architectural Components
Cloudera SDX
Ad

Similar to FAIR data (20)

PPTX
FAIR data: what it means, how we achieve it, and the role of RDA
PDF
Introduction to FAIR Data and Research Objects
PPTX
DTL Integrator's meeting
PDF
Dataverse as a FAIR Data Repository (Mercè Crosas)
PPTX
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
PDF
04 findable imming
PPTX
A coordinated framework for open data open science in Botswana/Simon Hodson
PPTX
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
PPTX
Fair data vs 5 star open data final
PDF
I o dav data workshop prof wafula final 19.9.17
PPTX
Why institutions need to raise their capabilities to support FAIR
PDF
LIBER Webinar: Turning FAIR Data Into Reality
PPTX
Turning FAIR data into reality
PDF
Origins of FAIR webinar
PPTX
FAIR play?
PPTX
Essentials 4 Data Support: a fine course in FAIR Data Support
PPTX
Open Science Globally: Some Developments/Dr Simon Hodson
PPTX
LIBER Webinar: Are the FAIR Data Principles really fair?
PPTX
VODAN Africa IN.pptx
PDF
Framework and Roadmap towards an Open Science Infrastructure/Simon Hodson
FAIR data: what it means, how we achieve it, and the role of RDA
Introduction to FAIR Data and Research Objects
DTL Integrator's meeting
Dataverse as a FAIR Data Repository (Mercè Crosas)
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
04 findable imming
A coordinated framework for open data open science in Botswana/Simon Hodson
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Fair data vs 5 star open data final
I o dav data workshop prof wafula final 19.9.17
Why institutions need to raise their capabilities to support FAIR
LIBER Webinar: Turning FAIR Data Into Reality
Turning FAIR data into reality
Origins of FAIR webinar
FAIR play?
Essentials 4 Data Support: a fine course in FAIR Data Support
Open Science Globally: Some Developments/Dr Simon Hodson
LIBER Webinar: Are the FAIR Data Principles really fair?
VODAN Africa IN.pptx
Framework and Roadmap towards an Open Science Infrastructure/Simon Hodson
Ad

More from Sarah Jones (20)

PPTX
Data training tips and tricks
PPTX
EOSC and libraries
PPTX
EOSC Association priorities and activities
PPTX
Managing and sharing data: lessons from the European context
PPTX
Reflections on Open Science
PPTX
MAR comments analysis
PPTX
EOSC-MAR-update.pptx
PPTX
Intro-EOSC.pptx
PPTX
Why is EOSC so hard?
PPTX
Data Management Planning for researchers
PPTX
Is Europe ready for Open Science
PPTX
DMPonline: 10 years, 10 lessons
PPTX
Do & don't of supporting Open Science
PPTX
It takes more than a village: lessons on building global research commons
PPTX
DMPTuuli - what's new?
PPTX
DCC and FAIR initiatives
PPTX
Reflections on EOSC through the mirror of ARDC
PPTX
Future EOSC roadmap
PPTX
Global Open Research Commons IG
PPTX
EOSC work plan
Data training tips and tricks
EOSC and libraries
EOSC Association priorities and activities
Managing and sharing data: lessons from the European context
Reflections on Open Science
MAR comments analysis
EOSC-MAR-update.pptx
Intro-EOSC.pptx
Why is EOSC so hard?
Data Management Planning for researchers
Is Europe ready for Open Science
DMPonline: 10 years, 10 lessons
Do & don't of supporting Open Science
It takes more than a village: lessons on building global research commons
DMPTuuli - what's new?
DCC and FAIR initiatives
Reflections on EOSC through the mirror of ARDC
Future EOSC roadmap
Global Open Research Commons IG
EOSC work plan

Recently uploaded (20)

PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
KodekX | Application Modernization Development
PPT
Teaching material agriculture food technology
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
cuic standard and advanced reporting.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Encapsulation theory and applications.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Empathic Computing: Creating Shared Understanding
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Spectral efficient network and resource selection model in 5G networks
KodekX | Application Modernization Development
Teaching material agriculture food technology
Per capita expenditure prediction using model stacking based on satellite ima...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
NewMind AI Monthly Chronicles - July 2025
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
cuic standard and advanced reporting.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Encapsulation theory and applications.pdf
Review of recent advances in non-invasive hemoglobin estimation
The Rise and Fall of 3GPP – Time for a Sabbatical?
Agricultural_Statistics_at_a_Glance_2022_0.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Empathic Computing: Creating Shared Understanding

FAIR data

  • 1. S a r a h J o n e s A s s o c i a t e D i r e c t o r , D i g i t a l C u r a t i o n C e n t r e R a p p o r t e u r o f F A I R d a t a e x p e r t g r o u p s a r a h . j o n e s @ g l a s g o w . a c . u k T w i t t e r : @ s j D C C The FAIR data concept
  • 2. What is FAIR? A set of principles to ensure that data are shared in a way that enables & enhances reuse, by humans and machines Image CC-BY-SA by SangyaPundir
  • 3. Origins of FAIR • Emerged from a workshop held in Leiden in 2014 • Come from life sciences but intended for all data • Issued by FORCE11 community • Echo previous principles on open data & curation OECD Principles and Guidelines for Access to Research Data from Public Funding (2007) Science as an Open Enterprise (2012) notion of ‘intelligent openness’ where data are accessible, intelligible, assessable and useable G8 Science Ministers Statement (2013)
  • 4. Why has FAIR gained traction?  Meaningful and memorable articulation of concepts  Natural desire to want to be ‘fair’  Easy to understand at a high-level, but… “We understand the basic principle of FAIR, but the terminology is often difficult to grasp immediately. Things could be explained better in plain language” “The term interoperable is quite confusing sometimes and mixed with re-use.” OpenAIRE & FAIR data Expert Group H2020 DMP survey https://doi.org/10.5281/zenodo.1120245
  • 5. What FAIR means: 15 principles Findable F1. (meta)data are assigned a globally unique and eternally persistent identifier. F2. data are described with rich metadata. F3. (meta)data are registered or indexed in a searchable resource. F4. metadata specify the data identifier. Interoperable I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. I2. (meta)data use vocabularies that follow FAIR principles. I3. (meta)data include qualified references to other (meta)data. Accessible A1 (meta)data are retrievable by their identifier using a standardized communications protocol. A1.1 the protocol is open, free, and universally implementable. A1.2 the protocol allows for an authentication and authorization procedure, where necessary. A2 metadata are accessible, even when the data are no longer available. Reusable R1. meta(data) have a plurality of accurate and relevant attributes. R1.1. (meta)data are released with a clear and accessible data usage license. R1.2. (meta)data are associated with their provenance. R1.3. (meta)data meet domain-relevant community standards. Slide CC-BY by Erik Schultes, Leiden UMC doi: 10.1038/sdata.2016.18
  • 6. The FAIR data principles explained • Clarifications from the Dutch Techcentre for Life Sciences • Each principle is a link to further clarification, examples and context https://www.dtls.nl/fair- data/fair-principles-explained R1. Meta(data) are richly described with a plurality of accurate and relevant attributes • By giving data many ‘labels’, it will be much easier to find and reuse the data. • Provide not just metadata that allows discovery, but also metadata that richly describes the context under which that data was generated • “plurality” indicates that metadata should be as generous as possible, even to the point of providing information that may seem irrelevant.
  • 7. FAIR data checklist  Findable - Persistent ID - Metadata online  Accessible - Data online - Restrictions where needed  Interoperable - Use standards, controlled vocabs - Common (open) formats  Reusable - Rich documentation - Clear usage licence https://doi.org/10.5281/zenodo.1065991
  • 8. FAIR vs open data • FAIR data does not have to be open • Data can be shared under restrictions & still be FAIR • Making data FAIR ensures it can be found, understood and reused • Open data is a subset of all the data shared "As open as possible, as closed as necessary"
  • 9. Implementing FAIR • The principles do not specify particular technologies or implementations e.g. semantic web • FAIR is not a standard to be followed or strict criteria – it’s a spectrum / continuum • Supporting FAIR will require investment in infrastructure, coordination across initiatives and engagement with research communities
  • 10. Context is king  Practices varies across research communities  particle physics mostly shares its data inside the large consortia attached to its experiments  social scientists typically provide lots of documentation and methodological information supporting reuse  Communities need to self-organise and define what FAIR data means in their context
  • 11. Emerging initiatives Pilot project and several infrastructure investments https://eoscpilot.eu http://tiny.cc/EOSC-projects https://www.go-fair.org https://www.rd-alliance.org
  • 12. Remit of the FAIR data expert group 1. To develop recommendations on what needs to be done to turn each component of the FAIR data principles into reality 2. To propose indicators to measure progress on each of the FAIR components 3. To provide input to the proposed European Open Science Cloud (EOSC) action plan on how to make data FAIR 4. To contribute to the evaluation of the Horizon 2020 Data Management Plan (DMP) template and development of associated sector / discipline-specific guidance 5. To provide input on the issue of costing and financing data management activities http://tinyurl.com/FAIR-EG
  • 13. FAIR EG progress and plans  Most of the report text is already written  Preliminary recommendations  Working meeting to initiate FAIR Data Action plan next week  Draft due end May  Public consultation around EOSC Summit in June  Final Report and Action Plan due end September  Announce / launch at Austrian Presidency events in October
  • 14. Recommendations to facilitate change  Define what actions are required by different stakeholders (EC, member states, disciplines, international) and when  Structure and prioritise actions across key topics: policy, skills, standards, infrastructure, costs, rewards…  Use these recommendations to develop a core FAIR Data Action Plan  Support the definition of more detailed FAIR Data Action Plans at domain and member state level
  • 15. Thanks - questions? Sarah Jones sarah.jones@glasgow.ac.uk Twitter: @sjDCC www.dcc.ac.uk http://tinyurl.com/FAIR-EG www.force11.org/group/fairgroup/fairprinciples

Editor's Notes

  • #4: OECD – 13 principles e.g. openness, flexible, transparent, legal, interoperable, quality, secure, accountable, efficient… OECD preconditions: ‘data must be accessible and readily located; they must be intelligible to those who wish to scrutinise them; data must be assessable so that judgments can be made about their reliability and the competence of those who created them; and they must be usable by others.’ G8 statement adopted verbatim in the European Commission’s first data guidelines for the Horizon 2020 framework programme later the same year.