@openaire_eu
Research data discovery in
OpenAIRE
Paolo Manghi
InstituteofInformationScienceandTechnologies -CNR
Populatingthe OpenAIRE
scholarlycommunicationgraph
Searching over the
OpenAIRE graph
OpenAIRE - EOSC Hub - EC meeting | Amsterdam | 15th Dec 2017
The OpenAIRE Graph
Research
communities
Researchers (All)
Content providers
Innovators
Research
managers
Funders
Building the graph and Dashboards
OpenAIRE Dashboards
Validation
Cleaning De-duplication
Inference
Info Space Services
Project communiity
FunderFunding
Result
Publicatio
n
Data Software
Organizatio
n
TERMS
OF USE
Harvesting Uploading
Brokering
Source
ORP
Publications
repositories
Data
repositories
CRIS
systems
Registries
OA
Journals
Software
repositories
Content Providers Research
Infras
GUIDE
LINES
OpenAIRE Data Model and Flows
mining
harvesting
deposition Project community
FunderFunding
Result
Publication
Research
Data
Software
Organization
Source
Other res.
products
Building and maintaining an open metadata scholarly
communication graph of interlinked scientific products, in turn
linked to Open Access information, funding information and
community views
The OpenAIRE scholarly communication graph
Complete
De-duplicated
Participatory
Graph
ALL Literature, Research data, Software, Other research
products
• Respecting the OpenAIRE guidelines (DataCite
metadata)
• Using PIDs with resolvers
Content Acquisition Policy
Harvesting: Revised Classification of Research
Products
Publications
• Article
• Preprint
• Report
• …
Datasets
• Dataset
• Collection
• Clinical Trials
• …
Software
• Research
Software
• …
Other Research
Products
• Service
• Workflow
• Interactive
Resource
• …
Institutional/
publication
repositories
Journals/
publishers
Data
repositories
Other
Products
repositories
Software
repositories
Content acquisition policy transition: from Oct
2018 to November 2018
2600000
2800000
3000000
3200000
3400000
3600000
Oct-18 Nov-18
Other research products
0
50000000
100000000
Oct-18 Nov-18
Literature
0
5000000
10000000
15000000
Oct-18 Nov-18
Research Data
0
20000
40000
60000
80000
Oct-18 Nov-18
Software
100+Mi 10+Mi
80+K
40Mi links
Exploring the graph
• Search, browse, claim, and interlink products
• Navigation between interlinked objects
Disovery of data in OpenAIRE
OpenAIRE - EOSC Hub - EC meeting | Amsterdam | 15th Dec 2017
Search plans in OpenAIRE
• Search datasets used in at least K papers
Data maturity-driven search
• Search for data in a community or used-cross-community
Community-driven search
• From dataset files or from related entities (publications,
project)
Search beyond dataset metadata
General challenges raised by experience
• Scientists should take seriously metadata curation and interlinking with other scientific
products
• Systems should be prepared to include new metadata/link information to existing
depositions, to reflec the ecolution of the domain
Low quality metadata
• Datasets have different descriptions, driven by the intended usage, which drive the
possible searches
Metadata citation Vs metadata for reuse within or across disciplines
• Communities should leverage a granularity level adequate to the intended discovery
Varying granularity among communities
Thank you!
Paolo Manghi
paolo.manghi@isti.cnr.it

More Related Content

PPTX
20191119_The OpenAIRE Research Graph
PPTX
OpenAIRE Open Innovation call: Next Generation Repositories
PPTX
20200130_Mannocci_OpenAIRE_ResearchGraph
PPTX
Gobinda Chowdhury
PPTX
Eva Méndez: Política europea y EOSC
PDF
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
PDF
Making Research Data Repositories Visible – The re3data.org Registry
PDF
Elab 16 5-13-re3data-scholze-final
20191119_The OpenAIRE Research Graph
OpenAIRE Open Innovation call: Next Generation Repositories
20200130_Mannocci_OpenAIRE_ResearchGraph
Gobinda Chowdhury
Eva Méndez: Política europea y EOSC
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
Making Research Data Repositories Visible – The re3data.org Registry
Elab 16 5-13-re3data-scholze-final

What's hot (20)

PDF
The Structured Data Hub in 2019
PPTX
The D4Science Infrastructure
PPT
Global registries initiative frumkin omodei
PPTX
The OpenAIRE Infrastructure: A Vision towards e-infrastructure Commons (e-...
PPTX
EOSC pilot STFC
PDF
PPTX
Building data networks: exploring trust and interoperability between authoris...
PPTX
Jisc on repositories unleashing data - Daniela Duca
PPT
Linking Collections Through Linked Open Data
PPT
Altman RDAP11 Policy-based Data Management
PPTX
Lightning Talks - Intro
PPTX
RDN Lightning talk - Open Research Leeds (@OpenResLeeds): networks, metrics a...
PDF
WP3: overzicht van de voortgang van WP# op de CLARIAH-dag
PPTX
Managing data behind creative masterpieces -RCM
PPTX
PPTX
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
PPTX
Managing data behind creative masterpieces
PPT
Smith RDAP11 NSF Data Management Plan Case Studies
PPTX
Report from RDAPlenary 3 to DataCitation Community in Australia
PPT
Open Data Publication - Requirements, Good practices, and Benefits
The Structured Data Hub in 2019
The D4Science Infrastructure
Global registries initiative frumkin omodei
The OpenAIRE Infrastructure: A Vision towards e-infrastructure Commons (e-...
EOSC pilot STFC
Building data networks: exploring trust and interoperability between authoris...
Jisc on repositories unleashing data - Daniela Duca
Linking Collections Through Linked Open Data
Altman RDAP11 Policy-based Data Management
Lightning Talks - Intro
RDN Lightning talk - Open Research Leeds (@OpenResLeeds): networks, metrics a...
WP3: overzicht van de voortgang van WP# op de CLARIAH-dag
Managing data behind creative masterpieces -RCM
Using Open Research Data for Public Policy Making: Opportunities of Virtual R...
Managing data behind creative masterpieces
Smith RDAP11 NSF Data Management Plan Case Studies
Report from RDAPlenary 3 to DataCitation Community in Australia
Open Data Publication - Requirements, Good practices, and Benefits
Ad

Similar to Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018) (20)

PPTX
OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...
PDF
7th Content Providers Community Call
PPTX
Belgium webinar - openAIRE Research Graph
PPTX
Introduction to OpenAIRE services and the OpenAIRE Research Graph
PPTX
A user journey in OpenAIRE services through the lens of repository managers -...
PPTX
A user journey in OpenAIRE services through the lens of repository managers -...
PPTX
OpenAIRE services and tools - presentation at #DI4R2016
PDF
4th Content Providers Community Call
PPTX
Facilitate Research Communities Adoption of Open Science Publishing Principle...
PPTX
OpenAIRE provide dashboard #OpenAIREweek2020
PPTX
OpenAIRE content in support of Open Science monitoring (Presentation by Paolo...
PDF
The Services of the OpenAIREplus Infrastructure for Scholarly Communication –...
PPTX
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
PPTX
OpenAIRE services & tools: Zenodo and what's next (Danish OpenAIRE workshop)
PPTX
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
PPTX
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
PPTX
Open aire services_v2.0
PPTX
Moving content across the OpenAIRE infrastructure boundaries (6th RDA Plenary)
PPTX
Make your content count - OpenAIRE Content providers Dashboard: service for r...
PDF
OpenAIRE Content Acquisition Policy: expanding the scope #OpenREPO2019 poster
OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...
7th Content Providers Community Call
Belgium webinar - openAIRE Research Graph
Introduction to OpenAIRE services and the OpenAIRE Research Graph
A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...
OpenAIRE services and tools - presentation at #DI4R2016
4th Content Providers Community Call
Facilitate Research Communities Adoption of Open Science Publishing Principle...
OpenAIRE provide dashboard #OpenAIREweek2020
OpenAIRE content in support of Open Science monitoring (Presentation by Paolo...
The Services of the OpenAIREplus Infrastructure for Scholarly Communication –...
IDCC workshop: OpenAIRE services and tools for Open Research Data in H2020
OpenAIRE services & tools: Zenodo and what's next (Danish OpenAIRE workshop)
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
Open aire services_v2.0
Moving content across the OpenAIRE infrastructure boundaries (6th RDA Plenary)
Make your content count - OpenAIRE Content providers Dashboard: service for r...
OpenAIRE Content Acquisition Policy: expanding the scope #OpenREPO2019 poster
Ad

More from OpenAIRE (20)

PDF
10th OpenAIRE Content Providers Community Call
PDF
9th Content Providers Community Call\
PPTX
OpenAIRE in the European Open Science Cloud (EOSC)
PDF
8th Content Providers Community Call
PDF
OpenAIRE PROVIDE Dashboard for Turkish repository managers
PDF
What will it cost to manage and share my data?
PDF
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
PDF
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
PDF
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
PDF
6th Content Providers Community Call
PPTX
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
PPTX
20200504_Research Data & the GDPR: How Open is Open?
PDF
20200504_Data, Data Ownership and Open Science
PPTX
20200429_Research Data & the GDPR: How Open is Open? (updated version)
PDF
20200429_Data, Data Ownership and Open Science
PPTX
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
PDF
COVID-19: Activities, tools, best practice and contact points in Greece
PDF
5th Content Providers Community Call
PDF
3rd Content Providers Community Call
PDF
2nd Content Providers Community Call
10th OpenAIRE Content Providers Community Call
9th Content Providers Community Call\
OpenAIRE in the European Open Science Cloud (EOSC)
8th Content Providers Community Call
OpenAIRE PROVIDE Dashboard for Turkish repository managers
What will it cost to manage and share my data?
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
6th Content Providers Community Call
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_Research Data & the GDPR: How Open is Open?
20200504_Data, Data Ownership and Open Science
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Data, Data Ownership and Open Science
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
COVID-19: Activities, tools, best practice and contact points in Greece
5th Content Providers Community Call
3rd Content Providers Community Call
2nd Content Providers Community Call

Recently uploaded (20)

PPTX
ELISA(Enzyme linked immunosorbent assay)
PPTX
SCIENCE 4 Q2W5 PPT.pptx Lesson About Plnts and animals and their habitat
PPTX
endocrine - management of adrenal incidentaloma.pptx
PDF
From Molecular Interactions to Solubility in Deep Eutectic Solvents: Explorin...
PPTX
BPharm_Hospital_Organization_Complete_PPT.pptx
PDF
Social preventive and pharmacy. Pdf
PDF
Packaging materials of fruits and vegetables
PDF
Metabolic Acidosis. pa,oakw,llwla,wwwwqw
PDF
2019UpdateAHAASAAISGuidelineSlideDeckrevisedADL12919.pdf
PPTX
Introduction to Immunology (Unit-1).pptx
PDF
Cosmology using numerical relativity - what hapenned before big bang?
PDF
Sujay Rao Mandavilli IJISRT25AUG764 context based approaches to population ma...
PPTX
HAEMATOLOGICAL DISEASES lack of red blood cells, which carry oxygen throughou...
PPTX
limit test definition and all limit tests
PPT
Cell Structure Description and Functions
PPT
Enhancing Laboratory Quality Through ISO 15189 Compliance
PPTX
2currentelectricity1-201006102815 (1).pptx
PPTX
Understanding the Circulatory System……..
PPTX
AP CHEM 1.2 Mass spectroscopy of elements
PDF
The Future of Telehealth: Engineering New Platforms for Care (www.kiu.ac.ug)
ELISA(Enzyme linked immunosorbent assay)
SCIENCE 4 Q2W5 PPT.pptx Lesson About Plnts and animals and their habitat
endocrine - management of adrenal incidentaloma.pptx
From Molecular Interactions to Solubility in Deep Eutectic Solvents: Explorin...
BPharm_Hospital_Organization_Complete_PPT.pptx
Social preventive and pharmacy. Pdf
Packaging materials of fruits and vegetables
Metabolic Acidosis. pa,oakw,llwla,wwwwqw
2019UpdateAHAASAAISGuidelineSlideDeckrevisedADL12919.pdf
Introduction to Immunology (Unit-1).pptx
Cosmology using numerical relativity - what hapenned before big bang?
Sujay Rao Mandavilli IJISRT25AUG764 context based approaches to population ma...
HAEMATOLOGICAL DISEASES lack of red blood cells, which carry oxygen throughou...
limit test definition and all limit tests
Cell Structure Description and Functions
Enhancing Laboratory Quality Through ISO 15189 Compliance
2currentelectricity1-201006102815 (1).pptx
Understanding the Circulatory System……..
AP CHEM 1.2 Mass spectroscopy of elements
The Future of Telehealth: Engineering New Platforms for Care (www.kiu.ac.ug)

Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)

  • 1. @openaire_eu Research data discovery in OpenAIRE Paolo Manghi InstituteofInformationScienceandTechnologies -CNR
  • 2. Populatingthe OpenAIRE scholarlycommunicationgraph Searching over the OpenAIRE graph OpenAIRE - EOSC Hub - EC meeting | Amsterdam | 15th Dec 2017
  • 4. Research communities Researchers (All) Content providers Innovators Research managers Funders Building the graph and Dashboards OpenAIRE Dashboards Validation Cleaning De-duplication Inference Info Space Services Project communiity FunderFunding Result Publicatio n Data Software Organizatio n TERMS OF USE Harvesting Uploading Brokering Source ORP Publications repositories Data repositories CRIS systems Registries OA Journals Software repositories Content Providers Research Infras GUIDE LINES
  • 5. OpenAIRE Data Model and Flows mining harvesting deposition Project community FunderFunding Result Publication Research Data Software Organization Source Other res. products
  • 6. Building and maintaining an open metadata scholarly communication graph of interlinked scientific products, in turn linked to Open Access information, funding information and community views The OpenAIRE scholarly communication graph Complete De-duplicated Participatory Graph
  • 7. ALL Literature, Research data, Software, Other research products • Respecting the OpenAIRE guidelines (DataCite metadata) • Using PIDs with resolvers Content Acquisition Policy
  • 8. Harvesting: Revised Classification of Research Products Publications • Article • Preprint • Report • … Datasets • Dataset • Collection • Clinical Trials • … Software • Research Software • … Other Research Products • Service • Workflow • Interactive Resource • … Institutional/ publication repositories Journals/ publishers Data repositories Other Products repositories Software repositories
  • 9. Content acquisition policy transition: from Oct 2018 to November 2018 2600000 2800000 3000000 3200000 3400000 3600000 Oct-18 Nov-18 Other research products 0 50000000 100000000 Oct-18 Nov-18 Literature 0 5000000 10000000 15000000 Oct-18 Nov-18 Research Data 0 20000 40000 60000 80000 Oct-18 Nov-18 Software 100+Mi 10+Mi 80+K 40Mi links
  • 11. • Search, browse, claim, and interlink products • Navigation between interlinked objects Disovery of data in OpenAIRE OpenAIRE - EOSC Hub - EC meeting | Amsterdam | 15th Dec 2017
  • 12. Search plans in OpenAIRE • Search datasets used in at least K papers Data maturity-driven search • Search for data in a community or used-cross-community Community-driven search • From dataset files or from related entities (publications, project) Search beyond dataset metadata
  • 13. General challenges raised by experience • Scientists should take seriously metadata curation and interlinking with other scientific products • Systems should be prepared to include new metadata/link information to existing depositions, to reflec the ecolution of the domain Low quality metadata • Datasets have different descriptions, driven by the intended usage, which drive the possible searches Metadata citation Vs metadata for reuse within or across disciplines • Communities should leverage a granularity level adequate to the intended discovery Varying granularity among communities