SlideShare a Scribd company logo
CORE APIv3: Seamless machine access to open
access metadata and full texts from across the
global repositories and publishers network
Matteo Cancellieri, Valerii Budko, Samuel Pearce, Petr Knoth
Knowledge Media institute, The Open University
CORE - https://core.ac.uk
API - https://core.ac.uk/services/api
Twitter - @oacore
• What can you do with the CORE API?
• Lessons learned from v2 and new features in v3
Outline
In doing so, we:
● enrich scholarly data using state-of-the-art text and data
mining technologies to aid discoverability,
● enable others to develop new tools and use cases on top
of the CORE platform,
● support the network of open access repositories and
journals with innovative technical, solutions and,
● facilitate a scalable, cost-effective route for the delivery
of open scholarship.
CORE’s mission
CORE’s mission is to aggregate all open access research worldwide and deliver
unrestricted access for all.
CORE services
Content discovery Raw data services Managing content
Discovery
Recommender
API
Dataset
FastSync
Repository Dashboard
Repository Edition
Search
Free to read links to full
text papers
~97 million
Data providers
10,372
Full texts hosted
directly by CORE
28,468,748
Languages
> 90
Countries
190
Metadata records
218,808,331
> 30M
monthly
active
users
Recent success stories using the CORE API
- Plagiarism detection
- Open Access papers discovery
- Fact checking
- New approaches to research evaluation
- Innovation engineering
- Content translation
- Trends detection
- Rapid systematic reviews
- Open Access Compliance Monitoring
Find more success stories at:
https://core.ac.uk/about/endorsements AND https://blog.core.ac.uk
● New model abstraction to represent the
scholarly world
● More coherent search queries
● Easier to access large datasets
● Improved analytical tools
● User management made easy
● Better documentation
● A gallery to kick start your journey with the
API
What's new
🖋 Documentation in Swagger
🖋 PHP + Symfony implementation
🚀 Elasticsearch
API clients
•Java https://github.com/oacore/oacore4j
•Python https://github.com/oacore/pyoacore
•R https://github.com/ropensci/rcoreoa
CORE API: where are we?
> 3,000 registered users​ > 300 active API users
(in the last two months )
Works
A deduplicated and polished item, it is made with the best metadata we can use from multiple articles
from different sources, it includes enrichments.
Article (old name) /Output (new name)
It is data coming directly from the data providers. It mostly comes from OAI-PMH but there also other
different data providers. The data is uniform so all the different data providers lead to a single metadata
format.
Data provider
It contains repositories (institutional
and disciplinary), preprint servers, journals and
publishers.
Journal
This dataset contains all journal titles included
in the CORE collection.
How CORE sees the world
1...n versions
contains contains
Improved search queries
+ better sorting
+ better filtering
Large dataset access
▪ The API now support querying
for medium size datasets
(1,000-100,000 records) through
the scroll parameter.
▪ For large datasets (>100,000),
consider the CORE dataset
Better analytical tools
Meaningful statistics for all
the entities in CORE
Search aggregations to help
you explore the data
User management
CORE
communities!
Great
docs
Easy to start with
Examples
Gallery
Free to
use
Free to
register
Easy to start with
Documentation (https://api.core.ac.uk/docs/v3)
Example from the quick start gallery
Examples
Collaboration network for publications using
the term covid in 2021
Most records using
the terms
“unprecedented
times” 2018-2021
2018 2019
2020 2021
Examples
Roadmap
APIv3 is in
production
Sunset period for
APIv2 (in Q2 2022)
Questions? https://bit.ly/core-apiv3
Feedback
If you are using the API for research, please cite one of our
research outputs https://core.ac.uk/about/research-
outputs
Show us how you are using the API
Get in touch if you have questions.
Questions? https://bit.ly/core-apiv3
Thank you!

More Related Content

PPTX
Access the world’s research outputs through the CORE API
PPTX
Better together: building services for public good on top of content from the...
PPTX
Better together: building services for public good on top of content from the...
PPT
CORE - Petr Knoth, Research Associate
PPTX
Closing the scientific literature access gap with CORE - how to gain free acc...
PPTX
Core @ repositories fringe 2015
PPTX
Developing Infrastructure to Support Closer Collaboration of Aggregators with...
PPTX
Analysing the performance of open access papers discovery tools
Access the world’s research outputs through the CORE API
Better together: building services for public good on top of content from the...
Better together: building services for public good on top of content from the...
CORE - Petr Knoth, Research Associate
Closing the scientific literature access gap with CORE - how to gain free acc...
Core @ repositories fringe 2015
Developing Infrastructure to Support Closer Collaboration of Aggregators with...
Analysing the performance of open access papers discovery tools

Similar to CORE APIv3 (20)

PPTX
Implementation of the RIOXX Metadata Guidelines in the UK's repositories thro...
PDF
Putting Open Access into Practice
PPTX
Open Science and the power of repositories: OpenAIRE OA Broker, Guidelines ad...
PPTX
Text mining in CORE (OR2012)
PDF
OpenCitations
PPTX
A user journey in OpenAIRE services through the lens of repository managers -...
PPTX
My repository is being aggregated: a blessing or a curse?
PDF
4th Content Providers Community Call
PDF
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...
PPTX
DCMI webinar - OpenAIRE Guidelines: Promoting Repositories Interoperability a...
PDF
20190527_Paolo Manghi_ OpenAIRE monitoring
PPTX
Towards an Infrastructure for Mining Scientific Publications
PPTX
OpenAIRE content in support of Open Science monitoring (Presentation by Paolo...
PPTX
Making your Repository or Open Access Journal OpenAIRE compatible with OA Hor...
PPTX
Tracking compliance of the REF2021 policy with the CORE Repository Dashboard
PPTX
Uk CORR presentation
PPTX
A user journey in OpenAIRE services through the lens of repository managers -...
PPTX
OpenAIRE infrastructure and Services (OpenAIRE Workshop Malta)
PDF
Modelling research output expressions : metadata schema modelling of publicat...
PPTX
From Open Access Metadata to Open Access Content: Two Principles for Increase...
Implementation of the RIOXX Metadata Guidelines in the UK's repositories thro...
Putting Open Access into Practice
Open Science and the power of repositories: OpenAIRE OA Broker, Guidelines ad...
Text mining in CORE (OR2012)
OpenCitations
A user journey in OpenAIRE services through the lens of repository managers -...
My repository is being aggregated: a blessing or a curse?
4th Content Providers Community Call
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...
DCMI webinar - OpenAIRE Guidelines: Promoting Repositories Interoperability a...
20190527_Paolo Manghi_ OpenAIRE monitoring
Towards an Infrastructure for Mining Scientific Publications
OpenAIRE content in support of Open Science monitoring (Presentation by Paolo...
Making your Repository or Open Access Journal OpenAIRE compatible with OA Hor...
Tracking compliance of the REF2021 policy with the CORE Repository Dashboard
Uk CORR presentation
A user journey in OpenAIRE services through the lens of repository managers -...
OpenAIRE infrastructure and Services (OpenAIRE Workshop Malta)
Modelling research output expressions : metadata schema modelling of publicat...
From Open Access Metadata to Open Access Content: Two Principles for Increase...
Ad

More from petrknoth (19)

PPTX
Qui Bono? Cumulative advantage in open access publishing
PPTX
OAI Identifiers: Decentralised PIDs for Research Outputs in Repositories
PPTX
UKRI OA policy requirements for repositories and how to meet them
PPTX
Enabling Educators to Locate High-Quality Teaching Resources
PPTX
CORE Analytics Dashboard
PPTX
Assessing Compliance with the UK REF 2021 Open Access Policy
PPTX
Data interoperability toolkit (OpenMinTeD)
PPTX
Integrating research indicators for use in the repositories infrastructure
PPTX
Towards effective research recommender systems for repositories
PPTX
COAR Next Generation Repositories WG - Text mining and Recommender system sto...
PPTX
Seamless access to the world’s open access research papers via ResourceSync
PPTX
Semantometrics: Towards Fulltext-based Research Evaluation
PPTX
Aggregating Research papers from Publishers' Systems to Support Text and Data...
PPTX
FOSTER - Content Delivery (WP3)
PPTX
DiggiCORE: Digging into Connected Repositories
PPTX
DEVCSI Core Mobile
PPTX
CORE: Aggregating and Enriching Content to Support Open Access
PPTX
CORE projects family
PPTX
Core presentation
Qui Bono? Cumulative advantage in open access publishing
OAI Identifiers: Decentralised PIDs for Research Outputs in Repositories
UKRI OA policy requirements for repositories and how to meet them
Enabling Educators to Locate High-Quality Teaching Resources
CORE Analytics Dashboard
Assessing Compliance with the UK REF 2021 Open Access Policy
Data interoperability toolkit (OpenMinTeD)
Integrating research indicators for use in the repositories infrastructure
Towards effective research recommender systems for repositories
COAR Next Generation Repositories WG - Text mining and Recommender system sto...
Seamless access to the world’s open access research papers via ResourceSync
Semantometrics: Towards Fulltext-based Research Evaluation
Aggregating Research papers from Publishers' Systems to Support Text and Data...
FOSTER - Content Delivery (WP3)
DiggiCORE: Digging into Connected Repositories
DEVCSI Core Mobile
CORE: Aggregating and Enriching Content to Support Open Access
CORE projects family
Core presentation
Ad

Recently uploaded (20)

PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PDF
Sciences of Europe No 170 (2025)
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PDF
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
Pharmacology of Autonomic nervous system
PPTX
Microbiology with diagram medical studies .pptx
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
The scientific heritage No 166 (166) (2025)
PPTX
neck nodes and dissection types and lymph nodes levels
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
TOTAL hIP ARTHROPLASTY Presentation.pptx
Introduction to Fisheries Biotechnology_Lesson 1.pptx
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
7. General Toxicologyfor clinical phrmacy.pptx
Sciences of Europe No 170 (2025)
The KM-GBF monitoring framework – status & key messages.pptx
lecture 2026 of Sjogren's syndrome l .pdf
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
Taita Taveta Laboratory Technician Workshop Presentation.pptx
Pharmacology of Autonomic nervous system
Microbiology with diagram medical studies .pptx
Classification Systems_TAXONOMY_SCIENCE8.pptx
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
The scientific heritage No 166 (166) (2025)
neck nodes and dissection types and lymph nodes levels
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf

CORE APIv3

  • 1. CORE APIv3: Seamless machine access to open access metadata and full texts from across the global repositories and publishers network Matteo Cancellieri, Valerii Budko, Samuel Pearce, Petr Knoth Knowledge Media institute, The Open University CORE - https://core.ac.uk API - https://core.ac.uk/services/api Twitter - @oacore
  • 2. • What can you do with the CORE API? • Lessons learned from v2 and new features in v3 Outline
  • 3. In doing so, we: ● enrich scholarly data using state-of-the-art text and data mining technologies to aid discoverability, ● enable others to develop new tools and use cases on top of the CORE platform, ● support the network of open access repositories and journals with innovative technical, solutions and, ● facilitate a scalable, cost-effective route for the delivery of open scholarship. CORE’s mission CORE’s mission is to aggregate all open access research worldwide and deliver unrestricted access for all.
  • 4. CORE services Content discovery Raw data services Managing content Discovery Recommender API Dataset FastSync Repository Dashboard Repository Edition Search
  • 5. Free to read links to full text papers ~97 million Data providers 10,372 Full texts hosted directly by CORE 28,468,748 Languages > 90 Countries 190 Metadata records 218,808,331 > 30M monthly active users
  • 6. Recent success stories using the CORE API - Plagiarism detection - Open Access papers discovery - Fact checking - New approaches to research evaluation - Innovation engineering - Content translation - Trends detection - Rapid systematic reviews - Open Access Compliance Monitoring Find more success stories at: https://core.ac.uk/about/endorsements AND https://blog.core.ac.uk
  • 7. ● New model abstraction to represent the scholarly world ● More coherent search queries ● Easier to access large datasets ● Improved analytical tools ● User management made easy ● Better documentation ● A gallery to kick start your journey with the API What's new
  • 8. 🖋 Documentation in Swagger 🖋 PHP + Symfony implementation 🚀 Elasticsearch API clients •Java https://github.com/oacore/oacore4j •Python https://github.com/oacore/pyoacore •R https://github.com/ropensci/rcoreoa CORE API: where are we? > 3,000 registered users​ > 300 active API users (in the last two months )
  • 9. Works A deduplicated and polished item, it is made with the best metadata we can use from multiple articles from different sources, it includes enrichments. Article (old name) /Output (new name) It is data coming directly from the data providers. It mostly comes from OAI-PMH but there also other different data providers. The data is uniform so all the different data providers lead to a single metadata format. Data provider It contains repositories (institutional and disciplinary), preprint servers, journals and publishers. Journal This dataset contains all journal titles included in the CORE collection. How CORE sees the world 1...n versions contains contains
  • 10. Improved search queries + better sorting + better filtering
  • 11. Large dataset access ▪ The API now support querying for medium size datasets (1,000-100,000 records) through the scroll parameter. ▪ For large datasets (>100,000), consider the CORE dataset
  • 12. Better analytical tools Meaningful statistics for all the entities in CORE Search aggregations to help you explore the data
  • 14. Great docs Easy to start with Examples Gallery Free to use Free to register
  • 15. Easy to start with Documentation (https://api.core.ac.uk/docs/v3) Example from the quick start gallery
  • 16. Examples Collaboration network for publications using the term covid in 2021
  • 17. Most records using the terms “unprecedented times” 2018-2021 2018 2019 2020 2021 Examples
  • 18. Roadmap APIv3 is in production Sunset period for APIv2 (in Q2 2022) Questions? https://bit.ly/core-apiv3
  • 19. Feedback If you are using the API for research, please cite one of our research outputs https://core.ac.uk/about/research- outputs Show us how you are using the API Get in touch if you have questions. Questions? https://bit.ly/core-apiv3