SlideShare a Scribd company logo
Scaling Usage Statistics across
Repositories as an OpenAIRE Analytics
Service
Dimitris Pierrakos, ATHENA Research & Innovation Center
Jochen Schirrwagen, Bielefeld University
Pedro Miguel Oliveira Bento Príncipe, University of Minho
Ricardo Saraiva, University of Minho
OR2016 Conference – Dublin – June 2016
OR2016 Conference – Dublin – June 2016
Outline
• Introduction
• Methodology
• Pilots & Preliminary Results
• Conclusions & The Future
OR2016 Conference – Dublin – June 2016
Outline
• Introduction
• Methodology
• Pilots & Preliminary Results
• Conclusions & The Future
OR2016 Conference – Dublin – June 2016
OpenAIRE 2020
• A pan-European Research Information platform to
monitor OA research outcomes from EC and other
national funders.
• Research analytics tools to promote new scientific
metrics & support evidence-based decision-making.
• Implementation of an OpenAIRE usage analytics
service for usage data collected from data providers
OR2016 Conference – Dublin – June 2016
Usage Analysis Service: Aims
• Standard alignment across heterogeneous data
providers for gathering usage data & sharing
statistics.
• Taking care of data privacy policies in EU and
member states.
• Collection, measure and analysis of usage data
(downloads and views).
• Correlate with other altmetrics.
OR2016 Conference – Dublin – June 2016
Outline
• Introduction
• Methodology
• Pilots & Preliminary Results
• Conclusions & The Future
OR2016 Conference – Dublin – June 2016
Methodology
• Tracking phase
• Tier 1 approach: direct tracking
• Tier 2 approach: exploit Sushi Lite API
• Analysis phase
• Import
• Process (COUNTER4 compliance)
• Analyze
OR2016 Conference – Dublin – June 2016
Using Piwik in OpenAIRE
• An Open Source analytics platform
• Tracking via JavaScript embedded in Web pages
• Usage parameters:VisitorID, SessionID,Visitor
IP,Timestamp,Country,and many more
• IP anonymization enabled
• Bots handling
OR2016 Conference – Dublin – June 2016
Counter Code of Practice
• An International, extendible Code of Practice for e-
Resources.
• Measures usage information in a credible,
consistent and compatible way using vendor-
generated data.
• Specifications for:
• Data Collection & Processing
• Usage Analysis Reports
• Currently in Release 4
OR2016 Conference – Dublin – June 2016
Tier 1 Tracking Workflow
Repository
Javascript event trackers
Data Anonymization
Usage
Data
Import Process
OR2016 Conference – Dublin – June 2016
Tier 2: Aggregated Statistics
Workflow using SUSHI Lite
Aggregator
service
Repository 1 Repository 2
Anonymization
Import Process
Usage Data
OR2016 Conference – Dublin – June 2016
Usage Data Analysis
Usage
Statistics
OpenAIRE
Usage Data
OR2016 Conference – Dublin – June 2016
Deduplication Process
Deduplication
Item_xxx
rep1_id
Item_xxx
rep2_id
Dedup_xxx
(rep1_Id,
rep2_id)
Repository 2
Repository 1
• Enhances the
calculation of
usage statistics by
having a single id
for common
records
• Disseminate cross-
repository usage
statistics
OR2016 Conference – Dublin – June 2016
Outline
• Introduction
• Methodology
• Pilots & Preliminary Results
• Conclusions & The Future
OR2016 Conference – Dublin – June 2016
Pilots 1st phase
3 Repositories OpenAIRE Portal
OR2016 Conference – Dublin – June 2016
Pilots 2nd phase
31 Repositories OpenAIRE Portal
IRUS-UK
1 Repository
1 Repository
1 Repository
2 Repositories
OR2016 Conference – Dublin – June 2016
Preliminary Results
Metadata Views – Downloads on Pilot Repositories
0
50000
100000
150000
200000
250000
300000
350000
400000
UMINHO UEVORA UCOIMBRA
views
downloads
OR2016 Conference – Dublin – June 2016
Preliminary Results
Metadata Views – Downloads Duplicate Information
0
5
10
15
20
25
30
Duplicate Articles Views Downloads
UMINHO downloads
UEVORA downloads
UMINHO views
UEVORA views
OR2016 Conference – Dublin – June 2016
Outline
• Introduction
• Methodology
• Pilots & Preliminary Results
• Conclusions & The Future
OR2016 Conference – Dublin – June 2016
Conclusions
✓ Usage Analysis in OpenAIRE
✓ Methodology
✓ Pilot Results
✓ Challenges
✓ Better handling of bots tracking and “gaming”
activity in usage data
✓ Tackling of direct downloads
OR2016 Conference – Dublin – June 2016
The Future
✓Collaboration with National Open
Access Desks (NOADs) for usage
service dissemination
✓Beta release in 2016
✓Production release in 2017.
OR2016 Conference – Dublin – June 2016
http://openaire.eu
@openaire_eu
https://www.facebook.com/groups/openaire/
https://www.linkedin.com/groups/3893548/profile
info@openaire.eu

More Related Content

PDF
The FP7 Post-Grant Open Access Pilot: An All-Encompassing Gold Open Access Fu...
PPTX
OpenAIRE workshop @ OR2016 - From Repositories, for repositories
PPTX
Peer Review on the Move from Closed to Open
PPTX
Tony Ross-Hellauer: eInfrastructure for Open Science
PDF
OpenAIRE Presentation @3AMconf - Supporting Research Analytics by OpenAIRE Us...
PDF
OpenAIRE@info day_amsterdam_jan_2016
PDF
20190527_Helena Cousijn _ FREYA
PPTX
OpenAIRE services and tools - presentation at #DI4R2016
The FP7 Post-Grant Open Access Pilot: An All-Encompassing Gold Open Access Fu...
OpenAIRE workshop @ OR2016 - From Repositories, for repositories
Peer Review on the Move from Closed to Open
Tony Ross-Hellauer: eInfrastructure for Open Science
OpenAIRE Presentation @3AMconf - Supporting Research Analytics by OpenAIRE Us...
OpenAIRE@info day_amsterdam_jan_2016
20190527_Helena Cousijn _ FREYA
OpenAIRE services and tools - presentation at #DI4R2016

What's hot (20)

PPTX
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
PPTX
20170530_Open Research Data in Horizon 2020
PPTX
Towards a European Research Information Infrastructure
PDF
Making research visible, making research count
PPTX
The Scholix Framework and the OpenAIRE Scholexplorer Service (OpenAIRE webina...
PPTX
OpenAIRE: Directrices 3.0, desarrollos y servicios para Gestores de Repositorios
PPTX
OpenAIRE: Services for Funders - Lightning Talk at #DI4R conference (Krakov, ...
PPTX
OpenAIRE webinar on Open Access in H2020 (OAW2016)
PPTX
User engagement in OpenAIRE - panel presentation at #DI4R2016
PPTX
Infrastructure for the Data Revolution: How OpenAIRE supports the EC’s Open ...
PPTX
OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...
PDF
OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)
PPTX
Open by default: the challenges of research data in Europe
PDF
OpenAIRE webinar: Horizon 2020 Open Science Policies and beyond, with Emilie ...
PPTX
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
PPTX
OpenAIRE-connect: Services for open science
PPTX
OpenAIRE in 8 minutes - Introduction to European einfrastructures session at ...
PPTX
OpenAIRE presentation - Open Access Week 2014 @EKT Conference (Greece)
PPTX
Open access to publications in Horizon 2020
PPTX
Marina Angelaki - PASTEUR4OA: Supporting Open Access Policies
OpenAIRE services and tools - 6th National Open Access Conference and OpenAIR...
20170530_Open Research Data in Horizon 2020
Towards a European Research Information Infrastructure
Making research visible, making research count
The Scholix Framework and the OpenAIRE Scholexplorer Service (OpenAIRE webina...
OpenAIRE: Directrices 3.0, desarrollos y servicios para Gestores de Repositorios
OpenAIRE: Services for Funders - Lightning Talk at #DI4R conference (Krakov, ...
OpenAIRE webinar on Open Access in H2020 (OAW2016)
User engagement in OpenAIRE - panel presentation at #DI4R2016
Infrastructure for the Data Revolution: How OpenAIRE supports the EC’s Open ...
OpenAIRE guidelines and broker service for repository managers - OpenAIRE #OA...
OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)
Open by default: the challenges of research data in Europe
OpenAIRE webinar: Horizon 2020 Open Science Policies and beyond, with Emilie ...
OpenAIRE: Open Science as-a-Service - presentation at #DI4R2016
OpenAIRE-connect: Services for open science
OpenAIRE in 8 minutes - Introduction to European einfrastructures session at ...
OpenAIRE presentation - Open Access Week 2014 @EKT Conference (Greece)
Open access to publications in Horizon 2020
Marina Angelaki - PASTEUR4OA: Supporting Open Access Policies
Ad

Similar to Scaling Usage Statistics across Repositories as an OpenAIRE Analytics Service - presentation at #OR2016 (20)

PPTX
Everything Counts in Large Amounts: Measuring the Impact of Usage Activity in...
PPTX
Managing active research in the University of Edinburgh
PPTX
OpenAIRE implementing open science
PDF
Quality assurance system for the online journal of Finnish applied unis
PPTX
Presentation of NISO Altmetrics RP - Charleston Library Conference
PDF
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
PPTX
Linked Data Initiatives at Springer Verlag
PDF
The FAIR Principles and the IMI FAIRplus project
PPTX
Aliaksandr Birukou. Linked Data Initiatives at Springer Verlag
PDF
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
PPTX
Edit Grogh - Open Peer Review in OpenUP | OpenUP Final Conference
PPTX
Research at risk: developing a shared research data management service for UK...
PPTX
Research data management: DMP & repository
PPTX
OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...
PPTX
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
PPTX
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
PPTX
OSFair2017 Workshop | EOSCpilot monitoring framework overview and goals
PPTX
OpenAIRE - Implementing Open Science (presentation by Natalia Manola at Food ...
PPTX
Online UAS Journal promoting networking in research
PPTX
Jisc Monitor Pilot Project: an exploration of how a Jisc managed shared servi...
Everything Counts in Large Amounts: Measuring the Impact of Usage Activity in...
Managing active research in the University of Edinburgh
OpenAIRE implementing open science
Quality assurance system for the online journal of Finnish applied unis
Presentation of NISO Altmetrics RP - Charleston Library Conference
OpenAIRE: Science. Set Free, Iryna Kuchma, EIFL
Linked Data Initiatives at Springer Verlag
The FAIR Principles and the IMI FAIRplus project
Aliaksandr Birukou. Linked Data Initiatives at Springer Verlag
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
Edit Grogh - Open Peer Review in OpenUP | OpenUP Final Conference
Research at risk: developing a shared research data management service for UK...
Research data management: DMP & repository
OpenAIRE: Implementing Open Science in EOSC - crosscutting with RDA (Presenta...
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OSFair2017 Workshop | EOSCpilot monitoring framework overview and goals
OpenAIRE - Implementing Open Science (presentation by Natalia Manola at Food ...
Online UAS Journal promoting networking in research
Jisc Monitor Pilot Project: an exploration of how a Jisc managed shared servi...
Ad

More from OpenAIRE (20)

PDF
10th OpenAIRE Content Providers Community Call
PDF
9th Content Providers Community Call\
PPTX
OpenAIRE in the European Open Science Cloud (EOSC)
PDF
8th Content Providers Community Call
PDF
7th Content Providers Community Call
PDF
OpenAIRE PROVIDE Dashboard for Turkish repository managers
PDF
What will it cost to manage and share my data?
PDF
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
PDF
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
PDF
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
PDF
6th Content Providers Community Call
PPTX
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
PPTX
20200504_Research Data & the GDPR: How Open is Open?
PDF
20200504_Data, Data Ownership and Open Science
PPTX
20200429_Research Data & the GDPR: How Open is Open? (updated version)
PDF
20200429_Data, Data Ownership and Open Science
PPTX
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
PDF
COVID-19: Activities, tools, best practice and contact points in Greece
PDF
5th Content Providers Community Call
PDF
4th Content Providers Community Call
10th OpenAIRE Content Providers Community Call
9th Content Providers Community Call\
OpenAIRE in the European Open Science Cloud (EOSC)
8th Content Providers Community Call
7th Content Providers Community Call
OpenAIRE PROVIDE Dashboard for Turkish repository managers
What will it cost to manage and share my data?
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
6th Content Providers Community Call
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_Research Data & the GDPR: How Open is Open?
20200504_Data, Data Ownership and Open Science
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Data, Data Ownership and Open Science
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
COVID-19: Activities, tools, best practice and contact points in Greece
5th Content Providers Community Call
4th Content Providers Community Call

Recently uploaded (20)

PPTX
Module 1 - Cyber Law and Ethics 101.pptx
PPTX
Introuction about ICD -10 and ICD-11 PPT.pptx
PPTX
Introduction about ICD -10 and ICD11 on 5.8.25.pptx
PDF
Decoding a Decade: 10 Years of Applied CTI Discipline
PDF
Introduction to the IoT system, how the IoT system works
PDF
Unit-1 introduction to cyber security discuss about how to secure a system
PPTX
Power Point - Lesson 3_2.pptx grad school presentation
PPT
Design_with_Watersergyerge45hrbgre4top (1).ppt
PPT
isotopes_sddsadsaadasdasdasdasdsa1213.ppt
PDF
APNIC Update, presented at PHNOG 2025 by Shane Hermoso
PPTX
PptxGenJS_Demo_Chart_20250317130215833.pptx
PPTX
E -tech empowerment technologies PowerPoint
PDF
Slides PDF The World Game (s) Eco Economic Epochs.pdf
PPTX
Introduction to Information and Communication Technology
PDF
The Internet -By the Numbers, Sri Lanka Edition
PPTX
Funds Management Learning Material for Beg
PPTX
522797556-Unit-2-Temperature-measurement-1-1.pptx
PDF
SASE Traffic Flow - ZTNA Connector-1.pdf
PPTX
innovation process that make everything different.pptx
PDF
FINAL CALL-6th International Conference on Networks & IOT (NeTIOT 2025)
Module 1 - Cyber Law and Ethics 101.pptx
Introuction about ICD -10 and ICD-11 PPT.pptx
Introduction about ICD -10 and ICD11 on 5.8.25.pptx
Decoding a Decade: 10 Years of Applied CTI Discipline
Introduction to the IoT system, how the IoT system works
Unit-1 introduction to cyber security discuss about how to secure a system
Power Point - Lesson 3_2.pptx grad school presentation
Design_with_Watersergyerge45hrbgre4top (1).ppt
isotopes_sddsadsaadasdasdasdasdsa1213.ppt
APNIC Update, presented at PHNOG 2025 by Shane Hermoso
PptxGenJS_Demo_Chart_20250317130215833.pptx
E -tech empowerment technologies PowerPoint
Slides PDF The World Game (s) Eco Economic Epochs.pdf
Introduction to Information and Communication Technology
The Internet -By the Numbers, Sri Lanka Edition
Funds Management Learning Material for Beg
522797556-Unit-2-Temperature-measurement-1-1.pptx
SASE Traffic Flow - ZTNA Connector-1.pdf
innovation process that make everything different.pptx
FINAL CALL-6th International Conference on Networks & IOT (NeTIOT 2025)

Scaling Usage Statistics across Repositories as an OpenAIRE Analytics Service - presentation at #OR2016

  • 1. Scaling Usage Statistics across Repositories as an OpenAIRE Analytics Service Dimitris Pierrakos, ATHENA Research & Innovation Center Jochen Schirrwagen, Bielefeld University Pedro Miguel Oliveira Bento Príncipe, University of Minho Ricardo Saraiva, University of Minho OR2016 Conference – Dublin – June 2016
  • 2. OR2016 Conference – Dublin – June 2016 Outline • Introduction • Methodology • Pilots & Preliminary Results • Conclusions & The Future
  • 3. OR2016 Conference – Dublin – June 2016 Outline • Introduction • Methodology • Pilots & Preliminary Results • Conclusions & The Future
  • 4. OR2016 Conference – Dublin – June 2016 OpenAIRE 2020 • A pan-European Research Information platform to monitor OA research outcomes from EC and other national funders. • Research analytics tools to promote new scientific metrics & support evidence-based decision-making. • Implementation of an OpenAIRE usage analytics service for usage data collected from data providers
  • 5. OR2016 Conference – Dublin – June 2016 Usage Analysis Service: Aims • Standard alignment across heterogeneous data providers for gathering usage data & sharing statistics. • Taking care of data privacy policies in EU and member states. • Collection, measure and analysis of usage data (downloads and views). • Correlate with other altmetrics.
  • 6. OR2016 Conference – Dublin – June 2016 Outline • Introduction • Methodology • Pilots & Preliminary Results • Conclusions & The Future
  • 7. OR2016 Conference – Dublin – June 2016 Methodology • Tracking phase • Tier 1 approach: direct tracking • Tier 2 approach: exploit Sushi Lite API • Analysis phase • Import • Process (COUNTER4 compliance) • Analyze
  • 8. OR2016 Conference – Dublin – June 2016 Using Piwik in OpenAIRE • An Open Source analytics platform • Tracking via JavaScript embedded in Web pages • Usage parameters:VisitorID, SessionID,Visitor IP,Timestamp,Country,and many more • IP anonymization enabled • Bots handling
  • 9. OR2016 Conference – Dublin – June 2016 Counter Code of Practice • An International, extendible Code of Practice for e- Resources. • Measures usage information in a credible, consistent and compatible way using vendor- generated data. • Specifications for: • Data Collection & Processing • Usage Analysis Reports • Currently in Release 4
  • 10. OR2016 Conference – Dublin – June 2016 Tier 1 Tracking Workflow Repository Javascript event trackers Data Anonymization Usage Data Import Process
  • 11. OR2016 Conference – Dublin – June 2016 Tier 2: Aggregated Statistics Workflow using SUSHI Lite Aggregator service Repository 1 Repository 2 Anonymization Import Process Usage Data
  • 12. OR2016 Conference – Dublin – June 2016 Usage Data Analysis Usage Statistics OpenAIRE Usage Data
  • 13. OR2016 Conference – Dublin – June 2016 Deduplication Process Deduplication Item_xxx rep1_id Item_xxx rep2_id Dedup_xxx (rep1_Id, rep2_id) Repository 2 Repository 1 • Enhances the calculation of usage statistics by having a single id for common records • Disseminate cross- repository usage statistics
  • 14. OR2016 Conference – Dublin – June 2016 Outline • Introduction • Methodology • Pilots & Preliminary Results • Conclusions & The Future
  • 15. OR2016 Conference – Dublin – June 2016 Pilots 1st phase 3 Repositories OpenAIRE Portal
  • 16. OR2016 Conference – Dublin – June 2016 Pilots 2nd phase 31 Repositories OpenAIRE Portal IRUS-UK 1 Repository 1 Repository 1 Repository 2 Repositories
  • 17. OR2016 Conference – Dublin – June 2016 Preliminary Results Metadata Views – Downloads on Pilot Repositories 0 50000 100000 150000 200000 250000 300000 350000 400000 UMINHO UEVORA UCOIMBRA views downloads
  • 18. OR2016 Conference – Dublin – June 2016 Preliminary Results Metadata Views – Downloads Duplicate Information 0 5 10 15 20 25 30 Duplicate Articles Views Downloads UMINHO downloads UEVORA downloads UMINHO views UEVORA views
  • 19. OR2016 Conference – Dublin – June 2016 Outline • Introduction • Methodology • Pilots & Preliminary Results • Conclusions & The Future
  • 20. OR2016 Conference – Dublin – June 2016 Conclusions ✓ Usage Analysis in OpenAIRE ✓ Methodology ✓ Pilot Results ✓ Challenges ✓ Better handling of bots tracking and “gaming” activity in usage data ✓ Tackling of direct downloads
  • 21. OR2016 Conference – Dublin – June 2016 The Future ✓Collaboration with National Open Access Desks (NOADs) for usage service dissemination ✓Beta release in 2016 ✓Production release in 2017.
  • 22. OR2016 Conference – Dublin – June 2016 http://openaire.eu @openaire_eu https://www.facebook.com/groups/openaire/ https://www.linkedin.com/groups/3893548/profile info@openaire.eu