SlideShare a Scribd company logo
Kenning Arlitsch, Dean @kenning_msu
Patrick OBrien, Semantic Web Research Director @sempob
RAMP:
Repository Analytics and Metrics Portal
Accurately measuring use of institutional repositories
ARL Assessment Forum
Atlanta, GA, January 20, 2017
Agenda
1. A New Reporting Model
2. Accuracy of Analytics Reporting Tools
Page Tagging
Log file
3. RAMP
New Prototype Web Service for IR reporting
A New Reporting Model
Page Type Definition Examples
Citable Content
Downloads
Non-HTML scholarly content
that may be formally cited in
the research process
● Publication (.pdf)
● Presentation (.ppt)
● Data Sets (.csv)
Item Summary
HTML pages to help user
decide to download the full
publication
● Title & Abstract
● Item Metadata
Ancillary
HTML pages that provide
general information or
navigation
● Search Results
● Browse by Author
● Statistics
RAMP: Repository Analytics and Metrics Portal
RAMP: Repository Analytics and Metrics Portal
RAMP: Repository Analytics and Metrics Portal
RAMP: Repository Analytics and Metrics Portal
Accuracy of Analytics Reporting Tools
Two Classes of Web Analytics
HTML
Analytics Service (SaaS)
1
Log Files
2Page Tagging
{JavaScript}
Page Tagging
Page Tagging does not track non-HTML CCD
HTML
Non-HTML
Show Google Scholar direct link
graphic
Google Analytics use in academic libraries
 Tested 279 academic library websites
 ARL
 DLF
 OCLC-RLP
 90% US libraries contained Google tracking code
51%
54%
52%
63%
35%
27%
20%
20%
14%
19%
28%
17%
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
~51% to 63% of Citable IR Activity is Unreported by Google Analytics
01/05/16 - 05/17/16 (days=134)
Citable Content Downloads Item Sumary PV Ancillary PV
Most IR activity is Citable Content Downloads
54.3%
46.2%
47.9%
42.5%
45.7%
53.8%
52.1%
57.5%
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
Most IR Activity is Unreported by Google Analytics
01/05/16 - 05/17/16 (days=134)
Reported Invisible
Most IR Activity Unreported by Google Analytics
Web Analytics Accuracy Risks
Risks Analytics Method
Area Types Page Tagging Log Files
OverCount
Visits Low High
Downloads Low High
Page Views Low Low
UnderCount
Visits Medium Medium
Downloads High Low
Page Views Low Low
Data set: Jan 5 - May 17, 2016 (n = 134 days)
Study Participant IR Platform URL
Montana State University ScholarWorks DSpace scholarworks.montana.edu
McMaster University MacSphere DSpace macsphere.mcmaster.ca
University of New Mexico LoboVault DSpace repository.unm.edu
University of Utah USpace CONTENTdm uspace.utah.edu
Page Tagging does not track non-HTML
Citable Content Downloads
Non-HTML
Search Analytics
Does!
Typical Page Tagging (GA) Data Types
Page Type
Analytics
Pages Events
Citable Content
Downloads
- ✅
Item Summary ✅ -
Ancillary ✅ -
Montana Method Includes Google Search Console
Page Type
Analytics
Search
Console
Pages Events Search Analytics
Citable Content
Downloads
- ✅ ✅
Item Summary ✅ - -
Ancillary ✅ - -
GA Ancillary Page Views
and Item Summary Page Views vs CCD
All four IR using Google Analytics
Page Type
Analytics
Search
Console
Pages Events Search Analytics
Citable Content
Downloads
- 26,355 562,933
Item Summary 284,303 - -
Ancillary 201,793 - -
CCD Tracking Improvement
2,000%
Montana Method Challenges
❖ Missing non-Google direct link CCD events
● Yahoo
● Bing
● Email
● FB
❖ GSC limits time and access
● Moving 90-day window
● Granular data = programing skills to access API
RAMP: Repository Analytics & Metrics Portal
RAMP - Repository Analytics and Metrics Portal
 Cloud Web service
 Currently accessible and free
 During grant period (through November 2017)
 No training or configuration required
 Consistent method and terminology
 Benchmarking across time and organization
 Request access - mixterj@oclc.org
RAMP: Repository Analytics and Metrics Portal
RAMP: Repository Analytics and Metrics Portal
Publications
Published:
Patrick OBrien, Kenning Arlitsch, Leila Sterman, Jeff Mixter, Jonathan Wheeler,
and Susan Borda. “Undercounting File Downloads from Institutional Repositories,”
Journal of Library Administration, vol. 56, no. 7, 2016
Forthcoming:
Patrick OBrien, Kenning Arlitsch, Leila Sterman, Jeff Mixter, Jonathan Wheeler,
and Susan Borda. “RAMP: Repository Analytics and Metrics Portal: A Prototype
Web Service that Accurately Counts Item Downloads from Institutional
Repositories,” Accepted by Library Hi Tech, expected early 2017
Proposal funded by IMLS:
”Measuring Up: Assessing Accuracy of Reported Use and Impact of Digital
Repositories” - scholarworks.montana.edu/xmlui/handle/1/8924
Undercounting Research Team
❖Montana State University
●Kenning Arlitsch, Dean @kenning_msu
●Patrick OBrien, Semantic Web Research Director @sempob
●Leila Sterman, Scholarly Communication Librarian @calamityleila
●Susan Borda, Digital Technologies Librarian @mutanthumb
❖OCLC Research
●Jeff Mixter, Software Engineer @jeffmixter
❖University of New Mexico
●Jonathan Wheeler, Data Curation Librarian

More Related Content

PPTX
Walk Before You Run: Prerequisites to Linked Data
PPTX
Improving the reported use and impact of institutional repositories
PPTX
Quantitative anthropology hackuarium
PDF
Research Paper
PDF
Insight Consulting Project
PPTX
So much data so many uses
PPTX
Beyond COUNTER Compliant: Ways to Assess E-Resources Reporting Tools
PPTX
Exploring data quality and retrieval strategies for Mendeley reader counts
Walk Before You Run: Prerequisites to Linked Data
Improving the reported use and impact of institutional repositories
Quantitative anthropology hackuarium
Research Paper
Insight Consulting Project
So much data so many uses
Beyond COUNTER Compliant: Ways to Assess E-Resources Reporting Tools
Exploring data quality and retrieval strategies for Mendeley reader counts

What's hot (19)

PPTX
Going All-Electronic and Keeping Track of It: Clickthrough Statistics for On...
PDF
Insight Consulting Project
PPTX
Turning the Corner at High Speed: How Collections Metrics Are Changing in a H...
PDF
demo_teralytics
PDF
Data Stories: Using Narratives to Reflect on a Data Purchase Pilot Program
PDF
Data science with Google Analytics @MeasureCamp
PDF
Data tools ecosystem for non-programmers
PPTX
Yale Library - Google Analytics & Tableau (5/14/2015)
PDF
Xinchao(luke) lu
PDF
Talis Insight Europe 2017 - Taking the pain out of reporting - University of ...
PPTX
NISO Webinar: Making Better Decisions with Usage Statistics
PDF
PPTX
DBtrends Semantics 2016
PDF
Activate 2019 - Search and relevance at scale for online classifieds
PPT
2010 nasig integrating_usage_statistics
PDF
PageRank and Related Methods
PDF
Resume xiaodan(vinci)
PPTX
Data analysis@network programming
PPTX
Linking GtoP <> PubChem <> PubMed
Going All-Electronic and Keeping Track of It: Clickthrough Statistics for On...
Insight Consulting Project
Turning the Corner at High Speed: How Collections Metrics Are Changing in a H...
demo_teralytics
Data Stories: Using Narratives to Reflect on a Data Purchase Pilot Program
Data science with Google Analytics @MeasureCamp
Data tools ecosystem for non-programmers
Yale Library - Google Analytics & Tableau (5/14/2015)
Xinchao(luke) lu
Talis Insight Europe 2017 - Taking the pain out of reporting - University of ...
NISO Webinar: Making Better Decisions with Usage Statistics
DBtrends Semantics 2016
Activate 2019 - Search and relevance at scale for online classifieds
2010 nasig integrating_usage_statistics
PageRank and Related Methods
Resume xiaodan(vinci)
Data analysis@network programming
Linking GtoP <> PubChem <> PubMed
Ad

Viewers also liked (20)

PPTX
Hot & Sexy Naila Nayem | You can't Believe She is a Bangladeshi Ramp Model
ODP
Fuse Service Works Design Time Governance and S-RAMP
PDF
Structure and Metadata: Shortening the On-Ramp to Linked Data
PPT
Case Study: Knowledge Sourcing in Daimler-Benz
PPTX
Top 10 ramp interview questions with answers
PPT
R&R Gage Analysis
PPTX
Ramp safety
PPTX
Ramp safety officer
PDF
La place du power-to-gas dans le système énergétique futur
PPTX
Characteristics Of Cell And Lead Acid Battery
PPTX
Chromium and insulin sensitivity
DOCX
Nouveau microsoft word document
PPTX
40 cfr 261.4(b)(6) The RCRA Exclusion From Hazardous Waste for Trivalent Chro...
PPTX
Removal of chromium
PPT
Objetos De Aprendizaje Nuevo Concepto Instruccional
PDF
A SHORT REVIEW ON ALUMINIUM ANODIZING: AN ECO-FRIENDLY METAL FINISHING PROCESS
PPTX
10 major industrial applications of sulfuric acid
PDF
Chromium problems
PDF
Metabolisme des lipides
PDF
Brochure Meca-19102016-bd
Hot & Sexy Naila Nayem | You can't Believe She is a Bangladeshi Ramp Model
Fuse Service Works Design Time Governance and S-RAMP
Structure and Metadata: Shortening the On-Ramp to Linked Data
Case Study: Knowledge Sourcing in Daimler-Benz
Top 10 ramp interview questions with answers
R&R Gage Analysis
Ramp safety
Ramp safety officer
La place du power-to-gas dans le système énergétique futur
Characteristics Of Cell And Lead Acid Battery
Chromium and insulin sensitivity
Nouveau microsoft word document
40 cfr 261.4(b)(6) The RCRA Exclusion From Hazardous Waste for Trivalent Chro...
Removal of chromium
Objetos De Aprendizaje Nuevo Concepto Instruccional
A SHORT REVIEW ON ALUMINIUM ANODIZING: AN ECO-FRIENDLY METAL FINISHING PROCESS
10 major industrial applications of sulfuric acid
Chromium problems
Metabolisme des lipides
Brochure Meca-19102016-bd
Ad

Similar to RAMP: Repository Analytics and Metrics Portal (20)

PPTX
OCLC Research Update at ALA Chicago. June 26, 2017.
PPTX
Smarter Data for Smarter Libraries
PPTX
Wa mw 2013
PPTX
LITA Forum 2012 Web Analytics Preconference
PPTX
MLA 2010 Improving Library Web Sites with Web Analytics
PDF
Mark Farmer - Google Analytics: Business Intelligence for Non-profits
PPTX
LITA Forum 2012 Web Analytics Strategy Preconference
PDF
Open Source Information Gathering Brucon Edition
PPTX
Web stats
PPTX
Introduction to Google Analytics
PPT
Web analytics webinar
PDF
Search Marketer's Toolkit for Google Tag Manager and Google Analytics
PPTX
analytics.pptx
PDF
Strategic_Web_Design-TestingYourVisualStory
PPT
Web analytics presentation
PPTX
Use Google Analytics Stats to Improve Website
PPT
Analytics
PPTX
Website - meaningful measurement. Stats that matter workshop.
PPTX
Google A
PDF
Government Web Analytics
OCLC Research Update at ALA Chicago. June 26, 2017.
Smarter Data for Smarter Libraries
Wa mw 2013
LITA Forum 2012 Web Analytics Preconference
MLA 2010 Improving Library Web Sites with Web Analytics
Mark Farmer - Google Analytics: Business Intelligence for Non-profits
LITA Forum 2012 Web Analytics Strategy Preconference
Open Source Information Gathering Brucon Edition
Web stats
Introduction to Google Analytics
Web analytics webinar
Search Marketer's Toolkit for Google Tag Manager and Google Analytics
analytics.pptx
Strategic_Web_Design-TestingYourVisualStory
Web analytics presentation
Use Google Analytics Stats to Improve Website
Analytics
Website - meaningful measurement. Stats that matter workshop.
Google A
Government Web Analytics

Recently uploaded (20)

PDF
Introduction to Data Science and Data Analysis
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
Introduction to the R Programming Language
PPTX
Computer network topology notes for revision
PDF
Mega Projects Data Mega Projects Data
PDF
.pdf is not working space design for the following data for the following dat...
PDF
Lecture1 pattern recognition............
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Introduction to machine learning and Linear Models
PPT
Quality review (1)_presentation of this 21
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Introduction to Data Science and Data Analysis
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
climate analysis of Dhaka ,Banglades.pptx
Introduction to the R Programming Language
Computer network topology notes for revision
Mega Projects Data Mega Projects Data
.pdf is not working space design for the following data for the following dat...
Lecture1 pattern recognition............
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Data_Analytics_and_PowerBI_Presentation.pptx
STERILIZATION AND DISINFECTION-1.ppthhhbx
Fluorescence-microscope_Botany_detailed content
Introduction-to-Cloud-ComputingFinal.pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Introduction to machine learning and Linear Models
Quality review (1)_presentation of this 21
The THESIS FINAL-DEFENSE-PRESENTATION.pptx

RAMP: Repository Analytics and Metrics Portal

  • 1. Kenning Arlitsch, Dean @kenning_msu Patrick OBrien, Semantic Web Research Director @sempob RAMP: Repository Analytics and Metrics Portal Accurately measuring use of institutional repositories ARL Assessment Forum Atlanta, GA, January 20, 2017
  • 2. Agenda 1. A New Reporting Model 2. Accuracy of Analytics Reporting Tools Page Tagging Log file 3. RAMP New Prototype Web Service for IR reporting
  • 3. A New Reporting Model Page Type Definition Examples Citable Content Downloads Non-HTML scholarly content that may be formally cited in the research process ● Publication (.pdf) ● Presentation (.ppt) ● Data Sets (.csv) Item Summary HTML pages to help user decide to download the full publication ● Title & Abstract ● Item Metadata Ancillary HTML pages that provide general information or navigation ● Search Results ● Browse by Author ● Statistics
  • 8. Accuracy of Analytics Reporting Tools
  • 9. Two Classes of Web Analytics HTML Analytics Service (SaaS) 1 Log Files 2Page Tagging {JavaScript}
  • 10. Page Tagging Page Tagging does not track non-HTML CCD HTML Non-HTML
  • 11. Show Google Scholar direct link graphic
  • 12. Google Analytics use in academic libraries  Tested 279 academic library websites  ARL  DLF  OCLC-RLP  90% US libraries contained Google tracking code
  • 13. 51% 54% 52% 63% 35% 27% 20% 20% 14% 19% 28% 17% 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% ~51% to 63% of Citable IR Activity is Unreported by Google Analytics 01/05/16 - 05/17/16 (days=134) Citable Content Downloads Item Sumary PV Ancillary PV Most IR activity is Citable Content Downloads
  • 14. 54.3% 46.2% 47.9% 42.5% 45.7% 53.8% 52.1% 57.5% 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Most IR Activity is Unreported by Google Analytics 01/05/16 - 05/17/16 (days=134) Reported Invisible Most IR Activity Unreported by Google Analytics
  • 15. Web Analytics Accuracy Risks Risks Analytics Method Area Types Page Tagging Log Files OverCount Visits Low High Downloads Low High Page Views Low Low UnderCount Visits Medium Medium Downloads High Low Page Views Low Low
  • 16. Data set: Jan 5 - May 17, 2016 (n = 134 days) Study Participant IR Platform URL Montana State University ScholarWorks DSpace scholarworks.montana.edu McMaster University MacSphere DSpace macsphere.mcmaster.ca University of New Mexico LoboVault DSpace repository.unm.edu University of Utah USpace CONTENTdm uspace.utah.edu
  • 17. Page Tagging does not track non-HTML Citable Content Downloads Non-HTML Search Analytics Does!
  • 18. Typical Page Tagging (GA) Data Types Page Type Analytics Pages Events Citable Content Downloads - ✅ Item Summary ✅ - Ancillary ✅ -
  • 19. Montana Method Includes Google Search Console Page Type Analytics Search Console Pages Events Search Analytics Citable Content Downloads - ✅ ✅ Item Summary ✅ - - Ancillary ✅ - -
  • 20. GA Ancillary Page Views and Item Summary Page Views vs CCD
  • 21. All four IR using Google Analytics Page Type Analytics Search Console Pages Events Search Analytics Citable Content Downloads - 26,355 562,933 Item Summary 284,303 - - Ancillary 201,793 - - CCD Tracking Improvement 2,000%
  • 22. Montana Method Challenges ❖ Missing non-Google direct link CCD events ● Yahoo ● Bing ● Email ● FB ❖ GSC limits time and access ● Moving 90-day window ● Granular data = programing skills to access API
  • 23. RAMP: Repository Analytics & Metrics Portal
  • 24. RAMP - Repository Analytics and Metrics Portal  Cloud Web service  Currently accessible and free  During grant period (through November 2017)  No training or configuration required  Consistent method and terminology  Benchmarking across time and organization  Request access - mixterj@oclc.org
  • 27. Publications Published: Patrick OBrien, Kenning Arlitsch, Leila Sterman, Jeff Mixter, Jonathan Wheeler, and Susan Borda. “Undercounting File Downloads from Institutional Repositories,” Journal of Library Administration, vol. 56, no. 7, 2016 Forthcoming: Patrick OBrien, Kenning Arlitsch, Leila Sterman, Jeff Mixter, Jonathan Wheeler, and Susan Borda. “RAMP: Repository Analytics and Metrics Portal: A Prototype Web Service that Accurately Counts Item Downloads from Institutional Repositories,” Accepted by Library Hi Tech, expected early 2017 Proposal funded by IMLS: ”Measuring Up: Assessing Accuracy of Reported Use and Impact of Digital Repositories” - scholarworks.montana.edu/xmlui/handle/1/8924
  • 28. Undercounting Research Team ❖Montana State University ●Kenning Arlitsch, Dean @kenning_msu ●Patrick OBrien, Semantic Web Research Director @sempob ●Leila Sterman, Scholarly Communication Librarian @calamityleila ●Susan Borda, Digital Technologies Librarian @mutanthumb ❖OCLC Research ●Jeff Mixter, Software Engineer @jeffmixter ❖University of New Mexico ●Jonathan Wheeler, Data Curation Librarian