SlideShare a Scribd company logo
Collecting Usage Statistics for E-Government
Resources
Christopher C. Brown
University of Denver, University Libraries
cbrown@du.edu
May 20, 2014 -- Online via GPO’s iCohere Platform
PURL Primer
Source: http://www.fdlp.gov/collections/distribution/709-withdrawal-recall-destruction-procedures
The Problem
 We have statistics for
government documents
print circulation
• But our directors want statistics
• The viability of our depository
status may rest on our ability to
provide statistics
 We don’t have any
statistics for online usage
Statistics we don’t know
 Visits to online docs URLs by our users – we are
clueless!
 How many times URLs are visited by our users
 What titles are visited by our users
 What agencies are most popular with our users
 We don’t know the whole picture
How Many PURLS?
 142,117 records in CGP with PURLS (as of May 13,
2014).
 There are a total of 179,566 PURLS in the GPO
PURL database (as of May 13, 2014)
 At present, GPO creates about 850 PURLS each
month
Source – James Mauldin, GPO
Part 1: GPO Solution
PURL Referral Reporting
 The tool also provides a listing of the top fifty (50) referred PURL resources per
hostname and/or IP address with:
 The PURL path.
 The full path of the target URL for each PURL.
 The total requests for that individual PURL.
 A search link utilizing the CGP to view cataloging records for the individual PURL.
 GPO releases monthly PURL referral reports; however, these reports include
aggregate totals only. Referrals totals strip out bot traffic and focuses on patron
requests.
 The PURL Referral Reporting Tool is locked down to Federal depository libraries
only. Data is current as of the previous day. Historical data is available for twelve
months. Tool functionality may be expanded in the future to include greater
historical data retention and additional functionality based on funding and
community feedback.
 Source: http://www.fdlp.gov/23-about/projects/141-purl-enhancement-and-
stabilization
Since Dec. 1, 2010 the referral reporting system has been operational.
Steps to getting Custom Reports
 Gather the relevant hostnames or IP addresses for
your institution – sites where you have PURLs
 Your library catalog (maybe you have two versions like we do –
classic catalog, next-gen catalog
 Your web discover tool (if you have one)
 Your library instruction guides (like Libguides)
 Other Web pages that may contain PURLs
 Also consider using your institution’s numeric IP address
 Go to http://purlreferrals.fdlp.gov/ (You will need to
login with your depository number and your internal
password).
Run Your Query (login with Internal FDLP
Credentials)
Results of Your Query
Top 50 Results
Export to CSV (Open with Excel)
You can See Exact Titles for Top 50
Older PURL Referrals
http://www.fdlp.gov/file-repository/collection-management/purl-referrals
You can get older PURL referral reports from here:
Compare your hits against
other depositories
PURL Rot
 In theory, it would be a wonderful world if someone
behind a curtain at GPO would check every PURL
every day to check for errors. But that does not
happen. It is up to us – documents librarians – to
report these.
PURL Rot: Reporting a Broken PURL
PURL Rot: Keeping Track
PURL Retrieval Summary
 You can get the total PURL hits by month,
 Or the top 50 most popular hits
 You cannot get all specific URLs. No way to do a
more comprehensive analysis
 Statistics are ONLY for PURLS, not for any other
online government URLs
 Statistics can be incomplete at times (GPO server
down, etc.)
Part 2: Local Solutions
Objective
To track online government
document clickthroughs when
accessed via the online catalog
oNot possible to capture every use of government info by our users
oBut is possible to capture all clickthroughs via the OPAC
Different Approaches
GPO PURL Tracking Local URL Tracking
Any PURL clickthrough from an
institution
Any URL clickthrough via the OPAC
Broad view: top PURLS and overall
numbers
Narrow view: specific PURLS/URLS
and then can derive titles, SuDocs, etc.
Wait for GPO to aggregate data Instant access to data
Basic Idea: How it Works
 A URL is prepended to the PURL (or URL)
 This URL initially directs to a library-hosted web
server which traps for the date/time, PURL (or
URL), URL of requestor
 The user is then instantly redirected to the PURL (or
URL) site
Two Methods to Track Locally
 Prepend to PURL
 Method #1: trap for the URL, date – more difficult at
the end, but easier at first
 Method #2: trap also for a unique record number –
more difficult at first, but benefits later
A Simple Prepend URL
http://library.du.edu/clickthrough/index.php/clicks/?type=gov&url=
Clickthrough Dashboard
Benefits of Clickthrough Project
 We can provide meaningful stats to the library
director
 We can see high-use and low-use areas
 We can tell if users benefit from our special projects
 We can do reactive URL maintenance
 We can see turn-aways and other problems
 We can see search engine attacks
 We can see how our docs work within your discovery
tools
Local Use for Docs by FY
Specs: How to ask for a clickthrough system
 Project hosted on stable server (such as library Web server).
 Should be able to handle long URLs – up to 700 characters.
 Prepended URL sends request to library server.
 Included in prepended URL is cataloger-supplied 3-letter code of
URL type (ex: gov, cou, ran – any 3-letter combination that may
be needed in future).
 Server records date/time, IP address of requestor, 3-letter code
of URL type, and URL requested.
 Server redirects user to desired URL.
 Reporting mechanism available to gather clickthroughs.
 Archiving function available to archive stats.
 Ability to view archived records.
 Secure login for authorized users.
Just give this slide to a code-writer in your library
– and you may have a link-tracking system soon!
Local Solutions to Problem
http://www.fdlp.gov/file-repository/1051-tracking-online-document-usage-from-the-catalog
Further Reading
Questions?
Christopher C. Brown
University of Denver, University Libraries
cbrown@du.edu

More Related Content

PPT
Phrase Based Indexing
DOCX
Seminar report(rohitsahu cs 17 vth sem)
PPT
OpenURL @ Rice U. (2008)
PDF
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
PDF
Smart Crawler for Efficient Deep-Web Harvesting
PDF
Architecture of a search engine
PDF
RabbitMQ Implementation as Message Broker in Distributed Application with RES...
PDF
CASI Fall 2010 Games & Sims poster
Phrase Based Indexing
Seminar report(rohitsahu cs 17 vth sem)
OpenURL @ Rice U. (2008)
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler for Efficient Deep-Web Harvesting
Architecture of a search engine
RabbitMQ Implementation as Message Broker in Distributed Application with RES...
CASI Fall 2010 Games & Sims poster

What's hot (8)

PDF
A survey on Design and Implementation of Clever Crawler Based On DUST Removal
PPT
KBART update ER&L 2009
PPTX
Link Resolvers, Knowledgebases and the KBART Working Group
PDF
Web Mining Patterns Discovery and Analysis Using Custom-Built Apriori Algorithm
PPTX
Configuring Knowledgebases for Discovery and Access
PDF
Context Based Web Indexing For Semantic Web
PDF
Proactive Approach to Estimate the Re-crawl Period for Resource Minimization ...
PDF
xml sitemap
A survey on Design and Implementation of Clever Crawler Based On DUST Removal
KBART update ER&L 2009
Link Resolvers, Knowledgebases and the KBART Working Group
Web Mining Patterns Discovery and Analysis Using Custom-Built Apriori Algorithm
Configuring Knowledgebases for Discovery and Access
Context Based Web Indexing For Semantic Web
Proactive Approach to Estimate the Re-crawl Period for Resource Minimization ...
xml sitemap
Ad

Viewers also liked (17)

PPT
Harvesting HathiTrust Documents: A New Model for Online Access
PDF
Startbizindia Brochure
PPTX
46465
PPTX
Downsizing Your Depository: Dealing with Mandates from Your Administration
DOC
Male sexual health
PPTX
NBA Cares
PDF
http://chiropractor.nearcarmelarea.com
PPT
Item Deselection on the Fast Track
PDF
Lorraine Nagai - Accelerated Reading Program
PPT
Veno strong presentation
PPTX
Fiche Online: A Vision for Digitizing All Documents Fiche
PPTX
Outbound Harvesting with Encore as a Library Space-Saving Strategy : The Cas...
PPTX
Web-scale Discovery Tools and the Backgrounding of Government Information
PPTX
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...
PPTX
An analysis of exam oriented education system (1)
PPT
iCBerry - революционный подход в поддержании интимного здоровья женщины. Pres...
Harvesting HathiTrust Documents: A New Model for Online Access
Startbizindia Brochure
46465
Downsizing Your Depository: Dealing with Mandates from Your Administration
Male sexual health
NBA Cares
http://chiropractor.nearcarmelarea.com
Item Deselection on the Fast Track
Lorraine Nagai - Accelerated Reading Program
Veno strong presentation
Fiche Online: A Vision for Digitizing All Documents Fiche
Outbound Harvesting with Encore as a Library Space-Saving Strategy : The Cas...
Web-scale Discovery Tools and the Backgrounding of Government Information
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...
An analysis of exam oriented education system (1)
iCBerry - революционный подход в поддержании интимного здоровья женщины. Pres...
Ad

Similar to Collecting Usage Statistics for E-Government Resources (20)

PPTX
Going All-Electronic and Keeping Track of It: Clickthrough Statistics for On...
PPTX
Preserving Public Government Information: The End of Term Web Archive
PDF
Blind Spots and Broken Links: Access to Government Information
PPTX
Writing esl
PPTX
Annotated bib and research strategies
PPTX
Ws rogers spring 2013a
PPTX
Writing Seminar Rogers
PPTX
Writing Seminar
PPTX
Advanced Google Analytics
PPT
Development of the CyberCemetery (2011)
PPTX
ALA cookies n learn update
PDF
Open government data portals: from publishing to use and impact
PPT
Web_Analytics_Part2--Analyzing_and_Acting--1-27-2011
PPTX
WS Scott Spring 2013
PPT
Database_Cache Replacemnt Policies(Lyras)
PPTX
Boundless Opportunity
PPT
LLA ZPortal
KEY
Online Collections Crawlability for Libraries, Archives, and Museums
DOC
( 4 ) Office 2007 Configure The Official Records Site
Going All-Electronic and Keeping Track of It: Clickthrough Statistics for On...
Preserving Public Government Information: The End of Term Web Archive
Blind Spots and Broken Links: Access to Government Information
Writing esl
Annotated bib and research strategies
Ws rogers spring 2013a
Writing Seminar Rogers
Writing Seminar
Advanced Google Analytics
Development of the CyberCemetery (2011)
ALA cookies n learn update
Open government data portals: from publishing to use and impact
Web_Analytics_Part2--Analyzing_and_Acting--1-27-2011
WS Scott Spring 2013
Database_Cache Replacemnt Policies(Lyras)
Boundless Opportunity
LLA ZPortal
Online Collections Crawlability for Libraries, Archives, and Museums
( 4 ) Office 2007 Configure The Official Records Site

More from Christopher Brown (7)

PPTX
Migrating Government Publications without Going South: Our Alma/Primo Experience
PPTX
Downsizing your Depository: Tools and Ideas
PPTX
The Darkening of Government Information
PPTX
The Three Googles: How I Teach Google in an Academic Setting
PPTX
The Front Face of the ERM
PPTX
Planning the Six-State Virtual Government Information Conference
PPTX
Summon and the Art of Discovery
Migrating Government Publications without Going South: Our Alma/Primo Experience
Downsizing your Depository: Tools and Ideas
The Darkening of Government Information
The Three Googles: How I Teach Google in an Academic Setting
The Front Face of the ERM
Planning the Six-State Virtual Government Information Conference
Summon and the Art of Discovery

Recently uploaded (20)

PPTX
11Sept2023_LTIA-Cluster-Training-Presentation.pptx
PPTX
Weekly Report 17-10-2024_cybersecutity.pptx
PPTX
Quiz - Saturday.pptxaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
PDF
buyers sellers meeting of mangoes in mahabubnagar.pdf
PPTX
Omnibus rules on leave administration.pptx
PDF
How FPOs Are Reshaping Agriculture in Maharashtra?
PDF
Creating Memorable Moments_ Personalized Plant Gifts.pdf
PDF
The Detrimental Impacts of Hydraulic Fracturing for Oil and Gas_ A Researched...
PPTX
Portland FPDR Oregon Legislature 2025.pptx
DOCX
EAPP.docxdffgythjyuikuuiluikluikiukuuuuuu
PPTX
Nur Shakila Assesmentlwemkf;m;mwee f.pptx
PPTX
PCCR-ROTC-UNIT-ORGANIZATIONAL-STRUCTURE-pptx-Copy (1).pptx
DOCX
Alexistogel: Solusi Tepat untuk Anda yang Cari Bandar Toto Macau Resmi
PPTX
Introduction_to_the_Study_of_Globalization.pptx
PPTX
The DFARS - Part 251 - Use of Government Sources By Contractors
PDF
Item # 3 - 934 Patterson Final Review.pdf
PDF
About Karen Miner-Romanoff - Academic & nonprofit consultant
PDF
Abhay Bhutada and Other Visionary Leaders Reinventing Governance in India
PDF
2025 Shadow report on Ukraine's progression regarding Chapter 29 of the acquis
PPTX
Vocational Education for educational purposes
11Sept2023_LTIA-Cluster-Training-Presentation.pptx
Weekly Report 17-10-2024_cybersecutity.pptx
Quiz - Saturday.pptxaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
buyers sellers meeting of mangoes in mahabubnagar.pdf
Omnibus rules on leave administration.pptx
How FPOs Are Reshaping Agriculture in Maharashtra?
Creating Memorable Moments_ Personalized Plant Gifts.pdf
The Detrimental Impacts of Hydraulic Fracturing for Oil and Gas_ A Researched...
Portland FPDR Oregon Legislature 2025.pptx
EAPP.docxdffgythjyuikuuiluikluikiukuuuuuu
Nur Shakila Assesmentlwemkf;m;mwee f.pptx
PCCR-ROTC-UNIT-ORGANIZATIONAL-STRUCTURE-pptx-Copy (1).pptx
Alexistogel: Solusi Tepat untuk Anda yang Cari Bandar Toto Macau Resmi
Introduction_to_the_Study_of_Globalization.pptx
The DFARS - Part 251 - Use of Government Sources By Contractors
Item # 3 - 934 Patterson Final Review.pdf
About Karen Miner-Romanoff - Academic & nonprofit consultant
Abhay Bhutada and Other Visionary Leaders Reinventing Governance in India
2025 Shadow report on Ukraine's progression regarding Chapter 29 of the acquis
Vocational Education for educational purposes

Collecting Usage Statistics for E-Government Resources

  • 1. Collecting Usage Statistics for E-Government Resources Christopher C. Brown University of Denver, University Libraries cbrown@du.edu May 20, 2014 -- Online via GPO’s iCohere Platform
  • 3. The Problem  We have statistics for government documents print circulation • But our directors want statistics • The viability of our depository status may rest on our ability to provide statistics  We don’t have any statistics for online usage
  • 4. Statistics we don’t know  Visits to online docs URLs by our users – we are clueless!  How many times URLs are visited by our users  What titles are visited by our users  What agencies are most popular with our users  We don’t know the whole picture
  • 5. How Many PURLS?  142,117 records in CGP with PURLS (as of May 13, 2014).  There are a total of 179,566 PURLS in the GPO PURL database (as of May 13, 2014)  At present, GPO creates about 850 PURLS each month Source – James Mauldin, GPO
  • 6. Part 1: GPO Solution
  • 7. PURL Referral Reporting  The tool also provides a listing of the top fifty (50) referred PURL resources per hostname and/or IP address with:  The PURL path.  The full path of the target URL for each PURL.  The total requests for that individual PURL.  A search link utilizing the CGP to view cataloging records for the individual PURL.  GPO releases monthly PURL referral reports; however, these reports include aggregate totals only. Referrals totals strip out bot traffic and focuses on patron requests.  The PURL Referral Reporting Tool is locked down to Federal depository libraries only. Data is current as of the previous day. Historical data is available for twelve months. Tool functionality may be expanded in the future to include greater historical data retention and additional functionality based on funding and community feedback.  Source: http://www.fdlp.gov/23-about/projects/141-purl-enhancement-and- stabilization Since Dec. 1, 2010 the referral reporting system has been operational.
  • 8. Steps to getting Custom Reports  Gather the relevant hostnames or IP addresses for your institution – sites where you have PURLs  Your library catalog (maybe you have two versions like we do – classic catalog, next-gen catalog  Your web discover tool (if you have one)  Your library instruction guides (like Libguides)  Other Web pages that may contain PURLs  Also consider using your institution’s numeric IP address  Go to http://purlreferrals.fdlp.gov/ (You will need to login with your depository number and your internal password).
  • 9. Run Your Query (login with Internal FDLP Credentials)
  • 12. Export to CSV (Open with Excel)
  • 13. You can See Exact Titles for Top 50
  • 14. Older PURL Referrals http://www.fdlp.gov/file-repository/collection-management/purl-referrals You can get older PURL referral reports from here: Compare your hits against other depositories
  • 15. PURL Rot  In theory, it would be a wonderful world if someone behind a curtain at GPO would check every PURL every day to check for errors. But that does not happen. It is up to us – documents librarians – to report these.
  • 16. PURL Rot: Reporting a Broken PURL
  • 18. PURL Retrieval Summary  You can get the total PURL hits by month,  Or the top 50 most popular hits  You cannot get all specific URLs. No way to do a more comprehensive analysis  Statistics are ONLY for PURLS, not for any other online government URLs  Statistics can be incomplete at times (GPO server down, etc.)
  • 19. Part 2: Local Solutions
  • 20. Objective To track online government document clickthroughs when accessed via the online catalog oNot possible to capture every use of government info by our users oBut is possible to capture all clickthroughs via the OPAC
  • 21. Different Approaches GPO PURL Tracking Local URL Tracking Any PURL clickthrough from an institution Any URL clickthrough via the OPAC Broad view: top PURLS and overall numbers Narrow view: specific PURLS/URLS and then can derive titles, SuDocs, etc. Wait for GPO to aggregate data Instant access to data
  • 22. Basic Idea: How it Works  A URL is prepended to the PURL (or URL)  This URL initially directs to a library-hosted web server which traps for the date/time, PURL (or URL), URL of requestor  The user is then instantly redirected to the PURL (or URL) site
  • 23. Two Methods to Track Locally  Prepend to PURL  Method #1: trap for the URL, date – more difficult at the end, but easier at first  Method #2: trap also for a unique record number – more difficult at first, but benefits later
  • 24. A Simple Prepend URL http://library.du.edu/clickthrough/index.php/clicks/?type=gov&url=
  • 26. Benefits of Clickthrough Project  We can provide meaningful stats to the library director  We can see high-use and low-use areas  We can tell if users benefit from our special projects  We can do reactive URL maintenance  We can see turn-aways and other problems  We can see search engine attacks  We can see how our docs work within your discovery tools
  • 27. Local Use for Docs by FY
  • 28. Specs: How to ask for a clickthrough system  Project hosted on stable server (such as library Web server).  Should be able to handle long URLs – up to 700 characters.  Prepended URL sends request to library server.  Included in prepended URL is cataloger-supplied 3-letter code of URL type (ex: gov, cou, ran – any 3-letter combination that may be needed in future).  Server records date/time, IP address of requestor, 3-letter code of URL type, and URL requested.  Server redirects user to desired URL.  Reporting mechanism available to gather clickthroughs.  Archiving function available to archive stats.  Ability to view archived records.  Secure login for authorized users. Just give this slide to a code-writer in your library – and you may have a link-tracking system soon!
  • 29. Local Solutions to Problem http://www.fdlp.gov/file-repository/1051-tracking-online-document-usage-from-the-catalog
  • 31. Questions? Christopher C. Brown University of Denver, University Libraries cbrown@du.edu