SlideShare a Scribd company logo
Research support with optical
character recognition apps
Jim Hahn
Text-shot prototype
2
Introduction
• Uses for OCR in library settings
– The prototype Text-shot module uses OCR software
and a backend search system for subject and title
recommendations.
– The choice to recommend library content to users
from the app stems from the objective to connect
students with library resources, and to help students
integrate library resources into their work.
3
Optical Character Recognition Apps
• Wordlens app: can translate words from
different languages using a digital camera feed
• Google Goggles app: take a picture of a book
cover (or painting)to run a google search on
the topic
• Camscanner app: digitize print documents
with camera on app and store/share
documents with others
4
Literature Review
• Optical Character Recognition APIs
– Evernote API: dev.evernote.com/doc
– Google Drive API: support.google.com/drive
– VuForia SDK:
developer.vuforia.com/resources/sdk/android
5
Methodology
• Formative evaluation
– Small set of test participants to gather feedback
early in the design phase so that the software
development process can progress in a direction
that will support user requirements for the
software
6
Methodology
• Test Participants
– Students were recruited from the General Studies
101 course. They are in their first year of study at
the university and have not yet chosen a major.
– There were a total of five test participants in the
first round of study.
7
Methodology
• Study Process
– Students were given an Android phone with the
Text-shot app loaded. Investigators observed the
students as they used the OCR mobile software to
obtain suggested library resources. Investigators
collected two sources of data: observation of how
students interact with the software and a
debriefing interview.
8
Functionality Tests
• Researchers tested the two main functions for
the software.
– Recognizing a string of text by taking a picture of
the words in a student assignment sheet and;
– suggesting subjects and titles based on the
scanned text.
9
Results
• Themes related to the improvement of
suggestions:
– Show broad subjects first
• Then expand to details subjects
– Prominently display title suggestions
10
Results
• Feature Requests:
– Include articles as well as book titles in
recommendations
• Use article APIs
• LibGuides-like help guides
11
Text-shot prototype
12
Next steps in OCR
• Topic Space app: Scanning call numbers in the
library
– If you scan a call number on a book, you can get
recommendations of other, related books in the
library, and other related digital content in the
library.
13
Topic Space: Book Scan
14
Topic Space: Suggested Topic Spaces
15
Topic Space:
Related Books that are not available
16
Topic Space: View Map
17
Future directions
• Implementing OCR modules in the Minrva
app:
– http://minrvaproject.org/modules_topicspace.ph
p
• Open sourcing OCR technology for use in
library settings:
– http://minrvaproject.org/source.php
18
Sponsors
• Institute of Museum and Library Services
• University of Illinois Campus Research Board
19
Acknowledgements
• My thanks to Ben Ryckman for Topic Space module development and support.
• Many thanks to Chris Diaz, Residency Librarian, Scholarly Communications and
Collections, University of Iowa for help with participant recruitment, observation,
and interviewing support in the user studies
• Thanks to Mayur Sadavarte, Graduate Student in Computer Science at the
University of Illinois and Nate Ryckman, Graduate Student in Information Systems
Management at Carnegie Mellon University for Optical Character recognition
programming support.
• Yinan Zhang, PhD Candidate in Computer Science at the University of Illinois,
Sherry (Mengxue) Zheng, Graduate Student in Computer Science for help
developing the search and suggestion functionality of the Deneb near-semantic
index, Maria Lux, Graphic Designer for laying out the polished recommendations
and prototyping Text-shot integration as a Minrva module.
20

More Related Content

PDF
Design and implementation of optical character recognition using template mat...
PDF
Optical Character Recognition
PPTX
Digitisation Doctor Optical Character Recognition
PPTX
Final Report on Optical Character Recognition
PDF
PDF
Optical character recognition of handwritten Arabic using hidden Markov models
PPTX
MATLAB Based Vehicle Number Plate Identification System using OCR
Design and implementation of optical character recognition using template mat...
Optical Character Recognition
Digitisation Doctor Optical Character Recognition
Final Report on Optical Character Recognition
Optical character recognition of handwritten Arabic using hidden Markov models
MATLAB Based Vehicle Number Plate Identification System using OCR

Viewers also liked (13)

PPTX
Optical character recognition (ocr) ppt
PPTX
Bengali optical character recognition system
PPT
OCR
DOCX
Optical character recognition IEEE Paper Study
PPTX
Optical Character Recognition (OCR)
PPTX
Basics of-optical-character-recognition
PPT
optical character recognition system
DOCX
Project report of OCR Recognition
PPTX
Optical Character Recognition( OCR )
PPTX
Number plate recognition system using matlab.
PPTX
An Online Game to Correct Inaccurate Optical Character Recognition (OCR) in B...
PPTX
Text Detection and Recognition
PPTX
Presentation_OCR
Optical character recognition (ocr) ppt
Bengali optical character recognition system
OCR
Optical character recognition IEEE Paper Study
Optical Character Recognition (OCR)
Basics of-optical-character-recognition
optical character recognition system
Project report of OCR Recognition
Optical Character Recognition( OCR )
Number plate recognition system using matlab.
An Online Game to Correct Inaccurate Optical Character Recognition (OCR) in B...
Text Detection and Recognition
Presentation_OCR
Ad

Similar to Research support with optical character recognition apps (20)

PPT
The Library Technology Prototyping Service at Illinois
PDF
Customizing Discovery Interfaces: Understanding Users’ Behaviors and Providin...
PPTX
Macon - about the project
PDF
Own the User Experience: Provide Discovery for Your Users
PPTX
Presentation to 2014 University of Guelph Accessibility Conference Perspectiv...
PPTX
Responsive hackfest: Code4Lib2014 Pre-conference
PPTX
AAUP 2016: UPScope (S. Doerr)
PDF
Library website usability study 2012
PPTX
Electricity_Monitoring_Presentation.pptx
PDF
Project Topic Presentation Data and Web Science Group IE686 Large Language Mo...
PDF
1st meeting of PG PUSHPIN
PPT
In Search Of The Lost Book - Improving Library Usability
PPTX
empirical-SLR.pptx
PPTX
Designing e-Learning Objects
PPTX
Jones "Enabling Discovery in the Library"
PPTX
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
PPT
1-Lec - Introduction vhvv,vbvv,v (2).ppt
PPTX
Online Journal Management using Open Journal Systems (OJS)
PPTX
CrowdAsk- A Crowdsourcing Reference System (Internet Librarian 2014)
PPTX
Ocls 4th annual breakfast 2016
The Library Technology Prototyping Service at Illinois
Customizing Discovery Interfaces: Understanding Users’ Behaviors and Providin...
Macon - about the project
Own the User Experience: Provide Discovery for Your Users
Presentation to 2014 University of Guelph Accessibility Conference Perspectiv...
Responsive hackfest: Code4Lib2014 Pre-conference
AAUP 2016: UPScope (S. Doerr)
Library website usability study 2012
Electricity_Monitoring_Presentation.pptx
Project Topic Presentation Data and Web Science Group IE686 Large Language Mo...
1st meeting of PG PUSHPIN
In Search Of The Lost Book - Improving Library Usability
empirical-SLR.pptx
Designing e-Learning Objects
Jones "Enabling Discovery in the Library"
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
1-Lec - Introduction vhvv,vbvv,v (2).ppt
Online Journal Management using Open Journal Systems (OJS)
CrowdAsk- A Crowdsourcing Reference System (Internet Librarian 2014)
Ocls 4th annual breakfast 2016
Ad

Recently uploaded (20)

PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PDF
Hazard Identification & Risk Assessment .pdf
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PDF
My India Quiz Book_20210205121199924.pdf
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
PPTX
Share_Module_2_Power_conflict_and_negotiation.pptx
PDF
AI-driven educational solutions for real-life interventions in the Philippine...
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PPTX
B.Sc. DS Unit 2 Software Engineering.pptx
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PPTX
Virtual and Augmented Reality in Current Scenario
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PPTX
History, Philosophy and sociology of education (1).pptx
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
Introduction to pro and eukaryotes and differences.pptx
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
PDF
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
PDF
Weekly quiz Compilation Jan -July 25.pdf
PPTX
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
Hazard Identification & Risk Assessment .pdf
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
My India Quiz Book_20210205121199924.pdf
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
Share_Module_2_Power_conflict_and_negotiation.pptx
AI-driven educational solutions for real-life interventions in the Philippine...
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
B.Sc. DS Unit 2 Software Engineering.pptx
Practical Manual AGRO-233 Principles and Practices of Natural Farming
Virtual and Augmented Reality in Current Scenario
Paper A Mock Exam 9_ Attempt review.pdf.
History, Philosophy and sociology of education (1).pptx
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Introduction to pro and eukaryotes and differences.pptx
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
Weekly quiz Compilation Jan -July 25.pdf
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...

Research support with optical character recognition apps

  • 1. Research support with optical character recognition apps Jim Hahn
  • 3. Introduction • Uses for OCR in library settings – The prototype Text-shot module uses OCR software and a backend search system for subject and title recommendations. – The choice to recommend library content to users from the app stems from the objective to connect students with library resources, and to help students integrate library resources into their work. 3
  • 4. Optical Character Recognition Apps • Wordlens app: can translate words from different languages using a digital camera feed • Google Goggles app: take a picture of a book cover (or painting)to run a google search on the topic • Camscanner app: digitize print documents with camera on app and store/share documents with others 4
  • 5. Literature Review • Optical Character Recognition APIs – Evernote API: dev.evernote.com/doc – Google Drive API: support.google.com/drive – VuForia SDK: developer.vuforia.com/resources/sdk/android 5
  • 6. Methodology • Formative evaluation – Small set of test participants to gather feedback early in the design phase so that the software development process can progress in a direction that will support user requirements for the software 6
  • 7. Methodology • Test Participants – Students were recruited from the General Studies 101 course. They are in their first year of study at the university and have not yet chosen a major. – There were a total of five test participants in the first round of study. 7
  • 8. Methodology • Study Process – Students were given an Android phone with the Text-shot app loaded. Investigators observed the students as they used the OCR mobile software to obtain suggested library resources. Investigators collected two sources of data: observation of how students interact with the software and a debriefing interview. 8
  • 9. Functionality Tests • Researchers tested the two main functions for the software. – Recognizing a string of text by taking a picture of the words in a student assignment sheet and; – suggesting subjects and titles based on the scanned text. 9
  • 10. Results • Themes related to the improvement of suggestions: – Show broad subjects first • Then expand to details subjects – Prominently display title suggestions 10
  • 11. Results • Feature Requests: – Include articles as well as book titles in recommendations • Use article APIs • LibGuides-like help guides 11
  • 13. Next steps in OCR • Topic Space app: Scanning call numbers in the library – If you scan a call number on a book, you can get recommendations of other, related books in the library, and other related digital content in the library. 13
  • 14. Topic Space: Book Scan 14
  • 15. Topic Space: Suggested Topic Spaces 15
  • 16. Topic Space: Related Books that are not available 16
  • 18. Future directions • Implementing OCR modules in the Minrva app: – http://minrvaproject.org/modules_topicspace.ph p • Open sourcing OCR technology for use in library settings: – http://minrvaproject.org/source.php 18
  • 19. Sponsors • Institute of Museum and Library Services • University of Illinois Campus Research Board 19
  • 20. Acknowledgements • My thanks to Ben Ryckman for Topic Space module development and support. • Many thanks to Chris Diaz, Residency Librarian, Scholarly Communications and Collections, University of Iowa for help with participant recruitment, observation, and interviewing support in the user studies • Thanks to Mayur Sadavarte, Graduate Student in Computer Science at the University of Illinois and Nate Ryckman, Graduate Student in Information Systems Management at Carnegie Mellon University for Optical Character recognition programming support. • Yinan Zhang, PhD Candidate in Computer Science at the University of Illinois, Sherry (Mengxue) Zheng, Graduate Student in Computer Science for help developing the search and suggestion functionality of the Deneb near-semantic index, Maria Lux, Graphic Designer for laying out the polished recommendations and prototyping Text-shot integration as a Minrva module. 20