SlideShare a Scribd company logo
Graph Based Word Spotting Approach for
Large Document Collections
Pau Riba, Josep Lladós and Alicia Fornés
Third Graph-TA, 18 March 2015
Index
 Introduction
 Database
 Graph Construction
 Word spotting approach
 Graph Indexation
 Experiment Results
 Conclusions and Future Work
Introduction
• Word Spotting: Locate a given query word in an image in terms of
shape features
• Most techniques use statistical representations. However, we propose
a graph-based approach.
• The proposed structural representation is suitable to be robust to the
inherent deformations of handwriting.
• To overcome the high computational complexity of subgraph matching
through an Indexation formalism for graph retrieval.
The Barcelona Marriage Licenses
• Old Marriage Licenses of the Cathedral of Barcelona (Spain)
– Pope Benedict XIII established in 1408 a marriage fee (for building the Cathedral)
– 244 books (15th -19th centuries)
– Approx. 700.000 marriage licenses from 90 parish churches
– The books include information on the couples, their parents, their jobs, and the tax
paid depending on their social class
NAME
DATE
JOB
PLACE
FEE
NAME
NAME
Graph Construction
• Overview:
Word Spotting Approach
• Graph Construction.
• Graph Matching:
– Bipartite Graph Matching: Suboptimal approximation of Graph Edit Distance.
• Retrieval depending on the edition cost.
Graph Indexation
• Binary embedding for each node, a vector of attributes representing
their local structure.
– The attributes count the length of a walk of order k originated in a vertex with label l.
(Idea of the Morgan Index).
– Binary-valued hash function to convert the vector to binary.
– Graph retrieval in terms of finding target graphs whose nodes have a small
Hamming distance from the query node.
– Those nodes will vote into regions and afterwards the most voted ones are chosen
as candidates to contain the query.
• Query:
• Some retrieved words:
• Typical errors
Experiments: Qualitative Results
• 27 pages of marriage records
• 6544 segmented words with
1751 transcriptions
• 514 queries with
32 different word classes
• Classic problems:
Experiments: Quantitative Results
Method mAP
DTW 19,20
Graph-based 24,60
BoVW 30,00
Loci-based 40,06
nrHOG 56,06
Proposed 48,64
Binarization
Lexical
Variations
Shared
Letters
Experiments: Qualitative Results
• 11 pages
• 3609 words
• 40 queries with
• 8 different words
Experiments: Quantitative Results
Query Transcription Precision Recall mAP
Eularia 0,0080 0,8462 0,7959
Hieronyma 0,0118 0,7875 0,9329
Jua$ 0,0149 0,5389 0,8490
defunct 0,0271 0,7886 0,6372
donsella 0,0420 0,8215 0,9454
pages 0,0590 0,9352 0,9463
rebere$ 0,0645 0,7676 0,9815
viudo 0,0133 0,6455 0,9231
Total 0,0301 0,7664 0,8764
Conclusions and Future Work
• Graph-based representation is comparable to statistical approches
in terms of performance and time requirements.
• Graphemes based on convexities can be stable under the
deformations of handwriting.
• Graph indexing approach can deal with large collections and avoid
the segmentation of words at the same time.
• Future work will focus on the evaluation of the stability of graph-
based representation in large multiwriter document collections.
Thank you!

More Related Content

PDF
co:op-READ-Convention Marburg - Enrique Vidal
PDF
A NOVEL APPROACH FOR WORD RETRIEVAL FROM DEVANAGARI DOCUMENT IMAGES
PPTX
Towards advanced data retrieval from learning objects repositories
PPT
Recognition as Graph Matching
PDF
PPT
Combining Inverted Indices and Structured Search for Ad-hoc Object Retrieval
PPTX
Image Captioning of Handwritten Mathematical Expressions
PPT
4888009.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
co:op-READ-Convention Marburg - Enrique Vidal
A NOVEL APPROACH FOR WORD RETRIEVAL FROM DEVANAGARI DOCUMENT IMAGES
Towards advanced data retrieval from learning objects repositories
Recognition as Graph Matching
Combining Inverted Indices and Structured Search for Ad-hoc Object Retrieval
Image Captioning of Handwritten Mathematical Expressions
4888009.pptnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn

Similar to Graph Based Word Spotting Approach for Large Document Collections (20)

PPTX
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
PPTX
Segmentation - based Historical Handwritten Word Spotting using document-spec...
PDF
LINSEN an efficient approach to split identifiers and expand abbreviations
PDF
Space-efficient detection of unusual words
PPTX
LSDI 2.pptx
PDF
Fosdem 2013 petra selmer flexible querying of graph data
PDF
A Graph-based Model for multimodal Information Retrieval (partially presented)
PPT
Finite State Queries In Lucene
PPTX
Building a names backbone
PPT
Compact and Distinctive Visual Vocabularies for Efficient Multimedia Data Ind...
PPTX
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
PDF
An evaluation and overview of indices
PDF
Probabilistic Data Structures and Approximate Solutions Oleksandr Pryymak
PPTX
Topical_Facets
PPTX
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
PDF
An Evaluation and Overview of Indices Based on Arabic Documents
PDF
An Evaluation and Overview of Indices Based on Arabic Documents
PDF
40120140501013
PDF
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
PDF
Ch14-Part4-ImageRetrieval.pdf
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
Segmentation - based Historical Handwritten Word Spotting using document-spec...
LINSEN an efficient approach to split identifiers and expand abbreviations
Space-efficient detection of unusual words
LSDI 2.pptx
Fosdem 2013 petra selmer flexible querying of graph data
A Graph-based Model for multimodal Information Retrieval (partially presented)
Finite State Queries In Lucene
Building a names backbone
Compact and Distinctive Visual Vocabularies for Efficient Multimedia Data Ind...
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
An evaluation and overview of indices
Probabilistic Data Structures and Approximate Solutions Oleksandr Pryymak
Topical_Facets
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
An Evaluation and Overview of Indices Based on Arabic Documents
An Evaluation and Overview of Indices Based on Arabic Documents
40120140501013
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
Ch14-Part4-ImageRetrieval.pdf
Ad

More from Graph-TA (20)

PDF
Computing on Event-sourced Graphs
PDF
Using Evolutionary Computing for Feature-driven Graph generation
PDF
Reactive Databases for Big Data applications
PDF
The scarcity of crossing dependencies: a direct outcome of a specific constra...
PDF
Holistic Benchmarking of Big Linked Data: HOBBIT
PDF
Identifiability in Dynamic Casual Networks
PDF
Polyglot Graph Databases using OCL as pivot
PDF
Benchmarking Versioning for Big Linked Data
PDF
Synthetic Data Generation using exponential random Graph modeling
PDF
Use of Graphs for Cloud Service Selection in Multi-Cloud Environments
PDF
Graphalytics: A big data benchmark for graph-processing platforms
PDF
Modelling the Clustering Coefficient of a Random graph
PPTX
RDF Graph Data Management in Oracle Database and NoSQL Platforms
PPTX
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
PPTX
On the Discovery of Novel Drug-Target Interactions from Dense SubGraphs
PDF
Graphalytics: A big data benchmark for graph processing platforms
PDF
Autograph: an evolving lightweight graph tool
PPTX
Understanding Graph Structure in Knowledge Bases
PDF
Finding patterns of chronic disease and medication prescriptions from a large...
PDF
Recent Updates on IBM System G — GraphBIG and Temporal Data
Computing on Event-sourced Graphs
Using Evolutionary Computing for Feature-driven Graph generation
Reactive Databases for Big Data applications
The scarcity of crossing dependencies: a direct outcome of a specific constra...
Holistic Benchmarking of Big Linked Data: HOBBIT
Identifiability in Dynamic Casual Networks
Polyglot Graph Databases using OCL as pivot
Benchmarking Versioning for Big Linked Data
Synthetic Data Generation using exponential random Graph modeling
Use of Graphs for Cloud Service Selection in Multi-Cloud Environments
Graphalytics: A big data benchmark for graph-processing platforms
Modelling the Clustering Coefficient of a Random graph
RDF Graph Data Management in Oracle Database and NoSQL Platforms
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
On the Discovery of Novel Drug-Target Interactions from Dense SubGraphs
Graphalytics: A big data benchmark for graph processing platforms
Autograph: an evolving lightweight graph tool
Understanding Graph Structure in Knowledge Bases
Finding patterns of chronic disease and medication prescriptions from a large...
Recent Updates on IBM System G — GraphBIG and Temporal Data
Ad

Recently uploaded (20)

PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Approach and Philosophy of On baking technology
PPTX
A Presentation on Artificial Intelligence
PPTX
1. Introduction to Computer Programming.pptx
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
Hybrid model detection and classification of lung cancer
PPTX
A Presentation on Touch Screen Technology
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
Mushroom cultivation and it's methods.pdf
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
project resource management chapter-09.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Accuracy of neural networks in brain wave diagnosis of schizophrenia
SOPHOS-XG Firewall Administrator PPT.pptx
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Assigned Numbers - 2025 - Bluetooth® Document
Approach and Philosophy of On baking technology
A Presentation on Artificial Intelligence
1. Introduction to Computer Programming.pptx
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Hybrid model detection and classification of lung cancer
A Presentation on Touch Screen Technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
A comparative analysis of optical character recognition models for extracting...
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Mushroom cultivation and it's methods.pdf
Chapter 5: Probability Theory and Statistics
Web App vs Mobile App What Should You Build First.pdf
Univ-Connecticut-ChatGPT-Presentaion.pdf
project resource management chapter-09.pdf

Graph Based Word Spotting Approach for Large Document Collections

  • 1. Graph Based Word Spotting Approach for Large Document Collections Pau Riba, Josep Lladós and Alicia Fornés Third Graph-TA, 18 March 2015
  • 2. Index  Introduction  Database  Graph Construction  Word spotting approach  Graph Indexation  Experiment Results  Conclusions and Future Work
  • 3. Introduction • Word Spotting: Locate a given query word in an image in terms of shape features • Most techniques use statistical representations. However, we propose a graph-based approach. • The proposed structural representation is suitable to be robust to the inherent deformations of handwriting. • To overcome the high computational complexity of subgraph matching through an Indexation formalism for graph retrieval.
  • 4. The Barcelona Marriage Licenses • Old Marriage Licenses of the Cathedral of Barcelona (Spain) – Pope Benedict XIII established in 1408 a marriage fee (for building the Cathedral) – 244 books (15th -19th centuries) – Approx. 700.000 marriage licenses from 90 parish churches – The books include information on the couples, their parents, their jobs, and the tax paid depending on their social class NAME DATE JOB PLACE FEE NAME NAME
  • 6. Word Spotting Approach • Graph Construction. • Graph Matching: – Bipartite Graph Matching: Suboptimal approximation of Graph Edit Distance. • Retrieval depending on the edition cost.
  • 7. Graph Indexation • Binary embedding for each node, a vector of attributes representing their local structure. – The attributes count the length of a walk of order k originated in a vertex with label l. (Idea of the Morgan Index). – Binary-valued hash function to convert the vector to binary. – Graph retrieval in terms of finding target graphs whose nodes have a small Hamming distance from the query node. – Those nodes will vote into regions and afterwards the most voted ones are chosen as candidates to contain the query.
  • 8. • Query: • Some retrieved words: • Typical errors Experiments: Qualitative Results
  • 9. • 27 pages of marriage records • 6544 segmented words with 1751 transcriptions • 514 queries with 32 different word classes • Classic problems: Experiments: Quantitative Results Method mAP DTW 19,20 Graph-based 24,60 BoVW 30,00 Loci-based 40,06 nrHOG 56,06 Proposed 48,64 Binarization Lexical Variations Shared Letters
  • 11. • 11 pages • 3609 words • 40 queries with • 8 different words Experiments: Quantitative Results Query Transcription Precision Recall mAP Eularia 0,0080 0,8462 0,7959 Hieronyma 0,0118 0,7875 0,9329 Jua$ 0,0149 0,5389 0,8490 defunct 0,0271 0,7886 0,6372 donsella 0,0420 0,8215 0,9454 pages 0,0590 0,9352 0,9463 rebere$ 0,0645 0,7676 0,9815 viudo 0,0133 0,6455 0,9231 Total 0,0301 0,7664 0,8764
  • 12. Conclusions and Future Work • Graph-based representation is comparable to statistical approches in terms of performance and time requirements. • Graphemes based on convexities can be stable under the deformations of handwriting. • Graph indexing approach can deal with large collections and avoid the segmentation of words at the same time. • Future work will focus on the evaluation of the stability of graph- based representation in large multiwriter document collections.