SlideShare a Scribd company logo
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 251
A REVIEW ON SIGNATURE DETECTION AND SIGNATURE BASED
DOCUMENT IMAGE RETRIEVAL
Chinnu S Gupta1
, Umesh .D. Dixit2
1
M.Tech, E&CE dept, B.L.D.E.A’s College of Engineering and Technology, Vijayapur 586101, India
2
Asst. Prof in E&CE dept, B.L.D.E.A’s College of Engineering and Technology, Vijayapur 586101, India
Abstract
Transforming a paper document to its electronic version in a form suitable for efficient storage, retrieval, and interpretation
continues to be a challenging problem. Signature is an individualistic identification of a person. It is an authentic identification
because a signature cannot be copied by others. Signatures are a special case of handwriting subject to intra personal variation
and inter personal differences. To counter check fraud and forgery of handwritten signatures, Signature extraction from printed
text background and signature based document retrieval from a large dataset is necessary. A lot many techniques have been
implemented successfully for both signature extraction and signature based document retrieval. This paper present techniques
and methods evolved for signature extraction and signature based document retrieval.
Keywords: signature detection, signature extraction, Document image Retrieval, Query image retrieval
---------------------------------------------------------------------***--------------------------------------------------------------------
1. INTRODUCTION
A signature is an individualistic, unique, evidentiary entity.
It provides an important form of indexing that enables
effective image search and retrieval from large
heterogeneous document image collections. In this paper,
we surveyed different technique involved in retrieval system
that automatically detects, segments, and matches signatures
from document image with unconstrained layouts and
complex background. This would involve extracting all the
signatures from the documents and then performing a match
on these signatures. In searching complex documents, a task
of Relevance is relating a signature in a given document to
the closest matches within a database of document given a
database of signed document. The signature based document
retrieval has many of the applications and some of them are
listed below.
 For business documents.
 Government organizations.
 Digital libraries for online books, student thesis etc.
 Security requirements of document.
The main contributions of this paper are summarized as
follows. Firstly, in this paper we have provide general
framework and detailed survey of signature extraction and
signature based document retrieval. Secondly, we have
discussed about issues and challenges.
The paper is organized as follows. In section 2, we given the
general framework and in section 3, we review the related
work. The issues and challenges in section 4, in section 5,
we discuss the performance evaluation metrics. Section 6,
concludes the paper
2. GENERAL FRAMWORK
Fig 1.General framework
The figure shows general framework for signature based
document retrieval.
 Query image: The image containing signature is
called as query image.
 Signature extraction: It is the process of extracting
only the signature part of the document i.e.
separating signature and non-signature part of the
document.
 Preprocessing: The preprocessing phase is a
sequence of image transformations creating the best
possible input for feature extraction algorithms.
Preprocessing step may include filtering, RGB to
gray scale conversion and binarization.
 Feature extraction: Feature extraction is a set of
(usually) independent functions returning a
characteristic feature set for the input image. Features
can either be particular to the whole signature (global
features) or to a part of the signature (topological or
local features).
 Matching and Retrieval: Given a few available query
signature instances and a large database of document
containing signatures, the problem of signature
Query
Image
Signature
extraction
Preprocessing
Matchin
g and
Retrieval
Document
Retrieved
Database
Feature
extraction
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 252
matching is to find the most similar signature
samples from the database. By constructing the list of
best matching signature documents, we effectively
retrieve the set of documents authorized or authored
by the same person.
3. RELATED WORK
Madasu, et al. [1] and Chalechale, et al. [2] used geometric
features including area, circularity, aspect ratio, size and
position to analyze segmented regions. These features are
compared based on certain parameters such as Manhattan
distance. The crop method objective is to locate rectangular
box around the signature which is the object of interest and
remove other objects outside this area, like signatures on
bank cheque. The crop method works by moving four
vectors from the four different directions (namely up, right,
bottom and left) towards the object of interest. Each vector
will mark the border of each side of the rectangular box. The
crop is then applied to area. Once the area approximation
has been done on the first model of the cheque. It is then
easy to extract a signature from a bank cheque and scan it.
The method failed to extract the signature as the signature
was not in the stipulated area hence the system could
approximate the area to be cropped.
Djeziri, et al. [3] dealt with the signature extraction problem
by an approach that was to mimic the human visual
perception. They introduced the filiformity as a criterion for
the curvature characteristics of handwritten signatures.
Filiformity is defined for two topological measures for
binary objects which also includes gray level images. It
differentiates the contour lines of the signatures from the
handwritten lines which are being isolated. The process of
isolating is also used to provide measure for local values
regarding the whole image. This process fails when other
filiformity objects are present in the document.
Madasu, et al. [4] tried to crop the image segment by
estimating the area in which the signature lies using a
sliding window. They then analyzed the local entropy
derived from the pixel-based density of the region to decide
its being signature or not. This approach disregarded the
noise and therefore high-density regions are reported as
signatures incorrectly. This segmentation does not need any
a priori information about the data field and features.
Ritesh Banka, et al. [5] proposed the method of extracting
signature from cluttered background which is scale invariant
technique using the efficiency of symmetric and loop
features that are common in signatures for extracting the
standard signatures from other documents. Signature is a
handwritten text hence this technique can be used because
signatures in English contain most of the characters
exhibiting symmetry in vertical, horizontal or diagonal
directions. The self printed characters from the documents,
as a few papers, lack of proper self symmetry and are
misclassified as handwritten text at the first level. The
misclassified printed characters in the first level contain a
single symmetric loop, which discriminates them from hand
written elements. The algorithm had two domains character
level and document level. For poor quality document where
the numbers of characters joint and broken are very high, the
algorithm efficiency decreases.
R.Jayadeyan, et al. [6] considered the images of bank
document a method has been proposed for the extraction of
signatures, signature localization is done using variance,
signature block extraction is done using entropy,
normalization are used for desired mean and variance, a
hidden Markov model (HMM).Horizontal and vertical
projection, right and left envelope, top and bottom envelope,
horizontal and vertical size variations of the binary format of
the genuine signatures are considered as the feature set of an
individual.
Sha’ashua, et al. [7] introduced the concept of multi-scale
saliency feature for signature detection that defines signature
characteristics by identifying salient structure by grouping
contours. They defined a saliency function which increased
with the curve length and decreased with the curvature when
totally squared. This was done by locally connected
network. Their approach was confined to rigid objects
having 1-D contour. This can be applied to handwritten
notes collected on a tablet pc online since trajectories of the
pen are available the framework used is general as it does
not embed unique assumption on local feature of the object
for example stroke level features for signatures. Hence it is
robust against any changes in shape based object detection
problems and is applicable to different languages.
A three-stage procedure was proposed by Mandal, et al. [8]
to extract signatures. First an algorithm is used to locate the
Signature in the document using word-level feature
extraction. Second stage separates signature strokes that
overlap with the printed text. Final stage uses conditional
random field minimization energy concept with skeleton
analysis to classify real signature strokes. Improper
segmentation of strokes causes error in separating signature
from printed text.
Bassam, et al. [9] used the method for extracting signature
from image on document is the base proposed auto cropping
method. This method improves the performance of security
system based on signature image as possible the region of
interest of the used image for the biometric system and it
also reduces the time cost associated with signature. Auto
cropping is the fast procedure to extract the Region of
interest (ROI). In this method they used image segmentation
and extraction. This cropped signature has no garbage
region it crops only the ROI of signature image. This takes
less processing time then the original signature image. The
performance of this method shows through its speed time,
and keeping the content information of the signature object
without losing of any pixel of image.
Esteban, et al. [10] considered two main stages of extraction
one being comparing distribution of stroke in model
signature against distribution of stroke in the original
signature using a different approaches extraction being the
main concern accumulative evidence technique is used, and
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 253
they achieved a high accuracy with this approach. Their
assumption is not usually true since the bank application
forms are often filled by new customers and this process is
not followed by verification but an archiving step.
Tang, et al. [11] proposed a technique to automatically gain
knowledge from various types of documents because
knowledge acquisition is difficult to obtain. It was proposed
to gain automatic knowledge acquisition in document
images by analyzing the geometric structure and logical
structural images. These two structures play an important
role to process the knowledge acquisition. The limitations of
these techniques are to find which is right in transforming a
geometric structure into a logical structure. Another
difficulty is to find the correct rules involved in obtaining
knowledge from different documents.
Liu, et al. [12] presented an approach to image-based form
document retrieval and they proposed a similarity measure
for forms that is insensitive to translation, scaling, moderate
skew, and image quality fluctuations, and developed a
prototype form retrieval system based on their proposed
similarity measure.
G.Zhu, et al. [13] proposed the method for signature based
document retrieval performance using different shape
representation like salient contour computed by detection
and segmentation of the skeleton that are extracted directly
from labeled signature region of database .The thinning
algorithm shows sensitivity to structural variations due to
noise and neighboring stroke salient contours give a globally
consistent representation of structurally important shape
feature. This retrieval technique has been successful on the
Maryland Arabic dataset in which background handwritten
as well as signature are closely spaced.
Chalechale, et al. [14], proposed signature based
decomposition and retrieval of document images, and he
investigated Arabic/Persian signature recognition and
retrieval. For the conduction of retrieval automatic links
were considered between feature vector of the signature and
the document containing the image using file names. The
retrieval performance was measured by average normalized
retrieval rate. The ANMRR indicates better performance in
comparison with line segmentation distribution method.
A.chalechale, et al. [15] proposed new method for document
image decomposition based on connected component
analysis and geometric properties of labeled region. The
signature is detected and extracted by spatial partitioning by
accumulating pixels and by using magnitude and Fourier
transform they achieved rotation invariance. The main
objective is signature extraction from the original image and
converted into a compact feature vector that supports
measuring signature similarities.
Srihari, et al. [16], proposed a document image retrieval
using signature as queries. Using global shape binary feature
vector, a normalized correlation similarity measure for
signature matching is done. The objective is to retrieve the
closest matching signature obtained from dataset by the
same signature person to remove the printed text from the
signature image. An image enhancement procedure using
chain code information is used. The technique gave
promising result for group based and non grouped base
retrieval with accuracy and precision.
H.Srinivasan, et al. [17] used conditional random fields and
proposed the method for signature based retrieval. In this
they retrieved the document from database using signature
image as query. Isolating the different contents present in
the document and with the help of CRF in extracting
signature from complex document. This method presents the
signature retrieval strategy using document indexing and
retrieval. Indexing is done by using a model based on CRF.
SVM is also supported by this technique, and the CRF is
used to label each patch and identified using the labels of the
neighboring patches. Document retrieval is performed by
using matching algorithm to compare the query with the
signature.
Guangya Zhu, et. Al. [18], proposed a signature-based
document image retrieval system that automatically detects,
segments, and matches signatures from document images
with unconstrained layout and complex backgrounds
.Signature is treated as a non rigid shape and represented by
discrete set of 2-D point to most commonly sited measures
for retrieval R-precision and average precision. The R-
precision emphasizes the ranking among retrieve
documents. Extensive experimental and field test give
excellent performance for document retrieval
4. ISSUES AND CHALLENGES
The issues and challenges for signature extraction and
retrieval are listed below.
 Signature with low resolution in documents make
difficult for detection and segmentation.
 The background of document differs from each other.
 The computer vision faces an important problem of
Detecting, segmenting and matching deformable
objects such as signature.
 Documents are subjected to restricted processing
time due to urgency of applications. Therefore the
detection and retrieval time must be fast.
 The handwritten characters and auxiliary lines
contained in the document overlap and resemble
signatures.
5. PERFORMANCES AND EVALUATION
METRICS
 Accuracy: It is the percentage of number of correctly
detected signatures to the number of groundtruthed
signatures from the document image.
Accuracy =
a
a h
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 254
 Precision: This refers to the percentage of retrieval of
document images that are relevant to query [19].
Precision (N) =Rn / N
Where N is number of retrieval and Rn is number of
relevant matches among retrievals.
 Recall: This refers to the percentage of all the
relevant document images in the search database
which are retrieved.
Re call (N) =Rn /M
Where M is the total number of relevant matches in
the database Rn is the number of relevant matches
among retrievals.
A strategy for evaluating document image retrieval system
involves the following techniques. These techniques have
been partially successful in signature based document
retrieval. The technique involves extraction of signature
from certain documents by different methods. Matching of
signature is done by several method and retrieval by known
successful techniques.
6. CONCLUSION
This paper provides detailed survey of techniques, as it
overcomes the difficulty and challenges faced during
extraction of signatures from the window of different
background documents, cheque etc. This paper provides the
overview of methods to apply according to the type of the
background document and signature window. These newly
developed methods are going to give a high rate of
recognition. The performance of the most renowned
methods provided the higher signature recognition rate with
a greater accuracy in extraction and recognition of the
signatures and signature based documents. It also highlights
issues and challenges in this area.
REFERENCES
[1]. V K Madasu ,M H M Yusof M Hanmandilu K
K b ka,”Automic Extraction Of Signatures From Bank
Cheques And Other Documents”, DICTA’03 2003
[2]. A.Chalechale, G.Naghdya, P.Permaratne, A.Martins ”
Document Image Analysis and Verification using Cursive
S a ”, IEEE I a a C M m a
and Expo, 2004.
[3]. S.Djeziri,F.Nouboud,R.Plamondon, “Extraction Of
Signatures From Check Background Based On A Filiformity
Criterion”, IEEE Trans. Image Process. 7(10), 1425–1438,
1998.
[4]. V.K. Madasu, B C L v ,” Automatic Segmentation
and R O Ba k Ch q F ” Digital Image
Compute. Tech. Appl., 80(1): 33–40. 2005
[5]. R h Ba ka, Fa ha N bakh h,”Exraction Of
Signature &Handwritten Region From Official Binary
D m Ima ”,september, 2008
[6]. R Ja a va a ,” Variance based extraction and
hidden Markov model based verification of signatures
p ba k h q ”, International Conference on
Computation Intelligence and Multimedia Applications.
2007
[7]. G. Zhu, Y. Zheng, D. Doermann, S. Jeager, “Multi-scale
Structural Saliency for Signature D ”, IEEE
Conference on Computer Vision and Pattern Recognition.
2007
[8]. R. Mandal, P.P. Roy, U.Pal, “Signature Segmentation
from Machine Printed Documents using Conditional
Ra m F ”, International Conference on Document
Analysis and Recognition. 2011.
[9]. BassamAl-Mahadeen and Mokhled S,Islam
H A Ta aw h, ”Signature Region of interest using auto
pp ”, IJCSI,march 2010
[10]. J.L.Esteban, J.F. Vé z, Á Sá h z, ”Off-Line
Handwritten Signature Detection By Analysis Of Evidence
A m a ”, IJDAR, 15:359–368. 2012.
[11]. Y.Ta , C D Ya, a C Y S , ”Document
Processing for Automatic Knowledge Acquisition. IEEE
T a K w a Da a E ”, vol.6, no.1, pp.3-21,1994
[12]. J. Liu and A K Ja ”Imaged-Based Form Document
R va ”, Pattern Recognition, vol.33, no.3, pp.503-513,
2000.
[13]. Guangyu Zhu, Yefeg Zheng, and David
D ma ,”Signature Based Document Image Retrieval”,
ECCV, 2008, Part III, LNCS 5304, pp.752-765, 2000.
[14]. Abdul ah Cha ha , G hah Na h , ”Signature
Based Document Retrieval. Faculty Of Information-
Papers”, University of Wollongong.
[15]. Sargur N. Srihari, Shravya Shetty, Gady Agam and
Ophir Frieder .2006. Document Image Retrieval Using
Signature as Queries. In Proceedings of the Second
International Conference on Document Image Analysis for
Libraries (DIAL’06)
[16]. H. Srinivasan and sargur Sridhar,” Signature-Based
Retrieval Of Scanned Document Using Conditional Random
F ”,2009.
[17]. I kha , Ha a O ,”D Ha w
Signature In Scanned Documents “, February, 2014
[18]. M B K ka a M S Sh h ka , “D m
Ima R va : A Ov v w”, I a a J a
Computer Applications, vol. 1, no. 7, (2010), pp. 114-119.
BIOGRAPHIES
Miss. Chinnu s. Gupta has completed her
Bachelor of Engineering in Visvesvaraya
Technological University, Belgaum.
Currently pursuing Master in Technology,
from the same university. Area of interest
in Image Processing.
Mr. Umesh .D. Dixit has completed his BE
And M.Tech from Visvesvaraya
Technological University, Belgaum. He is
working as Asst.Prof.in the department of
E&C, BLDEA’ CET, V ja ap , 11
Years. Currently he is pursuing PhD in
Visvesvaraya Technological University,
Belgaum.

More Related Content

PDF
Recognition of Words in Tamil Script Using Neural Network
PDF
50120130406021
PDF
Proposed technique-for-edge-matching-of-torn-paper
PDF
Text content dependent writer identification
PDF
Applications of Pattern Recognition Algorithms in Agriculture: A Review
PDF
A Review on Geometrical Analysis in Character Recognition
PDF
Finding similarities between structured documents as a crucial stage for gene...
PDF
An Efficient Segmentation Technique for Machine Printed Devanagiri Script: Bo...
Recognition of Words in Tamil Script Using Neural Network
50120130406021
Proposed technique-for-edge-matching-of-torn-paper
Text content dependent writer identification
Applications of Pattern Recognition Algorithms in Agriculture: A Review
A Review on Geometrical Analysis in Character Recognition
Finding similarities between structured documents as a crucial stage for gene...
An Efficient Segmentation Technique for Machine Printed Devanagiri Script: Bo...

What's hot (19)

PDF
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
PDF
DEVNAGARI DOCUMENT SEGMENTATION USING HISTOGRAM APPROACH
PDF
Offline signature identification using high intensity variations and cross ov...
PDF
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
PDF
AUTOMATIC TRAINING DATA SYNTHESIS FOR HANDWRITING RECOGNITION USING THE STRUC...
PDF
An optimal face recoginition tool
PDF
Handwritten character recognition in
PPTX
offline character recognition for handwritten gujarati text
PDF
IRJET- Automated Document Summarization and Classification using Deep Lear...
PDF
Offline Signature Verification and Recognition using Neural Network
PPTX
Handwriting Recognition
PDF
A Fast and Accurate Palmprint Identification System based on Consistency Orie...
PDF
Devnagari handwritten numeral recognition using geometric features and statis...
PDF
A Comprehensive Study On Handwritten Character Recognition System
PDF
An offline signature recognition and verification system based on neural network
PDF
Automatic authentication-of-handwritten-documents-via-low-density-pixel-measu...
PPTX
Handwriting Recognition Using Deep Learning and Computer Version
PDF
Feature selection, optimization and clustering strategies of text documents
PDF
Algorithm for calculating relevance of documents in information retrieval sys...
­­­­Cursive Handwriting Recognition System using Feature Extraction and Artif...
DEVNAGARI DOCUMENT SEGMENTATION USING HISTOGRAM APPROACH
Offline signature identification using high intensity variations and cross ov...
Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...
AUTOMATIC TRAINING DATA SYNTHESIS FOR HANDWRITING RECOGNITION USING THE STRUC...
An optimal face recoginition tool
Handwritten character recognition in
offline character recognition for handwritten gujarati text
IRJET- Automated Document Summarization and Classification using Deep Lear...
Offline Signature Verification and Recognition using Neural Network
Handwriting Recognition
A Fast and Accurate Palmprint Identification System based on Consistency Orie...
Devnagari handwritten numeral recognition using geometric features and statis...
A Comprehensive Study On Handwritten Character Recognition System
An offline signature recognition and verification system based on neural network
Automatic authentication-of-handwritten-documents-via-low-density-pixel-measu...
Handwriting Recognition Using Deep Learning and Computer Version
Feature selection, optimization and clustering strategies of text documents
Algorithm for calculating relevance of documents in information retrieval sys...
Ad

Similar to A review on signature detection and signature based document image retrieval (20)

PDF
Design of digital signature verification algorithm using relative slope method
PDF
Detection of fabrication in photocopy document using texture features through...
PDF
A novel approach for text extraction using effective pattern matching technique
PDF
A Novel approach for Document Clustering using Concept Extraction
PDF
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
PDF
A S URVEY ON D OCUMENT I MAGE A NALYSIS AND R ETRIEVAL S YSTEMS
PDF
OFFLINE HANDWRITTEN SIGNATURE RECOGNITION USING EDGE HINGE AND EDGE EXTRACTIO...
PDF
F045053236
PDF
Optimal approach for text summarization
PDF
Telugu letters dataset and parallel deep convolutional neural network with a...
PDF
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
PDF
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
PDF
A study and survey on various progressive duplicate detection mechanisms
PDF
Evaluating the efficiency of rule techniques for file classification
PDF
Survey of Machine Learning Techniques in Textual Document Classification
PDF
Evaluating the efficiency of rule techniques for file
PDF
IRJET- Review on Information Retrieval for Desktop Search Engine
PDF
Design and Implementation Recognition System for Handwritten Hindi/Marathi Do...
PDF
P33077080
PDF
Automatic signature verification with chain code using weighted distance and ...
Design of digital signature verification algorithm using relative slope method
Detection of fabrication in photocopy document using texture features through...
A novel approach for text extraction using effective pattern matching technique
A Novel approach for Document Clustering using Concept Extraction
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
A S URVEY ON D OCUMENT I MAGE A NALYSIS AND R ETRIEVAL S YSTEMS
OFFLINE HANDWRITTEN SIGNATURE RECOGNITION USING EDGE HINGE AND EDGE EXTRACTIO...
F045053236
Optimal approach for text summarization
Telugu letters dataset and parallel deep convolutional neural network with a...
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
Knowledge Graph and Similarity Based Retrieval Method for Query Answering System
A study and survey on various progressive duplicate detection mechanisms
Evaluating the efficiency of rule techniques for file classification
Survey of Machine Learning Techniques in Textual Document Classification
Evaluating the efficiency of rule techniques for file
IRJET- Review on Information Retrieval for Desktop Search Engine
Design and Implementation Recognition System for Handwritten Hindi/Marathi Do...
P33077080
Automatic signature verification with chain code using weighted distance and ...
Ad

More from eSAT Journals (20)

PDF
Mechanical properties of hybrid fiber reinforced concrete for pavements
PDF
Material management in construction – a case study
PDF
Managing drought short term strategies in semi arid regions a case study
PDF
Life cycle cost analysis of overlay for an urban road in bangalore
PDF
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
PDF
Laboratory investigation of expansive soil stabilized with natural inorganic ...
PDF
Influence of reinforcement on the behavior of hollow concrete block masonry p...
PDF
Influence of compaction energy on soil stabilized with chemical stabilizer
PDF
Geographical information system (gis) for water resources management
PDF
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
PDF
Factors influencing compressive strength of geopolymer concrete
PDF
Experimental investigation on circular hollow steel columns in filled with li...
PDF
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
PDF
Evaluation of punching shear in flat slabs
PDF
Evaluation of performance of intake tower dam for recent earthquake in india
PDF
Evaluation of operational efficiency of urban road network using travel time ...
PDF
Estimation of surface runoff in nallur amanikere watershed using scs cn method
PDF
Estimation of morphometric parameters and runoff using rs & gis techniques
PDF
Effect of variation of plastic hinge length on the results of non linear anal...
PDF
Effect of use of recycled materials on indirect tensile strength of asphalt c...
Mechanical properties of hybrid fiber reinforced concrete for pavements
Material management in construction – a case study
Managing drought short term strategies in semi arid regions a case study
Life cycle cost analysis of overlay for an urban road in bangalore
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory investigation of expansive soil stabilized with natural inorganic ...
Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of compaction energy on soil stabilized with chemical stabilizer
Geographical information system (gis) for water resources management
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Factors influencing compressive strength of geopolymer concrete
Experimental investigation on circular hollow steel columns in filled with li...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Evaluation of punching shear in flat slabs
Evaluation of performance of intake tower dam for recent earthquake in india
Evaluation of operational efficiency of urban road network using travel time ...
Estimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of morphometric parameters and runoff using rs & gis techniques
Effect of variation of plastic hinge length on the results of non linear anal...
Effect of use of recycled materials on indirect tensile strength of asphalt c...

Recently uploaded (20)

PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
web development for engineering and engineering
PPT
Mechanical Engineering MATERIALS Selection
PDF
PPT on Performance Review to get promotions
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
Geodesy 1.pptx...............................................
PPTX
Construction Project Organization Group 2.pptx
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
CH1 Production IntroductoryConcepts.pptx
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Automation-in-Manufacturing-Chapter-Introduction.pdf
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
OOP with Java - Java Introduction (Basics)
web development for engineering and engineering
Mechanical Engineering MATERIALS Selection
PPT on Performance Review to get promotions
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Geodesy 1.pptx...............................................
Construction Project Organization Group 2.pptx
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
CH1 Production IntroductoryConcepts.pptx
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Operating System & Kernel Study Guide-1 - converted.pdf
Embodied AI: Ushering in the Next Era of Intelligent Systems
CYBER-CRIMES AND SECURITY A guide to understanding
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf

A review on signature detection and signature based document image retrieval

  • 1. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 251 A REVIEW ON SIGNATURE DETECTION AND SIGNATURE BASED DOCUMENT IMAGE RETRIEVAL Chinnu S Gupta1 , Umesh .D. Dixit2 1 M.Tech, E&CE dept, B.L.D.E.A’s College of Engineering and Technology, Vijayapur 586101, India 2 Asst. Prof in E&CE dept, B.L.D.E.A’s College of Engineering and Technology, Vijayapur 586101, India Abstract Transforming a paper document to its electronic version in a form suitable for efficient storage, retrieval, and interpretation continues to be a challenging problem. Signature is an individualistic identification of a person. It is an authentic identification because a signature cannot be copied by others. Signatures are a special case of handwriting subject to intra personal variation and inter personal differences. To counter check fraud and forgery of handwritten signatures, Signature extraction from printed text background and signature based document retrieval from a large dataset is necessary. A lot many techniques have been implemented successfully for both signature extraction and signature based document retrieval. This paper present techniques and methods evolved for signature extraction and signature based document retrieval. Keywords: signature detection, signature extraction, Document image Retrieval, Query image retrieval ---------------------------------------------------------------------***-------------------------------------------------------------------- 1. INTRODUCTION A signature is an individualistic, unique, evidentiary entity. It provides an important form of indexing that enables effective image search and retrieval from large heterogeneous document image collections. In this paper, we surveyed different technique involved in retrieval system that automatically detects, segments, and matches signatures from document image with unconstrained layouts and complex background. This would involve extracting all the signatures from the documents and then performing a match on these signatures. In searching complex documents, a task of Relevance is relating a signature in a given document to the closest matches within a database of document given a database of signed document. The signature based document retrieval has many of the applications and some of them are listed below.  For business documents.  Government organizations.  Digital libraries for online books, student thesis etc.  Security requirements of document. The main contributions of this paper are summarized as follows. Firstly, in this paper we have provide general framework and detailed survey of signature extraction and signature based document retrieval. Secondly, we have discussed about issues and challenges. The paper is organized as follows. In section 2, we given the general framework and in section 3, we review the related work. The issues and challenges in section 4, in section 5, we discuss the performance evaluation metrics. Section 6, concludes the paper 2. GENERAL FRAMWORK Fig 1.General framework The figure shows general framework for signature based document retrieval.  Query image: The image containing signature is called as query image.  Signature extraction: It is the process of extracting only the signature part of the document i.e. separating signature and non-signature part of the document.  Preprocessing: The preprocessing phase is a sequence of image transformations creating the best possible input for feature extraction algorithms. Preprocessing step may include filtering, RGB to gray scale conversion and binarization.  Feature extraction: Feature extraction is a set of (usually) independent functions returning a characteristic feature set for the input image. Features can either be particular to the whole signature (global features) or to a part of the signature (topological or local features).  Matching and Retrieval: Given a few available query signature instances and a large database of document containing signatures, the problem of signature Query Image Signature extraction Preprocessing Matchin g and Retrieval Document Retrieved Database Feature extraction
  • 2. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 252 matching is to find the most similar signature samples from the database. By constructing the list of best matching signature documents, we effectively retrieve the set of documents authorized or authored by the same person. 3. RELATED WORK Madasu, et al. [1] and Chalechale, et al. [2] used geometric features including area, circularity, aspect ratio, size and position to analyze segmented regions. These features are compared based on certain parameters such as Manhattan distance. The crop method objective is to locate rectangular box around the signature which is the object of interest and remove other objects outside this area, like signatures on bank cheque. The crop method works by moving four vectors from the four different directions (namely up, right, bottom and left) towards the object of interest. Each vector will mark the border of each side of the rectangular box. The crop is then applied to area. Once the area approximation has been done on the first model of the cheque. It is then easy to extract a signature from a bank cheque and scan it. The method failed to extract the signature as the signature was not in the stipulated area hence the system could approximate the area to be cropped. Djeziri, et al. [3] dealt with the signature extraction problem by an approach that was to mimic the human visual perception. They introduced the filiformity as a criterion for the curvature characteristics of handwritten signatures. Filiformity is defined for two topological measures for binary objects which also includes gray level images. It differentiates the contour lines of the signatures from the handwritten lines which are being isolated. The process of isolating is also used to provide measure for local values regarding the whole image. This process fails when other filiformity objects are present in the document. Madasu, et al. [4] tried to crop the image segment by estimating the area in which the signature lies using a sliding window. They then analyzed the local entropy derived from the pixel-based density of the region to decide its being signature or not. This approach disregarded the noise and therefore high-density regions are reported as signatures incorrectly. This segmentation does not need any a priori information about the data field and features. Ritesh Banka, et al. [5] proposed the method of extracting signature from cluttered background which is scale invariant technique using the efficiency of symmetric and loop features that are common in signatures for extracting the standard signatures from other documents. Signature is a handwritten text hence this technique can be used because signatures in English contain most of the characters exhibiting symmetry in vertical, horizontal or diagonal directions. The self printed characters from the documents, as a few papers, lack of proper self symmetry and are misclassified as handwritten text at the first level. The misclassified printed characters in the first level contain a single symmetric loop, which discriminates them from hand written elements. The algorithm had two domains character level and document level. For poor quality document where the numbers of characters joint and broken are very high, the algorithm efficiency decreases. R.Jayadeyan, et al. [6] considered the images of bank document a method has been proposed for the extraction of signatures, signature localization is done using variance, signature block extraction is done using entropy, normalization are used for desired mean and variance, a hidden Markov model (HMM).Horizontal and vertical projection, right and left envelope, top and bottom envelope, horizontal and vertical size variations of the binary format of the genuine signatures are considered as the feature set of an individual. Sha’ashua, et al. [7] introduced the concept of multi-scale saliency feature for signature detection that defines signature characteristics by identifying salient structure by grouping contours. They defined a saliency function which increased with the curve length and decreased with the curvature when totally squared. This was done by locally connected network. Their approach was confined to rigid objects having 1-D contour. This can be applied to handwritten notes collected on a tablet pc online since trajectories of the pen are available the framework used is general as it does not embed unique assumption on local feature of the object for example stroke level features for signatures. Hence it is robust against any changes in shape based object detection problems and is applicable to different languages. A three-stage procedure was proposed by Mandal, et al. [8] to extract signatures. First an algorithm is used to locate the Signature in the document using word-level feature extraction. Second stage separates signature strokes that overlap with the printed text. Final stage uses conditional random field minimization energy concept with skeleton analysis to classify real signature strokes. Improper segmentation of strokes causes error in separating signature from printed text. Bassam, et al. [9] used the method for extracting signature from image on document is the base proposed auto cropping method. This method improves the performance of security system based on signature image as possible the region of interest of the used image for the biometric system and it also reduces the time cost associated with signature. Auto cropping is the fast procedure to extract the Region of interest (ROI). In this method they used image segmentation and extraction. This cropped signature has no garbage region it crops only the ROI of signature image. This takes less processing time then the original signature image. The performance of this method shows through its speed time, and keeping the content information of the signature object without losing of any pixel of image. Esteban, et al. [10] considered two main stages of extraction one being comparing distribution of stroke in model signature against distribution of stroke in the original signature using a different approaches extraction being the main concern accumulative evidence technique is used, and
  • 3. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 253 they achieved a high accuracy with this approach. Their assumption is not usually true since the bank application forms are often filled by new customers and this process is not followed by verification but an archiving step. Tang, et al. [11] proposed a technique to automatically gain knowledge from various types of documents because knowledge acquisition is difficult to obtain. It was proposed to gain automatic knowledge acquisition in document images by analyzing the geometric structure and logical structural images. These two structures play an important role to process the knowledge acquisition. The limitations of these techniques are to find which is right in transforming a geometric structure into a logical structure. Another difficulty is to find the correct rules involved in obtaining knowledge from different documents. Liu, et al. [12] presented an approach to image-based form document retrieval and they proposed a similarity measure for forms that is insensitive to translation, scaling, moderate skew, and image quality fluctuations, and developed a prototype form retrieval system based on their proposed similarity measure. G.Zhu, et al. [13] proposed the method for signature based document retrieval performance using different shape representation like salient contour computed by detection and segmentation of the skeleton that are extracted directly from labeled signature region of database .The thinning algorithm shows sensitivity to structural variations due to noise and neighboring stroke salient contours give a globally consistent representation of structurally important shape feature. This retrieval technique has been successful on the Maryland Arabic dataset in which background handwritten as well as signature are closely spaced. Chalechale, et al. [14], proposed signature based decomposition and retrieval of document images, and he investigated Arabic/Persian signature recognition and retrieval. For the conduction of retrieval automatic links were considered between feature vector of the signature and the document containing the image using file names. The retrieval performance was measured by average normalized retrieval rate. The ANMRR indicates better performance in comparison with line segmentation distribution method. A.chalechale, et al. [15] proposed new method for document image decomposition based on connected component analysis and geometric properties of labeled region. The signature is detected and extracted by spatial partitioning by accumulating pixels and by using magnitude and Fourier transform they achieved rotation invariance. The main objective is signature extraction from the original image and converted into a compact feature vector that supports measuring signature similarities. Srihari, et al. [16], proposed a document image retrieval using signature as queries. Using global shape binary feature vector, a normalized correlation similarity measure for signature matching is done. The objective is to retrieve the closest matching signature obtained from dataset by the same signature person to remove the printed text from the signature image. An image enhancement procedure using chain code information is used. The technique gave promising result for group based and non grouped base retrieval with accuracy and precision. H.Srinivasan, et al. [17] used conditional random fields and proposed the method for signature based retrieval. In this they retrieved the document from database using signature image as query. Isolating the different contents present in the document and with the help of CRF in extracting signature from complex document. This method presents the signature retrieval strategy using document indexing and retrieval. Indexing is done by using a model based on CRF. SVM is also supported by this technique, and the CRF is used to label each patch and identified using the labels of the neighboring patches. Document retrieval is performed by using matching algorithm to compare the query with the signature. Guangya Zhu, et. Al. [18], proposed a signature-based document image retrieval system that automatically detects, segments, and matches signatures from document images with unconstrained layout and complex backgrounds .Signature is treated as a non rigid shape and represented by discrete set of 2-D point to most commonly sited measures for retrieval R-precision and average precision. The R- precision emphasizes the ranking among retrieve documents. Extensive experimental and field test give excellent performance for document retrieval 4. ISSUES AND CHALLENGES The issues and challenges for signature extraction and retrieval are listed below.  Signature with low resolution in documents make difficult for detection and segmentation.  The background of document differs from each other.  The computer vision faces an important problem of Detecting, segmenting and matching deformable objects such as signature.  Documents are subjected to restricted processing time due to urgency of applications. Therefore the detection and retrieval time must be fast.  The handwritten characters and auxiliary lines contained in the document overlap and resemble signatures. 5. PERFORMANCES AND EVALUATION METRICS  Accuracy: It is the percentage of number of correctly detected signatures to the number of groundtruthed signatures from the document image. Accuracy = a a h
  • 4. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 254  Precision: This refers to the percentage of retrieval of document images that are relevant to query [19]. Precision (N) =Rn / N Where N is number of retrieval and Rn is number of relevant matches among retrievals.  Recall: This refers to the percentage of all the relevant document images in the search database which are retrieved. Re call (N) =Rn /M Where M is the total number of relevant matches in the database Rn is the number of relevant matches among retrievals. A strategy for evaluating document image retrieval system involves the following techniques. These techniques have been partially successful in signature based document retrieval. The technique involves extraction of signature from certain documents by different methods. Matching of signature is done by several method and retrieval by known successful techniques. 6. CONCLUSION This paper provides detailed survey of techniques, as it overcomes the difficulty and challenges faced during extraction of signatures from the window of different background documents, cheque etc. This paper provides the overview of methods to apply according to the type of the background document and signature window. These newly developed methods are going to give a high rate of recognition. The performance of the most renowned methods provided the higher signature recognition rate with a greater accuracy in extraction and recognition of the signatures and signature based documents. It also highlights issues and challenges in this area. REFERENCES [1]. V K Madasu ,M H M Yusof M Hanmandilu K K b ka,”Automic Extraction Of Signatures From Bank Cheques And Other Documents”, DICTA’03 2003 [2]. A.Chalechale, G.Naghdya, P.Permaratne, A.Martins ” Document Image Analysis and Verification using Cursive S a ”, IEEE I a a C M m a and Expo, 2004. [3]. S.Djeziri,F.Nouboud,R.Plamondon, “Extraction Of Signatures From Check Background Based On A Filiformity Criterion”, IEEE Trans. Image Process. 7(10), 1425–1438, 1998. [4]. V.K. Madasu, B C L v ,” Automatic Segmentation and R O Ba k Ch q F ” Digital Image Compute. Tech. Appl., 80(1): 33–40. 2005 [5]. R h Ba ka, Fa ha N bakh h,”Exraction Of Signature &Handwritten Region From Official Binary D m Ima ”,september, 2008 [6]. R Ja a va a ,” Variance based extraction and hidden Markov model based verification of signatures p ba k h q ”, International Conference on Computation Intelligence and Multimedia Applications. 2007 [7]. G. Zhu, Y. Zheng, D. Doermann, S. Jeager, “Multi-scale Structural Saliency for Signature D ”, IEEE Conference on Computer Vision and Pattern Recognition. 2007 [8]. R. Mandal, P.P. Roy, U.Pal, “Signature Segmentation from Machine Printed Documents using Conditional Ra m F ”, International Conference on Document Analysis and Recognition. 2011. [9]. BassamAl-Mahadeen and Mokhled S,Islam H A Ta aw h, ”Signature Region of interest using auto pp ”, IJCSI,march 2010 [10]. J.L.Esteban, J.F. Vé z, Á Sá h z, ”Off-Line Handwritten Signature Detection By Analysis Of Evidence A m a ”, IJDAR, 15:359–368. 2012. [11]. Y.Ta , C D Ya, a C Y S , ”Document Processing for Automatic Knowledge Acquisition. IEEE T a K w a Da a E ”, vol.6, no.1, pp.3-21,1994 [12]. J. Liu and A K Ja ”Imaged-Based Form Document R va ”, Pattern Recognition, vol.33, no.3, pp.503-513, 2000. [13]. Guangyu Zhu, Yefeg Zheng, and David D ma ,”Signature Based Document Image Retrieval”, ECCV, 2008, Part III, LNCS 5304, pp.752-765, 2000. [14]. Abdul ah Cha ha , G hah Na h , ”Signature Based Document Retrieval. Faculty Of Information- Papers”, University of Wollongong. [15]. Sargur N. Srihari, Shravya Shetty, Gady Agam and Ophir Frieder .2006. Document Image Retrieval Using Signature as Queries. In Proceedings of the Second International Conference on Document Image Analysis for Libraries (DIAL’06) [16]. H. Srinivasan and sargur Sridhar,” Signature-Based Retrieval Of Scanned Document Using Conditional Random F ”,2009. [17]. I kha , Ha a O ,”D Ha w Signature In Scanned Documents “, February, 2014 [18]. M B K ka a M S Sh h ka , “D m Ima R va : A Ov v w”, I a a J a Computer Applications, vol. 1, no. 7, (2010), pp. 114-119. BIOGRAPHIES Miss. Chinnu s. Gupta has completed her Bachelor of Engineering in Visvesvaraya Technological University, Belgaum. Currently pursuing Master in Technology, from the same university. Area of interest in Image Processing. Mr. Umesh .D. Dixit has completed his BE And M.Tech from Visvesvaraya Technological University, Belgaum. He is working as Asst.Prof.in the department of E&C, BLDEA’ CET, V ja ap , 11 Years. Currently he is pursuing PhD in Visvesvaraya Technological University, Belgaum.