Characterness: An Indicator of Text in the Wild 
ABSTRACT: 
Text in an image provides vital information for interpreting its contents, and text in 
a scene can aid a variety of tasks from navigation to obstacle avoidance and 
odometry. Despite its value, however, detecting general text in images remains a 
challenging research problem. Motivated by the need to consider the widely 
varying forms of natural text, we propose a bottom-up approach to the problem, 
which reflects the characterness of an image region. In this sense, our approach 
mirrors the move from saliency detection methods to measures of objectness. In 
order to measure the characterness, we develop three novel cues that are tailored 
for character detection and a Bayesian method for their integration. Because text is 
made up of sets of characters, we then design a Markov random field model so as 
to exploit the inherent dependencies between characters. We experimentally 
demonstrate the effectiveness of our characterness cues as well as the advantage of 
Bayesian multicue integration. The proposed text detector outperforms state-of-the- 
art methods on a few benchmark scene text detection data sets. We also show 
that our measurement of characterness is superior than state-of-the-art saliency 
detection models when applied to the same task.
EXISTING SYSTEM: 
Our basic motivation is the fact that text attracts human attention, even when 
amongst a cluttered background. This has been shown by a range of authors 
including Judd et al. and Cerf et al. who verified that humans tend to focus on text 
in natural scenes. Previous work has also demonstrated that saliency detection 
models can be used in early stages of scene text detection. In, for example, a 
saliency map obtained from Itti et al. was used to find regions of interest. Uchida et 
al. showed that using both SURF and saliency features achieved superior character 
recognition performance over using SURF features alone. More recently, Shahab 
et al compared the performance of four different saliency detection models at scene 
text detection. Meng and Song adopted the saliency framework for scene text 
detection. 
DISADVANTAGES OF EXISTING SYSTEM: 
 While the aforementioned approaches have demonstrated that saliency 
detection models facilitate scene text detection, they share a common 
inherent limitation, which is that they are distracted by other salient 
objects in the scene.
This approach has been shown to be very useful as a pre-processing step for a wide 
range of problems including occlusion boundary detection, semantic segmentation, 
and training object class detectors 
PROPOSED SYSTEM: 
We propose here a similar approach to text detection, in that we seek to develop a 
method which is capable of identifying individual, bounded units of text, rather 
than areas with text-like characteristics. The unit in the case of text is the character, 
and much like the ‘object’, it has a particular set of characteristics, including a 
closed boundary. In contrast to the objects, however, text is made up of a set of 
inter-related characters. Therefore, effective text detection should be able to 
compensate for, and exploit these dependencies between characters. The object 
detection method is similar to that proposed here in as much as it is based on a 
Bayesian framework combining a number of visual cues, including one which 
represents the boundary of the object, and one which measures the degree to which 
a putative object differs from the background.
ADVANTAGES OF PROPOSED SYSTEM: 
We are the first to present a saliency detection model which measures the 
characterness of image regions. This text-specific saliency detection model is less 
likely to be distracted by other objects which are usually considered as salient in 
general saliency detection models. 
SYSTEM ARCHITECTURE: 
SYSTEM REQUIREMENTS: 
HARDWARE REQUIREMENTS: 
 System : Pentium IV 2.4 GHz. 
 Hard Disk : 40 GB.
 Floppy Drive : 1.44 Mb. 
 Monitor : 15 VGA Colour. 
 Mouse : Logitech. 
 Ram : 512 Mb. 
SOFTWARE REQUIREMENTS: 
 Operating system : Windows XP/7. 
 Coding Language : MATLAB 
 Tool : MATLAB R 2007B 
REFERENCE: 
Yao Li, Wenjing Jia, Chunhua Shen, and Anton van den Hengel ,“Characterness: 
An Indicator of Text in the Wild”, IEEE TRANSACTIONS ON IMAGE 
PROCESSING, VOL. 23, NO. 4, APRIL 2014.

More Related Content

DOCX
Scene Text detection in Images-A Deep Learning Survey
PPTX
[RakutenTechConf2013] [C4-1] Text detection in product images
PPTX
Detecting text from natural images with Stroke Width Transform
PPTX
Text detection and recognition from natural scenes
PPTX
Information Extraction
PPTX
Text Detection From Image
DOC
Speaker: Leonid Kontorovich, CMU
PPTX
Presentation
Scene Text detection in Images-A Deep Learning Survey
[RakutenTechConf2013] [C4-1] Text detection in product images
Detecting text from natural images with Stroke Width Transform
Text detection and recognition from natural scenes
Information Extraction
Text Detection From Image
Speaker: Leonid Kontorovich, CMU
Presentation

What's hot (16)

PDF
Textual Document Categorization using Bigram Maximum Likelihood and KNN
PPTX
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
PDF
Packet Classification using Support Vector Machines with String Kernels
DOCX
COMPUTING SEMANTIC SIMILARITY OF CONCEPTS IN KNOWLEDGE GRAPHS
PPT
PPT
PDF
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
DOCX
IEEE 2014 DOTNET SERVICE COMPUTING PROJECTS Stars a statistical traffic patte...
PDF
SWiM – A Semantic Wiki for Mathematical Knowledge Management
PDF
L1803058388
PDF
Cc31331335
PDF
A SEMANTIC METADATA ENRICHMENT SOFTWARE ECOSYSTEM BASED ON TOPIC METADATA ENR...
PDF
Ay32333339
PPTX
Handwritten and Machine Printed Text Separation in Document Images using the ...
PDF
Ijetcas14 624
DOC
Stars : a statistical traffic pattern discovery system for manets
PDF
Ijetcas14 639
Textual Document Categorization using Bigram Maximum Likelihood and KNN
SCENE TEXT RECOGNITION IN MOBILE APPLICATION BY CHARACTER DESCRIPTOR AND STRU...
Packet Classification using Support Vector Machines with String Kernels
COMPUTING SEMANTIC SIMILARITY OF CONCEPTS IN KNOWLEDGE GRAPHS
PPT
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
IEEE 2014 DOTNET SERVICE COMPUTING PROJECTS Stars a statistical traffic patte...
SWiM – A Semantic Wiki for Mathematical Knowledge Management
L1803058388
Cc31331335
A SEMANTIC METADATA ENRICHMENT SOFTWARE ECOSYSTEM BASED ON TOPIC METADATA ENR...
Ay32333339
Handwritten and Machine Printed Text Separation in Document Images using the ...
Ijetcas14 624
Stars : a statistical traffic pattern discovery system for manets
Ijetcas14 639
Ad

Similar to JPM1417 Characterness: An Indicator of Text in the Wild (20)

PDF
40120140501009
PDF
Enhanced characterness for text detection in the wild
PDF
Scene Text Detection of Curved Text Using Gradiant Vector Flow Method
PDF
E1803012329
PDF
Text Extraction System by Eliminating Non-Text Regions
PDF
Text Detection and Recognition: A Review
PDF
A Survey On Thresholding Operators of Text Extraction In Videos
PDF
A Survey On Thresholding Operators of Text Extraction In Videos
PDF
IRJET - Text Detection in Natural Scene Images: A Survey
PPTX
Text extraction from natural scene image, a survey
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Investigating the Effect of BD-CRAFT to Text Detection Algorithms
PDF
INVESTIGATING THE EFFECT OF BD-CRAFT TO TEXT DETECTION ALGORITHMS
PDF
Detection & Recognition of Text.pdf
PDF
Anatomical Survey Based Feature Vector for Text Pattern Detection
PDF
CRNN model for text detection and classification from natural scenes
PDF
CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
PDF
Character recognition of kannada text in scene images using neural
PDF
Character recognition of kannada text in scene images using neural
PDF
Text Detection and Recognition with Speech Output for Visually Challenged Per...
40120140501009
Enhanced characterness for text detection in the wild
Scene Text Detection of Curved Text Using Gradiant Vector Flow Method
E1803012329
Text Extraction System by Eliminating Non-Text Regions
Text Detection and Recognition: A Review
A Survey On Thresholding Operators of Text Extraction In Videos
A Survey On Thresholding Operators of Text Extraction In Videos
IRJET - Text Detection in Natural Scene Images: A Survey
Text extraction from natural scene image, a survey
A comparative analysis of optical character recognition models for extracting...
Investigating the Effect of BD-CRAFT to Text Detection Algorithms
INVESTIGATING THE EFFECT OF BD-CRAFT TO TEXT DETECTION ALGORITHMS
Detection & Recognition of Text.pdf
Anatomical Survey Based Feature Vector for Text Pattern Detection
CRNN model for text detection and classification from natural scenes
CONTENT RECOVERY AND IMAGE RETRIVAL IN IMAGE DATABASE CONTENT RETRIVING IN TE...
Character recognition of kannada text in scene images using neural
Character recognition of kannada text in scene images using neural
Text Detection and Recognition with Speech Output for Visually Challenged Per...
Ad

More from chennaijp (20)

DOCX
JPEEE1440 Cascaded Two-Level Inverter-Based Multilevel STATCOM for High-Pow...
DOCX
JPN1423 Stars a Statistical Traffic Pattern
DOCX
JPN1422 Defending Against Collaborative Attacks by Malicious Nodes in MANETs...
DOCX
JPN1420 Joint Routing and Medium Access Control in Fixed Random Access Wire...
DOCX
JPN1418 PSR: A Lightweight Proactive Source Routing Protocol For Mobile Ad H...
DOCX
JPN1417 AASR: An Authenticated Anonymous Secure Routing Protocol for MANETs ...
DOCX
JPN1416 Sleep Scheduling for Geographic Routing in Duty-Cycled Mobile Sensor...
DOCX
JPN1415 R3E: Reliable Reactive Routing Enhancement for Wireless Sensor Netw...
DOCX
JPN1411 Secure Continuous Aggregation in Wireless Sensor Networks
DOCX
JPN1414 Distributed Deployment Algorithms for Improved Coverage in a Networ...
DOCX
JPN1413 An Energy-Balanced Routing Method Based on Forward-Aware Factor for...
DOCX
JPN1412 Transmission-Efficient Clustering Method for Wireless Sensor Networ...
DOCX
JPN1410 Secure and Efficient Data Transmission for Cluster-Based Wireless Se...
DOCX
JPN1409 Neighbor Table Based Shortcut Tree Routing in ZigBee Wireless Networks
DOCX
JPN1408 Hop-by-Hop Message Authentication and Source Privacy in Wireless Sen...
DOCX
JPN1406 Snapshot and Continuous Data Collection in Probabilistic Wireless S...
DOCX
JPN1405 RBTP: Low-Power Mobile Discovery Protocol through Recursive Binary T...
DOCX
JPN1404 Optimal Multicast Capacity and Delay Tradeoffs in MANETs
DOCX
JPM1410 Images as Occlusions of Textures: A Framework for Segmentation
DOCX
JPM1407 Exposing Digital Image Forgeries by Illumination Color Classification
JPEEE1440 Cascaded Two-Level Inverter-Based Multilevel STATCOM for High-Pow...
JPN1423 Stars a Statistical Traffic Pattern
JPN1422 Defending Against Collaborative Attacks by Malicious Nodes in MANETs...
JPN1420 Joint Routing and Medium Access Control in Fixed Random Access Wire...
JPN1418 PSR: A Lightweight Proactive Source Routing Protocol For Mobile Ad H...
JPN1417 AASR: An Authenticated Anonymous Secure Routing Protocol for MANETs ...
JPN1416 Sleep Scheduling for Geographic Routing in Duty-Cycled Mobile Sensor...
JPN1415 R3E: Reliable Reactive Routing Enhancement for Wireless Sensor Netw...
JPN1411 Secure Continuous Aggregation in Wireless Sensor Networks
JPN1414 Distributed Deployment Algorithms for Improved Coverage in a Networ...
JPN1413 An Energy-Balanced Routing Method Based on Forward-Aware Factor for...
JPN1412 Transmission-Efficient Clustering Method for Wireless Sensor Networ...
JPN1410 Secure and Efficient Data Transmission for Cluster-Based Wireless Se...
JPN1409 Neighbor Table Based Shortcut Tree Routing in ZigBee Wireless Networks
JPN1408 Hop-by-Hop Message Authentication and Source Privacy in Wireless Sen...
JPN1406 Snapshot and Continuous Data Collection in Probabilistic Wireless S...
JPN1405 RBTP: Low-Power Mobile Discovery Protocol through Recursive Binary T...
JPN1404 Optimal Multicast Capacity and Delay Tradeoffs in MANETs
JPM1410 Images as Occlusions of Textures: A Framework for Segmentation
JPM1407 Exposing Digital Image Forgeries by Illumination Color Classification

Recently uploaded (20)

PPTX
"Array and Linked List in Data Structures with Types, Operations, Implementat...
PDF
737-MAX_SRG.pdf student reference guides
PDF
distributed database system" (DDBS) is often used to refer to both the distri...
PPTX
Feature types and data preprocessing steps
PPTX
CyberSecurity Mobile and Wireless Devices
PDF
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
PDF
22EC502-MICROCONTROLLER AND INTERFACING-8051 MICROCONTROLLER.pdf
PDF
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
PPTX
introduction to high performance computing
PDF
August 2025 - Top 10 Read Articles in Network Security & Its Applications
PDF
Soil Improvement Techniques Note - Rabbi
PPTX
AUTOMOTIVE ENGINE MANAGEMENT (MECHATRONICS).pptx
PDF
Exploratory_Data_Analysis_Fundamentals.pdf
PDF
Categorization of Factors Affecting Classification Algorithms Selection
PDF
August -2025_Top10 Read_Articles_ijait.pdf
PPTX
ASME PCC-02 TRAINING -DESKTOP-NLE5HNP.pptx
PDF
Abrasive, erosive and cavitation wear.pdf
PPT
INTRODUCTION -Data Warehousing and Mining-M.Tech- VTU.ppt
PPTX
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
PDF
Accra-Kumasi Expressway - Prefeasibility Report Volume 1 of 7.11.2018.pdf
"Array and Linked List in Data Structures with Types, Operations, Implementat...
737-MAX_SRG.pdf student reference guides
distributed database system" (DDBS) is often used to refer to both the distri...
Feature types and data preprocessing steps
CyberSecurity Mobile and Wireless Devices
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
22EC502-MICROCONTROLLER AND INTERFACING-8051 MICROCONTROLLER.pdf
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
introduction to high performance computing
August 2025 - Top 10 Read Articles in Network Security & Its Applications
Soil Improvement Techniques Note - Rabbi
AUTOMOTIVE ENGINE MANAGEMENT (MECHATRONICS).pptx
Exploratory_Data_Analysis_Fundamentals.pdf
Categorization of Factors Affecting Classification Algorithms Selection
August -2025_Top10 Read_Articles_ijait.pdf
ASME PCC-02 TRAINING -DESKTOP-NLE5HNP.pptx
Abrasive, erosive and cavitation wear.pdf
INTRODUCTION -Data Warehousing and Mining-M.Tech- VTU.ppt
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
Accra-Kumasi Expressway - Prefeasibility Report Volume 1 of 7.11.2018.pdf

JPM1417 Characterness: An Indicator of Text in the Wild

  • 1. Characterness: An Indicator of Text in the Wild ABSTRACT: Text in an image provides vital information for interpreting its contents, and text in a scene can aid a variety of tasks from navigation to obstacle avoidance and odometry. Despite its value, however, detecting general text in images remains a challenging research problem. Motivated by the need to consider the widely varying forms of natural text, we propose a bottom-up approach to the problem, which reflects the characterness of an image region. In this sense, our approach mirrors the move from saliency detection methods to measures of objectness. In order to measure the characterness, we develop three novel cues that are tailored for character detection and a Bayesian method for their integration. Because text is made up of sets of characters, we then design a Markov random field model so as to exploit the inherent dependencies between characters. We experimentally demonstrate the effectiveness of our characterness cues as well as the advantage of Bayesian multicue integration. The proposed text detector outperforms state-of-the- art methods on a few benchmark scene text detection data sets. We also show that our measurement of characterness is superior than state-of-the-art saliency detection models when applied to the same task.
  • 2. EXISTING SYSTEM: Our basic motivation is the fact that text attracts human attention, even when amongst a cluttered background. This has been shown by a range of authors including Judd et al. and Cerf et al. who verified that humans tend to focus on text in natural scenes. Previous work has also demonstrated that saliency detection models can be used in early stages of scene text detection. In, for example, a saliency map obtained from Itti et al. was used to find regions of interest. Uchida et al. showed that using both SURF and saliency features achieved superior character recognition performance over using SURF features alone. More recently, Shahab et al compared the performance of four different saliency detection models at scene text detection. Meng and Song adopted the saliency framework for scene text detection. DISADVANTAGES OF EXISTING SYSTEM:  While the aforementioned approaches have demonstrated that saliency detection models facilitate scene text detection, they share a common inherent limitation, which is that they are distracted by other salient objects in the scene.
  • 3. This approach has been shown to be very useful as a pre-processing step for a wide range of problems including occlusion boundary detection, semantic segmentation, and training object class detectors PROPOSED SYSTEM: We propose here a similar approach to text detection, in that we seek to develop a method which is capable of identifying individual, bounded units of text, rather than areas with text-like characteristics. The unit in the case of text is the character, and much like the ‘object’, it has a particular set of characteristics, including a closed boundary. In contrast to the objects, however, text is made up of a set of inter-related characters. Therefore, effective text detection should be able to compensate for, and exploit these dependencies between characters. The object detection method is similar to that proposed here in as much as it is based on a Bayesian framework combining a number of visual cues, including one which represents the boundary of the object, and one which measures the degree to which a putative object differs from the background.
  • 4. ADVANTAGES OF PROPOSED SYSTEM: We are the first to present a saliency detection model which measures the characterness of image regions. This text-specific saliency detection model is less likely to be distracted by other objects which are usually considered as salient in general saliency detection models. SYSTEM ARCHITECTURE: SYSTEM REQUIREMENTS: HARDWARE REQUIREMENTS:  System : Pentium IV 2.4 GHz.  Hard Disk : 40 GB.
  • 5.  Floppy Drive : 1.44 Mb.  Monitor : 15 VGA Colour.  Mouse : Logitech.  Ram : 512 Mb. SOFTWARE REQUIREMENTS:  Operating system : Windows XP/7.  Coding Language : MATLAB  Tool : MATLAB R 2007B REFERENCE: Yao Li, Wenjing Jia, Chunhua Shen, and Anton van den Hengel ,“Characterness: An Indicator of Text in the Wild”, IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 23, NO. 4, APRIL 2014.