SlideShare a Scribd company logo
2
Most read
5
Most read
10
Most read
OPTICAL CHARACTER
RECOGNITION
Divyanshu Sagar
Ahmed Zaid Faizee
Vidyut Singhania
INTRO
1. Ingenious piece of software.
2. Involves the mechanical/electronic
conversion of scanned images of
typewritten/printed text into machine-
encoded/computer-readable text.
• 3. Heavily used in the
industry.
INTRO ii
• Common method of digitizing printed texts
• Subtle software which is as highly overlooked as it is simple.
• Numerous applications and uses – editing, scanning,
searching, comparison, compact storage and many more!
• OCR is a field of research in pattern recognition, artificial
intelligence and computer vision.
Problem Statement
Ever since Charles Babbage invented the computer back in the early 19th
century, Computer machines have held man's imagination for numerous reasons - the
primary being what all is this collection of nuts, bolts and wires capable of doing.
Character Recognition is one such concept which has beheld mankind’s attention. There
can be no greater testimony to the same than the fact that people were already working on
this idea - a few decades before John McCarthy even coined the term "Artificial
Intelligence".
Today, especially, Character Recognition plays a very important part of our daily lives as
they are incorporated so subtly that we even forget their presence. Some examples are
their implementation in Microsoft Word, Adobe Acrobat and even Pen computing.
Optical Character Recognition (OCR) is the mechanical or electronic conversion of scanned
or photoed images of typewritten or printed text into machine-encoded/computer-
readable text. This text can then be used in numerous ways - ranging from assisting the
visually impaired (text-to-speech), extracting information from the image, pen computing
and so on. Optical Character Recognition (OCR) is a result of cross-linking various avenues
of technology like Machine Learning, Artificial Intelligence and Neural Networks. We
propose to develop a system based on mathematical algorithms and principles which
involve all the aforementioned technologies. That being said, Optical Character Recognition
(OCR) also depends on a few other factors : the quality of the image taken, the orientation
of and the dialect being used. Our paper aims to address the aforementioned
problems, which enables its application in numerous new fields as well as the obvious &
established aspects of our surroundings.
Tech Jargon - I
• Pre-processing
Used to improve the successful
recognition of the image (include De-
skew, Layout analysis, Despeckle)
• Character/glyph recognition
• Post-processing
• Application specific optimization
Tweaking the system to better deal
with specific or different inputs.
Tech Jargon - II
Segmentation
Includes two important phases:
1) Obtaining training samples
2) Recognizing new images after
training
Feature Extraction
Feature of the character are extracted
and hence are compared with the glyph
Classification
After the extraction, neural network is
trained using the training data
Our Current Progress
• We started with the Neural Networks / Machine Learning
aspect of the project.
• We have implemented Univariate / Multivariate
Linear/Regularized Linear Regression, Gradient Descent for
Multiple Variables and Logistic/ Regularized Logistic
Regression.
• Currently, we are studying & working on the
implementation of Neural Nets using Forward Propogation.
• We plan on tackling character segmentation and feature
extraction next.
Technology to be used
• We are using the following technology
platforms :
– GNU Octave
To develop and test the OCR software.
– 5MP HD camera (720p @ 30fps)
To take images for detection
Timeline
Literature Review
• Microsoft One Note
• Adobe PDF scanner
• HP scanner
Optical Character Recognition (OCR)

More Related Content

PPTX
Optical Character Recognition
PDF
Optical Character Recognition (OCR) System
PPTX
OCR (Optical Character Recognition)
PDF
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
PPTX
OCR Presentation (Optical Character Recognition)
PPTX
Optical Character Recognition( OCR )
PPTX
Optical Character Recognition (OCR) based Retrieval
PPTX
Optical character recognition (ocr) ppt
Optical Character Recognition
Optical Character Recognition (OCR) System
OCR (Optical Character Recognition)
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
OCR Presentation (Optical Character Recognition)
Optical Character Recognition( OCR )
Optical Character Recognition (OCR) based Retrieval
Optical character recognition (ocr) ppt

What's hot (20)

DOCX
Optical character recognition IEEE Paper Study
PPTX
Basics of-optical-character-recognition
PPTX
Presentation on OCR
PPT
Text reader [OCR]
PPTX
Optical Character Recognition
PPTX
Final Report on Optical Character Recognition
DOC
Ocr abstract
PPTX
Handwriting Recognition Using Deep Learning and Computer Version
DOCX
Project report of OCR Recognition
PPTX
Character Recognition using Machine Learning
PPT
optical character recognition system
PDF
Optical Character Recognition Using Python
PPTX
Handwritten Character Recognition
PDF
A brief introduction to OCR (Optical character recognition)
PPTX
Handwriting Recognition
PPTX
Handwritten character recognition using artificial neural network
DOCX
Hand Written Character Recognition Using Neural Networks
PPTX
Object Recognition
PDF
Automated attendance system using Face recognition
PDF
IRJET- Detection and Classification of Skin Diseases using Different Colo...
Optical character recognition IEEE Paper Study
Basics of-optical-character-recognition
Presentation on OCR
Text reader [OCR]
Optical Character Recognition
Final Report on Optical Character Recognition
Ocr abstract
Handwriting Recognition Using Deep Learning and Computer Version
Project report of OCR Recognition
Character Recognition using Machine Learning
optical character recognition system
Optical Character Recognition Using Python
Handwritten Character Recognition
A brief introduction to OCR (Optical character recognition)
Handwriting Recognition
Handwritten character recognition using artificial neural network
Hand Written Character Recognition Using Neural Networks
Object Recognition
Automated attendance system using Face recognition
IRJET- Detection and Classification of Skin Diseases using Different Colo...
Ad

Similar to Optical Character Recognition (OCR) (20)

PPTX
Face Recognition System
PPTX
Intelligent image processing
PPTX
Traffic Violation Detector using Object Detection
PDF
AIDC India - AI Vision Slides
DOCX
Optical character recognization word
PDF
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
PDF
Computer architecture for vision system
PDF
A Deep Learning Approach to Recognize Cursive Handwriting
PDF
IRJET- Sign Language Interpreter
PPTX
ARTIFICIAL INTELLIGENCE for Human beings MORE SLIDES.pptx
PDF
Using Algorithmia to leverage AI and Machine Learning APIs
PDF
IRJET- Object Detection in an Image using Deep Learning
PDF
Optical Recognition of Handwritten Text
PPTX
OCR Presentation hjhPresentation 23.pptx
PDF
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
PDF
IRJET- Intelligent Character Recognition of Handwritten Characters
PPTX
Saksham presentation
PPTX
AI GRPOUP 4 PRESENTATION.pptx
PDF
Utilization of Machine Learning in Computer Vision
PDF
IRJET- Text Recognization of Product for Blind Person using MATLAB
Face Recognition System
Intelligent image processing
Traffic Violation Detector using Object Detection
AIDC India - AI Vision Slides
Optical character recognization word
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Computer architecture for vision system
A Deep Learning Approach to Recognize Cursive Handwriting
IRJET- Sign Language Interpreter
ARTIFICIAL INTELLIGENCE for Human beings MORE SLIDES.pptx
Using Algorithmia to leverage AI and Machine Learning APIs
IRJET- Object Detection in an Image using Deep Learning
Optical Recognition of Handwritten Text
OCR Presentation hjhPresentation 23.pptx
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Intelligent Character Recognition of Handwritten Characters
Saksham presentation
AI GRPOUP 4 PRESENTATION.pptx
Utilization of Machine Learning in Computer Vision
IRJET- Text Recognization of Product for Blind Person using MATLAB
Ad

Recently uploaded (20)

PDF
Empathic Computing: Creating Shared Understanding
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Electronic commerce courselecture one. Pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
cuic standard and advanced reporting.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Encapsulation theory and applications.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Machine learning based COVID-19 study performance prediction
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
MIND Revenue Release Quarter 2 2025 Press Release
Empathic Computing: Creating Shared Understanding
Understanding_Digital_Forensics_Presentation.pptx
Electronic commerce courselecture one. Pdf
Review of recent advances in non-invasive hemoglobin estimation
Dropbox Q2 2025 Financial Results & Investor Presentation
cuic standard and advanced reporting.pdf
20250228 LYD VKU AI Blended-Learning.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Encapsulation theory and applications.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Machine learning based COVID-19 study performance prediction
Unlocking AI with Model Context Protocol (MCP)
Mobile App Security Testing_ A Comprehensive Guide.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
MYSQL Presentation for SQL database connectivity
Spectral efficient network and resource selection model in 5G networks
MIND Revenue Release Quarter 2 2025 Press Release

Optical Character Recognition (OCR)

  • 2. INTRO 1. Ingenious piece of software. 2. Involves the mechanical/electronic conversion of scanned images of typewritten/printed text into machine- encoded/computer-readable text. • 3. Heavily used in the industry.
  • 3. INTRO ii • Common method of digitizing printed texts • Subtle software which is as highly overlooked as it is simple. • Numerous applications and uses – editing, scanning, searching, comparison, compact storage and many more! • OCR is a field of research in pattern recognition, artificial intelligence and computer vision.
  • 4. Problem Statement Ever since Charles Babbage invented the computer back in the early 19th century, Computer machines have held man's imagination for numerous reasons - the primary being what all is this collection of nuts, bolts and wires capable of doing. Character Recognition is one such concept which has beheld mankind’s attention. There can be no greater testimony to the same than the fact that people were already working on this idea - a few decades before John McCarthy even coined the term "Artificial Intelligence". Today, especially, Character Recognition plays a very important part of our daily lives as they are incorporated so subtly that we even forget their presence. Some examples are their implementation in Microsoft Word, Adobe Acrobat and even Pen computing. Optical Character Recognition (OCR) is the mechanical or electronic conversion of scanned or photoed images of typewritten or printed text into machine-encoded/computer- readable text. This text can then be used in numerous ways - ranging from assisting the visually impaired (text-to-speech), extracting information from the image, pen computing and so on. Optical Character Recognition (OCR) is a result of cross-linking various avenues of technology like Machine Learning, Artificial Intelligence and Neural Networks. We propose to develop a system based on mathematical algorithms and principles which involve all the aforementioned technologies. That being said, Optical Character Recognition (OCR) also depends on a few other factors : the quality of the image taken, the orientation of and the dialect being used. Our paper aims to address the aforementioned problems, which enables its application in numerous new fields as well as the obvious & established aspects of our surroundings.
  • 5. Tech Jargon - I • Pre-processing Used to improve the successful recognition of the image (include De- skew, Layout analysis, Despeckle) • Character/glyph recognition • Post-processing • Application specific optimization Tweaking the system to better deal with specific or different inputs.
  • 6. Tech Jargon - II Segmentation Includes two important phases: 1) Obtaining training samples 2) Recognizing new images after training Feature Extraction Feature of the character are extracted and hence are compared with the glyph Classification After the extraction, neural network is trained using the training data
  • 7. Our Current Progress • We started with the Neural Networks / Machine Learning aspect of the project. • We have implemented Univariate / Multivariate Linear/Regularized Linear Regression, Gradient Descent for Multiple Variables and Logistic/ Regularized Logistic Regression. • Currently, we are studying & working on the implementation of Neural Nets using Forward Propogation. • We plan on tackling character segmentation and feature extraction next.
  • 8. Technology to be used • We are using the following technology platforms : – GNU Octave To develop and test the OCR software. – 5MP HD camera (720p @ 30fps) To take images for detection
  • 10. Literature Review • Microsoft One Note • Adobe PDF scanner • HP scanner

Editor's Notes

  • #11: In 1914, Emanuel Goldberg developed a machine that read characters and converted them into standard telegraph code. Around the same time, Edmund Fournied'Albe developed the Otophone, a handheld scanner that when moved across a printed page, produced tones that corresponded to specific letters or characters.