SlideShare a Scribd company logo
2
Most read
8
Most read
9
Most read
Terry Taewoong Um
fb.com/deeplearningtalk fb.com/terryum
์‚ฌ์ง„ ์† ๊ธ€์ž๋ฅผ ์ฝ์–ด์ฃผ๋Š”
Optical Character
Recognition (OCR)
42
What is OCR?
โ€ข Optical Character Recognition (OCR)
Reading typed/printed/handwritten characters
from image sources
Speech
Recognition
What is OCR?
โ€ข Optical Character Recognition (OCR)
Reading typed/printed/handwritten characters
from image sources
OCR
Why OCR?
characters in the
computer
characters in the
physical world
A
Why OCR?
characters in the
computer
characters in the
physical world
Difficult because of the large variations!
(font, size, shape, location, noise, ...)
OCR vs Object detection
Text
Localization
Text
Recognition
โ€ข OCR
โ€ข Object detection
Object
Localization
Object
Recognition
Detect the bounding
boxes that enclose text Read it
โ€ข OCR is more challenging than object detection due to
- various aspect (W:H) ratio - large distortions
- confusion w/ textures (โ€˜Iโ€™, โ€˜Tโ€™)
- few pretrained models- high density
- various languages
Text Localization Text
Localization
Text
Recognition
์ดํ™œ์„, https://www.slideshare.net/deview/111-ai
regression-based
(like object detection)
end-to-end
[Textboxes, Liao et al., AAAI2017] [PixelLink, Deng et al., AAAI2018]
classification-based
(like semantic segmentation)
[FOTS, Liu et al., CVPR2018]
simultaneous local+recog
# of
papers
training unstable stable
Text Recognition Text
Localization
Text
Recognition
Connectionist
Temporal
Classification
r EOSpt i
t pi<GO> r
Attention
# of
papers
speed
rarely used
accuracy
OCR + Translation = SmartLens
Text
Localization
Text
Recognition
Machine
translation
โ€ข What you need to know is
- Machine learning basics - Neural network basics
- Convolutional Neural Networks (+ advanced topics)
- Recurrent Neural Networks (+ advanced topics)

More Related Content

PPTX
Optical Character Recognition (OCR) based Retrieval
PPTX
Optical Character Recognition( OCR )
PPTX
Text extraction From Digital image
PPT
Text reader [OCR]
PPTX
Optical Character Recognition
PPTX
Optical character recognition (ocr) ppt
PPTX
Basics of-optical-character-recognition
PPTX
OCR Presentation (Optical Character Recognition)
Optical Character Recognition (OCR) based Retrieval
Optical Character Recognition( OCR )
Text extraction From Digital image
Text reader [OCR]
Optical Character Recognition
Optical character recognition (ocr) ppt
Basics of-optical-character-recognition
OCR Presentation (Optical Character Recognition)

What's hot (20)

PPTX
Optical Character Recognition (OCR)
DOCX
Optical character recognition IEEE Paper Study
PDF
Optical Character Recognition (OCR) System
ย 
PPTX
Optical Character Recognition
PDF
Natural language processing
PDF
Handwritten Text Recognition and Digital Text Conversion
ย 
PPT
Speech recognition
PDF
Computer Vision
PPTX
Handwritten character recognition using artificial neural network
PPTX
Information retrieval 9 tf idf weights
PPTX
Handwriting Recognition
DOC
Ocr abstract
PPTX
OCR (Optical Character Recognition)
PPTX
Presentation on OCR
PPTX
NLP_KASHK:Text Normalization
PPTX
Character Recognition using Machine Learning
PDF
Generative adversarial text to image synthesis
PDF
Text Extraction from Image using Python
ย 
PPT
An OCR System for recognition of Urdu text in Nastaliq Font
PDF
Natural Language Processing with Python
Optical Character Recognition (OCR)
Optical character recognition IEEE Paper Study
Optical Character Recognition (OCR) System
ย 
Optical Character Recognition
Natural language processing
Handwritten Text Recognition and Digital Text Conversion
ย 
Speech recognition
Computer Vision
Handwritten character recognition using artificial neural network
Information retrieval 9 tf idf weights
Handwriting Recognition
Ocr abstract
OCR (Optical Character Recognition)
Presentation on OCR
NLP_KASHK:Text Normalization
Character Recognition using Machine Learning
Generative adversarial text to image synthesis
Text Extraction from Image using Python
ย 
An OCR System for recognition of Urdu text in Nastaliq Font
Natural Language Processing with Python
Ad

Similar to A brief introduction to OCR (Optical character recognition) (10)

PPTX
[DSC Europe 23] Paweล‚ Ekk-Cierniakowski - Video transcription with deep learn...
PPTX
[DSC Europe 23] Paweล‚ Ekk-Cierniakowski - Video transcription with deep learn...
PPT
Speech Technology Overview
ย 
PDF
Teaching Machines to Listen: An Introduction to Automatic Speech Recognition
KEY
Erlang/OTP for Rubyists
PDF
Deep Learning for NLP: An Introduction to Neural Word Embeddings
PDF
Entering the Fourth Dimension of OCR with Tesseract - Talk from Voxxed Days B...
PPTX
Nltk
PDF
Natural Language Processing (NLP)
PPTX
Matt Feiszli at AI Frontiers : Video Understanding
[DSC Europe 23] Paweล‚ Ekk-Cierniakowski - Video transcription with deep learn...
[DSC Europe 23] Paweล‚ Ekk-Cierniakowski - Video transcription with deep learn...
Speech Technology Overview
ย 
Teaching Machines to Listen: An Introduction to Automatic Speech Recognition
Erlang/OTP for Rubyists
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Entering the Fourth Dimension of OCR with Tesseract - Talk from Voxxed Days B...
Nltk
Natural Language Processing (NLP)
Matt Feiszli at AI Frontiers : Video Understanding
Ad

More from Terry Taewoong Um (19)

PDF
AI์—ฐ๊ตฌ์ž๊ฐ€ ๋ทฐํ‹ฐํ…Œํฌ ์ฐฝ์—…ํ•˜์—ฌ ์‹คํŒจํ•˜๋ฉฐ ๋ฐฐ์šด 7๊ฐ€์ง€ ๋ ˆ์Šจ๋“ค // 7 lessons an AI researcher learned fr...
PDF
#44. KAIST์—์„œ "๋Œ€ํ•™ ์œ ์ฃ„"๋ฅผ ์™ธ์น˜๋‹ค: ART Lab์˜ ๋„์ „
PPTX
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynami...
PDF
์ธ๊ณต์ง€๋Šฅ์˜ ์‚ฌํšŒ์ •์˜์˜ ํŽธ์ด ๋  ์ˆ˜ ์žˆ์„๊นŒ? (์ธ๊ณต์ง€๋Šฅ๊ณผ ๋ฒ•)
PDF
Deep learning (Machine learning) tutorial for beginners
PPTX
Deep Variational Bayes Filters (2017)
PDF
On Calibration of Modern Neural Networks (2017)
PPTX
Deep Learning: A Critical Appraisal (2018)
PPTX
Understanding Black-box Predictions via Influence Functions (2017)
PPTX
Learning with side information through modality hallucination (2016)
PPTX
Human Motion Forecasting (Generation) with RNNs
PDF
Deformable Convolutional Network (2017)
PPTX
Introduction to Deep Learning with TensorFlow
PPTX
About Two Motion Planning Papers
PPTX
๊ธฐ๊ณ„ํ•™์Šต(Machine learning) ์ž…๋ฌธํ•˜๊ธฐ
PDF
Lie Group Formulation for Robot Mechanics
PDF
๋กœ๋ด‡๊ณผ ์ธ๊ณต์ง€๋Šฅ, ๊ทธ๋ฆฌ๊ณ  ๋ฏธ๋ž˜์˜ ๋…ธ๋™
PDF
Introduction to Machine Learning and Deep Learning
PPTX
Lie Group Formulation for Robot Mechanics
AI์—ฐ๊ตฌ์ž๊ฐ€ ๋ทฐํ‹ฐํ…Œํฌ ์ฐฝ์—…ํ•˜์—ฌ ์‹คํŒจํ•˜๋ฉฐ ๋ฐฐ์šด 7๊ฐ€์ง€ ๋ ˆ์Šจ๋“ค // 7 lessons an AI researcher learned fr...
#44. KAIST์—์„œ "๋Œ€ํ•™ ์œ ์ฃ„"๋ฅผ ์™ธ์น˜๋‹ค: ART Lab์˜ ๋„์ „
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynami...
์ธ๊ณต์ง€๋Šฅ์˜ ์‚ฌํšŒ์ •์˜์˜ ํŽธ์ด ๋  ์ˆ˜ ์žˆ์„๊นŒ? (์ธ๊ณต์ง€๋Šฅ๊ณผ ๋ฒ•)
Deep learning (Machine learning) tutorial for beginners
Deep Variational Bayes Filters (2017)
On Calibration of Modern Neural Networks (2017)
Deep Learning: A Critical Appraisal (2018)
Understanding Black-box Predictions via Influence Functions (2017)
Learning with side information through modality hallucination (2016)
Human Motion Forecasting (Generation) with RNNs
Deformable Convolutional Network (2017)
Introduction to Deep Learning with TensorFlow
About Two Motion Planning Papers
๊ธฐ๊ณ„ํ•™์Šต(Machine learning) ์ž…๋ฌธํ•˜๊ธฐ
Lie Group Formulation for Robot Mechanics
๋กœ๋ด‡๊ณผ ์ธ๊ณต์ง€๋Šฅ, ๊ทธ๋ฆฌ๊ณ  ๋ฏธ๋ž˜์˜ ๋…ธ๋™
Introduction to Machine Learning and Deep Learning
Lie Group Formulation for Robot Mechanics

Recently uploaded (20)

PPTX
Internet of Things (IOT) - A guide to understanding
DOCX
573137875-Attendance-Management-System-original
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
additive manufacturing of ss316l using mig welding
PDF
Well-logging-methods_new................
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
Sustainable Sites - Green Building Construction
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
Digital Logic Computer Design lecture notes
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PPTX
Construction Project Organization Group 2.pptx
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
web development for engineering and engineering
Internet of Things (IOT) - A guide to understanding
573137875-Attendance-Management-System-original
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
UNIT-1 - COAL BASED THERMAL POWER PLANTS
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
additive manufacturing of ss316l using mig welding
Well-logging-methods_new................
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Sustainable Sites - Green Building Construction
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Digital Logic Computer Design lecture notes
R24 SURVEYING LAB MANUAL for civil enggi
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Construction Project Organization Group 2.pptx
CYBER-CRIMES AND SECURITY A guide to understanding
web development for engineering and engineering

A brief introduction to OCR (Optical character recognition)

  • 1. Terry Taewoong Um fb.com/deeplearningtalk fb.com/terryum ์‚ฌ์ง„ ์† ๊ธ€์ž๋ฅผ ์ฝ์–ด์ฃผ๋Š” Optical Character Recognition (OCR) 42
  • 2. What is OCR? โ€ข Optical Character Recognition (OCR) Reading typed/printed/handwritten characters from image sources Speech Recognition
  • 3. What is OCR? โ€ข Optical Character Recognition (OCR) Reading typed/printed/handwritten characters from image sources OCR
  • 4. Why OCR? characters in the computer characters in the physical world A
  • 5. Why OCR? characters in the computer characters in the physical world Difficult because of the large variations! (font, size, shape, location, noise, ...)
  • 6. OCR vs Object detection Text Localization Text Recognition โ€ข OCR โ€ข Object detection Object Localization Object Recognition Detect the bounding boxes that enclose text Read it โ€ข OCR is more challenging than object detection due to - various aspect (W:H) ratio - large distortions - confusion w/ textures (โ€˜Iโ€™, โ€˜Tโ€™) - few pretrained models- high density - various languages
  • 7. Text Localization Text Localization Text Recognition ์ดํ™œ์„, https://www.slideshare.net/deview/111-ai regression-based (like object detection) end-to-end [Textboxes, Liao et al., AAAI2017] [PixelLink, Deng et al., AAAI2018] classification-based (like semantic segmentation) [FOTS, Liu et al., CVPR2018] simultaneous local+recog # of papers training unstable stable
  • 8. Text Recognition Text Localization Text Recognition Connectionist Temporal Classification r EOSpt i t pi<GO> r Attention # of papers speed rarely used accuracy
  • 9. OCR + Translation = SmartLens Text Localization Text Recognition Machine translation โ€ข What you need to know is - Machine learning basics - Neural network basics - Convolutional Neural Networks (+ advanced topics) - Recurrent Neural Networks (+ advanced topics)