SlideShare a Scribd company logo
OCR Algorithm for Ge’ez
Characters
By:
Awet Haileslassie
Hadush Hailu
Mulu Hailemariam
Negash Desalegn
Contents
፩. Introduction
፪. What is OCR
፫. When and Why OCR
፬. What motivates US
፭. Problem Overview
፮. Proposed solution
፯. Results
፰. Conclusion
፱.
Introduction
 Ge'ez (ግዕዝ) (also known as Ethiopic)
is a script used as an abugida for several languages of Ethiopia
and Eritrea. It was first used to write Ge'ez, now the language of
the Ethiopian Orthodox Tewahedo Church and the Eritrean
Orthodox Tewahedo Church. In Amharic and Tigrinya, the script is
often called fidäl (ፊደል), meaning "script" or "alphabet".
 The Ge'ez script has been adapted to write other, mostly
Semitic, languages, particularly Amharic in Ethiopia, and
Tigrinya in both Eritrea and Ethiopia.
What is OCR
 OCR stand for Optical Character Recognition is a
technology that is used to translate scanned images of
text into computer editable and searchable text.
 • Input: scanned images of printed text
• Output: Computer readable version of input contents
When and Why OCR?
 OCR is used when recreating a similar document in paper
as a document in electronic form takes more time.
 Since we Ethiopian have many historical and religious
books, we have to save them and recreate them. HOW??
 The converted text files take less space than the original
image file.
Problem Overview
 In the running world there is a growing demand for the users to
convert the printed documents in to electronic documents for
maintaining the security of their data. Hence the basic OCR system
was invented to convert the data available on papers in to computer
process able documents, So that the documents can be editable and
reusable.
 It won't be an exaggeration to claim that Ethiopia's intellectual
property is hardly digitized; and is stored in paper, that is in the form
of century old parchment paper in monasteries or in the form of file
cabinets in various regional and federal offices.
 Digitizing these a number of documents by hand is not a feasible.
What Motivates Us
 The absences of locally and or internationally
developed single production for OCR software for
Ge’ez characters.
 The language is not supported by ASCII standard
to use it on the computer.
Proposed Solution
 Our proposed system is OCR Supports in identifying
and digitizing documents made up of Ethiopian
characters.
 By using OCR technology and application Artificial
Neural Network(ANN) we are going to develop an
application software that helps us to recognize
Ge'ez characters from a given images.
Results
Conclusion
 OCR system for Ge’ez Characters can be efficiently used
to digitize:
 Ethiopian books reside in Ethiopia and countries outside
Ethiopia
 Old books of EOTC
 Many other documents written in Ge’ez or Amharic as well
as Tigrigna.
Thank You!!

More Related Content

PDF
ስርዓተ ቅዳሴ (Kidase english-tigrinya-geez)
DOCX
Project report of OCR Recognition
PPT
How To Write A Prospectus For A Research Paper
PDF
CSR MEC600 ENGINEERS IN SOCIETY MAY- JULY 2021
PDF
Project sample
PDF
Optical character recognition for Ge'ez characters
DOCX
Projects Completed at the University of Manchester
PPTX
Machine learning
ስርዓተ ቅዳሴ (Kidase english-tigrinya-geez)
Project report of OCR Recognition
How To Write A Prospectus For A Research Paper
CSR MEC600 ENGINEERS IN SOCIETY MAY- JULY 2021
Project sample
Optical character recognition for Ge'ez characters
Projects Completed at the University of Manchester
Machine learning

Similar to Ocr algorithm for ge’ez characters (20)

PPTX
Optical Character Recognition (OCR) based Retrieval
PDF
D017222226
PDF
Optical Character Recognition (OCR) System
PDF
Volume 2-issue-6-2009-2015
PDF
Volume 2-issue-6-2009-2015
PDF
A Detailed Study And Recent Research On OCR
PDF
Correcting optical character recognition result via a novel approach
PDF
Optical Character Recognition from Text Image
PDF
RECOGNITION OF HANDWRITTEN MEITEI MAYEK SCRIPT BASED ON TEXTURE FEATURE
PDF
Recognition Of Handwritten Meitei Mayek Script Based On Texture Feature
DOCX
A detailed study and recent research on handwritten recognition
PDF
Bj35343348
PDF
E017322833
PDF
A Review of Optical Character Recognition System for Recognition of Printed Text
PDF
A Survey Paper on Character Recognition
PDF
50120130406005
PPTX
Optical Character Recognition (OCR)
PPTX
Team-98 research paper presentation.pptx
PDF
ocrppt-140415204404-phpapp01.pdf
PDF
CRC Final Report
Optical Character Recognition (OCR) based Retrieval
D017222226
Optical Character Recognition (OCR) System
Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015
A Detailed Study And Recent Research On OCR
Correcting optical character recognition result via a novel approach
Optical Character Recognition from Text Image
RECOGNITION OF HANDWRITTEN MEITEI MAYEK SCRIPT BASED ON TEXTURE FEATURE
Recognition Of Handwritten Meitei Mayek Script Based On Texture Feature
A detailed study and recent research on handwritten recognition
Bj35343348
E017322833
A Review of Optical Character Recognition System for Recognition of Printed Text
A Survey Paper on Character Recognition
50120130406005
Optical Character Recognition (OCR)
Team-98 research paper presentation.pptx
ocrppt-140415204404-phpapp01.pdf
CRC Final Report
Ad

Recently uploaded (20)

PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPT
Project quality management in manufacturing
PPTX
Safety Seminar civil to be ensured for safe working.
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PDF
PPT on Performance Review to get promotions
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
Current and future trends in Computer Vision.pptx
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
Construction Project Organization Group 2.pptx
PPTX
Lecture Notes Electrical Wiring System Components
PPT
Mechanical Engineering MATERIALS Selection
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPT
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Project quality management in manufacturing
Safety Seminar civil to be ensured for safe working.
UNIT 4 Total Quality Management .pptx
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPT on Performance Review to get promotions
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Current and future trends in Computer Vision.pptx
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Foundation to blockchain - A guide to Blockchain Tech
Construction Project Organization Group 2.pptx
Lecture Notes Electrical Wiring System Components
Mechanical Engineering MATERIALS Selection
Automation-in-Manufacturing-Chapter-Introduction.pdf
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
Ad

Ocr algorithm for ge’ez characters

  • 1. OCR Algorithm for Ge’ez Characters By: Awet Haileslassie Hadush Hailu Mulu Hailemariam Negash Desalegn
  • 2. Contents ፩. Introduction ፪. What is OCR ፫. When and Why OCR ፬. What motivates US ፭. Problem Overview ፮. Proposed solution ፯. Results ፰. Conclusion ፱.
  • 3. Introduction  Ge'ez (ግዕዝ) (also known as Ethiopic) is a script used as an abugida for several languages of Ethiopia and Eritrea. It was first used to write Ge'ez, now the language of the Ethiopian Orthodox Tewahedo Church and the Eritrean Orthodox Tewahedo Church. In Amharic and Tigrinya, the script is often called fidäl (ፊደል), meaning "script" or "alphabet".  The Ge'ez script has been adapted to write other, mostly Semitic, languages, particularly Amharic in Ethiopia, and Tigrinya in both Eritrea and Ethiopia.
  • 4. What is OCR  OCR stand for Optical Character Recognition is a technology that is used to translate scanned images of text into computer editable and searchable text.  • Input: scanned images of printed text • Output: Computer readable version of input contents
  • 5. When and Why OCR?  OCR is used when recreating a similar document in paper as a document in electronic form takes more time.  Since we Ethiopian have many historical and religious books, we have to save them and recreate them. HOW??  The converted text files take less space than the original image file.
  • 6. Problem Overview  In the running world there is a growing demand for the users to convert the printed documents in to electronic documents for maintaining the security of their data. Hence the basic OCR system was invented to convert the data available on papers in to computer process able documents, So that the documents can be editable and reusable.  It won't be an exaggeration to claim that Ethiopia's intellectual property is hardly digitized; and is stored in paper, that is in the form of century old parchment paper in monasteries or in the form of file cabinets in various regional and federal offices.  Digitizing these a number of documents by hand is not a feasible.
  • 7. What Motivates Us  The absences of locally and or internationally developed single production for OCR software for Ge’ez characters.  The language is not supported by ASCII standard to use it on the computer.
  • 8. Proposed Solution  Our proposed system is OCR Supports in identifying and digitizing documents made up of Ethiopian characters.  By using OCR technology and application Artificial Neural Network(ANN) we are going to develop an application software that helps us to recognize Ge'ez characters from a given images.
  • 10. Conclusion  OCR system for Ge’ez Characters can be efficiently used to digitize:  Ethiopian books reside in Ethiopia and countries outside Ethiopia  Old books of EOTC  Many other documents written in Ge’ez or Amharic as well as Tigrigna.