Optical Character Recognition of Typewritten Amharic Text

dc.contributor.advisorFissaha, Sisay (PhD)
dc.contributor.advisorAlemu, Worku (PhD)
dc.contributor.authorTeferi, Dereje
dc.date.accessioned2020-06-01T10:22:57Z
dc.date.accessioned2023-11-18T12:44:57Z
dc.date.available2020-06-01T10:22:57Z
dc.date.available2023-11-18T12:44:57Z
dc.date.issued1999-05
dc.description.abstractOptical Character Recognition is an area of research where a system is made to accept a document image and convert it into ASCII code so that it will be easy for storage, retrieval, and filterer processing. OCR helps to convert a bulk of information available on paper to electronically processable format without human intervention -- saving time, money, and labor Recently Optical Character Recognition for the Amharic Script has become an area of research interest. Some developments have been made in recognizing characters with specific type style, font, and font size. All the trials in this regard are on very high quality laser printouts on white papers. In reality, however, most Amharic typewritten documents that need to be converted into machine-readable format are typewritten and on non-white paper III this study an attempt is made to explore the possibilities of developing an OCR system for typewritten Amharic text. To this end, features of the typewritten Amharic characters are thoroughly studied. Some algorithms for noise removal and segmentation are reviewed. These algorithms are implemented to see their performance on typewritten Amharic text. Previous algorithm implemented for recognition of Amharic characters is modified to incorporate the specific features of typewritten Amharic characters. The segmentation and the noise removal algorithms are integrated with this algorithm. The result is tested on typewritten Amharic documents, and test results are presented. Recommendations are also drawn to point out issues to be investigated filterer for the development of typewritten Amharic OCR system with better performance.en_US
dc.identifier.urihttp://etd.aau.edu.et/handle/12345678/21380
dc.language.isoenen_US
dc.publisherAddis Ababa Universityen_US
dc.subjectInformation Scienceen_US
dc.titleOptical Character Recognition of Typewritten Amharic Texten_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Dereje Teferi.pdf
Size:
27.02 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: