Designing A Stemmer For Ge’ez Text Using Rule Based Approach

dc.contributor.advisorAbebe, Ermias (PhD)
dc.contributor.authorBelay, Abebe
dc.date.accessioned2018-11-23T13:51:29Z
dc.date.accessioned2023-11-29T04:57:17Z
dc.date.available2018-11-23T13:51:29Z
dc.date.available2023-11-29T04:57:17Z
dc.date.issued2010-07
dc.description.abstractIn this study, a stemmer of Ge’ez text was developed. In designing processes, different concepts such as background for the thesis, literatures on conflation of the stemming algorithms, morphological nature of Ge’ez language, stemming techniques and other realted things were discussed in order to model and develop an automatic procedure for conflation. When inflectional and derivational morphologies of the language were discussed, affixations such as prefixing, infixing and suffixing are the main word formation processes in Ge’ez language. The language is morphologically complex. This is because different words can be formed due to the wide concatenations of affixes. For the experiment, two techniques were used: affix removal and morphological analysis techniques. To evaluate the stemmer, manually error counting technique was used. From the experiment, three types of errors are observed: over stemmed (6%), under stemmed (4.27%) and structural problems (7.31%). When the stemmer runs on the sample texts, it performed with an accuracy of 82.42%. The dictionary reductions of the stemmer were 29.9% to the stemmed words and 62.8% to root words. Lastly, the possible recommendations to future works and improvements of this work were reported.en_US
dc.identifier.urihttp://etd.aau.edu.et/handle/123456789/14460
dc.language.isoenen_US
dc.publisherAddis Ababa Universityen_US
dc.subjectGe’ez Text Using Rule Baseden_US
dc.titleDesigning A Stemmer For Ge’ez Text Using Rule Based Approachen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Abebe Belay.pdf
Size:
655.64 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: