Designing A Stemmer For Ge’ez Text Using Rule Based Approach

Belay, Abebe

Designing A Stemmer For Ge’ez Text Using Rule Based Approach

dc.contributor.advisor	Abebe, Ermias (PhD)
dc.contributor.author	Belay, Abebe
dc.date.accessioned	2018-11-23T13:51:29Z
dc.date.accessioned	2023-11-29T04:57:17Z
dc.date.available	2018-11-23T13:51:29Z
dc.date.available	2023-11-29T04:57:17Z
dc.date.issued	2010-07
dc.description.abstract	In this study, a stemmer of Ge’ez text was developed. In designing processes, different concepts such as background for the thesis, literatures on conflation of the stemming algorithms, morphological nature of Ge’ez language, stemming techniques and other realted things were discussed in order to model and develop an automatic procedure for conflation. When inflectional and derivational morphologies of the language were discussed, affixations such as prefixing, infixing and suffixing are the main word formation processes in Ge’ez language. The language is morphologically complex. This is because different words can be formed due to the wide concatenations of affixes. For the experiment, two techniques were used: affix removal and morphological analysis techniques. To evaluate the stemmer, manually error counting technique was used. From the experiment, three types of errors are observed: over stemmed (6%), under stemmed (4.27%) and structural problems (7.31%). When the stemmer runs on the sample texts, it performed with an accuracy of 82.42%. The dictionary reductions of the stemmer were 29.9% to the stemmed words and 62.8% to root words. Lastly, the possible recommendations to future works and improvements of this work were reported.	en_US
dc.identifier.uri	http://etd.aau.edu.et/handle/123456789/14460
dc.language.iso	en	en_US
dc.publisher	Addis Ababa University	en_US
dc.subject	Ge’ez Text Using Rule Based	en_US
dc.title	Designing A Stemmer For Ge’ez Text Using Rule Based Approach	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Abebe Belay.pdf
Size:: 655.64 KB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Plain Text
Description:

Download

Collections

Health Informatics