Probabilistic Information Retrieval System for Amharic Language

dc.contributor.advisorMeshesha, Million (PhD)
dc.contributor.authorHirpa, Amanuel
dc.date.accessioned2018-11-23T15:09:18Z
dc.date.accessioned2023-11-29T04:57:23Z
dc.date.available2018-11-23T15:09:18Z
dc.date.available2023-11-29T04:57:23Z
dc.date.issued2012-06
dc.description.abstractNowadays, a considerable amount of information has been produced in Ethiopia. This accumulation of information is challenging for archival and searching from the existing huge amount of information particularly written in Amharic language. Thus, developing an information retrieval (IR) system for Amharic language allows searching and retrieving relevant documents that satisfy information need of users. Accordingly, few IR systems have been developed. However, those IR systems have not registered a promising performance because they are developed based on vector space model that do not have the mechanism to define user’s information need using relevance feedback and query reformulation techniques unless other modules are integrated. Furthermore, the model does not define uncertainty that exists in IR systems. In order to solve these issues, probabilistic retrieval model that has the capability of reweighting query terms based on relevance feedback can be used. In this research, a probabilistic based IR system is developed for Amharic language. Both indexing and searching module was constructed. In these modules, different text operations such as: tokenization, normalization, stemming and stop word removal are included. Then, the retrieval system is tested and the experimental results show that probabilistic based IR system returned encouraging result even without controlling the problem of synonyms and polysemous terms that exist in Amharic text. The system registered on the average 73% F-measure. Nevertheless, the performance of the system is greatly affected by synonyms and polysemous terms that exist in the language beside its richness in morphology (variant words). Keywords: Information Retrieval, Probabilistic Model, Amharic Language.en_US
dc.identifier.urihttp://etd.aau.edu.et/handle/123456789/14478
dc.language.isoenen_US
dc.publisherAddis Ababa Universityen_US
dc.subjectInformation Retrievalen_US
dc.subjectProbabilistic Modelen_US
dc.subjectAmharic Languageen_US
dc.titleProbabilistic Information Retrieval System for Amharic Languageen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Amanuel Hirpa.pdf
Size:
2.65 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: