Word Sense Disambiguation for Afaan Oromo Language

dc.contributor.advisorMidekso, Dida (PhD)
dc.contributor.authorKebede, Tesfa
dc.date.accessioned2018-06-25T12:41:47Z
dc.date.accessioned2023-11-04T12:22:32Z
dc.date.available2018-06-25T12:41:47Z
dc.date.available2023-11-04T12:22:32Z
dc.date.issued2013-11
dc.description.abstractThis thesis presents a research work on Word Sense Disambiguation for Afaan Oromo Language. A corpus based approach to disambiguation is employed where supervised machine learning techniques are applied to a corpus of Afaan Oromo language, to acquire disambiguation information automatically. It also applied Naïve Baye‟s theorem to find the prior probability and likelihood ratio of the sense in the given context. Due to lack of sense annotated text to be able to do these types of studies; a total of 1240 Afaan Oromo sense examples were collected for selected five ambiguous words namely sanyii, karaa, horii, sirna and qoqhii. The sense examples were also manually tagged with their correct senses and preprocessed to make it ready for experimentation. Hence, these sense examples were used as a corpus for disambiguation. A standard approach to WSD is to consider the context of the ambiguous word and use the information from its neighboring or collocation words. The contextual features used in this thesis were co-occurrence feature which indicate word occurrence within some number of words to the left or right of the ambiguous word. For the purpose of evaluating the system, a statistical technique called k-fold cross-validation was applied using standard performance evaluation metrics. The achieved result was encouraging, but further experiments for other ambiguous words and using different approaches will be needed for a better natural language understanding of Afaan Oromo language. Keywords: Natural Language Processing, Word Sense Disambiguation, Supervised Learning Method, Naïve Baye‟s theoremen_US
dc.identifier.urihttp://etd.aau.edu.et/handle/123456789/3286
dc.language.isoenen_US
dc.publisherAddis Ababa Universityen_US
dc.subjectNatural Language Processingen_US
dc.subjectWord Sense Disambiguationen_US
dc.subjectSupervised Learning Methoden_US
dc.subjectNaïve Baye‟S Theoremen_US
dc.titleWord Sense Disambiguation for Afaan Oromo Languageen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Tesfa Kebede.pdf
Size:
1.34 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description:

Collections