A Generic Approach towards All Words Amharic Word Sense Disambiguation

dc.contributor.advisorMeshesha(PhD), Million
dc.contributor.authorSiraj Bekeli, Dureti
dc.date.accessioned2018-11-09T09:15:24Z
dc.date.accessioned2023-11-18T12:45:18Z
dc.date.available2018-11-09T09:15:24Z
dc.date.available2023-11-18T12:45:18Z
dc.date.issued2017-02-05
dc.description.abstractSense disambiguation is an “intermediate task” which is helpful in other NLP tasks like machine translation, information retrieval and hypertext navigation, content and thematic analysis, grammatical analysis, speech processing and text processing. This study attempts to explore a more general approach to develop a WSD for Amharic language. To this end, a WSD system that identifies a sense of an Amharic ambiguous word by using information from tagged example sentences and Word-Net is developed. The system identifies the sense by measuring similarity between the input sentence and tagged example sentences. Two similarity measures are explored: Cosine similarity and Jaccard Coefficient similarity measure. We have collected 100 example sentences for each sense of the selected Amharic ambiguous words. The Word-Net is composed of words with their sysnonyms and gloss definition. The performance of the system is tested using 9 nouns, 3 verbs, 3 adjectives and 2 adverbs, a total 17 words which are selected randomly. The experiments were done for disambiguating one target word in a given text.The experimental step is designed in such a way that, first the performance of Cosine similarity and Jaccard coefficient are checked individually for WSD, next Lesk algorithm is tested on the third experiment and then experiments were conducted to check the performance of the two similarity measures as combined with Lesk algorithm. The result showed that Jaccard coefficient combined with Lesk algorithm come up with the highest result, which is 89.83% accuracy. The major challenge during the disambiguation process is that for those words that are frequently collocated with similar words in their different senses the system come up with a least accuracy.en_US
dc.identifier.urihttp://etd.aau.edu.et/handle/12345678/14026
dc.language.isoenen_US
dc.publisherAddis Ababa Universityen_US
dc.subjectAll Words Amharic Word Senseen_US
dc.titleA Generic Approach towards All Words Amharic Word Sense Disambiguationen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Dureti Siraj_2017.pdf
Size:
1.99 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: