Afaan Oromo Named Entity Recognition Using Hybrid Approach
dc.contributor.advisor | Midekso, Dida (PhD) | |
dc.contributor.author | Sani, Abdi | |
dc.date.accessioned | 2018-06-12T12:13:18Z | |
dc.date.accessioned | 2023-11-04T12:23:46Z | |
dc.date.available | 2018-06-12T12:13:18Z | |
dc.date.available | 2023-11-04T12:23:46Z | |
dc.date.issued | 2015-03 | |
dc.description.abstract | Named Entity Recognition and Classification (NERC) is an essential and challenging task in Natural Language Processing (NLP), particularly for resource scarce language like Afaan Oromo(AO). It seeks to classify words which represent names in text into predefined categories like person name, location, organization, date, time etc.Thus, this paper deals with some attempts in this direction. Mostly researcher have applied Machine Learning for Afaan Oromo Named Entity Recognition(AONER) while no researchers have used hand crafted rules and hybrid approach for Named Entity Recognition(NER) task. This thesis work deals with AONER System using hybrid approach, which contains machine learning(ML) and rule based components. The rule based component has parsing, filtering, grammar rules, whitelist gazetteers, blacklist gazetteers and exact matching components. The ML component has ML model and classifier components. We used General Architecture for Text Engineering (GATE) developer tool for rule based component and Weka in ML part. By using algorithms and rules we developed, we have identified Named Entity (NE) from Afaan Oromo texts, like name of persons, organizations, location, miscellaneous.Feature selection and rules are important factor in recognition of Afaan Oromo Name Entity (AONE). Various rules have been developed like prefix rule, suffix rule, clue word rule, context rule, first name and last name rule. We have used AONER corpus of size 27588, which is developed by Mandefro [1].From this corpus we have used corpus of size 23000 for training and 4588 for testing of our work. And we havean average result of 84.12% Precision, 81.21% Recall and 82.52% F-Score. Keywords: Named Entity Recognition, Named Entities, GATE Developer, Weka, Afaan Oromo | en_US |
dc.identifier.uri | http://etd.aau.edu.et/handle/123456789/547 | |
dc.language.iso | en | en_US |
dc.publisher | Addis Ababa University | en_US |
dc.subject | Named Entity Recognitionᤠamed Entities; Gate Developer; Weka, Afaan Oromo | en_US |
dc.title | Afaan Oromo Named Entity Recognition Using Hybrid Approach | en_US |
dc.type | Thesis | en_US |