Concept -Based Automatic Amharic Document Categorization

dc.contributor.advisorYacob, Daniel (PhD) )
dc.contributor.advisorLibsie, Mulugeta (PhD)
dc.contributor.authorSahlemariam, Meron
dc.date.accessioned2020-06-12T08:27:04Z
dc.date.accessioned2023-11-18T12:45:45Z
dc.date.available2020-06-12T08:27:04Z
dc.date.available2023-11-18T12:45:45Z
dc.date.issued2009-01
dc.description.abstractAlong with the continuously growing volume of information availability, there is a growing interest towards better solutions for finding, filtering and organizing these resources. Automatic text categorization can play an important role in a wide variety of more flexible, dynamic , and personalized information management tasks. The process of automatic text categorization involves calculating similarities between documents and categories using the information extracted from the document. In recent years, ontology-based document categorization method is introduced to solve the problem of document classifier. Previous works on keyword-based document categorization miss some important issues of considering semantic relationships between words. In order to resolve the existing problems, this study proposes a framework that automatically categorizes Amharic documents into predefined categories using knowledge represented in the News ontology. At the heart of the classification system is the knowledge base that enables the representation of different domain concepts. During the classification process, all the documents pass through pre-processing stages. Then index terms are extracted from a given document which is mapped onto their corresponding concepts in the ontology. Finally, the selected document is classified into a predefined category, based on the weighted concept. With the help of News domain entomologist, this study categorizes a given Amharic document into a specific predefined category . The study shows that the use of concepts for Amharic document categorize results in 92.9% accuracy which is a promising outcome. Keywords: Ontology, Keyword-based, Concept-based text categorization, Knowledge representation .en_US
dc.identifier.urihttp://etd.aau.edu.et/handle/12345678/21533
dc.language.isoenen_US
dc.publisherAddis Ababa Universityen_US
dc.subjectOntologyen_US
dc.subjectKeyword-baseden_US
dc.subjectConcept-based text categorizationen_US
dc.subjectKnowledge representationen_US
dc.titleConcept -Based Automatic Amharic Document Categorizationen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Meron Sahlemariam.pdf
Size:
37.46 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: