Amharic Document Categorization Using Itemsets Method
No Thumbnail Available
Date
2013-02
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Addis Ababa University
Abstract
Document categorization or document classification is the process of assigning a document to
one or more classes or categories. Many researches are conducted in the area of Amharic
document categorization. The main focus of those studies is to examine different document
categorization techniques and measuring their performance however itemsets method is not so
far examined. This study focused to extend Apriori algorithm which is traditionally used for
the purpose of knowledge mining in the form of association rules.
The research focused on the basic principles of applying itemsets method to categorize
Amharic documents. In addition to that the implementation of all the required tools which
helps to carry out automatic Amharic Document categorization using itemsets method is
developed and the algorithm is examined. Experiment results show itemsets method is an
efficient method to categorize Amharic documents. The effectiveness and accuracy of the
method to categorize Amharic documents is also evaluated and reported. Finally, factors
affecting the performance of the proposed system and the importance of preprocessing training
dataset in finding useful information are discussed.
Description
Keywords
Itemsets; Method