An Automatic Sentence Parser for Oromo Language Using Supervised Learning Technique
No Thumbnail Available
Date
2002-06
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Addis Ababa University
Abstract
The goal of Informal ion Retrieval has been to reduce human language complexities and as a
result serve users in The mos I efficient way. The decisive in achieving such end is the
Natural language Processing (NLP). NLP has many components in serving such purpose.
Parsing is one of such components in NLP in improving precision and calligraphic is The
goal of Informal ion Retrieval Systems. Moreover, parsing is also used inhere{for warlords
machine Translation which is one of the hear of Natural Language Processing.
Today, difference kinds of parsers have been developed' languages. lhis hare relatively
wider use nationally and/or international/ly since The 1960.1. Un[unalterably Gromo has nol
captured Ihe advanlage of such .Iyslem being Ihe working language of Ihe Slale Government
of Gromiya, and one of Ihe major languages in Elhiopia and Ababa (Abebe 2002) lor Ihere
are no syslems (parsers of any sarI) Ihal parse wril/en lexlS in Ihis language. This siudy is,
Iherefore, an allempl 10 develop a simple aulomalic .lenIence parser for Oromo language
In Ihe sludy, Ihe chari algorilhm 11 '0.1 used lI'ilh some modi/iealion. A module (or
mOlphological analyzer, which splils words inlo roOI form and Iheir wrresponding
morpheme, was also developed in order 10 faeil ilale Ihe preparalion of lexls in a lile 10 be
parsed wilh appropriale lexical calegories. In addition, The unsupervised learning algorilhm
was designed 10 guide The parser in predicting unknown and ambiguous words in a sentence.
Grammar rules, lexicon, morphological rules and lexicon in-formalin were also designed
on The basis of Ihe review Decide on Ihe linguistic propellers of amII/o grumll1alical
categories. This system, facing, is the firslinils kind fiJI' this language.
The study adopts an intelligent (Rule-Based+ learning Inodule) approach to develop a
prototype. which is a simple Drama parser/or the language.
The thesis. in short. describes processes a/automated sentence parsing oj' Free Texts. That
is, it is aimed at developing a prototype and conducting an experimel with it. The result
obtained (95% on the training test and 885% on the test set) using the small manually
parsed sentences encourage birther research to be launched. especially with the aim of
developing fill~fledged Oromo sentence parser.
Description
Keywords
Information Science