An Automatic Sentence Parser for Oromo Language Using Supervised Learning Technique

No Thumbnail Available

Date

2002-06

Journal Title

Journal ISSN

Volume Title

Publisher

Addis Ababa University

Abstract

The goal of Informal ion Retrieval has been to reduce human language complexities and as a result serve users in The mos I efficient way. The decisive in achieving such end is the Natural language Processing (NLP). NLP has many components in serving such purpose. Parsing is one of such components in NLP in improving precision and calligraphic is The goal of Informal ion Retrieval Systems. Moreover, parsing is also used inhere{for warlords machine Translation which is one of the hear of Natural Language Processing. Today, difference kinds of parsers have been developed' languages. lhis hare relatively wider use nationally and/or international/ly since The 1960.1. Un[unalterably Gromo has nol captured Ihe advanlage of such .Iyslem being Ihe working language of Ihe Slale Government of Gromiya, and one of Ihe major languages in Elhiopia and Ababa (Abebe 2002) lor Ihere are no syslems (parsers of any sarI) Ihal parse wril/en lexlS in Ihis language. This siudy is, Iherefore, an allempl 10 develop a simple aulomalic .lenIence parser for Oromo language In Ihe sludy, Ihe chari algorilhm 11 '0.1 used lI'ilh some modi/iealion. A module (or mOlphological analyzer, which splils words inlo roOI form and Iheir wrresponding morpheme, was also developed in order 10 faeil ilale Ihe preparalion of lexls in a lile 10 be parsed wilh appropriale lexical calegories. In addition, The unsupervised learning algorilhm was designed 10 guide The parser in predicting unknown and ambiguous words in a sentence. Grammar rules, lexicon, morphological rules and lexicon in-formalin were also designed on The basis of Ihe review Decide on Ihe linguistic propellers of amII/o grumll1alical categories. This system, facing, is the firslinils kind fiJI' this language. The study adopts an intelligent (Rule-Based+ learning Inodule) approach to develop a prototype. which is a simple Drama parser/or the language. The thesis. in short. describes processes a/automated sentence parsing oj' Free Texts. That is, it is aimed at developing a prototype and conducting an experimel with it. The result obtained (95% on the training test and 885% on the test set) using the small manually parsed sentences encourage birther research to be launched. especially with the aim of developing fill~fledged Oromo sentence parser.

Description

Keywords

Information Science

Citation