Automatic Amharic Factual Question Generation from Historic Text Using Rule Based Approach

dc.contributor.advisorTeferi, Dereje (PhD)
dc.contributor.authorDamtie, Getaneh
dc.date.accessioned2021-11-24T08:21:19Z
dc.date.accessioned2023-11-18T12:47:48Z
dc.date.available2021-11-24T08:21:19Z
dc.date.available2023-11-18T12:47:48Z
dc.date.issued2021-06-28
dc.description.abstractNowadays, due to the availability of digital devices, important educational materials in a variety of languages have become available. However, these texts do not have sufficient amount of practical questions and assessments. Manually preparing meaningful and relevant questions from such materials is a time-consuming and difficult endeavor that necessitates expertise, experience, and resources. This research addresses the problem by automatically generating questions from Amharic texts, with a particular focus on automating the construction of factual questions from text. The automatic Amharic factual question generation systems, which is developed in this research, takes a historical text as input and produces a set of possible questions as output. Historical texts contain various named entities such as names of persons, locations name, cities name, countries name, dates and other entities, which helps to generate many questions. The methodology used in this study is design science. It has six main activities namely, problem identification and motivation, defining objectives, design and development, demonstration, evaluation and communication. The current research used Part of Speech (PoS) tagger and Named Entity Recognition (NER). The PoS aids in the development of NER. The NER was also utilized to identify answer keywords and generate probable question phrases. In addition, informative sentence selection is used to select informative sentences from the text based on NER and using a certain rules. Transformation rules are used to construct questions from sentences. A prototype is developed using python. Human-evaluator is used to evaluate the question generation system. The experimental results showed 86.4% accuracy for PoS tagger, 82.0% accuracy for NER and 95.3% accuracy for relevant sentence selection. The experimental results of each question type got 94.1% accuracy for “ስንት” (how much/many), 91.6% accuracy for “ማን” (who), 83.3% accuracy for “መቼ” (When) and 73.0% accuracy for “የት” (where). The overall question generation system come up with 84.6% of accuracy. This shows that the system has high accuracy in question type “ስንት” (how much/many) and needs some improvement in question type “የት” (where). The system gives a good results for some question types. Accordingly, it is concluded that the system gives a good accuracy for a good coverage of domain specific datasets and also defining more rules by adding more word classes. For future works, forming new rules to improve the existing rules by adding more word classes, handling exceptions, preparing more domain specific training datasets, preparing common automatic question generation architecture and evaluation techniques are recommended.en_US
dc.identifier.urihttp://etd.aau.edu.et/handle/12345678/28934
dc.language.isoenen_US
dc.publisherAddis Ababa Universityen_US
dc.subjectAutomatic Question Generationen_US
dc.subjectFactual Questionsen_US
dc.subjectNatural Language Generationen_US
dc.titleAutomatic Amharic Factual Question Generation from Historic Text Using Rule Based Approachen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Getaneh Damtie 2021.pdf
Size:
2.06 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: