Modeling an Automatic Amharic text Summarizer: Abstractive Approach

dc.contributor.advisorAssabie, Yaregal (PhD)
dc.contributor.authorAbdella, Mohammed
dc.date.accessioned2019-11-19T08:32:34Z
dc.date.accessioned2023-11-29T04:06:07Z
dc.date.available2019-11-19T08:32:34Z
dc.date.available2023-11-29T04:06:07Z
dc.date.issued2016-09-01
dc.description.abstractThe need for automatic text summarization systems increase as the number of electronic documents that deal with specific information increases in the web. The two basic approaches of text summarization systems are extractive and abstractive. Extractive approach is based on selecting the most important sentences from the input document using different algorithms and presents the selected sentences as a summary for the input document. The abstractive approach for text summarization tries to generate novel sentences that may not be present in the input document but still represent the main idea of the input document. The abstractive approach is based on the semantic representation of input sentences. This thesis proposes an automatic Amharic text summarizer using abstractive approach based on the Universal Networking Language (UNL) which is one of the semantic representations of natural language sentences. We use different components that are related with UNL representation. Related sentences in the input document are clustered and each cluster will have its own generated sentence to be used as a summary. Thus, the number of summary sentences is based on the number of clusters formed from the input document. The text preprocessing stage which involves processes like normalization, stop-word removal and stemming makes the input data suitable for clustering component by giving the root forms or stems from the relevant words of an input sentence. The conversion between the natural language sentence and the UNL expression are done using the EnConversion or DeConversion rules together with the morphological properties of each of the words in an input sentence. There is also another component which is UNL analysis that is used for providing the common UNL expression from a group of UNL expressions. In order to evaluate the performance of the proposed system, we use Amharic input documents and human evaluators that are going to evaluate based on different parameters. The parameters used to evaluate the performance of the system are the grammar of the summary sentences and the idea represented in the summary. The results of the evaluation are promising since we use the subjective evaluation of summary sentences.en_US
dc.identifier.urihttp://etd.aau.edu.et/handle/123456789/20142
dc.language.isoenen_US
dc.publisherAddis Ababa Universityen_US
dc.subjectText Summarizationen_US
dc.subjectUniversal Networking Language (UNL)en_US
dc.subjectEnconversionen_US
dc.subjectDeconversionen_US
dc.titleModeling an Automatic Amharic text Summarizer: Abstractive Approachen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Mohammed Abdella 2016.pdf
Size:
1.21 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: