Synthetic Speech Trained - Large Vocabulary Amharic Speech Recognition System (SST-LVASR)

dc.contributor.advisorMamo, Mengesha(PhD)
dc.contributor.authorBirile, Mesfin
dc.date.accessioned2018-06-26T07:18:03Z
dc.date.accessioned2023-11-28T14:09:06Z
dc.date.available2018-06-26T07:18:03Z
dc.date.available2023-11-28T14:09:06Z
dc.date.issued2008-07
dc.description.abstractAmharic is the official language of Ethiopia, which is characterized by very large morphological forms of words. This thesis is an investigation of the possibility of developing an Automatic speech recognition system (ASR) for Amharic using synthesized Amharic speech generated through concatenation of prerecorded morphemes, can be used to train a hidden markov model (HMM) based ASR system. The development of HMM based ASR system requires identification of all possible words and a construction of text and speech corpora containing multiple samples of the words to be recognized by the system. These data are then used as training sets in the development of the models, the final objective being the construction of HMM models for each recognition unit. Since there are a large number of morphological forms for the words in Amharic, the effort of collecting the Amharic words for constructing the text corpus and the recording and labeling of the same words for the speech corpus is extremely difficult. This thesis demonstrates that by developing an automatic morphological expander, the effort of developing the text corpus is reduced to a manageable level. Additionally, a significant reduction in the speech corpus development is achieved by using machine generated speech for training the HMM models of the ASR system. These reductions in the development efforts of the text and speech corpora greatly reduce the most prominent of the obstacles in developing a general purpose Amharic speech recognizer. The 62.37% word accuracy for naturally recorded speech indicates that using synthetic speech for training at least 62% of the words are correctly identified and suggests that with synthetic speech some level of recognition is possible, giving the imputes for more research in finding ways to increase this accuracy.en_US
dc.identifier.urihttp://etd.aau.edu.et/handle/12345678/3510
dc.language.isoenen_US
dc.publisherAddis Ababa Universityen_US
dc.subjectRecognition Systemen_US
dc.subjectAmharic Speechen_US
dc.titleSynthetic Speech Trained - Large Vocabulary Amharic Speech Recognition System (SST-LVASR)en_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Mesfin Birile.pdf
Size:
714.96 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: