Addis Ababa University Libraries Electronic Thesis and Dissertations: AAU-ETD! >
School of Information Science and Computer Science >
Thesis - Information Science >

Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/1031

Title: Hidden Markov Model Based Large Vocabulary, Speaker Independent, Continuous Amharic Speech Recognition
Advisors: Ato Tesfaye Biru
Ato Solomon Birhanu
Ato Knfe Taddesse
Copyright: 2003
Date Added: 9-May-2008
Publisher: Addis Ababa University
Abstract: This study investigated the possibility of developing large vocabulary, speaker independent and continuous speech recognizer for Amharic language. The recognizer was developed using Hidden Markov Model; and the Hidden Markov Modeling Toolkit was used to implement it. In the process, a corpus developed by Solomon Tefera was used to get the required data for training and testing the models. It was a database comprised of 8000 utterances that were used for training and 500 plus sentences for development and evaluation. The data was preprocessed in line with the requirements of the HTK toolkit. In order to support the acoustic models, a bigram language model was constructed. In addition, pronunciation dictionary was prepared and used as an input. Since the recognizer was meant to recognize large vocabulary and continuous speech, phonemes were opted as the basic unit of recognition. However phonemes are known to be context independent units, given that the environment in which a sound is put makes a difference in the way it is pronounced. Thus after the monophone based speech recognizer was built, it was promoted to triphone based system in which the left and right contexts were considered and modeled. Besides, the mixture components of the states of the models were incremented in view of optimizing the performance of the recognizers.Performance tests were then conducted at various stages using the development and finally using evaluation test data. In the end, a 79% word level correctness, 76.18% word accuracy, and 30.01% sentence level correctness were obtained. The results are encouraging and with more optimization works better results can be achieved. Finally, conclusions were drawn and recommendations were made in line with the analysis and findings
Description: A thesis submitted to the School of Graduate Studies of Addis Ababa University in partial fulfillment of the requirements for the Degree of Master of Science in Information Science.
URI: http://hdl.handle.net/123456789/1031
Appears in:Thesis - Information Science

Files in This Item:

File Description SizeFormat
ZEGAYE SEIFU.pdf555.36 kBAdobe PDFView/Open

Items in the AAUL Digital Library are protected by copyright, with all rights reserved, unless otherwise indicated.


  Last updated: May 2010. Copyright © Addis Ababa University Libraries - Feedback