|
Addis Ababa University Libraries Electronic Thesis and Dissertations: AAU-ETD! >
Faculty of Informatics >
Thesis - Information Science >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/123456789/1031
|
| Title: | Hidden Markov Model Based Large Vocabulary, Speaker Independent, Continuous Amharic Speech Recognition |
| Authors: | ZEGAYE, SEIFU |
| Advisors: | Ato Tesfaye Biru Ato Solomon Birhanu Ato Knfe Taddesse |
| Copyright: | 2003 |
| Date Added: | 9-May-2008 |
| Publisher: | Addis Ababa University |
| Abstract: | This study investigated the possibility of developing large vocabulary, speaker
independent and continuous speech recognizer for Amharic language. The recognizer was
developed using Hidden Markov Model; and the Hidden Markov Modeling Toolkit was
used to implement it.
In the process, a corpus developed by Solomon Tefera was used to get the required data
for training and testing the models. It was a database comprised of 8000 utterances that
were used for training and 500 plus sentences for development and evaluation. The data
was preprocessed in line with the requirements of the HTK toolkit. In order to support the
acoustic models, a bigram language model was constructed. In addition, pronunciation
dictionary was prepared and used as an input.
Since the recognizer was meant to recognize large vocabulary and continuous speech,
phonemes were opted as the basic unit of recognition. However phonemes are known to be
context independent units, given that the environment in which a sound is put makes a
difference in the way it is pronounced. Thus after the monophone based speech recognizer
was built, it was promoted to triphone based system in which the left and right contexts
were considered and modeled. Besides, the mixture components of the states of the models
were incremented in view of optimizing the performance of the recognizers.Performance tests were then conducted at various stages using the development and finally
using evaluation test data. In the end, a 79% word level correctness, 76.18% word
accuracy, and 30.01% sentence level correctness were obtained. The results are
encouraging and with more optimization works better results can be achieved. Finally,
conclusions were drawn and recommendations were made in line with the analysis and
findings |
| Description: | A thesis submitted to the School of Graduate Studies of Addis Ababa
University in partial fulfillment of the requirements for
the Degree of Master of Science in Information Science. |
| URI: | http://hdl.handle.net/123456789/1031 |
| Appears in: | Thesis - Information Science
|
Items in the AAUL Digital Library are protected by copyright, with all rights reserved, unless otherwise indicated.
|