Signal-based Ethiopian Languages Identification using Gaussian Mixture Model

dc.contributor.advisorMenor, Mr
dc.contributor.authorWondimu, Mikias
dc.date.accessioned2018-06-28T12:01:28Z
dc.date.accessioned2023-11-04T15:14:33Z
dc.date.available2018-06-28T12:01:28Z
dc.date.available2023-11-04T15:14:33Z
dc.date.issued2017-02
dc.description.abstractLanguage Identification (LID) refers to the task of identifying an unknown language from the test utterances. The core problem in solving the language identification (LID) task is to find a way of reducing the complexity of human language such that an automatic algorithm can determine the language and identify it from a relatively brief audio sample. From the review of the existing approaches for LID, it is observed that very few attempts have been made on Language Identification System for African languages. The importance of Language Identification for African languages is seeing a dramatic increase due to the development of telecommunication infrastructure and, as a result, an increase in volumes of data and speech traffic in public networks. By automatically processing the raw speech data the vital assistance given to people in distress can be speeded up, by referring their calls to a person knowledgeable in that language. An LID system for four different Ethiopian languages namely Amharic, Guragegna, Oromiffa and Tigregna is done using Gaussian mixture models (GMM). The system developed here is intended to identify which language is spoken by the speaker from these four languages audio utterances of some phrases for some duration. A dataset consisted of recording of 7 different speakers for each languages were prepared and after preprocessing the database mono channel, the features are extracted using Mel frequency cepstral coefficients (MFCC) and classification is done using GMMs. To test the performance of the LID system experimental scenarios are designed and carried out by taking two, three and four languages at a time. The LID system is tested for both utterance dependent and independent system (i.e. the test is done by taking the same speech for both training and testing (utterance/speech dependent) and also by taking different speech than the training utterance (utterance/speech independent)). It is more challenging to implement and get a better LID system performance with utterance independent system with such a small recorded database. In addition to this the system also tested for the speaker independent system. The utterance dependent LID performance for four language tasks was about 93% accurate and the utterance independent LID performance for four language tasks was about 70% accurate on average. The speaker independent LID system performance for the four language task was about 91%. Keywords: Language identification, Languages, MFCC, GMM, Accuracy, Utteranceen_US
dc.identifier.urihttp://etd.aau.edu.et/handle/123456789/4659
dc.language.isoenen_US
dc.publisherAddis Ababa Universityen_US
dc.subjectLanguage Identificationen_US
dc.subjectLanguagesen_US
dc.subjectAccuracyen_US
dc.subjectUtteranceen_US
dc.subjectMfccen_US
dc.titleSignal-based Ethiopian Languages Identification using Gaussian Mixture Modelen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Mikias Wondimu.pdf
Size:
2.71 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: