Concatenative Speech Synthesis for Amharic Using Unit Selection Method

Bayou, Eyob

Concatenative Speech Synthesis for Amharic Using Unit Selection Method

dc.contributor.advisor	Assabie, Yaregal (PhD)
dc.contributor.author	Bayou, Eyob
dc.date.accessioned	2018-06-19T13:41:43Z
dc.date.accessioned	2023-11-29T04:07:01Z
dc.date.available	2018-06-19T13:41:43Z
dc.date.available	2023-11-29T04:07:01Z
dc.date.issued	2011-06
dc.description.abstract	Speech synthesis takes text as input and generates acoustic signal as output. In the process, the input text is preprocessed to tokenize it into words or other meaningful tokens and to transliterate numbers, abbreviations and acronyms. Text-analysis follows text preprocessing to identify grammatical structures and context. Once the text analysis phase is completed the next step is to convert graphical representation of sounds to their phonetic representation. A phoneme usually has multiple phones that are used in different contexts. Amharic language’s orthography is phonemical in the sense that a grapheme represents exactly one phoneme. However, this statement is true as long as epenthesis and geminations are not considered. The language’s orthography does not also show suprasegmental information that is required to properly model speaking styles. Even though converting grapheme to phoneme is easy in Amharic, converting phoneme to phone is very difficult because of the two necessary and yet orthographically unrepresented components of the language – epenthesis and gemination. Modeling prosodic features of various speaking styles is also the other challenging task in developing Amharic TTS. This is challenging because, in one hand, the task of modeling human speech is very challenging in itself and in the other hand, research works done for Amharic language are relatively few. This project work has tried to address epenthesis and gemination, which are phonologically very important features of the language, by studying and implementing techniques found in various literatures. Making use of orthographic property of verbs in their perfect form, this work introduces rules that can be used to locate phones that need to be stressed. The grapheme to phoneme conversion algorithm also addresses epenthesis. Prosodic differences of declarative and interrogative utterances are represented by making use of unique sentence-final phones recorded and segmented for this purpose. Transliteration of numerals and abbreviations is also addressed in the text preprocessing phase of the system. The results found after being evaluated by ten fluent speakers of the language are encouraging.	en_US
dc.identifier.uri	http://etd.aau.edu.et/handle/123456789/1818
dc.language.iso	en	en_US
dc.publisher	Addis Ababa University	en_US
dc.subject	Concatenative ;Speech Synthesis	en_US
dc.title	Concatenative Speech Synthesis for Amharic Using Unit Selection Method	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Eyob Bayou.pdf
Size:: 800.79 KB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Plain Text
Description:

Download

Collections

Environmental Science