Repository logo
  • English
  • Català
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Italiano
  • Latviešu
  • Magyar
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Srpski (lat)
  • Suomi
  • Svenska
  • Türkçe
  • Tiếng Việt
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Српски
  • Yкраї́нська
  • Log In
    New user? Click here to register. Have you forgotten your password?
Repository logo
  • Colleges, Institutes & Collections
  • Browse AAU-ETD
  • English
  • Català
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Italiano
  • Latviešu
  • Magyar
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Srpski (lat)
  • Suomi
  • Svenska
  • Türkçe
  • Tiếng Việt
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Српски
  • Yкраї́нська
  • Log In
    New user? Click here to register. Have you forgotten your password?
  1. Home
  2. Browse by Author

Browsing by Author "Tulu, Gelan"

Now showing 1 - 2 of 2
Results Per Page
Sort Options
  • No Thumbnail Available
    Item
    Bidirectional Amharic-Afaan Oromo Machine Translation Using Hybrid Approach
    (Addis Ababa University, 2020-03-03) Tulu, Gelan; Midekso, Dida (PhD)
    Machine translation is the area of Natural Language Processing (NLP) that focuses on obtaining a target language text from a source language text by means of automatic techniques. Machine translation is a multidisciplinary field and the challenge has been approached from various points of view including linguistic and statistics. Hybrid methods focus on combining the best properties of two or more machine translation approaches. Nowadays, it has become very popular to include rules in statistical machine translation approaches. In this study, a bidirectional Amharic-Afaan Oromo machine translation system using hybrid approach has been developed. The system has four components: sentence reordering, language model, decoding and translation model. The sentence reordering is used to pre-process the structure of the source language to be more similar to the structure of the target language by using their Part of Speech (POS) tagging and to better guide the statistical engine. Since there are no publicly available POS tagger tools for both Amharic and Afaan Oromo languages, tagged corpus is prepared manually. The linguistic background and nature of the two languages have been studied in order to design the reordering rules for different types of Amharic/Afaan Oromo phrases and sentences. Language models by using IRSTLM tool and translation models by using GIZA++ have been developed for Afaan Oromo and Amharic languages because the system is bidirectional. A decoder has been used to find the best translation in the target language (Amharic/Afaan Oromo) for a given source language (Afaan Oromo/Amharic) based on the translation and language models. To check the accuracy of the system, two experiments were conducted using two different approaches. The first experiment is conducted by using a statistical approach to translate Amharic to Afaan Oromo and vice versa and has a BLEU score of 89.39% and 80.33% respectively. The second experiment is carried out by using a hybrid approach and has a BLEU score of 91.56% and 82.24% for Amharic to Afaan Oromo and Afaan Oromo to Amharic translation respectively. The result shows that the hybrid approach is slightly better than the statistical approach.
  • No Thumbnail Available
    Item
    Bidirectional Amharic-Afaan Oromo Machine Translation Using Hybrid Approach
    (Addis Ababa University, 3/3/2020) Tulu, Gelan; Midekso, Dida (PhD)
    Machine translation is the area of Natural Language Processing (NLP) that focuses on obtaining a target language text from a source language text by means of automatic techniques. Machine translation is a multidisciplinary field and the challenge has been approached from various points of view including linguistic and statistics. Hybrid methods focus on combining the best properties of two or more machine translation approaches. Nowadays, it has become very popular to include rules in statistical machine translation approaches. In this study, a bidirectional Amharic-Afaan Oromo machine translation system using hybrid approach has been developed. The system has four components: sentence reordering, language model, decoding and translation model. The sentence reordering is used to pre-process the structure of the source language to be more similar to the structure of the target language by using their Part of Speech (POS) tagging and to better guide the statistical engine. Since there are no publicly available POS tagger tools for both Amharic and Afaan Oromo languages, tagged corpus is prepared manually. The linguistic background and nature of the two languages have been studied in order to design the reordering rules for different types of Amharic/Afaan Oromo phrases and sentences. Language models by using IRSTLM tool and translation models by using GIZA++ have been developed for Afaan Oromo and Amharic languages because the system is bidirectional. A decoder has been used to find the best translation in the target language (Amharic/Afaan Oromo) for a given source language (Afaan Oromo/Amharic) based on the translation and language models. To check the accuracy of the system, two experiments were conducted using two different approaches. The first experiment is conducted by using a statistical approach to translate Amharic to Afaan Oromo and vice versa and has a BLEU score of 89.39% and 80.33% respectively. The second experiment is carried out by using a hybrid approach and has a BLEU score of 91.56% and 82.24% for Amharic to Afaan Oromo and Afaan Oromo to Amharic translation respectively. The result shows that the hybrid approach is slightly better than the statistical approach.

Home |Privacy policy |End User Agreement |Send Feedback |Library Website

Addis Ababa University © 2023