Amharic Named Entity Recognition Using Neural Word Embedding as a Feature

No Thumbnail Available

Date

2017-10

Journal Title

Journal ISSN

Volume Title

Publisher

AAU

Abstract

In this paper, Amharic Named Entity Recognition problem is addressed by employing a semi-supervised learning approach based on neural networks. The proposed approach aims at automating manual feature design and avoiding dependency on other natural language processing tasks for classi cation features. In this work potential feature information represented as word vectors are generated using neural network from unlabeled Amharic text les. These generated features are used as features for Amharic Named entity classi cation. SVM, J48, random tree, IBk(Instance based learning with parameter k), attribute selected and OneR(one rule) classi ers are tested with word vector features. Additionally BLSTM(bi-directional long short term memory), LSTM(long short term memory) and MLP(multi layer perceptron) deep neural networks are also tested to investigate the impact of proposed approach. From the experiments the highest F-score achieved was 95.5% using the SVM classi er. Relative to state-of-the-art approaches (SVM and J48) an average F-score improvement of 3.95% was achieved. The results showed that, automatically learned word features can substitute manually designed features for Amharic named entity recognition. Also these features has given better performance while reducing the e ort in manual feature design.

Description

Keywords

Amharic Named Entity Recognition, Neural Word Embedding, Deep Neural Networks, Word2vec, Skip-gram Model, Natural Language Processing

Citation