POS Tagging for Amharic: A Machine Learning Approach

Sintayehu Hirpassa Kefena; Gurpreet Singh Lehal

pdf

Published: Jun 18, 2020

Sintayehu Hirpassa Kefena

Mr.

Gurpreet Singh Lehal

Abstract

In this paper, our focus is the problem of automatic prediction of Parts of Speech of words in Amharic language sentence. We present an experiment that involves the study and implementation of POS tagging model. Four statistical taggers, i.e. Trigrams’n’Tags (TnT) Tagger, Conditional Random Field taggers (CRF), Naive Bays (NB) and Decision Tree (DT) classifier is applying for a morphologically rich language: Amharic. We compare the performances of all taggers with the same size of training and testing Dataset. Various types of language-dependent and independent feature set have formed, and for each algorithm, a combination of them is applied. Based on such inputs CRF based model has achieved outperformed accuracy. The best accuracy obtained from our experiment is 94.08%. Finally, our study shows that linguistic features play a decisive part in overcoming the limitations of the baseline statistical model for Amharic languages.

How to Cite

Kefena, S. H., & Lehal, G. S. (2020). POS Tagging for Amharic: A Machine Learning Approach. INFOCOMP Journal of Computer Science, 19(1). Retrieved from https://infocomp.dcc.ufla.br/index.php/infocomp/article/view/627

Issue

Vol. 19 No. 1 (2020): June 2020

Section

Machine Learning and Computational Intelligence

Upon receipt of accepted manuscripts, authors will be invited to complete a copyright license to publish the paper. At least the corresponding author must send the copyright form signed for publication. It is a condition of publication that authors grant an exclusive licence to the the INFOCOMP Journal of Computer Science. This ensures that requests from third parties to reproduce articles are handled efficiently and consistently and will also allow the article to be as widely disseminated as possible. In assigning the copyright license, authors may use their own material in other publications and ensure that the INFOCOMP Journal of Computer Science is acknowledged as the original publication place.

Author Biography

Gurpreet Singh Lehal

Professor in Department of Computer Science, Punjabi University

Article Sidebar

Main Article Content

Abstract

Article Details

Gurpreet Singh Lehal