Hidden Markov model with rule based approach for part of speech tagging of Myanmar language

  • Authors:
  • Khine Khine Zin

  • Affiliations:
  •  

  • Venue:
  • CIT'09 Proceedings of the 3rd International Conference on Communications and information technology
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Part-of-Speech (POS) Tagging is the process of assigning the words with their categories that best suits the definition of the word as well as the context of the sentence in which it is used. There are different approaches to the problem of POS Tagging. In this paper, we use two approaches (rule based and Hidden Markov Model with rule based approach), and compare the performance of these techniques for Tagging using Myanmar language. These approaches use supervised POS Tagging that requires a large amount of annotated training corpus to tag properly. At this initial stage of POS Tagging for Myanmar Language, we have very limited resource of annotated corpus. We tried to see which technique maximizes the performance with this limited resources. By experiments, the best configuration is investigated using HMM with rule based approach and the accuracy is 97.56%. Therefore, this approach has better performance than rule based approach.