Subject and object identification in Malayalam text

  • Authors:
  • Saranya D. Krishnan;R. R. Rajeev;Mary Priya Sebastian;Sherly Elizabeth

  • Affiliations:
  • Rajagiri School of Engineering & Technology Kakkanad, Kerala;VRCLC, IIITM-K Technopark, Trivandrum;Rajagiri School of Engineering & Technology Kakkanad, Kerala;IIITM-K Technopark, Trivandrum

  • Venue:
  • Proceedings of the International Conference on Advances in Computing, Communications and Informatics
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Subject and object identification denotes the process of identifying the syntactic subject and object in a sentence while tagging. Identification of subject/object turned out as very effective for many natural language applications such as machine translation, anaphora resolution, relationship extraction etc. For languages with rigid word order, the subject, object and verb in a sentence can be identified by their position. But for languages like Malayalam with relatively free word order the subject/object identification task is more complex and is significant. This paper presents a method for subject and object identification in Malayalam text using statistical tagging approach using HMM and Viterbi algorithm. In this approach, the tagset for part of speech tagging is modified to include tags that distinguish between the subject and object in a sentence.