Inflectional Morphology Analyzer for Sanskrit

  • Authors:
  • Girish Nath Jha;Muktanand Agrawal; Subash;Sudhir K. Mishra;Diwakar Mani;Diwakar Mishra;Manji Bhadra;Surjit K. Singh

  • Affiliations:
  • Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067

  • Venue:
  • Sanskrit Computational Linguistics
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The paper describes a Sanskrit morphological analyzer that identifies and analyzes inflected noun-forms and verb-forms in any given sandhi-free text. The system which has been developed as java servlet RDBMS can be tested at http://sanskrit.jnu.ac.in (Language Processing Tools Sanskrit Tinanta Analyzer/Subanta Analyzer) with Sanskrit data in Unicode text. Subsequently, the separate systems of subanta and tinanta will be combined into a single system of sentence analysis with karaka interpretation. Currently, the system checks and labels each word as three basic POS categories - subanta, tinanta, and avyaya. Thereafter, each subanta is sent for subanta processing based on an example database and a rule database. The verbs are examined based on a database of verb roots and forms as well by reverse morphology based on Paninian techniques. Future enhancements include plugging in the amarakosa (http://sanskrit.jnu.ac.in/amara) and other noun lexicons with the subanta system. The tinanta will be enhanced by the kṛdanta analysis module being developed separately.