Inflectional Morphology Analyzer for Sanskrit

Authors:
Girish Nath Jha;Muktanand Agrawal; Subash;Sudhir K. Mishra;Diwakar Mani;Diwakar Mishra;Manji Bhadra;Surjit K. Singh
Affiliations:
Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067;Special Centre for Sanskrit Studies, Jawaharlal Nehru University, New Delhi, 110067
Venue:
Sanskrit Computational Linguistics
Year:
2009

Citing 1
Cited 0

The Oxford Handbook of Computational Linguistics (Oxford Handbooks)

The Oxford Handbook of Computational Linguistics (Oxford Handbooks)

Quantified Score

Hi-index	0.00

Visualization

Abstract

The paper describes a Sanskrit morphological analyzer that identifies and analyzes inflected noun-forms and verb-forms in any given sandhi-free text. The system which has been developed as java servlet RDBMS can be tested at http://sanskrit.jnu.ac.in (Language Processing Tools Sanskrit Tinanta Analyzer/Subanta Analyzer) with Sanskrit data in Unicode text. Subsequently, the separate systems of subanta and tinanta will be combined into a single system of sentence analysis with karaka interpretation. Currently, the system checks and labels each word as three basic POS categories - subanta, tinanta, and avyaya. Thereafter, each subanta is sent for subanta processing based on an example database and a rule database. The verbs are examined based on a database of verb roots and forms as well by reverse morphology based on Paninian techniques. Future enhancements include plugging in the amarakosa (http://sanskrit.jnu.ac.in/amara) and other noun lexicons with the subanta system. The tinanta will be enhanced by the kṛdanta analysis module being developed separately.