Automatic identification of cause-effect relations in tamil using CRFs

  • Authors:
  • S. Menaka;Pattabhi R. K. Rao;Sobha Lalitha Devi

  • Affiliations:
  • AU-KBC Research Centre, MIT Campus of Anna Univeristy, Chennai, India;AU-KBC Research Centre, MIT Campus of Anna Univeristy, Chennai, India;AU-KBC Research Centre, MIT Campus of Anna Univeristy, Chennai, India

  • Venue:
  • CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present our work on automatic identification of cause-effect relations in a given Tamil text. Based on the analysis of causal constructions in Tamil, we identified a set of causal markers for Tamil and arrived at certain features used to develop our language model. We manually annotated a Tamil corpus of 8648 sentences for cause-effect relations. With this corpus, we developed the model for identifying causal relations using the machine learning technique, Conditional Random Fields (CRFs). We performed experiments and the results are encouraging. We performed an error analysis of the results and found that the errors can be attributed to some very interesting structural interdependencies between closely occurring causal relations. After comparing these structures in Tamil and English, we claim that at discourse level, the complexity of structural interdependencies between causal relations is more complex in Tamil than in English due to the free word order nature of Tamil.