A register-based annotation scheme for CO3H

  • Authors:
  • Ritesh Kumar

  • Affiliations:
  • Jawaharlal Nehru University, New Delhi, India

  • Venue:
  • Proceedings of the International Conference on Web Intelligence, Mining and Semantics
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper gives a description of an annotation scheme for annotating a corpus of computer-mediated communication in Hindi (CO3H) with certain semantic, pragmatic and situational features. The annotation scheme is based on the theory of register analysis, where it is assumed that a registeral difference entails difference in certain linguistic features. It adapts and integrates the annotation schemes of sense annotation in the Penn Discourse Treebank and dialogue act annotation of DIT++ within this larger registeral framework. The situational and linguistic features that will be used to annotate the corpus for PoRT is described in the paper, along with some proposed labels for these features.