The choice of features for classification of verbs in biomedical texts

  • Authors:
  • Anna Korhonen;Yuval Krymolowski;Nigel Collier

  • Affiliations:
  • University of Cambridge, Cambridge, UK;Haifa University, Israel;National Institute of Informatics, Chiyoda-ku, Tokyo, Japan

  • Venue:
  • COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We conduct large-scale experiments to investigate optimal features for classification of verbs in biomedical texts. We introduce a range of feature sets and associated extraction techniques, and evaluate them thoroughly using a robust method new to the task: cost-based framework for pairwise clustering. Our best results compare favourably with earlier ones. Interestingly, they are obtained with sophisticated feature sets which include lexical and semantic information about selectional preferences of verbs. The latter are acquired automatically from corpus data using a fully unsupervised method.