Hierarchical verb clustering using graph factorization

  • Authors:
  • Lin Sun;Anna Korhonen

  • Affiliations:
  • University of Cambridge, Computer Laboratory, Cambridge, UK;University of Cambridge, Computer Laboratory, Cambridge, UK

  • Venue:
  • EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most previous research on verb clustering has focussed on acquiring flat classifications from corpus data, although many manually built classifications are taxonomic in nature. Also Natural Language Processing (nlp) applications benefit from taxonomic classifications because they vary in terms of the granularity they require from a classification. We introduce a new clustering method called Hierarchical Graph Factorization Clustering (hgfc) and extend it so that it is optimal for the task. Our results show that Hgfc outperforms the frequently used agglomerative clustering on a hierarchical test set extracted from VerbNet, and that it yields state-of-the-art performance also on a flat test set. We demonstrate how the method can be used to acquire novel classifications as well as to extend existing ones on the basis of some prior knowledge about the classification.