New ranking algorithms for parsing and tagging: kernels over discrete structures, and the voted perceptron

  • Authors:
  • Michael Collins;Nigel Duffy

  • Affiliations:
  • AT&T Labs-Research, New Jersey;iKuni Inc., Palo Alto, CA

  • Venue:
  • ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper introduces new learning algorithms for natural language processing based on the perceptron algorithm. We show how the algorithms can be efficiently applied to exponential sized representations of parse trees, such as the "all subtrees" (DOP) representation described by (Bod 1998), or a representation tracking all sub-fragments of a tagged sentence. We give experimental results showing significant improvements on two tasks: parsing Wall Street Journal text, and named-entity extraction from web data.