A context pattern induction method for named entity extraction

  • Authors:
  • Partha Pratim Talukdar;Thorsten Brants;Mark Liberman;Fernando Pereira

  • Affiliations:
  • University of Pennsylvania, Philadelphia, PA;Google, Inc., Mountain View, CA;University of Pennsylvania, Philadelphia, PA;University of Pennsylvania, Philadelphia, PA

  • Venue:
  • CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a novel context pattern induction method for information extraction, specifically named entity extraction. Using this method, we extended several classes of seed entity lists into much larger high-precision lists. Using token membership in these extended lists as additional features, we improved the accuracy of a conditional random field-based named entity tagger. In contrast, features derived from the seed lists decreased extractor accuracy.