Comparing information extraction pattern models

  • Authors:
  • Mark Stevenson;Mark A. Greenwood

  • Affiliations:
  • University of Sheffield, Sheffield, UK;University of Sheffield, Sheffield, UK

  • Venue:
  • IEBeyondDoc '06 Proceedings of the Workshop on Information Extraction Beyond The Document
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Several recently reported techniques for the automatic acquisition of Information Extraction (IE) systems have used dependency trees as the basis of their extraction pattern representation. These approaches have used a variety of pattern models (schemes for representing IE patterns based on particular parts of the dependency analysis). An appropriate model should be expressive enough to represent the information which is to be extracted from text without being overly complicated. Four previously reported pattern models are evaluated using existing IE evaluation corpora and three dependency parsers. It was found that one model, linked chains, could represent around 95% of the information of interest without generating an unwieldy number of possible patterns.