A Global Rule Induction Approach to Information Extraction

  • Authors:
  • Jing Xiao;Tat-Seng Chua;Jimin Liu

  • Affiliations:
  • -;-;-

  • Venue:
  • ICTAI '03 Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The ability to extract desired pieces of information from natural language texts is an important task with a growing number of potential applications. This paper presents a novel pattern rule induction learning system, GRID, which emphasizes on utilizing global feature distribution in all of the training instances in order to make better decision on rule induction. GRID incorporates features at lexical, syntactical and semantic levels simultaneously. It induces rules by adopting a combination of top-down and bottom-up approaches. The features chosen in GRID are general and they were applied successfully to both semi-structured text and free text. Our experimental results on some publicly available webpage corpora and MUC-4 test set indicate that our approach is effective.