Closing the gap: learning-based information extraction rivaling knowledge-engineering methods

  • Authors:
  • Hai Leong Chieu;Hwee Tou Ng;Yoong Keok Lee

  • Affiliations:
  • DSO National Laboratories, Singapore;National University of Singapore, Singapore;DSO National Laboratories, Singapore

  • Venue:
  • ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a learning approach to the scenario template task of information extraction, where information filling one template could come from multiple sentences. When tested on the MUC-4 task, our learning approach achieves accuracy competitive to the best of the MUC-4 systems, which were all built with manually engineered rules. Our analysis reveals that our use of full parsing and state-of-the-art learning algorithms have contributed to the good performance. To our knowledge, this is the first research to have demonstrated that a learning approach to the full-scale information extraction task could achieve performance rivaling that of the knowledge engineering approach.