A maximum entropy approach to information extraction from semi-structured and free text

  • Authors:
  • Hai Leong Chieu;Hwee Tou Ng

  • Affiliations:
  • DSO National Laboratories, 20 Science Park Drive, Singapore;Department of Computer Science, School of Computing, National University of Singapore, 3 Science Drive 2, Singapore

  • Venue:
  • Eighteenth national conference on Artificial intelligence
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a classification-based approach towards single-slot as well as multi-slot information extraction (IE). For single-slot IE, we worked on the domain of Seminar Announcements, where each document contains information on only one seminar. For multi-slot IE, we worked on the domain of Management Succession. For this domain, we restrict ourselves to extracting information sentence by sentence, in the same way as (Soderland 1999). Each sentence can contain information on several management succession events. By using a classification approach based on a maximum entropy framework, our system achieves higher accuracy than the best previously published results in both domains.