SystemT: a declarative information extraction system

  • Authors:
  • Yunyao Li;Frederick R. Reiss;Laura Chiticariu

  • Affiliations:
  • IBM Research - Almaden, San Jose, CA;IBM Research - Almaden, San Jose, CA;IBM Research - Almaden, San Jose, CA

  • Venue:
  • HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Systems Demonstrations
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Emerging text-intensive enterprise applications such as social analytics and semantic search pose new challenges of scalability and usability to Information Extraction (IE) systems. This paper presents SystemT, a declarative IE system that addresses these challenges and has been deployed in a wide range of enterprise applications. SystemT facilitates the development of high quality complex annotators by providing a highly expressive language and an advanced development environment. It also includes a cost-based optimizer and a high-performance, flexible runtime with minimum memory footprint. We present SystemT as a useful resource that is freely available, and as an opportunity to promote research in building scalable and usable IE systems.