Integrating top-down and bottom-up strategies in a text processing system

  • Authors:
  • Lisa F. Rau;Paul S. Jacobs

  • Affiliations:
  • GE Company, Corporate R&D, Schenectady, NY;GE Company, Corporate R&D, Schenectady, NY

  • Venue:
  • ANLC '88 Proceedings of the second conference on Applied natural language processing
  • Year:
  • 1988

Quantified Score

Hi-index 0.02

Visualization

Abstract

The SCISOR system is a computer program designed to scan naturally occurring texts in constrained domains, extract information, and answer questions about that information. The system currently reads newspapers stories in the domain of corporate mergers and acquisitions. The language analysis strategy used by SCISOR combines full syntactic (bottom-up) parsing and conceptual expectation-driven (top-down) parsing. Four knowledge sources, including syntactic and semantic information and domain knowledge, interact in a flexible manner. This integration produces a more robust semantic analyzer designed to deal gracefully with gaps in lexical and syntactic knowledge, transports easily to new domains, and facilitates the extraction of information from texts.