SCISOR: extracting information from on-line news
Communications of the ACM
TCS: a shell for content-based text categorization
Proceedings of the sixth conference on Artificial intelligence applications
Conceptual Information Processing
Conceptual Information Processing
ACL '88 Proceedings of the 26th annual meeting on Association for Computational Linguistics
COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 1
An equipment model and its role in the interpretation of nominal compounds
HLT '86 Proceedings of the workshop on Strategic computing natural language
Information extraction from free-text business documents
Effective databases for text & document management
An information extraction core system for real world German text processing
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Automatic processing of proper names in texts
EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Accurate information extraction for quantitative financial events
Proceedings of the 20th ACM international conference on Information and knowledge management
Extraction of financial information from online business reports
ACM SIGMIS Database
Hi-index | 0.00 |
While complete understanding of arbitrary input text remains in the future, it is currently possible to construct natural language processing systems that provide a partial understanding of text with limited accuracy. Moreover, such systems can provide cost-effective solutions to commercially-significant business problems. This paper describes one such system: JASPER. JASPER is a fact extraction system recently developed and deployed by Carnegie Group for Reuters Ltd. JASPER uses a template-driven approach, partial understanding techniques, and heuristic procedures to extract certain key pieces of information from a limited range of text.We believe that many significant business problems can be solved by fact extraction applications which involve locating and extracting specific, predefined types of information from a limited range of text. The information extracted by such systems can be used in a variety of ways, such as filling in values in a database, generating summaries of the input text, serving as a part of the knowledge in an expert system, or feeding into another program which bases decisions on it. We expect to develop many such applications in the future using similar techniques.