A trainable system for the extraction of meaning from text

Authors:
Amit Bagga;Joyce Chai;Alan W. Biermann;Curry I. Guinn;Alan Hui
Affiliations:
Department of Computer Science, Duke University;Department of Computer Science, Duke University;Department of Computer Science, Duke University;Department of Computer Science, Duke University;IBM Software Solutions, Research Triangle Park, NC
Venue:
CASCON '95 Proceedings of the 1995 conference of the Centre for Advanced Studies on Collaborative research
Year:
1995

Citing 4
Cited 1

Automatic structuring and retrieval of large text files

Communications of the ACM
Automated learning of decision rules for text categorization

ACM Transactions on Information Systems (TOIS)
Acquiring disambiguation rules from text

ACL '89 Proceedings of the 27th annual meeting on Association for Computational Linguistics
FASTUS: a system for extracting information from text

HLT '93 Proceedings of the workshop on Human Language Technology

Introduction to information extraction

AI Communications

Quantified Score

Hi-index	0.01

Visualization

Abstract

This project is developing a trainable system that can extract meaning from texts in different domains (example: various Internet news-groups). The system does partial parsing based on a large dictionary containing approximately 150,000 words. The system assists the user in extracting a semantic network representation for each member of a set of training articles contained in some large database. Based on the user's training, the system forms statistical tables, a knowledge base, and a set of rules mirroring the user's actions. The system then generalizes these rules. Using statistically based semantic classification, the system applies these rules to new articles from the database for automatically building semantic networks.