Using a semantic network for information extraction

Authors:
Robert Gaizauskas;Kevin Humphreys
Affiliations:
Department of Computer Science, University of Sheffield, Regent Court, Portobello Road, Sheffield S1 4DP, UK/ e-mail: {r.gaizauskas,k.humphreys}@dcs.shef.ac.uk;Department of Computer Science, University of Sheffield, Regent Court, Portobello Road, Sheffield S1 4DP, UK/ e-mail: {r.gaizauskas,k.humphreys}@dcs.shef.ac.uk
Venue:
Natural Language Engineering
Year:
1997

Citing 20
Cited 5

Formal semantics: an introduction

Formal semantics: an introduction
Information extraction

Communications of the ACM
Naive Semantics for Natural Language Understanding

Naive Semantics for Natural Language Understanding
Conception vs. Lexicons: An Architecture for Multilingual Information Extraction

SCIE '97 International Summer School on Information Extraction: A Multidisciplinary Approach to an Emerging Information Technology
Message Understanding Conference-6: a brief history

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
SRI International: description of the TACITUS system as used for MUC-3

MUC3 '91 Proceedings of the 3rd conference on Message understanding
GE-CMU: description of the SHOGUN system used for MUC-5

MUC5 '93 Proceedings of the 5th conference on Message understanding
New York University: description of the Proteus system as used for MUC-5

MUC5 '93 Proceedings of the 5th conference on Message understanding
SRI: description of the JV-FASTUS system used for MUC-5

MUC5 '93 Proceedings of the 5th conference on Message understanding
GE NLToolset: description of the system as used for MUC-4

MUC4 '92 Proceedings of the 4th conference on Message understanding
SRI International: description of the FASTUS system used for MUC-4

MUC4 '92 Proceedings of the 4th conference on Message understanding
BEN: description of the PLUM system as used for MUC-6

MUC6 '95 Proceedings of the 6th conference on Message understanding
University of Durham: description of the LOLITA system as used in MUC-6

MUC6 '95 Proceedings of the 6th conference on Message understanding
University of Manitoba: description of the PIE system used for MUC-6

MUC6 '95 Proceedings of the 6th conference on Message understanding
MITRE: description of the Alembic system used for MUC-6

MUC6 '95 Proceedings of the 6th conference on Message understanding
The NYU system for MUC-6 or where's the syntax?

MUC6 '95 Proceedings of the 6th conference on Message understanding
University of Sheffield: description of the LaSIE system as used for MUC-6

MUC6 '95 Proceedings of the 6th conference on Message understanding
SRA: description of the SRA system as used for MUC-6

MUC6 '95 Proceedings of the 6th conference on Message understanding
SRI International FASTUS system: MUC-6 test results and analysis

MUC6 '95 Proceedings of the 6th conference on Message understanding
One sense per discourse

HLT '91 Proceedings of the workshop on Speech and Natural Language

Approximate Information Filtering on the Semantic Web

KI '02 Proceedings of the 25th Annual German Conference on AI: Advances in Artificial Intelligence
Evaluation-driven design of a robust coreference resolution system

Natural Language Engineering
An ontology-driven approach for semantic information retrieval on the Web

ACM Transactions on Internet Technology (TOIT)
Event coreference for information extraction

ANARESOLUTION '97 Proceedings of a Workshop on Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts
Towards a context sensitive approach to searching information based on domain specific knowledge sources

Web Semantics: Science, Services and Agents on the World Wide Web

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes the approach to knowledge representation taken in the LaSIE Information Extraction (IE) system. Unlike many IE systems that skim texts and use large collections of shallow, domain-specific patterns and heuristics to fill in templates, LaSIE attempts a fuller text analysis, first translating individual sentences to a quasi-logical form, and then constructing a weak discourse model of the entire text from which template fills are finally derived. Underpinning the system is a general ‘world model’, represented as a semantic net, which is extended during the processing of a text by adding the classes and instances described in that text. In the paper we describe the system's knowledge representation formalisms, their use in the IE task, and how the knowledge represented in them is acquired, including experiments to extend the system's coverage using the WordNet general purpose semantic network. Preliminary evaluations of our approach, through the Sixth DARPA Message Understanding Conference, indicate comparable performance to shallower approaches. However, we believe its generality and extensibility offer a route towards the higher precision that is required of IE systems if they are to become genuinely usable technologies.