Ontologies and text retrieval

Authors:
James Mayfield
Affiliations:
The Johns Hopkins University Applied Physics Laboratory, 11100 Johns Hopkins Rd., Laurel MD 20723–/6099, USA/ e-mail: James.Mayfield@jhuapl.edu
Venue:
The Knowledge Engineering Review
Year:
2002

Citing 8
Cited 2

Organizing information: principles of data base and retrieval systems

Organizing information: principles of data base and retrieval systems
Translingual information retrieval: learning from bilingual corpora

Artificial Intelligence - Special issue: artificial intelligence 40 years later
Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
The nature of indexing: how humans and machines analyze messages and texts for retrieval: part I: research, and the nature of human indexing

Information Processing and Management: an International Journal
Explorations in Automatic Thesaurus Discovery

Explorations in Automatic Thesaurus Discovery
Building Large Knowledge-Based Systems; Representation and Inference in the Cyc Project

Building Large Knowledge-Based Systems; Representation and Inference in the Cyc Project
Retrieving with Good Sense

Information Retrieval
Semantic indexing using WordNet senses

RANLPIR '00 Proceedings of the ACL-2000 workshop on Recent advances in natural language processing and information retrieval: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 11

A methodology for engineering ontology acquisition and validation

Artificial Intelligence for Engineering Design, Analysis and Manufacturing
Harmonisation of soft logical inference rules in distributed decision systems

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

Analogues to much of today's work in ontologies have existed for centuries in text retrieval. The use of controlled vocabularies, or thesauri, has been fundamental to document indexing in library science. Thesauri serve several purposes, including:• Knowledge organisation A thesaurus provides a hierarchy of concepts that organises domain-specific knowledge.• Terminology normalisation By selecting a unique word or phrase to represent each domain concept, then linking synonymous terms to it, a thesaurus enforces terminological consistency.• Query expansion A thesaurus facilitates the addition of terms to a query by providing explicit hierarchical and lateral relationships among terms.These properties serve to mediate the information flow from indexer to user. Thesauri thus serve many of the same functions for people that ontologies are designed to serve for software agents. As automated retrieval has developed over the decades since the inception of computer processing of text, many techniques have been introduced to apply this typically manual work to the automated arena (see Soergel (1985) for an introduction to library information systems, also Anderson and Pélrez-Carballo (2001a, 2001b) for a summary of the intersection of human and machine indexing).