Composite document extended retrieval: an overview

Authors:
Edward A. Fox
Affiliations:
Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA
Venue:
SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
Year:
1985

Citing 31
Cited 4

Relevance feedback and a fuzzy set of search terms in an information retrieval system

Information Technology Research Development Applications
Soft evaluation of Boolean search queries in information retrieval systems

Information Technology Research Development Applications
Building expert systems

Building expert systems
Logic for problem-solving

Logic for problem-solving
Improved retrieval using a relational thesaurus for automatic expansion of boolean logic queries

Relational models of the lexicon
Approximate String Matching

ACM Computing Surveys (CSUR)
Document processing in a relational database system

ACM Transactions on Information Systems (TOIS)
The computer science research network CSNET: a history and status report

Communications of the ACM
Extended Boolean information retrieval

Communications of the ACM
Grapevine: an exercise in distributed computing

Communications of the ACM
The proposed new Computing Reviews classification scheme

Communications of the ACM
String similarity and misspellings

Communications of the ACM
Retrieval of misspelled names in an airlines passenger record system

Communications of the ACM
Applications for information retrieval techniques in the office

SIGIR '83 Proceedings of the 6th annual international ACM SIGIR conference on Research and development in information retrieval
PROLOG Database System

PROLOG Database System
Information Retrieval

Information Retrieval
An intelligent terminal for implementing relevance feedback on large operational retrieval systems

SIGIR '82 Proceedings of the 5th annual ACM conference on Research and development in information retrieval
Adapting a data organization to the structure of stored information

SIGIR '82 Proceedings of the 5th annual ACM conference on Research and development in information retrieval
On the architecture of a system integrating data base management and information retrieval

SIGIR '82 Proceedings of the 5th annual ACM conference on Research and development in information retrieval
Implementing SMART for minicomputers via relational processing With abstract data types

SIGSMALL '81 Proceedings of the 1981 ACM SIGSMALL symposium on Small systems and SIGMOD workshop on Small database systems
OTTER - An information retrieval system for office automation

COCS '84 Proceedings of the second ACM-SIGOA conference on Office information systems
An object-oriented Office Document Architecture model for processing and interchange of documents

COCS '84 Proceedings of the second ACM-SIGOA conference on Office information systems
The structure of abstract document objects

COCS '84 Proceedings of the second ACM-SIGOA conference on Office information systems
Officeaid: An integrated document management system

COCS '84 Proceedings of the second ACM-SIGOA conference on Office information systems
A conceptual approach to document retrieval

COCS '84 Proceedings of the second ACM-SIGOA conference on Office information systems
Extending the boolean and vector space models of information retrieval with p-norm queries and multiple concept types

Extending the boolean and vector space models of information retrieval with p-norm queries and multiple concept types
Computing story trees

Computational Linguistics
Flexible parsing

Computational Linguistics
Coping with extragrammaticality

ACL '84 Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics
Experiments with automatic text filing and retrieval in the office environment

ACM SIGIR Forum
Language As a Cognitive Process: Syntax

Language As a Cognitive Process: Syntax

Coefficients of combining concept classes in a collection

SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
The automatic generation of extended queries

SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
An interpretation of index term weighting schemes based on document components

Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
A survey in indexing and searching XML documents

Journal of the American Society for Information Science and Technology - XML

Quantified Score

Hi-index	0.00

Visualization

Abstract

Experimental information retrieval (IR) systems, some dating back to the sixties, have demonstrated the viability of fully automatic document storage and retrieval methodologies with small to medium size bibliographic collections [72]. Many of these experimental systems utilize the vector space model in which each important term (such as a word stem) identifies a different dimension in a space, so that matrix methods and vector operations can be defined on queries and documents. Statistical techniques have been very effective, and probabilistic enhancements have given additional improvements [84]. However, the basic vector space model is oriented towards recording the essential information in the text of a title/abstract combination rather than describing more complex document structures. It is necessary to extend the model in order to handle composite documents.On the other hand, commonly available retrieval systems that employ Boolean logic queries and utilize inverted file storage schemes can without modification accommodate such documents, albeit with somewhat less effectiveness than is possible with more sophisticated systems. Hence, it is also of interest to consider how Boolean logic systems can be extended to give better performance, especially with composite documents, and to integrate those approaches with vector methods.