ACM Computing Surveys (CSUR) - Annals of discrete mathematics, 24
Another look at automatic text-retrieval systems
Communications of the ACM
Parallel free-text search on the connection machine system
Communications of the ACM - Special issue on parallelism
Optimal signature extraction and information loss
ACM Transactions on Database Systems (TODS)
Signature files: an access method for documents and its analytical performance evaluation
ACM Transactions on Information Systems (TOIS)
ACM Transactions on Information Systems (TOIS)
Design of Database Structures
A Method for Speeding Up Text Retrieval
Databases for Business and Office Applications, Database Week
Addressing the requirements of a dynamic corporate textual information base
SIGIR '91 Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval
Inverted files for text search engines
ACM Computing Surveys (CSUR)
Hi-index | 0.00 |
This paper presents a new data structure and an associated strategy to be utilized by indexing facilities for text retrieval systems. The paper starts by reviewing some of the goals that may be considered when designing such an index and continues with a small survey of various current strategies. It then presents an indexing strategy referred to as surrogate subsets discussing its appropriateness in the light of the specified goals. Various design issues and implementation details are discussed. Our strategy requires that a surrogate file be divided into a large number of subsets separated by free space which will allow the index to expand when new material is appended to the database. Experimental results report on the utilization of free space when the database is enlarged.