The Utah text retrieval project -- a status report
Proc. of the third joint BCS and ACM symposium on Research and development in information retrieval
An integrated fact/document information system for office automation
Information Technology Research Development Applications - Lecture notes in computer science 178
Database machines and database management
Database machines and database management
Concepts of the cover coefficient-based clustering methodology
SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Dynamic information and library processing
Dynamic information and library processing
A dynamic cluster maintenance system for information retrieval
SIGIR '87 Proceedings of the 10th annual international ACM SIGIR conference on Research and development in information retrieval
Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases
ACM Transactions on Database Systems (TODS)
Analysis of multiterm queries in a dynamic signature file organization
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Multi-media document representation and retrieval
CSC '91 Proceedings of the 19th annual conference on Computer Science
Hi-index | 0.00 |
In this article we present an interactive automatic document indexing software together with various index tuning/optimization strategies. After stems are generated from the raw text, the initial index vocabulary is narrowed down and tuned with the use of indexing versus clustering theory relationships. The narrowed down vocabulary is further optimized with the inclusion of term phrases and virtual terms corresponding to high and low frequency terms respectively. The results of performance experimentation which proved significant improvements of index vocabulary optimization are presented. The exploitation of the term discrimination value concept in index and retrieval system tuning and optimization is discussed.