Split size-rank models for the distribution of index terms
Journal of the American Society for Information Science
Stochastic models for the distribution of index terms
Journal of Documentation
Information Processing and Management: an International Journal - Special issue on Informetrics
Information Processing and Management: an International Journal - Special issue on Informetrics
Automatic indexing based on Bayesian inference networks
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Information storage and retrieval
Information storage and retrieval
Automatic subject indexing using an associative neural network
Proceedings of the third ACM conference on Digital libraries
A theory of term weighting based on exploratory data analysis
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
DARE: distance and angle retrieval environment: a tale of the two measures
Journal of the American Society for Information Science
The feature quantity: an information theoretic perspective of Tfidf-like measures
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Visualization of term discrimination analysis
Journal of the American Society for Information Science and Technology
Automatic identification and organization of index terms for interactive browsing
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
TOFIR: a tool of facilitating information retrieval — introduce a visual retrieval model
Information Processing and Management: an International Journal
Theory of Indexing
Journal of the American Society for Information Science and Technology
Expansion of multi-word terms for indexing and retrieval using morphology and syntax
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Improving automatic indexing through concept combination and term enrichment
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
A New Term Significance Weighting Approach
Journal of Intelligent Information Systems
Hi-index | 0.00 |
Index modeling and computer simulation techniques are used to examine the influence of indexing frequency distributions, indexing exhaustivity distributions, and three weighting methods on hypothetical document spaces in a vector-based information retrieval (IR) system. The way documents are indexed plays an important role in retrieval. The authors demonstrate the influence of different indexing characteristics on document space density (DSD) changes and document space discriminative capacity for IR. Document environments that contain a relatively higher percentage of infrequently occurring terms provide lower density outcomes than do environments where a higher percentage of frequently occurring terms exists. Different indexing exhaustivity levels, however, have little influence on the document space densities. A weighting algorithm that favors higher weights for infrequently occurring terms results in the lowest overall document space densities, which allows documents to be more readily differentiated from one another. This in turn can positively influence IR. The authors also discuss the influence on outcomes using two methods of normalization of term weights (i.e., means and ranges) for the different weighting methods. © 2008 Wiley Periodicals, Inc.