Term position ranking: some new test results
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Self-indexing inverted files for fast text retrieval
ACM Transactions on Information Systems (TOIS)
Exploring the similarity space
ACM SIGIR Forum
Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Shortest-substring retrieval and ranking
ACM Transactions on Information Systems (TOIS)
Biterm language models for document retrieval
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Internet Document Filtering Using Fourier Domain Scoring
PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
A Novel Web Text Mining Method Using the Discrete Cosine Transform
PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
A new implementation technique for fast Spectral based document retrieval systems
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Fourier Domain Scoring: A Novel Document Ranking Method
IEEE Transactions on Knowledge and Data Engineering
Dependence language model for information retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A Novel Document Ranking Method Using the Discrete Cosine Transform
IEEE Transactions on Pattern Analysis and Machine Intelligence
Hybrid Pre-Query Term Expansion using Latent Semantic Analysis
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Hi-index | 0.00 |
The vector space model (VSM) of information retrieval suffers in two areas, it does not utilise term positions and it treats every term as being independent. We examine two information retrieval methods based on the simple vector space model. The first uses the query term position flow within the documents to calculate the document score, the second includes related terms in the query by making use of term correlations. Both of these methods show significant improvements over the VSM precision while keeping the query time to speeds similar to those of the VSM.