On modeling of information retrieval concepts in vector spaces
ACM Transactions on Database Systems (TODS)
Automatic text processing: the transformation, analysis, and retrieval of information by computer
Automatic text processing: the transformation, analysis, and retrieval of information by computer
Proceedings of the 1992 ACM/IEEE conference on Supercomputing
A user-centred evaluation of ranking algorithms for interactive query expansion
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic thesaurus generation for an electronic community system
Journal of the American Society for Information Science
Journal of the American Society for Information Science
Improving automatic query expansion
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
On the necessity of term dependence in a query space for weighted retrieval
Journal of the American Society for Information Science
Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval based on context distance and morphology
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Semantic Clustering of Index Terms
Journal of the ACM (JACM)
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
The text retrieval conferences (TRECS)
TIPSTER '98 Proceedings of a workshop on held at Baltimore, Maryland: October 13-15, 1998
Using genetic algorithms to find suboptimal retrieval expert combinations
Proceedings of the 2002 ACM symposium on Applied computing
Learning retrieval expert combinations with genetic algorithms
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
Multi-Dimensional Evaluation of Information Retrieval Results
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Set-based vector model: An efficient approach for correlation-based ranking
ACM Transactions on Information Systems (TOIS)
A Novel Context Matching Based Technique for Web Document Retrieval
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Locating thematic pinpoints in narrative texts with short phrases: a test study on Don Quixote
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
A collaborative filtering-based approach to personalized document clustering
Decision Support Systems
Integrated Computer-Aided Engineering
A "Bag" or a "Window" of Words for Information Filtering?
SETN '08 Proceedings of the 5th Hellenic conference on Artificial Intelligence: Theories, Models and Applications
Managing Word Mismatch Problems in Information Retrieval: A Topic-Based Query Expansion Approach
Journal of Management Information Systems
Contextual proximity based term-weighting for improved web information retrieval
KSEM'07 Proceedings of the 2nd international conference on Knowledge science, engineering and management
Integration of descriptors for software component retrieval
KSEM'07 Proceedings of the 2nd international conference on Knowledge science, engineering and management
Knowledge-level management of web information
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
A general fuzzy-based framework for text representation and its application to text categorization
FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Dynamic context for document search and recovery
ICCSA'13 Proceedings of the 13th international conference on Computational Science and Its Applications - Volume 1
Hi-index | 0.00 |
In the vector space model for information retrieval, term vectors are pair-wise orthogonal, that is, terms are assumed to be independent. It is well known that this assumption is too restrictive. In this article, we present our work on an indexing and retrieval method that, based on the vector space model, incorporates term dependencies and thus obtains semantically richer representations of documents. First, we generate term context vectors based on the co-occurrence of terms in the same documents. These vectors are used to calculate context vectors for documents. We present different techniques for estimating the dependencies among terms. We also define term weights that can be employed in the model. Experimental results on four text collections (MED, CRANFIELD, CISI, and CACM) show that the incorporation of term dependencies in the retrieval process performs statistically significantly better than the classical vector space model with IDF weights. We also show that the degree of semantic matching versus direct word matching that performs best varies on the four collections. We conclude that the model performs well for certain types of queries and, generally, for information tasks with high recall requirements. Therefore, we propose the use of the context vector model in combination with other, direct word-matching methods.