A context vector model for information retrieval

Authors:
Holger Billhardt;Daniel Borrajo;Victor Maojo
Affiliations:
Univ. Politécnica de Madrid, Spain;Univ. Carlos III de Madrid, Spain;Univ. Politécnica de Madrid, Spain
Venue:
Journal of the American Society for Information Science and Technology
Year:
2002

Citing 13
Cited 16

On modeling of information retrieval concepts in vector spaces

ACM Transactions on Database Systems (TODS)
Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
Dimensions of meaning

Proceedings of the 1992 ACM/IEEE conference on Supercomputing
A user-centred evaluation of ranking algorithms for interactive query expansion

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic thesaurus generation for an electronic community system

Journal of the American Society for Information Science
A concept space approach to addressing the vocabulary problem in scientific information retrieval: an experiment on the worm community system

Journal of the American Society for Information Science
Improving automatic query expansion

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
On the necessity of term dependence in a query space for weighted retrieval

Journal of the American Society for Information Science
Probabilistic latent semantic indexing

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval based on context distance and morphology

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Semantic Clustering of Index Terms

Journal of the ACM (JACM)
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
The text retrieval conferences (TRECS)

TIPSTER '98 Proceedings of a workshop on held at Baltimore, Maryland: October 13-15, 1998

Using genetic algorithms to find suboptimal retrieval expert combinations

Proceedings of the 2002 ACM symposium on Applied computing
Learning retrieval expert combinations with genetic algorithms

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
Multi-Dimensional Evaluation of Information Retrieval Results

WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Set-based vector model: An efficient approach for correlation-based ranking

ACM Transactions on Information Systems (TOIS)
A Novel Context Matching Based Technique for Web Document Retrieval

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Locating thematic pinpoints in narrative texts with short phrases: a test study on Don Quixote

Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
A collaborative filtering-based approach to personalized document clustering

Decision Support Systems
Combining structure and function-based descriptors for component retrieval in software digital libraries

Integrated Computer-Aided Engineering
A "Bag" or a "Window" of Words for Information Filtering?

SETN '08 Proceedings of the 5th Hellenic conference on Artificial Intelligence: Theories, Models and Applications
Managing Word Mismatch Problems in Information Retrieval: A Topic-Based Query Expansion Approach

Journal of Management Information Systems
Contextual proximity based term-weighting for improved web information retrieval

KSEM'07 Proceedings of the 2nd international conference on Knowledge science, engineering and management
Integration of descriptors for software component retrieval

KSEM'07 Proceedings of the 2nd international conference on Knowledge science, engineering and management
Knowledge-level management of web information

APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
A general fuzzy-based framework for text representation and its application to text categorization

FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
The use of monolingual context vectors for missing translations in cross-language information retrieval

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Dynamic context for document search and recovery

ICCSA'13 Proceedings of the 13th international conference on Computational Science and Its Applications - Volume 1

Quantified Score

Hi-index	0.00

Visualization

Abstract

In the vector space model for information retrieval, term vectors are pair-wise orthogonal, that is, terms are assumed to be independent. It is well known that this assumption is too restrictive. In this article, we present our work on an indexing and retrieval method that, based on the vector space model, incorporates term dependencies and thus obtains semantically richer representations of documents. First, we generate term context vectors based on the co-occurrence of terms in the same documents. These vectors are used to calculate context vectors for documents. We present different techniques for estimating the dependencies among terms. We also define term weights that can be employed in the model. Experimental results on four text collections (MED, CRANFIELD, CISI, and CACM) show that the incorporation of term dependencies in the retrieval process performs statistically significantly better than the classical vector space model with IDF weights. We also show that the degree of semantic matching versus direct word matching that performs best varies on the four collections. We conclude that the model performs well for certain types of queries and, generally, for information tasks with high recall requirements. Therefore, we propose the use of the context vector model in combination with other, direct word-matching methods.