On extending the vector space model for Boolean query processing
Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
Generalized vector spaces model in information retrieval
SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
A learning algorithm applied to document redescription
SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
Experiments on the determination of the relationships between terms
ACM Transactions on Database Systems (TODS)
Computer Evaluation of Indexing and Text Processing
Journal of the ACM (JACM)
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
An evaluation of term dependence models in information retrieval
SIGIR '82 Proceedings of the 5th annual ACM conference on Research and development in information retrieval
Dynamic information and library processing
Dynamic information and library processing
A neural network for probabilistic information retrieval
SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
An object-oriented modeling of the history of optimal retrievals
SIGIR '91 Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval
Computation of term associations by a neural network
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
On modeling information retrieval with probabilistic inference
ACM Transactions on Information Systems (TOIS)
A network approach to probabilistic information retrieval
ACM Transactions on Information Systems (TOIS)
Information filtering: the computation of similarities in large corpora of legal texts
ICAIL '95 Proceedings of the 5th international conference on Artificial intelligence and law
An information-theoretic approach to automatic query expansion
ACM Transactions on Information Systems (TOIS)
A context vector model for information retrieval
Journal of the American Society for Information Science and Technology
Set-based model: a new approach for information retrieval
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
A New Approach to Clustering Records in Information Retrieval Systems
Information Retrieval
Design of an Integrated Information Retrieval/Database Management System
IEEE Transactions on Knowledge and Data Engineering
Vector-based approach to analysis of file space properties
Progress in computer research
A Probabilistic Framework for Vague Queries and Imprecise Information in Databases
VLDB '90 Proceedings of the 16th International Conference on Very Large Data Bases
A Theory and Approach to Improving Relevance Ranking in Web Retrieval
WI '01 Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development
On Modeling of Concept Based Retrieval in Generalized Vector Spaces
ISMIS '00 Proceedings of the 12th International Symposium on Foundations of Intelligent Systems
Using User Profiles in Intelligent Information Retrieval
ISMIS '02 Proceedings of the 13th International Symposium on Foundations of Intelligent Systems
Enhancing the Set-Based Model Using Proximity Information
SPIRE 2002 Proceedings of the 9th International Symposium on String Processing and Information Retrieval
Technologies for constructing intelligent systems
CALA: a web analysis algorithm combined with content correlation analysis method
Journal of Computer Science and Technology
Efficient Semantic-Based Content Search in P2P Network
IEEE Transactions on Knowledge and Data Engineering
SimFusion: measuring similarity using unified relationship matrix
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Comparing vector space retrieval with the RUBRIC expert system
ACM SIGIR Forum
Set-based vector model: An efficient approach for correlation-based ranking
ACM Transactions on Information Systems (TOIS)
Context modeling and discovery using vector space bases
Proceedings of the 14th ACM international conference on Information and knowledge management
Two-stage statistical language models for text database selection
Information Retrieval
Concept Based Retrieval Using Generalized Retrieval Functions
Fundamenta Informaticae - Intelligent Systems
Energy and quality aware query processing in wireless sensor database systems
Information Sciences: an International Journal
A graph method for keyword-based selection of the top-K databases
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Exploiting Morphological Query Structure Using Genetic Optimisation
NLDB '08 Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information Systems
Adaptive indexing for content-based search in P2P systems
Data & Knowledge Engineering
An analysis of latent semantic term self-correlation
ACM Transactions on Information Systems (TOIS)
Community-supported collaborative navigation with FoxPeer
International Journal of Web Based Communities
A generalized vector space model for text retrieval based on semantic relatedness
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop
Representing Context Information for Document Retrieval
FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
Structure of morphologically expanded queries: A genetic algorithm approach
Data & Knowledge Engineering
A survey of Chinese text similarity computation
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
A vector space approach to tag cloud similarity ranking
Information Processing Letters
Information Sciences: an International Journal
Query clauses and term independence
CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Lexical and Syntactic knowledge for Information Retrieval
Information Processing and Management: an International Journal
Preference-based top-k spatial keyword queries
Proceedings of the 1st international workshop on Mobile location-based service
A Survey of Automatic Query Expansion in Information Retrieval
ACM Computing Surveys (CSUR)
Using genetic algorithms for query reformulation
FDIA'07 Proceedings of the 1st BCS IRSG conference on Future Directions in Information Access
Co-spatial searcher: efficient tag-based collaborative spatial search on geo-social network
DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Concept Based Retrieval Using Generalized Retrieval Functions
Fundamenta Informaticae - Intelligent Systems
Hi-index | 0.00 |
The Vector Space Model (VSM) has been adopted in information retrieval as a means of coping with inexact representation of documents and queries, and the resulting difficulties in determining the relevance of a document relative to a given query. The major problem in employing this approach is that the explicit representation of term vectors is not known a priori. Consequently, earlier researchers made the assumption that the vectors corresponding to terms are pairwise orthogonal. Such an assumption is clearly unrealistic. Although attempts have been made to compensate for this assumption by some separate, corrective steps, such methods are ad hoc and, in most cases, formally inconsistent.In this paper, a generalization of the VSM, called the GVSM, is advanced. The developments provide a solution not only for the computation of a measure of similarity (correlation) between terms, but also for the incorporation of these similarities into the retrieval process.The major strength of the GVSM derives from the fact that it is theoretically sound and elegant. Furthermore, experimental evaluation of the model on several test collections indicates that the performance is better than that of the VSM. Experiments have been performed on some variations of the GVSM, and all these results have also been compared to those of the VSM, based on inverse document frequency weighting. These results and some ideas for the efficient implementation of the GVSM are discussed.