On modeling of information retrieval concepts in vector spaces

Authors:
S. K.M. Wong;W. Ziarko;V. V. Raghavan;P. C.N. Wong
Affiliations:
Univ. of Regina, Regina, Sask., Canada;Univ. of Regina, Regina, Sask., Canada;Univ. of Regina, Regina, Sask., Canada;Univ. of Regina, Regina, Sask., Canada
Venue:
ACM Transactions on Database Systems (TODS)
Year:
1987

Citing 8
Cited 48

On extending the vector space model for Boolean query processing

Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
Generalized vector spaces model in information retrieval

SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
A learning algorithm applied to document redescription

SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
Experiments on the determination of the relationships between terms

ACM Transactions on Database Systems (TODS)
Computer Evaluation of Indexing and Text Processing

Journal of the ACM (JACM)
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
An evaluation of term dependence models in information retrieval

SIGIR '82 Proceedings of the 5th annual ACM conference on Research and development in information retrieval
Dynamic information and library processing

Dynamic information and library processing

A neural network for probabilistic information retrieval

SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
An object-oriented modeling of the history of optimal retrievals

SIGIR '91 Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval
Computation of term associations by a neural network

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Concept based query expansion

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
On modeling information retrieval with probabilistic inference

ACM Transactions on Information Systems (TOIS)
A network approach to probabilistic information retrieval

ACM Transactions on Information Systems (TOIS)
Information filtering: the computation of similarities in large corpora of legal texts

ICAIL '95 Proceedings of the 5th international conference on Artificial intelligence and law
An information-theoretic approach to automatic query expansion

ACM Transactions on Information Systems (TOIS)
A context vector model for information retrieval

Journal of the American Society for Information Science and Technology
Set-based model: a new approach for information retrieval

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
A New Approach to Clustering Records in Information Retrieval Systems

Information Retrieval
Design of an Integrated Information Retrieval/Database Management System

IEEE Transactions on Knowledge and Data Engineering
Vector-based approach to analysis of file space properties

Progress in computer research
A Probabilistic Framework for Vague Queries and Imprecise Information in Databases

VLDB '90 Proceedings of the 16th International Conference on Very Large Data Bases
A Theory and Approach to Improving Relevance Ranking in Web Retrieval

WI '01 Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development
On Modeling of Concept Based Retrieval in Generalized Vector Spaces

ISMIS '00 Proceedings of the 12th International Symposium on Foundations of Intelligent Systems
Using User Profiles in Intelligent Information Retrieval

ISMIS '02 Proceedings of the 13th International Symposium on Foundations of Intelligent Systems
Enhancing the Set-Based Model Using Proximity Information

SPIRE 2002 Proceedings of the 9th International Symposium on String Processing and Information Retrieval
Using semantic and phonetic term similarity for spoken document retrieval and spoken query processing

Technologies for constructing intelligent systems
CALA: a web analysis algorithm combined with content correlation analysis method

Journal of Computer Science and Technology
Efficient Semantic-Based Content Search in P2P Network

IEEE Transactions on Knowledge and Data Engineering
SimFusion: measuring similarity using unified relationship matrix

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Comparing vector space retrieval with the RUBRIC expert system

ACM SIGIR Forum
Set-based vector model: An efficient approach for correlation-based ranking

ACM Transactions on Information Systems (TOIS)
Application of ART neural network to development of technology for functional feature-based reference design retrieval

Computers in Industry
Context modeling and discovery using vector space bases

Proceedings of the 14th ACM international conference on Information and knowledge management
Two-stage statistical language models for text database selection

Information Retrieval
Concept Based Retrieval Using Generalized Retrieval Functions

Fundamenta Informaticae - Intelligent Systems
Energy and quality aware query processing in wireless sensor database systems

Information Sciences: an International Journal
A graph method for keyword-based selection of the top-K databases

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Exploiting Morphological Query Structure Using Genetic Optimisation

NLDB '08 Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information Systems
Adaptive indexing for content-based search in P2P systems

Data & Knowledge Engineering
An analysis of latent semantic term self-correlation

ACM Transactions on Information Systems (TOIS)
Community-supported collaborative navigation with FoxPeer

International Journal of Web Based Communities
A generalized vector space model for text retrieval based on semantic relatedness

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop
Application of ART neural network to development of technology for functional feature-based reference design retrieval

Computers in Industry
Representing Context Information for Document Retrieval

FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
Structure of morphologically expanded queries: A genetic algorithm approach

Data & Knowledge Engineering
A survey of Chinese text similarity computation

AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
A vector space approach to tag cloud similarity ranking

Information Processing Letters
Estimating intrinsic dimensionality using the multi-criteria decision weighted model and the average standard estimator

Information Sciences: an International Journal
Query clauses and term independence

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Lexical and Syntactic knowledge for Information Retrieval

Information Processing and Management: an International Journal
Preference-based top-k spatial keyword queries

Proceedings of the 1st international workshop on Mobile location-based service
A Survey of Automatic Query Expansion in Information Retrieval

ACM Computing Surveys (CSUR)
Using genetic algorithms for query reformulation

FDIA'07 Proceedings of the 1st BCS IRSG conference on Future Directions in Information Access
Co-spatial searcher: efficient tag-based collaborative spatial search on geo-social network

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Concept Based Retrieval Using Generalized Retrieval Functions

Fundamenta Informaticae - Intelligent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Vector Space Model (VSM) has been adopted in information retrieval as a means of coping with inexact representation of documents and queries, and the resulting difficulties in determining the relevance of a document relative to a given query. The major problem in employing this approach is that the explicit representation of term vectors is not known a priori. Consequently, earlier researchers made the assumption that the vectors corresponding to terms are pairwise orthogonal. Such an assumption is clearly unrealistic. Although attempts have been made to compensate for this assumption by some separate, corrective steps, such methods are ad hoc and, in most cases, formally inconsistent.In this paper, a generalization of the VSM, called the GVSM, is advanced. The developments provide a solution not only for the computation of a measure of similarity (correlation) between terms, but also for the incorporation of these similarities into the retrieval process.The major strength of the GVSM derives from the fact that it is theoretically sound and elegant. Furthermore, experimental evaluation of the model on several test collections indicates that the performance is better than that of the VSM. Experiments have been performed on some variations of the GVSM, and all these results have also been compared to those of the VSM, based on inverse document frequency weighting. These results and some ideas for the efficient implementation of the GVSM are discussed.