The effect multiple query representations on information retrieval system performance
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Passage-level evidence in document retrieval
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Analyses of multiple evidence combination
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Relevance based language models
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to Ad Hoc information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating document clustering for interactive information retrieval
Proceedings of the tenth international conference on Information and knowledge management
Passage retrieval based on language models
Proceedings of the eleventh international conference on Information and knowledge management
Corpus structure, language models, and ad hoc information retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Unified utility maximization framework for resource selection
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Diffusion Kernels on Statistical Manifolds
The Journal of Machine Learning Research
Riemannian geometry and statistical machine learning
Riemannian geometry and statistical machine learning
Estimation and use of uncertainty in pseudo-relevance feedback
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
CCVisu: automatic visual software decomposition
Companion of the 30th international conference on Software engineering
Blog site search using resource selection
Proceedings of the 17th ACM conference on Information and knowledge management
It pays to be picky: an evaluation of thread retrieval in online forums
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Online community search using thread structure
Proceedings of the 18th ACM conference on Information and knowledge management
Sided and symmetrized Bregman centroids
IEEE Transactions on Information Theory
Utilizing passage-based language models for document retrieval
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Evaluating text representations for retrieval of the best group of documents
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Cluster-based fusion of retrieved lists
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Is document frequency important for PRF?
ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
The opposite of smoothing: a language model approach to ranking query-specific document clusters
Journal of Artificial Intelligence Research
Predicting document effectiveness in pseudo relevance feedback
Proceedings of the 20th ACM international conference on Information and knowledge management
Online community search using conversational structures
Information Retrieval
Predicting Query Performance by Query-Drift Estimation
ACM Transactions on Information Systems (TOIS)
Structured event retrieval over microblog archives
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Query-performance prediction and cluster ranking: two sides of the same coin
Proceedings of the 21st ACM international conference on Information and knowledge management
Exploring the cluster hypothesis, and cluster-based retrieval, over the web
Proceedings of the 21st ACM international conference on Information and knowledge management
Ranking document clusters using markov random fields
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
A Theoretical Analysis of Pseudo-Relevance Feedback Models
Proceedings of the 2013 Conference on the Theory of Information Retrieval
Hi-index | 0.00 |
Combining multiple documents to represent an information object is well-known as an effective approach for many Information Retrieval tasks. For example, passages can be combined to represent a document for retrieval, document clusters are represented using combinations of the documents they contain, and feedback documents can be combined to represent a query model. Various techniques for combination have been introduced, and among them, representation techniques based on concatenation and the arithmetic mean are frequently used. Some recent work has shown the potential of a new representation technique using the geometric mean. However, these studies lack a theoretical foundation explaining why the geometric mean should have advantages for representing multiple documents. In this paper, we show that the arithmetic mean and the geometric mean are approximations to the center of mass in certain geometries, and show empirically that the geometric mean is closer to the center. Through experiments with two IR tasks, we show the potential benefits for geometric representations, including a geometry-based pseudo-relevance feedback method that outperforms state-of-the-art techniques.