Regularizing ad hoc retrieval scores

Authors:
Fernando Diaz
Affiliations:
University of Massachusetts, Amherst, MA
Venue:
Proceedings of the 14th ACM international conference on Information and knowledge management
Year:
2005

Citing 19
Cited 44

On the use of spreading activation methods in automatic information

SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
Retrieving documents by plausible inference: a priliminary study

SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
Adaptive information retrieval: using a connectionist representation to retrieve and learn about documents

SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
A neural network for probabilistic information retrieval

SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
Inference networks for document retrieval

SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
Using the cosine measure in a neural network for document retrieval

SIGIR '91 Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval
Viewing morphology as an inference process

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
The first text retrieval conference (TREC-1) Rockville, MD, U.S.A., 4–6 November, 1992

Information Processing and Management: an International Journal
Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Cluster-based language models for distributed retrieval

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Language Modeling for Information Retrieval

Language Modeling for Information Retrieval
Semi-Supervised Learning on Riemannian Manifolds

Machine Learning
Cluster-based retrieval using language models

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Corpus structure, language models, and ad hoc information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Combining the language model and inference network approaches to retrieval

Information Processing and Management: an International Journal - Special issue: Bayesian networks and information retrieval
PageRank without hyperlinks: structural re-ranking using links induced by language models

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A study of relevance propagation for web search

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A generative theory of relevance

A generative theory of relevance
Semi-supervised learning with graphs

Semi-supervised learning with graphs

Respect my authority!: HITS without hyperlinks, utilizing cluster-based language models

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Document re-ranking using cluster validation and label propagation

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Performance prediction using spatial autocorrelation

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Semiautomatic evaluation of retrieval systems using document similarities

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Towards a unified approach to document similarity search using manifold-ranking of blocks

Information Processing and Management: an International Journal
The opposite of smoothing: a language model approach to ranking query-specific document clusters

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A cluster-based resampling method for pseudo-relevance feedback

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A rank-aggregation approach to searching for optimal query-specific clusters

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A general optimization framework for smoothing language models on graph structures

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Re-ranking search results using document-passage graphs

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Effective latent space graph-based re-ranking model with global consistency

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Clusters, language models, and ad hoc information retrieval

ACM Transactions on Information Systems (TOIS)
Enhancing Expert Finding Using Organizational Hierarchies

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Re-ranking search results using language models of query-specific clusters

Information Retrieval
A generalized Co-HITS algorithm and its application to bipartite graphs

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Building enriched document representations using aggregated anchor text

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
From "Identical" to "Similar": Fusing Retrieved Lists Based on Inter-document Similarities

ICTIR '09 Proceedings of the 2nd International Conference on Theory of Information Retrieval: Advances in Information Retrieval Theory
Pseudo-aligned multilingual corpora

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
The use of categorization information in language models for question retrieval

Proceedings of the 18th ACM conference on Information and knowledge management
Utilizing inter-passage and inter-document similarities for re-ranking search results

Proceedings of the 18th ACM conference on Information and knowledge management
Re-ranking Documents Based on Query-Independent Document Specificity

FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
Latent document re-ranking

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Context-aware citation recommendation

Proceedings of the 19th international conference on World wide web
Utilizing passage-based language models for ad hoc document retrieval

Information Retrieval
PageRank without hyperlinks: Structural reranking using links induced by language models

ACM Transactions on Information Systems (TOIS)
On identifying representative relevant documents

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Utilizing inter-passage and inter-document similarities for reranking search results

ACM Transactions on Information Systems (TOIS)
Learning to re-rank web search results with multiple pairwise features

Proceedings of the fourth ACM international conference on Web search and data mining
Dual-space re-ranking model for document retrieval

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Latent semantic indexing (LSI) fails for TREC collections

ACM SIGKDD Explorations Newsletter
Smoothing click counts for aggregated vertical search

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Cluster-based fusion of retrieved lists

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Re-ranking search results using an additional retrieved list

Information Retrieval
Negation for document re-ranking in ad-hoc retrieval

ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
From "identical" to "similar": fusing retrieved lists based on inter-document similarities

Journal of Artificial Intelligence Research
The opposite of smoothing: a language model approach to ranking query-specific document clusters

Journal of Artificial Intelligence Research
A study of the integration of passage-, document-, and cluster-based information for re-ranking search results

Information Retrieval
Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives

ACM Transactions on Information Systems (TOIS)
The optimum clustering framework: implementing the cluster hypothesis

Information Retrieval
Modeling and exploiting heterogeneous bibliographic networks for expertise ranking

Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
Exploiting pairwise recommendation and clustering strategies for image re-ranking

Information Sciences: an International Journal
Employing document dependency in blog search

Journal of the American Society for Information Science and Technology
Document Re-ranking Using Partial Social Tagging

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
A deterministic resampling method using overlapping document clusters for pseudo-relevance feedback

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.01

Visualization

Abstract

The cluster hypothesis states: closely related documents tend to be relevant to the same request. We exploit this hypothesis directly by adjusting ad hoc retrieval scores from an initial retrieval so that topically related documents receive similar scores. We refer to this process as score regularization. Score regularization can be presented as an optimization problem, allowing the use of results from semi-supervised learning. We demonstrate that regularized scores consistently and significantly rank documents better than unregularized scores, given a variety of initial retrieval algorithms. We evaluate our method on two large corpora across a substantial number of topics.