Standard deviation as a query hardness estimator

Authors:
Joaquín Pérez-Iglesias;Lourdes Araujo
Affiliations:
Universidad Nacional de Educación a Distancia, Madrid, Spain;Universidad Nacional de Educación a Distancia, Madrid, Spain
Venue:
SPIRE'10 Proceedings of the 17th international conference on String processing and information retrieval
Year:
2010

Citing 7
Cited 9

A probabilistic model of information retrieval: development and comparative experiments

Information Processing and Management: an International Journal
Predicting query performance

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Precision prediction based on ranked list coherence

Information Retrieval
Performance prediction using spatial autocorrelation

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Improved query difficulty prediction for the web

Proceedings of the 17th ACM conference on Information and knowledge management
Estimating retrieval effectiveness using rank distributions

Proceedings of the 17th ACM conference on Information and knowledge management
On score distributions and relevance

ECIR'07 Proceedings of the 29th European conference on IR research

Improved query performance prediction using standard deviation

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Navigating the user query space

SPIRE'11 Proceedings of the 18th international conference on String processing and information retrieval
Predicting Query Performance by Query-Drift Estimation

ACM Transactions on Information Systems (TOIS)
Predicting query performance directly from score distributions

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Investigating performance predictors using monte carlo simulation and score distribution models

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Query performance prediction for IR

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Predicting query performance for fusion-based retrieval

Proceedings of the 21st ACM international conference on Information and knowledge management
Query-performance prediction and cluster ranking: two sides of the same coin

Proceedings of the 21st ACM international conference on Information and knowledge management
Document Score Distribution Models for Query Performance Inference and Prediction

ACM Transactions on Information Systems (TOIS)

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper a new Query Performance Prediction method is introduced. This method is based on the hypothesis that different score distributions appear for 'hard' and 'easy' queries. Following we propose a set of measures which try to capture the differences between both types of distributions, focusing on the dispersion degree among the scores. We have applied some variants of the classic standard deviation and have studied methods to find out the most suitable size of the ranking list for these measures. Finally, we present the results obtained performing the experiments on two different data-sets.