A query-basis approach to parametrizing novelty-biased cumulative gain

Authors:
Teerapong Leelanupab;Guido Zuccon;Joemon M. Jose
Affiliations:
School of Computing Science, University of Glasgow, Glasgow, United Kingdom;School of Computing Science, University of Glasgow, Glasgow, United Kingdom;School of Computing Science, University of Glasgow, Glasgow, United Kingdom
Venue:
ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
Year:
2011

Citing 2
Cited 4

Beyond independent relevance: methods and evaluation metrics for subtopic retrieval

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Novelty and diversity in information retrieval evaluation

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval

Evaluation with informational and navigational intents

Proceedings of the 21st international conference on World Wide Web
Evaluating aggregated search pages

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
A comprehensive analysis of parameter settings for novelty-biased cumulative gain

Proceedings of the 21st ACM international conference on Information and knowledge management
Preference based evaluation measures for novelty and diversity

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Novelty-biased cumulative gain (α-NDCG) has become the de facto measure within the information retrieval (IR) community for evaluating retrieval systems in the context of sub-topic retrieval. Setting the incorrect value of parameter α in α-NDCG prevents the measure from behaving as desired in particular circumstances. In fact, when α is set according to common practice (i.e. α = 0.5), the measure favours systems that promote redundant relevant sub-topics rather than provide novel relevant ones. Recognising this characteristic of the measure is important because it affects the comparison and the ranking of retrieval systems. We propose an approach to overcome this problem by defining a safe threshold for the value of a on a query basis. Moreover, we study its impact on system rankings through a comprehensive simulation.