A noise-tolerant graphical model for ranking

Authors:
Xiubo Geng;Tao Qin;Tie-Yan Liu;Xue-Qi Cheng
Affiliations:
Institute of Computing Technology, Chinese Academy of Sciences, No. 6 Kexueyuan South Road, Zhongguancun, Haidian District, Beijing 100190, PR China;Microsoft Research Asia, No. 49 Zhichun Road, Haidian District, Beijing 100190, PR China;Microsoft Research Asia, No. 49 Zhichun Road, Haidian District, Beijing 100190, PR China;Institute of Computing Technology, Chinese Academy of Sciences, No. 6 Kexueyuan South Road, Zhongguancun, Haidian District, Beijing 100190, PR China
Venue:
Information Processing and Management: an International Journal
Year:
2012

Citing 16
Cited 0

A re-examination of relevance: toward a dynamic, situational definition

Information Processing and Management: an International Journal
Variations in relevance assessments and the measurement of retrieval effectiveness

Journal of the American Society for Information Science - Special issue: evaluation of information retrieval systems
From highly relevant to not relevant: examining different regions of relevance

Information Processing and Management: an International Journal
User perspectives on relevance criteria: a comparison among relevant, partially relevant , and not-relevant judgements

Journal of the American Society for Information Science and Technology
Modern Information Retrieval

Modern Information Retrieval
Cumulated gain-based evaluation of IR techniques

ACM Transactions on Information Systems (TOIS)
Learning From Noisy Examples

Machine Learning
Estimating a Kernel Fisher Discriminant in the Presence of Label Noise

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Optimizing search engines using clickthrough data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
An efficient boosting algorithm for combining preferences

The Journal of Machine Learning Research
Learning to rank using gradient descent

ICML '05 Proceedings of the 22nd international conference on Machine learning
Learning to rank: from pairwise approach to listwise approach

Proceedings of the 24th international conference on Machine learning
Relevance assessment: are judges exchangeable and does it matter

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
BoltzRank: learning to maximize expected ranking gain

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Learning to Rank for Information Retrieval

Foundations and Trends in Information Retrieval
Improving quality of training data for learning to rank using click-through data

Proceedings of the third ACM international conference on Web search and data mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper studies how to learn accurate ranking functions from noisy training data for information retrieval. Most previous work on learning to rank assumes that the relevance labels in the training data are reliable. In reality, however, the labels usually contain noise due to the difficulties of relevance judgments and several other reasons. To tackle the problem, in this paper we propose a novel approach to learning to rank, based on a probabilistic graphical model. Considering that the observed label might be noisy, we introduce a new variable to indicate the true label of each instance. We then use a graphical model to capture the joint distribution of the true labels and observed labels given features of documents. The graphical model distinguishes the true labels from observed labels, and is specially designed for ranking in information retrieval. Therefore, it helps to learn a more accurate model from noisy training data. Experiments on a real dataset for web search show that the proposed approach can significantly outperform previous approaches.