An alternative ranking problem for search engines

Authors:
Corinna Cortes;Mehryar Mohri;Ashish Rastogi
Affiliations:
Google Research, New York, NY;Courant Institute of Mathematical Sciences and Google Research, New York, NY;Courant Institute of Mathematical Sciences, New York, NY
Venue:
WEA'07 Proceedings of the 6th international conference on Experimental algorithms
Year:
2007

Citing 5
Cited 4

An Efficient Boosting Algorithm for Combining Preferences

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Stability and generalization

The Journal of Machine Learning Research
New approaches to support vector ordinal regression

ICML '05 Proceedings of the 22nd international conference on Machine learning
Stability and generalization of bipartite ranking algorithms

COLT'05 Proceedings of the 18th annual conference on Learning Theory
Margin-Based ranking meets boosting in the middle

COLT'05 Proceedings of the 18th annual conference on Learning Theory

An efficient algorithm for learning to rank from preference graphs

Machine Learning
An experimental comparison of cross-validation techniques for estimating the area under the ROC curve

Computational Statistics & Data Analysis
Graph-based alignment of narratives for automated neurological assessment

BioNLP '12 Proceedings of the 2012 Workshop on Biomedical Natural Language Processing
Learning performance of coefficient-based regularized ranking

Neurocomputing

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper examines in detail an alternative ranking problem for search engines, movie recommendation, and other similar ranking systems motivated by the requirement to not just accurately predict pairwise ordering but also preserve the magnitude of the preferences or the difference between ratings. We describe and analyze several cost functions for this learning problem and give stability bounds for their generalization error, extending previously known stability results to nonbipartite ranking and magnitude of preference-preserving algorithms. We present algorithms optimizing these cost functions, and, in one instance, detail both a batch and an on-line version. For this algorithm, we also show how the leave-one-out error can be computed and approximated efficiently, which can be used to determine the optimal values of the trade-off parameter in the cost function. We report the results of experiments comparing these algorithms on several datasets and contrast them with those obtained using an AUC-maximization algorithm. We also compare training times and performance results for the on-line and batch versions, demonstrating that our on-line algorithm scales to relatively large datasets with no significant loss in accuracy.