On Equivalence Relationships Between Classification and Ranking Algorithms

Authors:
Şeyda Ertekin;Cynthia Rudin
Affiliations:
-;-
Venue:
The Journal of Machine Learning Research
Year:
2011

Citing 15
Cited 1

A decision-theoretic generalization of on-line learning and an application to boosting

Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
Soft Margins for AdaBoost

Machine Learning
Logistic Regression, AdaBoost and Bregman Distances

Machine Learning
AdaCost: Misclassification Cost-Sensitive Boosting

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
A Geometric Approach to Leveraging Weak Learners

EuroCOLT '99 Proceedings of the 4th European Conference on Computational Learning Theory
An efficient boosting algorithm for combining preferences

The Journal of Machine Learning Research
Learning to rank using gradient descent

ICML '05 Proceedings of the 22nd international conference on Machine learning
An empirical comparison of supervised learning algorithms

ICML '06 Proceedings of the 23rd international conference on Machine learning
Boosted Classification Trees and Class Probability/Quantile Estimation

The Journal of Machine Learning Research
Cost-sensitive boosting for classification of imbalanced data

Pattern Recognition
Robust reductions from ranking to classification

Machine Learning
Multi-class cost-sensitive boosting with p-norm loss functions

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Margin-based Ranking and an Equivalence between AdaBoost and RankBoost

The Journal of Machine Learning Research
The P-Norm Push: A Simple Convex Ranking Algorithm that Concentrates at the Top of the List

The Journal of Machine Learning Research
No free lunch theorems for optimization

IEEE Transactions on Evolutionary Computation

Editorial: Parameter-free classification in multi-class imbalanced data sets

Data & Knowledge Engineering

Quantified Score

Hi-index	0.01

Visualization

Abstract

We demonstrate that there are machine learning algorithms that can achieve success for two separate tasks simultaneously, namely the tasks of classification and bipartite ranking. This means that advantages gained from solving one task can be carried over to the other task, such as the ability to obtain conditional density estimates, and an order-of-magnitude reduction in computational time for training the algorithm. It also means that some algorithms are robust to the choice of evaluation metric used; they can theoretically perform well when performance is measured either by a misclassification error or by a statistic of the ROC curve (such as the area under the curve). Specifically, we provide such an equivalence relationship between a generalization of Freund et al.'s RankBoost algorithm, called the "P-Norm Push," and a particular cost-sensitive classification algorithm that generalizes AdaBoost, which we call "P-Classification." We discuss and validate the potential benefits of this equivalence relationship, and perform controlled experiments to understand P-Classification's empirical performance. There is no established equivalence relationship for logistic regression and its ranking counterpart, so we introduce a logistic-regression-style algorithm that aims in between classification and ranking, and has promising experimental performance with respect to both tasks.