Kantorovich distances between rankings with applications to rank aggregation

  • Authors:
  • Stéphan Clémençon;Jérémie Jakubowicz

  • Affiliations:
  • LTCI, Telecom Paristech, TSI, UMR Institut Telecom, CNRS;LTCI, Telecom Paristech, TSI, UMR Institut Telecom, CNRS

  • Venue:
  • ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The goal of this paper is threefold. It first describes a novel way of measuring disagreement between rankings of a finite set χ of n ≥ 1 elements, that can be viewed as a (mass transportation) Kantorovich metric, once the collection rankings of χ is embedded in the set κn of n× n doubly-stochastic matrices. It also shows that such an embedding makes it possible to define a natural notion of median, that can be interpreted in a probabilistic fashion. In addition, from a computational perspective, the convexification induced by this approach makes median computation more tractable, in contrast to the standard metric-based method that generally yields NP-hard optimization problems. As an illustration, this novel methodology is applied to the issue of ranking aggregation, and is shown to compete with state of the art techniques.