Noisy sorting without resampling

Authors:
Mark Braverman;Elchanan Mossel
Affiliations:
C.S. University of Toronto;U.C. Berkeley
Venue:
Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms
Year:
2008

Citing 7
Cited 9

Computing with unreliable information

STOC '90 Proceedings of the twenty-second annual ACM symposium on Theory of computing
Introduction to coding and information theory

Introduction to coding and information theory
Aggregating inconsistent information: ranking and clustering

Proceedings of the thirty-seventh annual ACM symposium on Theory of computing
Ordering by weighted number of wins gives a good ranking for weighted tournaments

SODA '06 Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm
Ranking Tournaments

SIAM Journal on Discrete Mathematics
How to rank with few errors

Proceedings of the thirty-ninth annual ACM symposium on Theory of computing
Noisy binary search and its applications

SODA '07 Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms

Correlation clustering with noisy input

SODA '10 Proceedings of the twenty-first annual ACM-SIAM symposium on Discrete Algorithms
Ranking from pairs and triplets: information quality, evaluation methods and query complexity

Proceedings of the fourth ACM international conference on Web search and data mining
Tolerant algorithms

ESA'11 Proceedings of the 19th European conference on Algorithms
Randomized Shellsort: A Simple Data-Oblivious Sorting Algorithm

Journal of the ACM (JACM)
An active learning algorithm for ranking from pairwise preferences with an almost optimal query complexity

The Journal of Machine Learning Research
Manipulating stochastically generated single-elimination tournaments for nearly all players

WINE'11 Proceedings of the 7th international conference on Internet and Network Economics
Rigging tournament brackets for weaker players

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume One
Sorting noisy data with partial information

Proceedings of the 4th conference on Innovations in Theoretical Computer Science
When do noisy votes reveal the truth?

Proceedings of the fourteenth ACM conference on Electronic commerce

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we study noisy sorting without re-sampling. In this problem there is an unknown order {display equation} where π is a permutation on n elements. The input is the status of (n2) queries of the form q(ai, aj), for i j, where q(ai, aj) = + (-) with probability 1/2 + γ if π(i) π(j)(π(i) j)) for all pairs i ≠ j, where γ 0 is a constant. It is assumed that the errors are independent. Given the status of the queries the goal is to find the maximum likelihood order. In other words, the goal is find a permutation σ that minimizes the number of pairs σ(i) σ(j) where q(σ(i), σ(j)) = -. The problem so defined is the feedback arc set problem on distributions of inputs, each of which is a tournament obtained as a noisy perturbation of a linear order. Note that when γ n is large, it is impossible to recover the original order π. It is known that the weighted feedback arc set problem on tournaments is NP-hard in general. Here we present an algorithm of running time nO(γ-4)) and sampling complexity Oγ (n log n) that with high probability solves the noisy sorting without re-sampling problem. We also show that if aσ(1), aσ(2), …, aσ(n) is an optimal solution of the problem then it is "close" to the original order. More formally, with high probability it holds that {display equation}. Our results are of interest in applications to ranking, such as ranking in sports, or ranking of search items based on comparisons by experts.