Efficient clustering earth mover's distance

Authors:
Jenny Wagner;Björn Ommer
Affiliations:
Interdisciplinary Center for Scientific Computing, Heidelberg University, Germany;ommer@uni-heidelberg.de
Venue:
ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Year:
2010

Citing 9
Cited 0

A Unified Approach to the Change of Resolution: Space and Gray-Level

IEEE Transactions on Pattern Analysis and Machine Intelligence
The Earth Mover's Distance as a Metric for Image Retrieval

International Journal of Computer Vision
"GrabCut": interactive foreground extraction using iterated graph cuts

ACM SIGGRAPH 2004 Papers
Smooth minimization of non-smooth functions

Mathematical Programming: Series A and B
Spectral Segmentation with Multiscale Graph Decomposition

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
A Unifying Approach to Hard and Probabilistic Clustering

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Dominant Sets and Pairwise Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
An Efficient Earth Mover's Distance Algorithm for Robust Histogram Comparison

IEEE Transactions on Pattern Analysis and Machine Intelligence
Fast evolutionary maximum margin clustering

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

The two-class clustering problem is formulated as an integer convex optimisation problem which determines the maximum of the Earth Movers Distance (EMD) between two classes, constructing a bipartite graph with minimum flow and maximum inter-class EMD between two sets. Subsequently including the nearest neighbours of the start point in feature space and calculating the EMD for this labellings quickly converges to a robust optimum. A histogram of grey values with the number of bins b as the only parameter is used as feature, which makes run time complexity independent of the number of pixels. After convergence in O(b) steps, spatial correlations can be taken into account by total variational smoothing. Testing the algorithm on real world images from commonly used databases reveals that it is competitive to state-of-theart methods, while it deterministically yields hard assignments without requiring any a priori knowledge of the input data or similarity matrices to be calculated.