Distribution-based similarity measures for multi-dimensional point set retrieval applications

Authors:
Jie Shao;Zi Huang;Heng Tao Shen;Jialie Shen;Xiaofang Zhou
Affiliations:
The University of Queensland, Brisbane, Australia;The University of Queensland, Brisbane, Australia;The University of Queensland, Brisbane, Australia;Singapore Management University, Singapore, Singapore;The University of Queensland, Brisbane, Australia
Venue:
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Year:
2008

Citing 19
Cited 1

Two-sample test statistics for measuring discrepancies between two multivariate probability density functions using kernel-based density estimates

Journal of Multivariate Analysis
Nonlinear Modeling of Scattered Multivariate Data and Its Application to Shape Change

IEEE Transactions on Pattern Analysis and Machine Intelligence
Classification with Nonmetric Distances: Image Retrieval and Class Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multispace KL for Pattern Representation and Classification

IEEE Transactions on Pattern Analysis and Machine Intelligence
Empirical evaluation of dissimilarity measures for color and texture

Computer Vision and Image Understanding - Special issue on empirical evaluation of computer vision algorithms
Non-parametric Similarity Measures for Unsupervised Texture Segmentation and Image Retrieval

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Similarity Search for Multidimensional Data Sequences

ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Discovering Similar Multidimensional Trajectories

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
On the influence of the kernel on the consistency of support vector machines

The Journal of Machine Learning Research
Segmentation and recognition of multi-attribute motion sequences

Proceedings of the 12th annual ACM international conference on Multimedia
Temporal classification: extending the classification paradigm to multivariate time series

Temporal classification: extending the classification paradigm to multivariate time series
Robust and fast similarity search for moving object trajectories

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Integrating structured biological data by Kernel Maximum Mean Discrepancy

Bioinformatics
Multivariate image similarity in the compressed domain using statistical graph matching

Pattern Recognition
An efficient k nearest neighbor search for multivariate time series

Information and Computation
Exact indexing of dynamic time warping

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Statistical summarization of content features for fast near-duplicate video detection

Proceedings of the 15th international conference on Multimedia
Near-Duplicate Keyframe Identification With Interest Point Matching and Pattern Learning

IEEE Transactions on Multimedia
On the asymptotic properties of a nonparametric L1-test statistic of homogeneity

IEEE Transactions on Information Theory

A Visual Interactive System for Spatial Querying and Ranking of Geographic Regions

Proceedings of the 13th International Conference on Knowledge Management and Knowledge Technologies

Quantified Score

Hi-index	0.01

Visualization

Abstract

Effective and efficient method of similarity assessment continues to be one of the most fundamental problems in multimedia data analysis. In case of retrieving relevant items from a collection of objects based on series of multivariate observations (e.g., searching the similar video clips in a repository to a query example), satisfactory performance cannot be expected using many conventional similarity measures based on the aggregation of element pairwise comparisons. Some correlation information among the individual elements has also been investigated to characterize each set of multi-dimensional points for ranked retrieval, by making use of an unwarranted assumption that the underlying data distribution has a particular parametric form. Motivated by this observation, this paper introduces a novel collective gauge of relevance ranking by evaluating the probabilities that point sets are consistent with the same distribution of the query. Two non-parametric hypothesis tests in statistics are justified to exploit the distributional discrepancy of samples for assessing the similarity between two ensembles of points. While our methodology is mainly presented in the context of video similarity search, it enjoys great flexibility and can be easily adapted to other applications involving generic multi-dimensional point set representation for each object such as human gesture recognition.