A sample-based hierarchical adaptive K-means clustering method for large-scale video retrieval

Authors:
Kaiyang Liao;Guizhong Liu;Li Xiao;Chaoteng Liu
Affiliations:
-;-;-;-
Venue:
Knowledge-Based Systems
Year:
2013

Citing 32
Cited 1

Random sampling with a reservoir

ACM Transactions on Mathematical Software (TOMS)
A massively parallel architecture for a self-organizing neural pattern recognition machine

Computer Vision, Graphics, and Image Processing
The 'Neural' Phonetic Typewriter

Computer
Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
Cluster analysis and related issues

Handbook of pattern recognition & computer vision
Randomized algorithms

Randomized algorithms
BIRCH: an efficient data clustering method for very large databases

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
CURE: an efficient clustering algorithm for large databases

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Data clustering: a review

ACM Computing Surveys (CSUR)
ROCK: a robust clustering algorithm for categorical attributes

Information Systems
Modern Information Retrieval

Modern Information Retrieval
P-AutoClass: Scalable Parallel Clustering for Mining Large Data Sets

IEEE Transactions on Knowledge and Data Engineering
Hierarchical Clustering Algorithms for Document Datasets

Data Mining and Knowledge Discovery
Approximate clustering in very large relational data: Research Articles

International Journal of Intelligent Systems
Evolutionary clustering

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Scalable Recognition with a Vocabulary Tree

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters

IEEE Transactions on Computers
A survey of evolutionary algorithms for clustering

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Graph based representations of density distribution and distances for self-organizing maps

IEEE Transactions on Neural Networks
Hierarchical document clustering using local patterns

Data Mining and Knowledge Discovery
Approximate pairwise clustering for large data sets via sampling plus extension

Pattern Recognition
Fast modified global k-means algorithm for incremental cluster construction

Pattern Recognition
A hybrid particle swarm optimization approach for clustering and classification of datasets

Knowledge-Based Systems
Spectral clustering with density sensitive similarity function

Knowledge-Based Systems
A quality driven Hierarchical Data Divisive Soft Clustering for information retrieval

Knowledge-Based Systems
A dissimilarity measure for the k-Modes clustering algorithm

Knowledge-Based Systems
Spectral clustering with discriminant cuts

Knowledge-Based Systems
A fuzzy k-prototype clustering algorithm for mixed numeric and categorical data

Knowledge-Based Systems
MIFT: a mirror reflection invariant feature descriptor

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part II
Integrating wavelets with clustering and indexing for effective content-based image retrieval

Knowledge-Based Systems
A self-organizing network for hyperellipsoidal clustering (HEC)

IEEE Transactions on Neural Networks
Survey of clustering algorithms

IEEE Transactions on Neural Networks

Fast K-means algorithm based on a level histogram for image retrieval

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Finding useful patterns in large datasets has attracted considerable interest recently, and one of the most widely studied problems in this area is the identification of clusters in a multi-dimensional dataset. This paper introduces a sample-based hierarchical adaptive K-means (SHAKM) clustering algorithm for large-scale video retrieval. To handle large databases efficiently, SHAKM employs a multilevel random sampling strategy. Furthermore, SHAKM utilises the adaptive K-means clustering algorithm to determine the correct number of clusters and to construct an unbalanced cluster tree. Furthermore, SHAKM uses the fast labelling scheme to assign each pattern in the dataset to the closest cluster. To evaluate the proposed method, several datasets are used to illustrate its effectiveness. The results show that SHAKM is fast and effective on very large datasets. Furthermore, the results demonstrate that the proposed method can be used efficiently and successfully for a project on content-based video copy detection.