LEGClust—A Clustering Algorithm Based on Layered Entropic Subgraphs

Authors:
Jorge M. Santos;Joaquim Marques de Sa;Luis A. Alexandre
Affiliations:
-;-;-
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2008

Citing 27
Cited 2

Algorithms for clustering data

Algorithms for clustering data
Min-cut clustering

Mathematical Programming: Series A and B
BIRCH: an efficient data clustering method for very large databases

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
CURE: an efficient clustering algorithm for large databases

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
An algorithm for clustering cDNAs for gene expression analysis

RECOMB '99 Proceedings of the third annual international conference on Computational molecular biology
Entropy-based subspace clustering for mining numerical data

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Data clustering: a review

ACM Computing Surveys (CSUR)
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
ROCK: a robust clustering algorithm for categorical attributes

Information Systems
A clustering algorithm based on graph connectivity

Information Processing Letters
Information Theoretic Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
Mean Shift: A Robust Approach Toward Feature Space Analysis

IEEE Transactions on Pattern Analysis and Machine Intelligence
BIRCH: A New Data Clustering Algorithm and Its Applications

Data Mining and Knowledge Discovery
Chameleon: Hierarchical Clustering Using Dynamic Modeling

Computer
An Optimal Graph Theoretic Approach to Data Clustering: Theory and Its Application to Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Mean Shift, Mode Seeking, and Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
Interpreting and Extending Classical Agglomerative Clustering Algorithms using a Model-Based approach

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
A Min-max Cut Algorithm for Graph Partitioning and Data Clustering

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Path-Based Clustering for Grouping of Smooth Curves and Texture Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
On clusterings-good, bad and spectral

FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
Mean Shift Analysis and Applications

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Minimum Entropy Clustering and Applications to Gene Expression Analysis

CSB '04 Proceedings of the 2004 IEEE Computational Systems Bioinformatics Conference
Landscape of Clustering Algorithms

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 1 - Volume 01
Training MLPs layer-by-layer with the information potential

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 02
Maximum within-cluster association

Pattern Recognition Letters
Neural networks trained with the EEM algorithm: tuning the smoothing parameter

NN'05 Proceedings of the 6th WSEAS international conference on Neural networks
UBIRIS: a noisy iris image database

ICIAP'05 Proceedings of the 13th international conference on Image Analysis and Processing

The mee principle in data classification: A perceptron-based analysis

Neural Computation
Feature selection using genetic algorithm and cluster validation

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.15

Visualization

Abstract

Hierarchical clustering is a stepwise clustering method usually based on proximity measures between objects or sets of objects from a given data set. The most common proximity measures are distance measures. The derived proximity matrices can be used to build graphs, which provide the basic structure for some clustering methods. We present here a new proximity matrix based on an entropic measure and also a clustering algorithm (LEGClust) that builds layers of subgraphs based on this matrix, and uses them and a hierarchical agglomerative clustering technique to form the clusters. Our approach capitalizes on both a graph structure and a hierarchical construction. Moreover, by using entropy as a proximity measure we are able, with no assumption about the cluster shapes, to capture the local structure of the data, forcing the clustering method to reflect this structure. We present several experiments on artificial and real data sets that provide evidence on the superior performance of this new algorithm when compared with competing ones.