Algorithms for clustering data
Algorithms for clustering data
Normalized Cuts and Image Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Introduction to Algorithms
IEEE Transactions on Pattern Analysis and Machine Intelligence
Path-Based Clustering for Grouping of Smooth Curves and Texture Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Machine Learning
k-Edge Connected Neighborhood Graph for Geodesic Distance Estimation and Nonlinear Data Projection
ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 1 - Volume 01
Combining Multiple Clusterings Using Evidence Accumulation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Clustering with a minimum spanning tree of scale-free-like structure
Pattern Recognition Letters
Clustering Ensembles: Models of Consensus and Weak Partitions
IEEE Transactions on Pattern Analysis and Machine Intelligence
Fast Agglomerative Clustering Using a k-Nearest Neighbor Graph
IEEE Transactions on Pattern Analysis and Machine Intelligence
ACM Transactions on Knowledge Discovery from Data (TKDD)
A clustering algorithm based on maximal θ-distant subtrees
Pattern Recognition
Engineering graph clustering: Models and experimental evaluation
Journal of Experimental Algorithmics (JEA)
Robust path-based spectral clustering
Pattern Recognition
Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters
IEEE Transactions on Computers
Cumulative Voting Consensus Method for Partitions with Variable Number of Clusters
IEEE Transactions on Pattern Analysis and Machine Intelligence
An improved algorithm for clustering gene expression data
Bioinformatics
Clustering high dimensional data: A graph-based relaxed optimization approach
Information Sciences: an International Journal
An Evolutionary Approach to Multiobjective Clustering
IEEE Transactions on Evolutionary Computation
New spectral methods for ratio cut partitioning and clustering
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Survey of clustering algorithms
IEEE Transactions on Neural Networks
Scalable parallel minimum spanning forest computation
Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
Biometric identification based on the eye movements and graph matching techniques
Pattern Recognition Letters
Dense subgraph mining with a mixed graph model
Pattern Recognition Letters
Clustering based on a near neighbor graph and a grid cell graph
Journal of Intelligent Information Systems
How Many Clusters: A Validation Index for Arbitrary-Shaped Clusters
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Enhancing minimum spanning tree-based clustering by removing density-based outliers
Digital Signal Processing
Hi-index | 0.01 |
Many clustering approaches have been proposed in the literature, but most of them are vulnerable to the different cluster sizes, shapes and densities. In this paper, we present a graph-theoretical clustering method which is robust to the difference. Based on the graph composed of two rounds of minimum spanning trees (MST), the proposed method (2-MSTClus) classifies cluster problems into two groups, i.e. separated cluster problems and touching cluster problems, and identifies the two groups of cluster problems automatically. It contains two clustering algorithms which deal with separated clusters and touching clusters in two phases, respectively. In the first phase, two round minimum spanning trees are employed to construct a graph and detect separated clusters which cover distance separated and density separated clusters. In the second phase, touching clusters, which are subgroups produced in the first phase, can be partitioned by comparing cuts, respectively, on the two round minimum spanning trees. The proposed method is robust to the varied cluster sizes, shapes and densities, and can discover the number of clusters. Experimental results on synthetic and real datasets demonstrate the performance of the proposed method.