Kernel k-means: spectral clustering and normalized cuts

Authors:
Inderjit S. Dhillon;Yuqiang Guan;Brian Kulis
Affiliations:
University of Texas at Austin, Austin, TX;University of Texas at Austin, Austin, TX;University of Texas at Austin, Austin, TX
Venue:
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2004

Citing 7
Cited 102

Nonlinear component analysis as a kernel eigenvalue problem

Neural Computation
An introduction to support Vector Machines: and other kernel-based learning methods

An introduction to support Vector Machines: and other kernel-based learning methods
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
On clusterings-good, bad and spectral

FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
Iterative Clustering of High Dimensional Text Data Augmented by Local Search

ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Multiclass Spectral Clustering

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Mercer kernel-based clustering in feature space

IEEE Transactions on Neural Networks

A fast kernel-based multilevel algorithm for graph clustering

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Semi-supervised graph clustering: a kernel approach

ICML '05 Proceedings of the 22nd international conference on Machine learning
Kernelizing the output of tree-based methods

ICML '06 Proceedings of the 23rd international conference on Machine learning
Diffusion Maps and Coarse-Graining: A Unified Framework for Dimensionality Reduction, Graph Partitioning, and Data Set Parameterization

IEEE Transactions on Pattern Analysis and Machine Intelligence
Maximum unfolded embedding: formulation, solution, and application for image clustering

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Some Equivalences between Kernel Methods and Information Theoretic Methods

Journal of VLSI Signal Processing Systems
Graph clustering with network structure indices

Proceedings of the 24th international conference on Machine learning
A dependence maximization view of clustering

Proceedings of the 24th international conference on Machine learning
Evolutionary spectral clustering by incorporating temporal smoothness

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A spectral clustering approach to optimally combining numericalvectors with a modular network

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A survey of kernel and spectral methods for clustering

Pattern Recognition
Inappropriateness of the criterion of k-way normalized cuts for deciding the number of clusters

Pattern Recognition Letters
Weighted Graph Cuts without Eigenvectors A Multilevel Approach

IEEE Transactions on Pattern Analysis and Machine Intelligence
Facetnet: a framework for analyzing communities and their evolutions in dynamic networks

Proceedings of the 17th international conference on World Wide Web
Local vs global interactions in clustering algorithms: Advances over K-means

International Journal of Knowledge-based and Intelligent Engineering Systems
KPCA for semantic object extraction in images

Pattern Recognition
A Kernel-Based Two-Stage One-Class Support Vector Machines Algorithm

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks, Part III
Finding Arbitrary Shaped Clusters for Character Recognition

ICIAR '08 Proceedings of the 5th international conference on Image Analysis and Recognition
Semi-supervised graph clustering: a kernel approach

Machine Learning
Analyzing communities and their evolutions in dynamic social networks

ACM Transactions on Knowledge Discovery from Data (TKDD)
Optimal reverse prediction: a unified perspective on supervised, unsupervised and semi-supervised learning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Co-clustering on manifolds

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
An adaptive scheme for distributed dynamic security assessment of large scale power systems

WSEAS Transactions on Circuits and Systems
Geometric Manifold Energy and Manifold Clustering

ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Foreground Focus: Unsupervised Learning from Partially Matching Images

International Journal of Computer Vision
K-Subspace Clustering

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
A Matrix Factorization Approach for Integrating Multiple Data Views

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
On evolutionary spectral clustering

ACM Transactions on Knowledge Discovery from Data (TKDD)
The global kernel k-means algorithm for clustering in feature space

IEEE Transactions on Neural Networks
Using graph partitioning to discover regions of correlated spatio-temporal change in evolving graphs

Intelligent Data Analysis
SPARCL: an effective and efficient algorithm for mining arbitrary shape-based clusters

Knowledge and Information Systems
Kernel-based fuzzy clustering and fuzzy clustering: A comparative experimental study

Fuzzy Sets and Systems
Detecting sequence and structure homology via an integrative kernel: a case-study in recognizing enzymes

CIBCB'09 Proceedings of the 6th Annual IEEE conference on Computational Intelligence in Bioinformatics and Computational Biology
Mine the easy, classify the hard: a semi-supervised approach to automatic sentiment classification

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
A Novel Path-Based Clustering Algorithm Using Multi-dimensional Scaling

AI '09 Proceedings of the 22nd Australasian Joint Conference on Advances in Artificial Intelligence
Adapting Spectral Co-clustering to Documents and Terms Using Latent Semantic Analysis

AI '09 Proceedings of the 22nd Australasian Joint Conference on Advances in Artificial Intelligence
Topic-wise, sentiment-wise, or otherwise?: Identifying the hidden dimension for unsupervised text classification

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Analyzing knowledge communities using foreground and background clusters

ACM Transactions on Knowledge Discovery from Data (TKDD)
Data clustering: 50 years beyond K-means

Pattern Recognition Letters
Sparse Kernel PCA by Kernel K-means and preimage reconstruction algorithms

PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
Translational symmetry in subsequence time-series clustering

JSAI'06 Proceedings of the 20th annual conference on New frontiers in artificial intelligence
Graph nodes clustering based on the commute-time kernel

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Multi-modal data integration using graph for collaborative assembly design information sharing and reuse

IEA/AIE'07 Proceedings of the 20th international conference on Industrial, engineering, and other applications of applied intelligent systems
A grid computing based approach for the power system dynamic security assessment

Computers and Electrical Engineering
A clustering framework for unbalanced partitioning and outlier filtering on high dimensional datasets

ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
Application of fusion-fission to the multi-way graph partitioning problem

PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
Nonlinear non-negative component analysis algorithms

IEEE Transactions on Image Processing
Linear and nonlinear projective nonnegative matrix factorization

IEEE Transactions on Neural Networks
Image clustering using local discriminant models and global integration

IEEE Transactions on Image Processing - Special section on distributed camera networks: sensing, processing, communication, and implementation
A spectral approach to clustering numerical vectors as nodes in a network

Pattern Recognition
Robust clustering using discriminant analysis

ICDM'10 Proceedings of the 10th industrial conference on Advances in data mining: applications and theoretical aspects
MMSVC: an efficient unsupervised learning approach for large-scale datasets

LSMS/ICSEE'10 Proceedings of the 2010 international conference on Life system modeling and simulation and intelligent computing, and 2010 international conference on Intelligent computing for sustainable energy and environment: Part III
Multi class semi-supervised classification with graph construction based on adaptive metric learning

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
Non-local spatial spectral clustering for image segmentation

Neurocomputing
A novel approach to FRUC using discriminant saliency and frame segmentation

IEEE Transactions on Image Processing
A time-efficient pattern reduction algorithm for k-means clustering

Information Sciences: an International Journal
Which clustering do you want? inducing your ideal clustering with minimal feedback

Journal of Artificial Intelligence Research
Spectral clustering with more than K eigenvectors

Neurocomputing
Kernel-based algorithm for clustering spatial data

ACOS'06 Proceedings of the 5th WSEAS international conference on Applied computer science
A kernel prototype-based clustering algorithm

ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
Learning low-rank kernel matrices for constrained clustering

Neurocomputing
Unsupervised decomposition of a document into authorial components

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Self-adjust local connectivity analysis for spectral clustering

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
Application of a unified medical data miner (UMDM) for prediction, classification, interpretation and visualization on medical datasets: the diabetes dataset case

ICDM'11 Proceedings of the 11th international conference on Advances in data mining: applications and theoretical aspects
Learning from label proportions by optimizing cluster model selection

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Gaussian kernel optimization: Complex problem and a simple solution

Neurocomputing
A partitioning method for symbolic interval data based on kernelized metric

Proceedings of the 20th ACM international conference on Information and knowledge management
An efficient algorithm for maximal margin clustering

Journal of Global Optimization
Why does subsequence time-series clustering produce sine waves?

PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Localized alternative cluster ensembles for collaborative structuring

ECML'06 Proceedings of the 17th European conference on Machine Learning
An intelligent system based on kernel methods for crop yield prediction

PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Spectral clustering with discriminant cuts

Knowledge-Based Systems
Influence maximizing and local influenced community detection based on multiple spread model

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
Towards multiple identity detection in social networks

Proceedings of the 21st international conference companion on World Wide Web
Sequence kernels for clustering and visualizing near duplicate video segments

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Dense Neighborhoods on Affinity Graph

International Journal of Computer Vision
Partitive clustering (K-means family)

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Chaotic ant swarm approach for data clustering

Applied Soft Computing
Sequential minimal optimization in convex clustering repetitions

Statistical Analysis and Data Mining
Clustering high dimensional data

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Event-based social networks: linking the online and offline social worlds

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Semi-supervised clustering with discriminative random fields

Pattern Recognition
Robust re-identification using randomness and statistical learning: Quo vadis

Pattern Recognition Letters
MMSVC: An efficient unsupervised learning approach for large-scale datasets

Neurocomputing
Discovering factions in the computational linguistics community

ACL '12 Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries
Geodesic analysis on the gaussian RKHS hypersphere

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
Clustering interval data through kernel-induced feature space

Journal of Intelligent Information Systems
Fuzzy semi-supervised co-clustering for text documents

Fuzzy Sets and Systems
Speeding-up the kernel k-means clustering method: A prototype based hybrid approach

Pattern Recognition Letters
On non-euclidean metrics based clustering

IScIDE'12 Proceedings of the third Sino-foreign-interchange conference on Intelligent Science and Intelligent Data Engineering
An elastic net clustering algorithm for non-linearly separable data

ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part I
Automatic virtual machine clustering based on bhattacharyya distance for multi-cloud systems

Proceedings of the 2013 international workshop on Multi-cloud applications and federated clouds
Efficient community detection in large networks using content and links

Proceedings of the 22nd international conference on World Wide Web
Combining co-clustering with noise detection for theme-based summarization

ACM Transactions on Speech and Language Processing (TSLP)
How Many Clusters: A Validation Index for Arbitrary-Shaped Clusters

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Cross-modal social image clustering and tag cleansing

Journal of Visual Communication and Image Representation
Kernel k'-means algorithm for clustering analysis

ICIC'13 Proceedings of the 9th international conference on Intelligent Computing Theories and Technology
Euler clustering

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Kernel fuzzy c-means with automatic variable weighting

Fuzzy Sets and Systems
Analysis of the k-means algorithm in the case of data points occurring on the border of two or more clusters

Knowledge-Based Systems
Exploiting small world property for network clustering

World Wide Web
Feature selection for k-means clustering stability: theoretical analysis and an algorithm

Data Mining and Knowledge Discovery

Quantified Score

Hi-index	0.00

Visualization

Abstract

Kernel k-means and spectral clustering have both been used to identify clusters that are non-linearly separable in input space. Despite significant research, these methods have remained only loosely related. In this paper, we give an explicit theoretical connection between them. We show the generality of the weighted kernel k-means objective function, and derive the spectral clustering objective of normalized cut as a special case. Given a positive definite similarity matrix, our results lead to a novel weighted kernel k-means algorithm that monotonically decreases the normalized cut. This has important implications: a) eigenvector-based algorithms, which can be computationally prohibitive, are not essential for minimizing normalized cuts, b) various techniques, such as local search and acceleration schemes, may be used to improve the quality as well as speed of kernel k-means. Finally, we present results on several interesting data sets, including diametrical clustering of large gene-expression matrices and a handwriting recognition data set.