Semi-supervised graph clustering: a kernel approach

Authors:
Brian Kulis;Sugato Basu;Inderjit Dhillon;Raymond Mooney
Affiliations:
University of Texas at Austin, Austin, TX;University of Texas at Austin, Austin, TX;University of Texas at Austin, Austin, TX;University of Texas at Austin, Austin, TX
Venue:
ICML '05 Proceedings of the 22nd international conference on Machine learning
Year:
2005

Citing 9
Cited 62

An introduction to support Vector Machines: and other kernel-based learning methods

An introduction to support Vector Machines: and other kernel-based learning methods
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Correlation Clustering

FOCS '02 Proceedings of the 43rd Symposium on Foundations of Computer Science
Constrained K-means Clustering with Background Knowledge

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
From Instance-level Constraints to Space-Level Constraints: Making the Most of Prior Knowledge in Data Clustering

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Segmentation Given Partial Grouping Constraints

IEEE Transactions on Pattern Analysis and Machine Intelligence
A probabilistic framework for semi-supervised clustering

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Kernel k-means: spectral clustering and normalized cuts

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Spectral learning

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence

Learning low-rank kernel matrices

ICML '06 Proceedings of the 23rd international conference on Machine learning
Segmenting Customers from Population to Individuals: Does 1-to-1 Keep Your Customers Forever?

IEEE Transactions on Knowledge and Data Engineering
Revisiting probabilistic models for clustering with pair-wise constraints

Proceedings of the 24th international conference on Machine learning
BoostCluster: boosting clustering by pairwise constraints

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A spectral clustering approach to optimally combining numericalvectors with a modular network

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A survey of kernel and spectral methods for clustering

Pattern Recognition
Deriving semantics for image clustering from accumulated user feedbacks

Proceedings of the 15th international conference on Multimedia
A clustering framework based on subjective and objective validity criteria

ACM Transactions on Knowledge Discovery from Data (TKDD)
Repairing self-confident active-transductive learners using systematic exploration

Pattern Recognition Letters
Joint cluster analysis of attribute data and relationship data: The connected k-center problem, algorithms and applications

ACM Transactions on Knowledge Discovery from Data (TKDD)
Towards effective document clustering: A constrained K-means based approach

Information Processing and Management: an International Journal
Pairwise constraint propagation by semidefinite programming for semi-supervised classification

Proceedings of the 25th international conference on Machine learning
Improving supervised learning performance by using fuzzy clustering method to select training data

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Fuzzy theory and technology with applications
Non-negative matrix factorization for semi-supervised data clustering

Knowledge and Information Systems
Semi-supervised graph clustering: a kernel approach

Machine Learning
Clustering with Feature Order Preferences

PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Kernel-Based Transductive Learning with Nearest Neighbors

APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
A Learning Algorithm for the Optimum-Path Forest Classifier

GbRPR '09 Proceedings of the 7th IAPR-TC-15 International Workshop on Graph-Based Representations in Pattern Recognition
Semi-supervised Document Clustering with Simultaneous Text Representation and Categorization

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Applying Electromagnetic Field Theory Concepts to Clustering with Constraints

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Semi-supervised learning by mixed label propagation

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Metric learning for semi-supervised clustering using pairwise constraints and the geometrical structure of data

Intelligent Data Analysis
Semi-supervised clustering with metric learning: An adaptive kernel method

Pattern Recognition
A multiobjective simultaneous learning framework for clustering and classification

IEEE Transactions on Neural Networks
Non-linear metric learning using pairwise similarity and dissimilarity constraints and the geometrical structure of data

Pattern Recognition
Clustering with feature order preferences

Intelligent Data Analysis - Artificial Intelligence
Accelerating spectral clustering with partial supervision

Data Mining and Knowledge Discovery
Data clustering with size constraints

Knowledge-Based Systems
Mining networks with shared items

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Constrained spectral clustering via exhaustive and efficient constraint propagation

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Semi-supervised projection clustering with transferred centroid regularization

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III
Semisupervised kernel matrix learning by kernel propagation

IEEE Transactions on Neural Networks
Graph-based clustering with constraints

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
Constraint selection for semi-supervised topological clustering

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Multi-modal constraint propagation for heterogeneous image clustering

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Multiple-Instance learning via random walk

ECML'06 Proceedings of the 17th European conference on Machine Learning
Subspace metric ensembles for semi-supervised clustering of high dimensional data

ECML'06 Proceedings of the 17th European conference on Machine Learning
An adaptive kernel method for semi-supervised clustering

ECML'06 Proceedings of the 17th European conference on Machine Learning
Semi-supervised clustering of graph objects: a subgraph mining approach

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Two-stage nonparametric kernel leaning: From label propagation to kernel propagation

Neurocomputing
Semi-supervised learning with mixed knowledge information

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Integrating meta-path selection with user-guided object clustering in heterogeneous information networks

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Active co-analysis of a set of shapes

ACM Transactions on Graphics (TOG) - Proceedings of ACM SIGGRAPH Asia 2012
A general approach for adaptive kernels in semi-supervised clustering

IDEAL'12 Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning
Semi-supervised fuzzy clustering with metric learning and entropy regularization

Knowledge-Based Systems
Hypergraph based information-theoretic feature selection

Pattern Recognition Letters
Linear semi-supervised projection clustering by transferred centroid regularization

Journal of Intelligent Information Systems
Semi-intrinsic mean shift on riemannian manifolds

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Discovering latent domains for multisource domain adaptation

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Hypergraph learning with hyperedge expansion

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
Fuzzy semi-supervised co-clustering for text documents

Fuzzy Sets and Systems
Semi-supervised learning with nuclear norm regularization

Pattern Recognition
Enhancing expression recognition in the wild with unlabeled reference data

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Learning mid-perpendicular hyperplane similarity from cannot-link constraints

Neurocomputing
PathSelClus: Integrating Meta-Path Selection with User-Guided Object Clustering in Heterogeneous Information Networks

ACM Transactions on Knowledge Discovery from Data (TKDD) - Special Issue on ACM SIGKDD 2012
On Knowledge-Enhanced Document Clustering

International Journal of Information Retrieval Research
On constrained spectral clustering and its applications

Data Mining and Knowledge Discovery
Semi-supervised clustering of large data sets with kernel methods

Pattern Recognition Letters
A new interactive semi-supervised clustering model for large image database indexing

Pattern Recognition Letters
Change detection in remotely sensed images using semi-supervised clustering algorithms

International Journal of Knowledge Engineering and Soft Data Paradigms
Exploiting small world property for network clustering

World Wide Web
Pairwise constrained concept factorization for data representation

Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

Semi-supervised clustering algorithms aim to improve clustering results using limited supervision. The supervision is generally given as pairwise constraints; such constraints are natural for graphs, yet most semi-supervised clustering algorithms are designed for data represented as vectors. In this paper, we unify vector-based and graph-based approaches. We show that a recently-proposed objective function for semi-supervised clustering based on Hidden Markov Random Fields, with squared Euclidean distance and a certain class of constraint penalty functions, can be expressed as a special case of the weighted kernel k-means objective. A recent theoretical connection between kernel k-means and several graph clustering objectives enables us to perform semi-supervised clustering of data given either as vectors or as a graph. For vector data, the kernel approach also enables us to find clusters with non-linear boundaries in the input data space. Furthermore, we show that recent work on spectral learning (Kamvar et al., 2003) may be viewed as a special case of our formulation. We empirically show that our algorithm is able to outperform current state-of-the-art semi-supervised algorithms on both vector-based and graph-based data sets.