A probabilistic framework for relational clustering

Authors:
Bo Long;Zhongfei Mark Zhang;Philip S. Yu
Affiliations:
SUNY Binghamton;SUNY Binghamton;IBM Watson Research Center
Venue:
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2007

Citing 30
Cited 31

Spectral K-way ratio-cut partitioning and clustering

DAC '93 Proceedings of the 30th international Design Automation Conference
A multilevel algorithm for partitioning graphs

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs

SIAM Journal on Scientific Computing
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Co-clustering documents and words using bipartite spectral graph partitioning

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Bipartite graph partitioning and data clustering

Proceedings of the tenth international conference on Information and knowledge management
Relational Data Mining

Relational Data Mining
Constrained K-means Clustering with Background Knowledge

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
A Min-max Cut Algorithm for Graph Partitioning and Data Clustering

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Latent Class Models for Collaborative Filtering

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Using Logical Decision Trees for Clustering

ILP '97 Proceedings of the 7th International Workshop on Inductive Logic Programming
A Unified Framework for Clustering Heterogeneous Web Objects

WISE '02 Proceedings of the 3rd International Conference on Web Information Systems Engineering
SimRank: a measure of structural-context similarity

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Cluster ensembles: a knowledge reuse framework for combining partitionings

Eighteenth national conference on Artificial intelligence
ReCoM: reinforcement clustering of multi-type interrelated data objects

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Information-theoretic co-clustering

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Graph-based relational learning: current and future directions

ACM SIGKDD Explorations Newsletter
A probabilistic framework for semi-supervised clustering

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
A generalized maximum entropy approach to bregman co-clustering and matrix approximation

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Consistent bipartite graph co-partitioning for star-structured high-order heterogeneous data co-clustering

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
A general model for clustering binary data

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Cross-relational clustering with user's guidance

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Co-clustering by block value decomposition

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Unsupervised learning on k-partite graphs

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
LinkClus: efficient clustering via heterogeneous semantic links

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Clustering with Bregman Divergences

The Journal of Machine Learning Research
Probabilistic classification and clustering in relational data

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Probabilistic latent semantic analysis

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
An information-theoretic analysis of hard and soft assignment methods for clustering

UAI'97 Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence

Relational learning via collective matrix factorization

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Community evolution in dynamic multi-mode networks

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
A Unified View of Matrix Factorization Models

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Data weaving: scaling up the state-of-the-art in data clustering

Proceedings of the 17th ACM conference on Information and knowledge management
Scalable community discovery on textual data with relations

Proceedings of the 17th ACM conference on Information and knowledge management
MetaFac: community discovery via relational hypergraph factorization

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
A probabilistic topic-based ranking framework for location-sensitive domain information retrieval

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Knowledge transfer on hybrid graph

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Multiple information sources cooperative learning

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
On community outliers and their efficient detection in information networks

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Relational network-service clustering analysis with set evidences

Proceedings of the 3rd ACM workshop on Artificial intelligence and security
Multi-modal multi-correlation person-centric news retrieval

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
FacetCube: a framework of incorporating prior knowledge into non-negative tensor factorization

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Community Discovery via Metagraph Factorization

ACM Transactions on Knowledge Discovery from Data (TKDD)
A framework for joint community detection across multiple related networks

Neurocomputing
On context-aware co-clustering with metadata support

Journal of Intelligent Information Systems
Relation strength-aware clustering of heterogeneous information networks with incomplete attributes

Proceedings of the VLDB Endowment
Subgraph mining on directed and weighted graphs

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
A tripartite clustering analysis on microRNA, gene and disease model

Proceedings of the 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine
Detecting communities in K-partite K-uniform (hyper)networks

Journal of Computer Science and Technology - Special issue on Community Analysis and Information Recommendation
Latent Community Topic Analysis: Integration of Community Discovery with Topic Modeling

ACM Transactions on Intelligent Systems and Technology (TIST)
Integrating meta-path selection with user-guided object clustering in heterogeneous information networks

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
A new parametric estimation method for graph-based clustering

TextGraphs-7 '12 Workshop Proceedings of TextGraphs-7 on Graph-based Methods for Natural Language Processing
New approach for clustering relational data based on relationship and attribute information

ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part II
Parameter-less co-clustering for star-structured heterogeneous data

Data Mining and Knowledge Discovery
Fuzzy semi-supervised co-clustering for text documents

Fuzzy Sets and Systems
Transforming graph data for statistical relational learning

Journal of Artificial Intelligence Research
Transfer learning in heterogeneous collaborative filtering domains

Artificial Intelligence
Latent factor blockmodel for modelling relational data

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
PathSelClus: Integrating Meta-Path Selection with User-Guided Object Clustering in Heterogeneous Information Networks

ACM Transactions on Knowledge Discovery from Data (TKDD) - Special Issue on ACM SIGKDD 2012
Mixtures of biased sentiment analysers

Advances in Data Analysis and Classification

Quantified Score

Hi-index	0.01

Visualization

Abstract

Relational clustering has attracted more and more attention due to its phenomenal impact in various important applications which involve multi-type interrelated data objects, such as Web mining, search marketing, bioinformatics, citation analysis, and epidemiology. In this paper, we propose a probabilistic model for relational clustering, which also provides a principal framework to unify various important clustering tasks including traditional attributes-based clustering, semi-supervised clustering, co-clustering and graph clustering. The proposed model seeks to identify cluster structures for each type of data objects and interaction patterns between different types of objects. Under this model, we propose parametric hard and soft relational clustering algorithms under a large number of exponential family distributions. The algorithms are applicable to relational data of various structures and at the same time unifies a number of stat-of-the-art clustering algorithms: co-clustering algorithms, the k-partite graph clustering, Bregman k-means, and semi-supervised clustering based on hidden Markov random fields.