A model-based approach to attributed graph clustering

  • Authors:
  • Zhiqiang Xu;Yiping Ke;Yi Wang;Hong Cheng;James Cheng

  • Affiliations:
  • Nanyang Technological University, Singapore, Singapore;Institute of High Performance Computing, Singapore, Singapore;National University of Singapore, Singapore, Singapore;The Chinese University of Hong Kong, Hong Kong, China;Nanyang Technological University, Singapore, Singapore

  • Venue:
  • SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Graph clustering, also known as community detection, is a long-standing problem in data mining. However, with the proliferation of rich attribute information available for objects in real-world graphs, how to leverage structural and attribute information for clustering attributed graphs becomes a new challenge. Most existing works take a distance-based approach. They proposed various distance measures to combine structural and attribute information. In this paper, we consider an alternative view and propose a model-based approach to attributed graph clustering. We develop a Bayesian probabilistic model for attributed graphs. The model provides a principled and natural framework for capturing both structural and attribute aspects of a graph, while avoiding the artificial design of a distance measure. Clustering with the proposed model can be transformed into a probabilistic inference problem, for which we devise an efficient variational algorithm. Experimental results on large real-world datasets demonstrate that our method significantly outperforms the state-of-art distance-based attributed graph clustering method.