Relational clustering by symmetric convex coding

  • Authors:
  • Bo Long;Zhongfei (Mark) Zhang;Xiaoyun Wu;Philip S. Yu

  • Affiliations:
  • Suny Binghamton, Binghamton, NY;Suny Binghamton, Binghamton, NY;Google Inc, Mountain View, CA;IBM Watson Research Center, Hawthorne, NY

  • Venue:
  • Proceedings of the 24th international conference on Machine learning
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Relational data appear frequently in many machine learning applications. Relational data consist of the pairwise relations (similarities or dissimilarities) between each pair of implicit objects, and are usually stored in relation matrices and typically no other knowledge is available. Although relational clustering can be formulated as graph partitioning in some applications, this formulation is not adequate for general relational data. In this paper, we propose a general model for relational clustering based on symmetric convex coding. The model is applicable to all types of relational data and unifies the existing graph partitioning formulation. Under this model, we derive two alternative bound optimization algorithms to solve the symmetric convex coding under two popular distance functions, Euclidean distance and generalized I-divergence. Experimental evaluation and theoretical analysis show the effectiveness and great potential of the proposed model and algorithms.