Applying Electromagnetic Field Theory Concepts to Clustering with Constraints

  • Authors:
  • Huseyin Hakkoymaz;Georgios Chatzimilioudis;Dimitrios Gunopulos;Heikki Mannila

  • Affiliations:
  • Dept. of Computer Science, University of California, Riverside, USA 92521;Dept. of Computer Science, University of California, Riverside, USA 92521;Dept. of Computer Science, University of California, Riverside, USA 92521 and Dept. of Informatics and Telecommunications, Univ. of Athens, Greece;HIIT, Helsinki University of Technology and University of Helsinki, Finland

  • Venue:
  • ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work shows how concepts from the electromagnetic field theory can be efficiently used in clustering with constraints. The proposed framework transforms vector data into a fully connected graph, or just works straight on the given graph data. User constraints are represented by electromagnetic fields that affect the weight of the graph's edges. A clustering algorithm is then applied on the adjusted graph, using k -distinct shortest paths as the distance measure. Our framework provides better accuracy compared to MPCK-Means, SS-Kernel-KMeans and Kmeans+Diagonal Metric even when very few constraints are used, significantly improves clustering performance on some datasets that other methods fail to partition successfully, and can cluster both vector and graph datasets. All these advantages are demonstrated through thorough experimental evaluation.