How to Control Clustering Results? Flexible Clustering Aggregation

  • Authors:
  • Martin Hahmann;Peter B. Volk;Frank Rosenthal;Dirk Habich;Wolfgang Lehner

  • Affiliations:
  • Database Technology Group, Dresden University of Technology, Email: dbinfo@mail.inf.tu-dresden.de,;Database Technology Group, Dresden University of Technology, Email: dbinfo@mail.inf.tu-dresden.de,;Database Technology Group, Dresden University of Technology, Email: dbinfo@mail.inf.tu-dresden.de,;Database Technology Group, Dresden University of Technology, Email: dbinfo@mail.inf.tu-dresden.de,;Database Technology Group, Dresden University of Technology, Email: dbinfo@mail.inf.tu-dresden.de,

  • Venue:
  • IDA '09 Proceedings of the 8th International Symposium on Intelligent Data Analysis: Advances in Intelligent Data Analysis VIII
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

One of the most important and challenging questions in the area of clustering is how to choose the best-fitting algorithm and parameterization to obtain an optimal clustering for the considered data. The clustering aggregation concept tries to bypass this problem by generating a set of separate, heterogeneous partitionings of the same data set, from which an aggregate clustering is derived. As of now, almost every existing aggregation approach combines given crisp clusterings on the basis of pair-wise similarities. In this paper, we regard an input set of soft clusterings and show that it contains additional information that is efficiently useable for the aggregation. Our approach introduces an expansion of mentioned pair-wise similarities, allowing control and adjustment of the aggregation process and its result. Our experiments show that our flexible approach offers adaptive results, improved identification of structures and high useability.