Fuzzy clustering with semantically distinct families of variables: Descriptive and predictive aspects

  • Authors:
  • Witold Pedrycz;Andrzej Bargiela

  • Affiliations:
  • Department of Electrical and Computer Engineering, University of Alberta, Edmonton AB, Canada T6R 2G7 and System Research Institute, Polish Academy of Sciences, Warsaw, Poland;School of Computer Science, The University of Nottingham, Malaysia Campus, Jalan Broga, 43500 Semenyih, Selangor Darul Ehsan, Malaysia

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2010

Quantified Score

Hi-index 0.10

Visualization

Abstract

Fuzzy clustering being focused on the discovery of structure in multivariable data is of relational nature in the sense of not distinguishing between the natures of the individual variables (features) encountered in the problem. In this study, we revisit the generic approach to clustering by studying situations in which there are families of features of descriptive and functional nature whose semantics needs to be incorporated into the clustering algorithm. While the structure is determined on the basis of all features taken en-block, it is anticipated that the topology revealed in this manner would aid the effectiveness of determining values of functional features given the vector of the corresponding descriptive features. We propose an augmented distance in which the families of descriptive and predictive features are distinguished through some weighted version of the distance between patterns. The optimization of this distance is guided by a reconstruction criterion, which helps minimize the reconstruction error between the original vector of functional features and their reconstruction realized by means of descriptive features. Experimental results are offered to demonstrate the performance of the clustering and quantify the effect of reaching balance between semantically distinct families of features.