Data Clustering Algorithms for Information Systems

  • Authors:
  • Sadaaki Miyamoto

  • Affiliations:
  • Department of Risk Engineering, Faculty of Systems and Information Engineering, University of Tsukuba, Ibaraki 305-8573, Japan

  • Venue:
  • RSFDGrC '07 Proceedings of the 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Although the approaches are fundamentally different, the derivation of decision rules from information systems in the form of tables can be compared to supervised classification in pattern recognition; in the latter case classification rules should be derived from the classes of given points in a feature space. We also notice that methods of unsupervised classification (in other words, data clustering) in pattern recognition are closely related to supervised classification techniques. This observation leads us to the discussion of clustering for information systems by investigating relations between the two methods in the pattern classification. We thus discuss a number of methods of data clustering of information tables without decision attributes on the basis of rough set approach in this paper. Current clustering algorithms using rough sets as well as new algorithms motivated from pattern classification techniques are considered. Agglomerative clustering are generalized into a method of poset-valued clustering for discussing structures of information systems using new notations in relational databases. On the other hand K-means algorithms are developed using the kernel function approach. Illustrative examples are given.