A morphology method for determining the number of clusters present in spectral co-clustering documents and words

  • Authors:
  • Na Liu;Mingyu Lu

  • Affiliations:
  • Department of Information Science & Technology, Dalian Maritime University, Dalian, LiaoNing, China;Department of Information Science & Technology, Dalian Maritime University, Dalian, LiaoNing, China

  • Venue:
  • CGGA'10 Proceedings of the 9th international conference on Computational Geometry, Graphs and Applications
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

A new algorithm for clustering documents and words simultaneously has recently been presented. As most spectral clustering algorithms, the prior knowledge of the number of clusters present is required. In this paper, we explore a method based on morphology for determining the number of clusters present in the given dataset for co-clustering documents and words. The proposed method employs some refined feature extraction techniques, which mainly include a VAT (Visual Assessment of Cluster Tendency) image representation of input matrix generated by spectral co-clustering documents and words, and the texture information obtained by filtering the VAT image. The number of clusters present in co-clustering documents and words is finally reported by computing the eigengap of gray-scale matrix of filtered image. Our experimental results show that the proposed method works well in practice.