Graph partition and identification of cluster number in data analysis

  • Authors:
  • Yu-Shiuan Li;Kwang-Cheng Chen

  • Affiliations:
  • National Taiwan University, Taipei, Taiwan;National Taiwan University, Taipei, Taiwan

  • Venue:
  • Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
  • Year:
  • 2011

Quantified Score

Hi-index 0.01

Visualization

Abstract

Modern computing with wide range of applications in different areas such as Internet, biology, and social science, involves large scale of data analysis. The relations of data can be modeled as graphs and graph partitioning problem can be effectively approximated by spectral approaches. A critically important problem in graph partition is determination of the cluster number k. Although eigengap heuristic is a principle for this problem and is supported by theory, it is difficult to be applied for the real-world data and complex graphs. In this paper, by considering the general data analysis scenario, we present an algorithm to determine the cluster number k and perform clustering task simultaneously. The experimental result shows that our algorithm works successfully even for the real world data, which is therefore a promising tool for future data analysis.