Aggregate distance based clustering using fibonacci series-FIBCLUS

  • Authors:
  • Rakesh Rawat;Richi Nayak;Yuefeng Li;Slah Alsaleh

  • Affiliations:
  • Faculty of Science and Technology, Queensland University of University, Brisbane Australia;Faculty of Science and Technology, Queensland University of University, Brisbane Australia;Faculty of Science and Technology, Queensland University of University, Brisbane Australia;Faculty of Science and Technology, Queensland University of University, Brisbane Australia

  • Venue:
  • APWeb'11 Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes an innovative instance similarity based evaluation metric that reduces the search map for clustering to be performed. An aggregate global score is calculated for each instance using the novel idea of Fibonacci series. The use of Fibonacci numbers is able to separate the instances effectively and, in hence, the intra-cluster similarity is increased and the intercluster similarity is decreased during clustering. The proposed FIBCLUS algorithm is able to handle datasets with numerical, categorical and a mix of both types of attributes. Results obtained with FIBCLUS are compared with the results of existing algorithms such as k-means, x-means expected maximization and hierarchical algorithms that are widely used to cluster numeric, categorical and mix data types. Empirical analysis shows that FIBCLUS is able to produce better clustering solutions in terms of entropy, purity and F-score in comparison to the above described existing algorithms.