Towards Better Outliers Detection for Gene Expression Datasets

  • Authors:
  • R. Kashef;M. S. Kamel

  • Affiliations:
  • -;-

  • Venue:
  • BIOTECHNO '08 Proceedings of the 2008 International Conference on Biocomputation, Bioinformatics, and Biomedical Technologies
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper compares the performance of three clustering algorithms on the task of outlier's detection. The goal is to illustrate that better clustering indicates better detection of outliers. k-means (KM), Bisecting k-means (BKM) and the Partitioning Around Medoids (PAM) algorithms are each combined with the clustering-based outliers detection (Find CBLOF) method. Undertaken experimental results over four gene expression datasets where outliers are presented show that the clustering solutions of the PAM algorithm enable the Find CBLOF algorithm to discover more outliers than those of both the k-means and the bisecting k-means algorithms. The main reason for this is that PAM provides better clustering quality than that of the other two clustering algorithms on the tested datasets measured by external and internal quality measures.