Improved robustness in time series analysis of gene expression data by polynomial model based clustering

  • Authors:
  • Michael Hirsch;Allan Tucker;Stephen Swift;Nigel Martin;Christine Orengo;Paul Kellam;Xiaohui Liu

  • Affiliations:
  • School of Information Systems Computing and Mathematics, Brunel University, Uxbridge, UK;School of Information Systems Computing and Mathematics, Brunel University, Uxbridge, UK;School of Information Systems Computing and Mathematics, Brunel University, Uxbridge, UK;School of Computer Science and Information Systems Birkbeck, University of London, London, UK;Department of Biochemistry and Molecular Biology, University College London, London, UK;Department of Infection, University College London, London, UK;School of Information Systems Computing and Mathematics, Brunel University, Uxbridge, UK

  • Venue:
  • CompLife'06 Proceedings of the Second international conference on Computational Life Sciences
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Microarray experiments produce large data sets that often contain noise and considerable missing data. Typical clustering methods such as hierarchical clustering or partitional algorithms can often be adversely affected by such data. This paper introduces a method to overcome such problems associated with noise and missing data by modelling the time series data with polynomials and using these models to cluster the data. Similarity measures for polynomials are given that comply with commonly used standard measures. The polynomial model based clustering is compared with standard clustering methods under different conditions and applied to a real gene expression data set. It shows significantly better results as noise and missing data are increased.