Adaptive clustering for time series: Application for identifying cell cycle expressed genes

Authors:
Ahlame Douzal-Chouakria;Alpha Diallo;Françoise Giroud
Affiliations:
TIMB TIMC-IMAG (CNRS-UMR 5525), Université Joseph Fourier Grenoble 1, France;RFMQ TIMC-IMAG (CNRS-UMR 5525), Université Joseph Fourier Grenoble 1, France;RFMQ TIMC-IMAG (CNRS-UMR 5525), Université Joseph Fourier Grenoble 1, France
Venue:
Computational Statistics & Data Analysis
Year:
2009

Citing 8
Cited 7

Multiclass sparse logistic regression for classification of multiple cancer types using gene expression data

Computational Statistics & Data Analysis
Use of SVD-based probit transformation in clustering gene expression profiles

Computational Statistics & Data Analysis
Classification of gene functions using support vector machine for time-course gene expression data

Computational Statistics & Data Analysis
Weighted rank aggregation of cluster validation measures

Bioinformatics
Assessing agreement of clustering methods with gene expression microarray data

Computational Statistics & Data Analysis
Cluster analysis using multivariate normal mixture models to detect differential gene expression with microarray data

Computational Statistics & Data Analysis
A periodogram-based metric for time series classification

Computational Statistics & Data Analysis
Clustering of time series data-a survey

Pattern Recognition

Which Distance for the Identification and the Differentiation of Cell-Cycle Expressed Genes?

IDA '09 Proceedings of the 8th International Symposium on Intelligent Data Analysis: Advances in Intelligent Data Analysis VIII
Non-linear time series clustering based on non-parametric forecast densities

Computational Statistics & Data Analysis
Time series labeling algorithms based on the K-nearest neighbors' frequencies

Expert Systems with Applications: An International Journal
Classification trees for time series

Pattern Recognition
Phase and amplitude-based clustering for functional data

Computational Statistics & Data Analysis
A hypothesis test using bias-adjusted AR estimators for classifying time series in small samples

Computational Statistics & Data Analysis
Polarization of forecast densities: A new approach to time series classification

Computational Statistics & Data Analysis

Quantified Score

Hi-index	0.03

Visualization

Abstract

The biological problem of identifying the active genes during the cell division process is addressed. The cell division ensures the proliferation of cells, which is drastically aberrant in cancer cells. The studied genes are described by their expression profiles during the cell division cycle. Commonly, the identification process is a supervised approach based on an a priori set of reference genes, assumed as well-characterizing the cell cycle phases. Each studied gene is then classified by its peak similarity to one pre-specified reference gene. This classical approach suffers from two limitations. On the one hand, there is no consensus between biologists about the set of reference genes to consider for the identification process. On the other hand, the proximity measures used for genes expression profiles are unjustified and mainly based on the expression values regardless of the genes expression behavior. To identify genes expression profiles, a new adaptive clustering approach is proposed which consists of two main points. First, it allows in an unsupervised way the selection of a well-justified set of reference genes, to be compared with the pre-specified ones. Secondly, it enables the users to learn the appropriate proximity measure to use for genes expression data, a measure which will cover both proximity on values and on behavior. The adaptive clustering method is compared to a correlation-based approach through public and simulated genes expression data.