Iterative clustering analysis for grouping missing data in gene expression profiles

Authors:
Dae-Won Kim;Bo-Yeong Kang
Affiliations:
School of Computer Science and Engineering, Chung-Ang University, Seoul, Korea;Center of Healthcare Ontology R&D, Seoul National University, Seoul, Korea
Venue:
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Year:
2006

Citing 4
Cited 1

Fuzzy sets and their application to clustering and training

Fuzzy sets and their application to clustering and training
Gaussian mixture clustering and imputation of microarray data

Bioinformatics
Detecting clusters of different geometrical shapes in microarray gene expression data

Bioinformatics
Fuzzy c-means clustering of incomplete data

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Clustering with Missing Values

Fundamenta Informaticae

Quantified Score

Hi-index	0.00

Visualization

Abstract

Clustering has been used as a popular technique for finding groups of genes that show similar expression patterns under multiple experimental conditions. Because a clustering method requires a complete data matrix as an input, we must estimate the missing values using an imputation method in the preprocessing step of clustering. However, a common limitation of these conventional approach is that once the estimates of missing values are fixed in the preprocessing step, they are not changed during subsequent process of clustering. Badly estimated missing values obtained in data preprocessing are likely to deteriorate the quality and reliability of clustering results. Thus, a new clustering method is required for improving missing values during iterative clustering process.