Algorithms for clustering data
Algorithms for clustering data
Numerical recipes in C (2nd ed.): the art of scientific computing
Numerical recipes in C (2nd ed.): the art of scientific computing
Identifying gene regulatory networks from experimental data
RECOMB '99 Proceedings of the third annual international conference on Computational molecular biology
IEEE Spectrum
Pattern Recognition with Fuzzy Objective Function Algorithms
Pattern Recognition with Fuzzy Objective Function Algorithms
An Introduction to Nonlinear Image Processing
An Introduction to Nonlinear Image Processing
Microarray Gridding by Mathematical Morphology
SIBGRAPI '01 Proceedings of the 14th Brazilian Symposium on Computer Graphics and Image Processing
APBC '03 Proceedings of the First Asia-Pacific bioinformatics conference on Bioinformatics 2003 - Volume 19
Interactive Visualization and Analysis for Gene Expression Data
HICSS '02 Proceedings of the 35th Annual Hawaii International Conference on System Sciences (HICSS'02)-Volume 6 - Volume 6
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
A high-performance VLSI architecture for the histogram peak-climbing data clustering algorithm
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Rough Sets in Oligonucleotide Microarray Data Analysis
RSEISP '07 Proceedings of the international conference on Rough Sets and Intelligent Systems Paradigms
Hi-index | 0.00 |
The rapid advancement of DNA microarray technology has revolutionalized genetic research in bioscience. Due to the enormous amount of gene expression data generated by such technology, computer processing and analysis of such data has become indispensable. In this paper, we present a computational framework for the extraction, analysis and visualization of gene expression data from microarray experiments. A novel, fully automated, spot segmentation algorithm for DNA microarray images, which makes use of adaptive thresholding, morphological processing and statistical intensity modeling, is proposed to: (i) segment the blocks of spots, (ii) generate the grid structure, and (iii) to segment the spot within each subregion. For data analysis, we propose a binary hierarchical clustering (BHC) framework for the clustering of gene expression data. The BHC algorithm involves two major steps. Firstly, the fuzzy C-means algorithm and the average linkage hierarchical clustering algorithm are used to split the data into two classes. Secondly, the Fisher linear discriminant analysis is applied to the two classes to assess whether the split is acceptable. The BHC algorithm is applied to the sub-classes recursively and ends when all clusters cannot be split any further. BHC does not require the number of clusters to be known in advance. It does not place any assumption about the number of samples in each cluster or the class distribution. The hierarchical framework naturally leads to a tree structure representation for effective visualization of gene expressions.