Two-stage classification methods for microarray data

Authors:
Tzu-Tsung Wong;Ching-Han Hsu
Affiliations:
Institute of Information Management, National Cheng Kung University, 1 Ta-Sheuh Road, Tainan City 701, Taiwan, ROC;Institute of Information Management, National Cheng Kung University, 1 Ta-Sheuh Road, Tainan City 701, Taiwan, ROC
Venue:
Expert Systems with Applications: An International Journal
Year:
2008

Citing 7
Cited 13

Cancer classification using gene expression data

Information Systems - Special issue: Data management in bioinformatics
Reliability analysis of microarray data using fuzzy c-means and normal mixture modeling based classification methods

Bioinformatics
Ensemble dependence model for classification and prediction of cancer and normal gene expression data

Bioinformatics
Analyzing microarray data using quantitative association rules

Bioinformatics
Differential gene expression detection and sample classification using penalized linear regression models

Bioinformatics
Classification of microarray data with factor mixture models

Bioinformatics
An Epicurean learning approach to gene-expression data classification

Artificial Intelligence in Medicine

A non-linearly virtual sample generation technique using group discovery and parametric equations of hypersphere

Expert Systems with Applications: An International Journal
A sequential feature extraction approach for naïve bayes classification of microarray data

Expert Systems with Applications: An International Journal
Utilization of virtual samples to facilitate cancer identification for DNA microarray data in the early stages of an investigation

Information Sciences: an International Journal
A Probabilistic mechanism based on clustering analysis and distance measure for subset gene selection

Expert Systems with Applications: An International Journal
Gene selection and cancer microarray data classification via mixed-integer optimization

EvoBIO'08 Proceedings of the 6th European conference on Evolutionary computation, machine learning and data mining in bioinformatics
A framework for microarray data-based tumor diagnostic system with improving performance incrementally

Expert Systems with Applications: An International Journal
Partition-conditional ICA for Bayesian classification of microarray data

Expert Systems with Applications: An International Journal
Robust approach for estimating probabilities in Naïve-Bayes Classifier for gene expression data

Expert Systems with Applications: An International Journal
Gene selection and classification using Taguchi chaotic binary particle swarm optimization

Expert Systems with Applications: An International Journal
An efficient statistical feature selection approach for classification of gene expression data

Journal of Biomedical Informatics
Design of fuzzy expert system for microarray data classification using a novel Genetic Swarm Algorithm

Expert Systems with Applications: An International Journal
Mining microarray data to predict the histological grade of a breast cancer

Journal of Biomedical Informatics
genEnsemble: A new model for the combination of classifiers and integration of biological knowledge applied to genomic data

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	12.06

Visualization

Abstract

Gene expression data are a key factor for the success of medical diagnosis, and two-stage classification methods are therefore developed for processing microarray data. The first stage for this kind of classification methods is to select a pre-specified number of genes, which are likely to be the most relevant to the occurrence of a disease, and passes these genes to the second stage for classification. In this paper, we use four gene selection mechanisms and two classification tools to compose eight two-stage classification methods, and test these eight methods on eight microarray data sets for analyzing their performance. The first interesting finding is that the genes chosen by different categories of gene selection mechanisms are less than half in common but result in insignificantly different classification accuracies. A subset-gene-ranking mechanism can be beneficial in classification accuracy, but its computational effort is much heavier. Whether the classification tool employed at the second stage should be accompanied with a dimension reduction technique depends on the characteristics of a data set.