Fuzzy–Rough Supervised Attribute Clustering Algorithm and Classification of Microarray Data

Authors:
P. Maji
Affiliations:
Machine Intell. Unit, Indian Stat. Inst., Kolkata, India
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Year:
2011

Citing 0
Cited 4

Entropy measures and granularity measures for set-valued information systems

Information Sciences: an International Journal
On fuzzy-rough attribute selection: Criteria of Max-Dependency, Max-Relevance, Min-Redundancy, and Max-Significance

Applied Soft Computing
Rough-Fuzzy Clustering for Grouping Functionally Similar Genes from Microarray Data

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
An improved multiple-attractor cellular automata classifier with a tree frame based on CART

Computers & Mathematics with Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

One of the major tasks with gene expression data is to find groups of coregulated genes whose collective expression is strongly associated with sample categories. In this regard, a new clustering algorithm, termed as fuzzy-rough supervised attribute clustering (FRSAC), is proposed to find such groups of genes. The proposed algorithm is based on the theory of fuzzy-rough sets, which directly incorporates the information of sample categories into the gene clustering process. A new quantitative measure is introduced based on fuzzy-rough sets that incorporates the information of sample categories to measure the similarity among genes. The proposed algorithm is based on measuring the similarity between genes using the new quantitative measure, whereby redundancy among the genes is removed. The clusters are refined incrementally based on sample categories. The effectiveness of the proposed FRSAC algorithm, along with a comparison with existing supervised and unsupervised gene selection and clustering algorithms, is demonstrated on six cancer and two arthritis data sets based on the class separability index and predictive accuracy of the naive Bayes' classifier, the K-nearest neighbor rule, and the support vector machine.