Rough set based maximum relevance-maximum significance criterion and Gene selection from microarray data

Authors:
Pradipta Maji;Sushmita Paul
Affiliations:
Machine Intelligence Unit, Indian Statistical Institute, 203, Barrackpore Trunk Road, Kolkata 700 108, India;Machine Intelligence Unit, Indian Statistical Institute, 203, Barrackpore Trunk Road, Kolkata 700 108, India
Venue:
International Journal of Approximate Reasoning
Year:
2011

Citing 44
Cited 10

Attributes and rough properties in information systems

International Journal of Approximate Reasoning
Variable precision rough set model

Journal of Computer and System Sciences
The nature of statistical learning theory

The nature of statistical learning theory
Wrappers for feature subset selection

Artificial Intelligence - Special issue on relevance
Data mining: concepts and techniques

Data mining: concepts and techniques
Using Rough Sets with Heuristics for Feature Selection

Journal of Intelligent Information Systems
Rough Sets: Theoretical Aspects of Reasoning about Data

Rough Sets: Theoretical Aspects of Reasoning about Data
Neuro-Fuzzy Pattern Recognition: Methods in Soft Computing

Neuro-Fuzzy Pattern Recognition: Methods in Soft Computing
Feature Selection Using Rough Sets Theory

ECML '93 Proceedings of the European Conference on Machine Learning
Dynamic Reducts as a Tool for Extracting Laws from Decisions Tables

ISMIS '94 Proceedings of the 8th International Symposium on Methodologies for Intelligent Systems
Minimum Redundancy Feature Selection from Microarray Gene Expression Data

CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
Constructive and axiomatic approaches of fuzzy approximation operators

Information Sciences—Informatics and Computer Science: An International Journal - Mining stream data
Cluster Analysis for Gene Expression Data: A Survey

IEEE Transactions on Knowledge and Data Engineering
Semantics-Preserving Dimensionality Reduction: Rough and Fuzzy-Rough-Based Approaches

IEEE Transactions on Knowledge and Data Engineering
Feature Selection Based on Mutual Information: Criteria of Max-Dependency, Max-Relevance, and Min-Redundancy

IEEE Transactions on Pattern Analysis and Machine Intelligence
Interactive Gene Clustering--A Case Study of Breast Cancer Microarray Data

Information Systems Frontiers
Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation

Pattern Recognition
Gene selection by sequential search wrapper approaches in microarray cancer class prediction

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - Challenges for future intelligent systems in biomedicine
Clustering and visualization approaches for human cell cycle gene expression data analysis

International Journal of Approximate Reasoning
Logistic regression for disease classification using microarray data

Bioinformatics
Hybrid huberized support vector machines for microarray classification and gene selection

Bioinformatics
Rough Sets and Few-Objects-Many-Attributes Problem: The Case Study of Analysis of Gene Expression Data Sets

FBIT '07 Proceedings of the 2007 Frontiers in the Convergence of Bioscience and Information Technologies
Neighborhood rough set based heterogeneous feature subset selection

Information Sciences: an International Journal
Editorial: Probabilistic rough sets: Approximations, decision-makings, and applications

International Journal of Approximate Reasoning
Probabilistic rough set approximations

International Journal of Approximate Reasoning
Probabilistic approach to rough sets

International Journal of Approximate Reasoning
Variable precision rough set for group decision-making: An application

International Journal of Approximate Reasoning
Exploring the boundary region of tolerance rough sets for feature selection

Pattern Recognition
Knowledge structure, knowledge granulation and knowledge distance in a knowledge base

International Journal of Approximate Reasoning
Gene boosting for cancer classification based on gene expression profiles

Pattern Recognition
Variable-precision dominance-based rough set approach and attribute reduction

International Journal of Approximate Reasoning
A granularity-based framework of deduction, induction, and abduction

International Journal of Approximate Reasoning
Attribute selection with fuzzy decision reducts

Information Sciences: an International Journal
Attribute dependency functions considering data efficiency

International Journal of Approximate Reasoning
The model of fuzzy variable precision rough sets

IEEE Transactions on Fuzzy Systems
Gaussian kernel based fuzzy rough sets: Model, uncertainty measures and applications

International Journal of Approximate Reasoning
Roughfication of numeric decision tables: the case study of gene expression data

RSKT'07 Proceedings of the 2nd international conference on Rough sets and knowledge technology
Feature Selection Using f-Information Measures in Fuzzy Approximation Spaces

IEEE Transactions on Knowledge and Data Engineering
Fuzzy-rough sets for information measures and selection of relevant genes from microarray data

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on game theory
Relevant attribute discovery in high dimensional data: application to breast cancer gene expressions

RSKT'06 Proceedings of the First international conference on Rough Sets and Knowledge Technology
Mining of MicroRNA expression data—a rough set approach

RSKT'06 Proceedings of the First international conference on Rough Sets and Knowledge Technology
Approximation spaces and information granulation

Transactions on Rough Sets III
Optimal Search-Based Gene Subset Selection for Gene Array Cancer Classification

IEEE Transactions on Information Technology in Biomedicine
Fuzzy-Rough Sets Assisted Attribute Selection

IEEE Transactions on Fuzzy Systems

An efficient fuzzy rough approach for feature selection

RSKT'11 Proceedings of the 6th international conference on Rough sets and knowledge technology
Classification systems based on rough sets under the belief function framework

International Journal of Approximate Reasoning
A new gene selection method based on random subspace ensemble for microarray cancer classification

PRIB'11 Proceedings of the 6th IAPR international conference on Pattern recognition in bioinformatics
Rough sets for selection of functionally diverse genes from microarray data

SEMCCO'11 Proceedings of the Second international conference on Swarm, Evolutionary, and Memetic Computing - Volume Part I
On fuzzy-rough attribute selection: Criteria of Max-Dependency, Max-Relevance, Min-Redundancy, and Max-Significance

Applied Soft Computing
Neighborhood rough sets based multi-label classification for automatic image annotation

International Journal of Approximate Reasoning
An extension to Rough c-means clustering based on decision-theoretic Rough Sets model

International Journal of Approximate Reasoning
Feature selection with test cost constraint

International Journal of Approximate Reasoning
Review article: Computational intelligence techniques in bioinformatics

Computational Biology and Chemistry
Diverse accurate feature selection for microarray cancer diagnosis

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

Among the large amount of genes presented in microarray gene expression data, only a small fraction of them is effective for performing a certain diagnostic test. In this regard, a new feature selection algorithm is presented based on rough set theory. It selects a set of genes from microarray data by maximizing the relevance and significance of the selected genes. A theoretical analysis is presented to justify the use of both relevance and significance criteria for selecting a reduced gene set with high predictive accuracy. The importance of rough set theory for computing both relevance and significance of the genes is also established. The performance of the proposed algorithm, along with a comparison with other related methods, is studied using the predictive accuracy of K-nearest neighbor rule and support vector machine on five cancer and two arthritis microarray data sets. Among seven data sets, the proposed algorithm attains 100% predictive accuracy for three cancer and two arthritis data sets, while the rough set based two existing algorithms attain this accuracy only for one cancer data set.