Fuzzy clustering with biological knowledge for gene selection

Authors:
Sampreeti Ghosh;Sushmita Mitra;Rana Dattagupta
Affiliations:
-;-;-
Venue:
Applied Soft Computing
Year:
2014

Citing 24
Cited 0

Selection of relevant features and examples in machine learning

Artificial Intelligence - Special issue on relevance
Wrappers for feature subset selection

Artificial Intelligence - Special issue on relevance
Pattern Recognition with Fuzzy Objective Function Algorithms

Pattern Recognition with Fuzzy Objective Function Algorithms
Neuro-Fuzzy Pattern Recognition: Methods in Soft Computing

Neuro-Fuzzy Pattern Recognition: Methods in Soft Computing
Random Forests

Machine Learning
CLARANS: A Method for Clustering Objects for Spatial Data Mining

IEEE Transactions on Knowledge and Data Engineering
Unsupervised Feature Selection Applied to Content-Based Retrieval of Lung Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Data Mining: Concepts and Algorithms From Multimedia to Bioinformatics

Data Mining: Concepts and Algorithms From Multimedia to Bioinformatics
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Microarray data mining: facing the challenges

ACM SIGKDD Explorations Newsletter
Improving classification of microarray data using prototype-based feature selection

ACM SIGKDD Explorations Newsletter
Simultaneous Feature Selection and Clustering Using Mixture Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Cluster Analysis for Gene Expression Data: A Survey

IEEE Transactions on Knowledge and Data Engineering
Toward Integrating Feature Selection Algorithms for Classification and Clustering

IEEE Transactions on Knowledge and Data Engineering
Attribute Clustering for Grouping, Selection, and Classification of Gene Expression Data

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Incorporating biological knowledge into distance-based clustering analysis of microarray gene expression data

Bioinformatics
Agglomerative Fuzzy K-Means Clustering Algorithm with Selection of Number of Clusters

IEEE Transactions on Knowledge and Data Engineering
Fuzzy c-means clustering with prior biological knowledge

Journal of Biomedical Informatics
Testing significance relative to a fold-change threshold is a TREAT

Bioinformatics
The WEKA data mining software: an update

ACM SIGKDD Explorations Newsletter
Next generation software for functional trend analysis

Bioinformatics
Evolutionary Rough Feature Selection in Gene Expression Data

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Some new indexes of cluster validity

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Clustering large data with uncertainty

Applied Soft Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents an application of Fuzzy Clustering of Large Applications based on Randomized Search (FCLARANS) for attribute clustering and dimensionality reduction in gene expression data. Domain knowledge based on gene ontology and differential gene expressions are employed in the process. The use of domain knowledge helps in the automated selection of biologically meaningful partitions. Gene ontology (GO) study helps in detecting biologically enriched and statistically significant clusters. Fold-change is measured to select the differentially expressed genes as the representatives of these clusters. Tools like Eisen plot and cluster profiles of these clusters help establish their coherence. Important representative features (or genes) are extracted from each enriched gene partition to form the reduced gene space. While the reduced gene set forms a biologically meaningful attribute space, it simultaneously leads to a decrease in computational burden. External validation of the reduced subspace, using various well-known classifiers, establishes the effectiveness of the proposed methodology on four sets of publicly available microarray gene expression data.