Feature selection via Boolean independent component analysis

Authors:
Bruno Apolloni;Simone Bassis;Andrea Brega
Affiliations:
Dipartimento di Scienze dell'Informazione, Universití degli Studi di Milano, Via Comelico 39/41, 20135 Milano, Italy;Dipartimento di Scienze dell'Informazione, Universití degli Studi di Milano, Via Comelico 39/41, 20135 Milano, Italy;Dipartimento di Matematica "F. Enriques", Universití degli Studi di Milano, Via Saldini 50, 20133 Milano, Italy
Venue:
Information Sciences: an International Journal
Year:
2009

Citing 35
Cited 4

Recursive distributed representations

Artificial Intelligence - On connectionist symbol processing
Learning internal representations by error propagation

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
Elements of information theory

Elements of information theory
C4.5: programs for machine learning

C4.5: programs for machine learning
An information-maximization approach to blind separation and blind deconvolution

Neural Computation
Wrappers for feature subset selection

Artificial Intelligence - Special issue on relevance
Input Feature Extraction for Multilayered Perceptrons Using Supervised Principal Component Analysis

Neural Processing Letters
Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
Feature Extraction, Construction and Selection: A Data Mining Perspective

Feature Extraction, Construction and Selection: A Data Mining Perspective
Introduction to the Theory of Neural Computation

Introduction to the Theory of Neural Computation
Gene Selection for Cancer Classification using Support Vector Machines

Machine Learning
Feature selection for high-dimensional genomic microarray data

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
PAC Meditation on Boolean Formulas

Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation
Non-linear dimensionality reduction techniques for classification and visualization

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
An introduction to variable and feature selection

The Journal of Machine Learning Research
Experiments with random projections for machine learning

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Redundancy based feature selection for microarray data

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Supervised Clustering " Algorithms and Benefits

ICTAI '04 Proceedings of the 16th IEEE International Conference on Tools with Artificial Intelligence
Toward Integrating Feature Selection Algorithms for Classification and Clustering

IEEE Transactions on Knowledge and Data Engineering
Detecting driving awareness

PKDD '04 Proceedings of the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases
BICA: a Boolean Independent Component Analysis Algorithm

HIS '05 Proceedings of the Fifth International Conference on Hybrid Intelligent Systems
Significance of Gene Ranking for Classification of Microarray Samples

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Genetic algorithms for linear feature extraction

Pattern Recognition Letters
Gene selection using support vector machines with non-convex penalty

Bioinformatics
Gene selection in cancer classification using sparse logistic regression with Bayesian regularization

Bioinformatics
Feature Extraction from Microarray Expression Data by Integration of Semantic Knowledge

ICMLA '07 Proceedings of the Sixth International Conference on Machine Learning and Applications
The weighted majority algorithm

SFCS '89 Proceedings of the 30th Annual Symposium on Foundations of Computer Science
Cluster analysis of genome-wide expression data for feature extraction

Expert Systems with Applications: An International Journal
Discriminative clustering

Neurocomputing
Which is the best multiclass SVM method? an empirical study

MCS'05 Proceedings of the 6th international conference on Multiple Classifier Systems
MML inference of oblique decision trees

AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence
From synapses to rules

Cognitive Systems Research
A new gradient-based neural network for solving linear and quadratic programming problems

IEEE Transactions on Neural Networks
A general framework for learning rules from data

IEEE Transactions on Neural Networks
Pruning algorithms-a survey

IEEE Transactions on Neural Networks

18F-FDG PET imaging analysis for computer aided Alzheimer's diagnosis

Information Sciences: an International Journal
Applying electromagnetism-like mechanism for feature selection

Information Sciences: an International Journal
mr2PSO: A maximum relevance minimum redundancy feature selection method based on swarm intelligence for support vector machine classification

Information Sciences: an International Journal
Feature selection for classification of animal feed ingredients from near infrared microscopy spectra

Information Sciences: an International Journal

Quantified Score

Hi-index	0.07

Visualization

Abstract

We devise a feature selection method in terms of a follow-out utility of a special classification procedure. In turn, we root the latter on binary features which we extract from the input patterns with a wrapper method. The whole contrivance results in a procedure that is progressive in two respects. As for features, first we compute a very essential representation of them in terms of Boolean independent components in order to reduce their entropy. Then we reverse the representation mapping to discover the subset of the original features supporting a successful classification. As for the classification, we split it into two less hard tasks. With the former we look for a clustering of input patterns that satisfies loose consistency constraints and benefits from the conciseness of binary representation. With the latter we attribute labels to the clusters through the combined use of basically linear separators. We implement out the method through a relatively quick numerical procedure by assembling a set of connectionist and symbolic routines. These we toss on the benchmark of feature selection of DNA microarray data in cancer diagnosis and other ancillary datasets.