Relevance measures for subset variable selection in regression problems based on k-additive mutual information

Authors:
Ivan Kojadinovic
Affiliations:
LINA CNRS FRE 2729, Site école polytechnique de l'université de Nantes, Rue Christian Pauc, 44306 Nantes, France
Venue:
Computational Statistics & Data Analysis
Year:
2005

Citing 12
Cited 8

Introduction to statistical pattern recognition (2nd ed.)

Introduction to statistical pattern recognition (2nd ed.)
Elements of information theory

Elements of information theory
Numerical recipes in C (2nd ed.): the art of scientific computing

Numerical recipes in C (2nd ed.): the art of scientific computing
Bagging predictors

Machine Learning
k-order additive discrete fuzzy measures and their representation

Fuzzy Sets and Systems - Special issue on fuzzy measures and integrals
An estimator of the mutual information based on a criterion for independence

Computational Statistics & Data Analysis
Equivalent Representations of Set Functions

Mathematics of Operations Research
Feature Extraction, Construction and Selection: A Data Mining Perspective

Feature Extraction, Construction and Selection: A Data Mining Perspective
Feature Selection for Knowledge Discovery and Data Mining

Feature Selection for Knowledge Discovery and Data Mining
Modeling interaction phenomena using fuzzy measures: on the notions of interaction and independence

Fuzzy Sets and Systems - Non-additive measures and random processes
Nonparametric multivariate density estimation: a comparative study

IEEE Transactions on Signal Processing
An axiomatic approach of the discrete Choquet integral as a tool to aggregate interacting criteria

IEEE Transactions on Fuzzy Systems

Clustering and symbolic analysis of cardiovascular signals: discovery and visualization of medically relevant patterns in long-term data using limited prior knowledge

EURASIP Journal on Applied Signal Processing
Keyword search for data-centric XML collections with long text fields

Proceedings of the 13th International Conference on Extending Database Technology
Linear projection method based on information theoretic learning

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part III
Using structural information in XML keyword search effectively

ACM Transactions on Database Systems (TODS)
On the use of variable complementarity for feature selection in cancer classification

EuroGP'06 Proceedings of the 2006 international conference on Applications of Evolutionary Computing
Low bias histogram-based estimation of mutual information for feature selection

Pattern Recognition Letters
Approaches to multiple-criteria group decision making based on interval-valued intuitionistic fuzzy Choquet integral with respect to the generalized λ-Shapley index

Knowledge-Based Systems
A new histogram-based estimation technique of entropy and mutual information using mean squared error minimization

Computers and Electrical Engineering

Quantified Score

Hi-index	0.03

Visualization

Abstract

In the framework of subset variable selection for regression, relevance measures based on the notion of mutual information are studied. Results on the estimation of this index of stochastic dependence in a continuous setting are first presented. They are grounded on kernel density estimation which makes the overall estimation of the mutual information quadratic. The behavior of the mutual information as a relevance measure is then empirically studied on several regression problems. The considered problems are artificially generated to contain irrelevant and redundant candidate explanatory variables as well as strongly nonlinear relationships. Next, still in a subset variable selection context, computationally more efficient approximations of the mutual information based on the notion of k-additive truncation are proposed. The 2- and 3-additive truncations appear to be of practical interest as relevance measures. The 2-additive truncation is based on the computation of the approximate relevance of a set of potential predictors from the relevance values of the singletons and pairs it contains. The 3-additive truncation additionally involves the relevance values of the 3-element subsets. The lower the amount of redundancy among the candidate explanatory variables, the better these approximations. The sample behavior of the two resulting relevance measures is finally empirically studied on the previously generated nonlinear artificial regression problems.