Mutual information-based feature selection and partition design in fuzzy rule-based classifiers from vague data

Authors:
Luciano Sánchez;M. Rosario Suárez;J. R. Villar;Inés Couso
Affiliations:
Computer Science Department, University Oviedo, 33071 Gijón, Asturias, Spain;Computer Science Department, University Oviedo, 33071 Gijón, Asturias, Spain;Computer Science Department, University Oviedo, 33071 Gijón, Asturias, Spain;Statistics Department, University Oviedo, 33071 Oviedo, Asturias, Spain
Venue:
International Journal of Approximate Reasoning
Year:
2008

Citing 20
Cited 17

When upper probabilities are possibility measures

Fuzzy Sets and Systems - Special issue dedicated to Professor Claude Ponsard
A practical approach to feature selection

ML92 Proceedings of the ninth international workshop on Machine learning
Genetic algorithms + data structures = evolution programs (3rd ed.)

Genetic algorithms + data structures = evolution programs (3rd ed.)
A random sets-based method for identifying fuzzy models

Fuzzy Sets and Systems
Genetic feature selection in a fuzzy rule-based classification system learning process for high-dimensional problems

Information Sciences: an International Journal - Recent advances in genetic fuzzy systems
Classification and Modeling with Linguistic Information Granules: Advanced Approaches to Linguistic Data Mining (Advanced Information Processing)

Classification and Modeling with Linguistic Information Granules: Advanced Approaches to Linguistic Data Mining (Advanced Information Processing)
Induction of descriptive fuzzy classifiers with the Logitboost algorithm

Soft Computing - A Fusion of Foundations, Methodologies and Applications
Construction of fuzzy knowledge bases incorporating feature selection

Soft Computing - A Fusion of Foundations, Methodologies and Applications
Feature Selection using Fuzzy Support Vector Machines

Fuzzy Optimization and Decision Making
Joint propagation of probability and possibility in risk analysis: Towards a formal framework

International Journal of Approximate Reasoning
Higher order models for fuzzy random variables

Fuzzy Sets and Systems
Obtaining transparent models of chaotic systems with multi-objective simulated annealing algorithms

Information Sciences: an International Journal
Multi-objective optimization of problems with epistemic uncertainty

EMO'05 Proceedings of the Third international conference on Evolutionary Multi-Criterion Optimization
Reducing the Memory Size of a Fuzzy Case-Based Reasoning System Applying Rough Set Techniques

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
A fast and elitist multiobjective genetic algorithm: NSGA-II

IEEE Transactions on Evolutionary Computation
Input features' impact on fuzzy decision processes

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Induction of fuzzy-rule-based classifiers with evolutionary boosting algorithms

IEEE Transactions on Fuzzy Systems
Fuzzy-Rough Sets Assisted Attribute Selection

IEEE Transactions on Fuzzy Systems
Advocating the Use of Imprecisely Observed Data in Genetic Fuzzy Systems

IEEE Transactions on Fuzzy Systems
Using mutual information for selecting features in supervised neural net learning

IEEE Transactions on Neural Networks

A Minimum Risk Wrapper Algorithm for Genetically Selecting Imprecisely Observed Features, Applied to the Early Diagnosis of Dyslexia

HAIS '08 Proceedings of the 3rd international workshop on Hybrid Artificial Intelligence Systems
Hierarchical fuzzy rule based classification systems with genetic rule selection for imbalanced data-sets

International Journal of Approximate Reasoning
A fuzzy random forest

International Journal of Approximate Reasoning
Diagnosis of dyslexia with low quality data with genetic fuzzy systems

International Journal of Approximate Reasoning
On dynamic soft dimension reduction in evolving fuzzy classifiers

IPMU'10 Proceedings of the Computational intelligence for knowledge-based systems design, and 13th international conference on Information processing and management of uncertainty
On-line incremental feature weighting in evolving fuzzy classifiers

Fuzzy Sets and Systems
Upper and lower probabilities induced by a fuzzy random variable

Fuzzy Sets and Systems
Upper and lower probabilities induced by a fuzzy random variable

Fuzzy Sets and Systems
Mark-recapture techniques in statistical tests for imprecise data

International Journal of Approximate Reasoning
Core-generating approximate minimum entropy discretization for rough set feature selection in pattern classification

International Journal of Approximate Reasoning
An study of the tree generation algorithms in equation based model learning with low quality data

HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part II
Multi-objective learning of white box models with low quality data

Neurocomputing
Review: A framework for awareness maintenance

Journal of Network and Computer Applications
Analysing the low quality of the data in lighting control systems

HAIS'10 Proceedings of the 5th international conference on Hybrid Artificial Intelligence Systems - Volume Part I
Comparison of fuzzy functions for low quality data GAP algorithms

HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part II
Inner and outer fuzzy approximations of confidence intervals

Fuzzy Sets and Systems
Feature subset selection Filter-Wrapper based on low quality data

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Algorithms for preprocessing databases with incomplete and imprecise data are seldom studied. For the most part, we lack numerical tools to quantify the mutual information between fuzzy random variables. Therefore, these algorithms (discretization, instance selection, feature selection, etc.) have to use crisp estimations of the interdependency between continuous variables, whose application to vague datasets is arguable. In particular, when we select features for being used in fuzzy rule-based classifiers, we often use a mutual information-based ranking of the relevance of inputs. But, either with crisp or fuzzy data, fuzzy rule-based systems route the input through a fuzzification interface. The fuzzification process may alter this ranking, as the partition of the input data does not need to be optimal. In our opinion, to discover the most important variables for a fuzzy rule-based system, we want to compute the mutual information between the fuzzified variables, and we should not assume that the ranking between the crisp variables is the best one. In this paper we address these problems, and propose an extended definition of the mutual information between two fuzzified continuous variables. We also introduce a numerical algorithm for estimating the mutual information from a sample of vague data. We will show that this estimation can be included in a feature selection algorithm, and also that, in combination with a genetic optimization, the same definition can be used to obtain the most informative fuzzy partition for the data. Both applications will be exemplified with the help of some benchmark problems.