A practical approach to feature selection
ML92 Proceedings of the ninth international workshop on Machine learning
Selection of relevant features and examples in machine learning
Artificial Intelligence - Special issue on relevance
Wrappers for feature subset selection
Artificial Intelligence - Special issue on relevance
Modeling obesity using abductive networks
Computers and Biomedical Research
The Random Subspace Method for Constructing Decision Forests
IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature selection for ensembles
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Rough set methods in feature selection and recognition
Pattern Recognition Letters - Special issue: Rough sets, pattern recognition and data mining
Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Ensemble Feature election with the Simple Bayesian Classification in Medical Diagnostics
CBMS '02 Proceedings of the 15th IEEE Symposium on Computer-Based Medical Systems (CBMS'02)
Selection of Voice Features to Diagnose Hearing Impairments of Children
CBMS '01 Proceedings of the Fourteenth IEEE Symposium on Computer-Based Medical Systems
Reduced feature-set based parallel CHMM speech recognition systems
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Spoken language analysis, modeling and recognition-statistical and adaptive connectionist approaches
Lung cancer cell identification based on artificial neural network ensembles
Artificial Intelligence in Medicine
A new methodology of extraction, optimization and application of crisp and fuzzy logical rules
IEEE Transactions on Neural Networks
GMDH-based feature ranking and selection for improved classification of medical data
Journal of Biomedical Informatics
Diversity of ability and cognitive style for group decision processes
Information Sciences: an International Journal
Computational intelligence for heart disease diagnosis: A medical knowledge driven approach
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
This paper demonstrates the use of abductive network classifier committees trained on different features for improving classification accuracy in medical diagnosis. In an earlier publication, committee members were trained on different subsets of the training set to ensure enough diversity for improved committee performance. In situations characterized by high data dimensionality, i.e. a large number of features and a relatively few training examples, it may be more advantageous to split the feature set rather than the training set. We describe a novel approach for tentatively ranking the features and forming subsets of uniform predictive quality for training individual members. The abductive network training algorithm is used to select optimum predictors from the feature set at various levels of model complexity specified by the user. Using the resulting tentative ranking, the features are grouped into mutually exclusive subsets of approximately equal predictive power for training the members. The approach is demonstrated on three standard medical diagnosis datasets (breast cancer, heart disease, and diabetes). Three-member committees trained on different feature subsets and using simple output combination methods reduce classification errors by up to 20% compared to the best single model developed with the full feature set. Results are compared with those reported previously with members trained through splitting the training set. Training abductive committee members on feature subsets of approximately equal predictive power achieves both diversity and quality for improved committee performance. Ensemble feature subset selection can be performed using GMDH-based learning algorithms. The approach should be advantageous in situations characterized by high data dimensionality.