Medical diagnosis with C4.5 rule preceded by artificial neural network ensemble

Authors:
Zhi-Hua Zhou;Yuan Jiang
Affiliations:
Nat. Lab. for Novel Software Technol., Nanjing Univ., China;-
Venue:
IEEE Transactions on Information Technology in Biomedicine
Year:
2003

Citing 0
Cited 23

Mining risk patterns in medical data

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
A Case-Based Explanation System for Black-Box Systems

Artificial Intelligence Review
Evolutionary stratified training set selection for extracting classification rules with trade off precision-interpretability

Data & Knowledge Engineering
Reliability Assessment of Ensemble Classifiers: Application in Mammography

IWDM '08 Proceedings of the 9th international workshop on Digital Mammography
Efficient discovery of risk patterns in medical data

Artificial Intelligence in Medicine
Mining extremely small data sets with application to software reuse

Software—Practice & Experience
Predicting the outcome of patients with subarachnoid hemorrhage using machine learning techniques

IEEE Transactions on Information Technology in Biomedicine - Special section on computational intelligence in medical systems
Exploratory undersampling for class-imbalance learning

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Degree prediction of malignancy in brain glioma using support vector machines

Computers in Biology and Medicine
Intelligent approach for effective management of governmental funds for small and medium enterprises

Expert Systems with Applications: An International Journal
SeCED-FS: a new approach for the classification and discovery of significant regions in medical images

APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Enhancing the classification accuracy by scatter-search-based ensemble approach

Applied Soft Computing
A framework for diagnosis of urinary incontinence disease based on scoring measures and automatic classifiers

Computers in Biology and Medicine
Municipal revenue prediction by ensembles of neural networks and support vector machines

WSEAS Transactions on Computers
A lung cancer outcome calculator using ensemble data mining on SEER data

Proceedings of the Tenth International Workshop on Data Mining in Bioinformatics
Generation of comprehensible hypotheses from gene expression data

BioDM'06 Proceedings of the 2006 international conference on Data Mining for Biomedical Applications
Neighbor line-based locally linear embedding

PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Mining the most interesting patterns from multiple phenotypes medical data

RSCTC'06 Proceedings of the 5th international conference on Rough Sets and Current Trends in Computing
Spiculated lesion detection in digital mammogram based on artificial neural network ensemble

ISNN'05 Proceedings of the Second international conference on Advances in Neural Networks - Volume Part III
Topic-specific text filtering based on multiple reducts

AIS-ADM 2005 Proceedings of the 2005 international conference on Autonomous Intelligent Systems: agents and Data Mining
Mining tourist preferences with twice-learning

PAKDD'11 Proceedings of the 15th international conference on New Frontiers in Applied Data Mining
Fuzzy cognitive map ensemble learning paradigm to solve classification problems: Application to autism identification

Applied Soft Computing
Lung cancer survival prediction using ensemble data mining on SEER data

Scientific Programming - Biological Knowledge Discovery and Data Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

Comprehensibility is very important when machine learning techniques are used in computer-aided medical diagnosis. Since an artificial neural network ensemble is composed of multiple artificial neural networks, its comprehensibility is worse than that of a single artificial neural network. In this paper, C4.5 Rule-PANE, which combines an artificial neural network ensemble with rule induction by regarding the former as a preprocess of the latter, is proposed. At first, an artificial neural network ensemble is trained. Then, a new training data set is generated by feeding the feature vectors of original training instances to the trained ensemble and replacing the expected class labels of original training instances with the class labels output from the ensemble. Additional training data may also be appended by randomly generating feature vectors and combining them with their corresponding class labels output from the ensemble. Finally, a specific rule induction approach, i.e., C4.5 Rule, is used to learn rules from the new training data set. Case studies on diabetes, hepatitis , and breast cancer show that C4.5 Rule-PANE could generate rules with strong generalization ability, which benefits from an artificial neural network ensemble, and strong comprehensibility, which benefits from rule induction.