Medical diagnosis with C4.5 rule preceded by artificial neural network ensemble

  • Authors:
  • Zhi-Hua Zhou;Yuan Jiang

  • Affiliations:
  • Nat. Lab. for Novel Software Technol., Nanjing Univ., China;-

  • Venue:
  • IEEE Transactions on Information Technology in Biomedicine
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Comprehensibility is very important when machine learning techniques are used in computer-aided medical diagnosis. Since an artificial neural network ensemble is composed of multiple artificial neural networks, its comprehensibility is worse than that of a single artificial neural network. In this paper, C4.5 Rule-PANE, which combines an artificial neural network ensemble with rule induction by regarding the former as a preprocess of the latter, is proposed. At first, an artificial neural network ensemble is trained. Then, a new training data set is generated by feeding the feature vectors of original training instances to the trained ensemble and replacing the expected class labels of original training instances with the class labels output from the ensemble. Additional training data may also be appended by randomly generating feature vectors and combining them with their corresponding class labels output from the ensemble. Finally, a specific rule induction approach, i.e., C4.5 Rule, is used to learn rules from the new training data set. Case studies on diabetes, hepatitis , and breast cancer show that C4.5 Rule-PANE could generate rules with strong generalization ability, which benefits from an artificial neural network ensemble, and strong comprehensibility, which benefits from rule induction.