How an Ensemble Method Can Compute a Comprehensible Model

  • Authors:
  • José L. Triviño-Rodriguez;Amparo Ruiz-Sepúlveda;Rafael Morales-Bueno

  • Affiliations:
  • Department of Computer Science and Artificial Intelligence, University of Málaga, Málaga, (Spain);Department of Computer Science and Artificial Intelligence, University of Málaga, Málaga, (Spain);Department of Computer Science and Artificial Intelligence, University of Málaga, Málaga, (Spain)

  • Venue:
  • DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Ensemble machine learning methods have been developed as an easy way to improve accuracy in theoretical and practical machine learning problems. However, hypotheses computed by these methods are often considered difficult to understand. This could be an important drawback in fields such as data mining and knowledge discovery where comprehensibility is a main criterion. This paper aims to explore the area of trade-offs between accuracy and comprehensibility in ensemble machine learning methods by proposing a learning method that combines the accuracy of boosting algorithms with the comprehensibility of decision trees. The approach described in this paper avoids the voting scheme of boosting by computing simple classification rules from the boosting learning approach while achieving the accuracy of AdaBoost learning algorithm in a set of UCI datasets. The comprehensibility of the hypothesis is thus enhanced without any loss of accuracy.