Improving the prediction accuracy of liver disorder disease with oversampling

  • Authors:
  • Hyontai Sug

  • Affiliations:
  • Division of Computer and Information Engineering, Dongseo University, Busan, Republic of Korea

  • Venue:
  • AMERICAN-MATH'12/CEA'12 Proceedings of the 6th WSEAS international conference on Computer Engineering and Applications, and Proceedings of the 2012 American conference on Applied Mathematics
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The complexity of liver makes it easily affected by disease of disorder. So diagnosing liver disorder disease is a high interest to data miners, and decision trees have been useful data mining tools to diagnose the disease, but the accuracy of decision trees has been limited due to insufficient data. In order to generate more accurate decision trees for liver disorder disease this paper suggests a method based on over-sampling in minor classes to compensate the insufficiency of data effectively. Experiments were done with two representative algorithms of decision trees, C4.5 and CART, and a data set, 'BUPA liver disorder', and showed the validity of the method.