A Search for the Best Data Mining Method to Predict Melanoma

  • Authors:
  • Jerzy W. Grzymala-Busse;Zdzislaw S. Hippe

  • Affiliations:
  • -;-

  • Venue:
  • TSCTC '02 Proceedings of the Third International Conference on Rough Sets and Current Trends in Computing
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Our main objective was to decrease the error rate of diagnosis of melanoma, a very dangerous skin cancer. Since diagnosticians routinely use the so-called ABCD formula for melanoma prediction, our main concern was to improve the ABCD formula. In our search for the best coefficients of the ABCD formula we used two different discretization methods, agglomerative and divisive, both based on cluster analysis. In our experiments we used the data mining system LERS (Learning from Examples based on Rough Sets). As a result of more than 30,000 experiments, two optimal ABCD formulas were found, one with the use of the agglomerative method, the other one with divisive. These formulas were evaluated using statistical methods. Our final conclusion is that it is more important to use an appropriate discretization method than to modify the ABCD formula. Also, the divisive method of discretization is better than agglomerative. Finally, diagnosis of melanoma without taking into account results of the ABCD formula is much worse, i.e., the error rate is significantly greater, comparing with any form of the ABCD formula.