Class Noise and Supervised Learning in Medical Domains: The Effect of Feature Extraction

  • Authors:
  • Mykola Pechenizkiy;Alexey Tsymbal;Seppo Puuronen;Oleksandr Pechenizkiy

  • Affiliations:
  • Univ. of Jyväskylä, Finland;Trinity College Dublin, Ireland;Univ. of Jyväskylä, Finland;Univ. of Jyväskylä, Finland

  • Venue:
  • CBMS '06 Proceedings of the 19th IEEE Symposium on Computer-Based Medical Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Inductive learning systems have been successfully applied in a number of medical domains. It is generally accepted that the highest accuracy results that an inductive learning system can achieve depend on the quality of data and on the appropriate selection of a learning algorithm for the data. In this paper we analyze the effect of class noise on supervised learning in medical domains. We review the related work on learning from noisy data and propose to use feature extraction as a pre-processing step to diminish the effect of class noise on the learning process. Our experiments with 8 medical datasets show that feature extraction indeed helps to deal with class noise. It clearly results in higher classification accuracy of learnt models without the separate explicit elimination of noisy instances.