Use of Classification Algorithms in Noise Detection and Elimination

  • Authors:
  • André L. Miranda;Luís Paulo Garcia;André C. Carvalho;Ana C. Lorena

  • Affiliations:
  • Instituto de Ciências Matemáticas e Computação, Universidade de São Paulo USP, São Carlos, Brazil 13560-970;Instituto de Ciências Matemáticas e Computação, Universidade de São Paulo USP, São Carlos, Brazil 13560-970;Instituto de Ciências Matemáticas e Computação, Universidade de São Paulo USP, São Carlos, Brazil 13560-970;Centro de Matemática, Computação e Cognição, Universidade Federal do ABC UFABC, Santo André, Brazil 09090-400

  • Venue:
  • HAIS '09 Proceedings of the 4th International Conference on Hybrid Artificial Intelligence Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data sets in Bioinformatics usually present a high level of noise. Various processes involved in biological data collection and preparation may be responsible for the introduction of this noise, such as the imprecision inherent to laboratory experiments generating these data. Using noisy data in the induction of classifiers through Machine Learning techniques may harm the classifiers prediction performance. Therefore, the predictions of these classifiers may be used for guiding noise detection and removal. This work compares three approaches for the elimination of noisy data from Bioinformatics data sets using Machine Learning classifiers: the first is based in the removal of the detected noisy examples, the second tries to reclassify these data and the third technique, named hybrid, unifies the previous approaches.