Pre-processing Techniques for the QSAR Problem

  • Authors:
  • L. Dumitriu;M. -V. Craciun;A. Cocu;C. Segal

  • Affiliations:
  • Dept. of Computer Sci&Eng, University Dunăărea de Jos of Galaţi, Romania;Dept. of Computer Sci&Eng, University Dunăărea de Jos of Galaţi, Romania;Dept. of Computer Sci&Eng, University Dunăărea de Jos of Galaţi, Romania;Dept. of Computer Sci&Eng, University Dunăărea de Jos of Galaţi, Romania

  • Venue:
  • Proceedings of the 2008 conference on New Trends in Multimedia and Network Information Systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Predictive Toxicology (PT) attempts to describe the relationships between the chemical structure of chemical compounds and biological and toxicological processes. The most important issue related to real-world PT problems is the huge number of the chemical descriptors. A secondary issue is the quality of the data since irrelevant, redundant, noisy, and unreliable data have a negative impact on the prediction results. The pre-processing step of Data Mining deals with complexity reduction as well as data quality improvement through feature selection, data cleaning, and noise reduction. In this paper, we present some of the issues that can be taken into account for preparing data before the actual knowledge discovery is performed.