Improving classification for microarray data sets by constructing synthetic data

  • Authors:
  • Shun Bian;Wenjia Wang

  • Affiliations:
  • School of Computing Sciences, University of East Anglia, Norwich, UK;School of Computing Sciences, University of East Anglia, Norwich, UK

  • Venue:
  • CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Microarray technology has been widely used in biological and medical research to observe a large number of gene expressions. However, such experiments are usually carried out with few replica or instances, which may lead to poor modelling and analysis. This paper suggests an approach to improve classification by using synthetic data. A new algorithm is proposed to estimate synthetic data value and the generated data are labelled by ensemble methods. Experiments with artificial data and real world data demonstrate that the proposed algorithm is able to generate synthetic data on uncertain regions of classifiers to improve effectiveness and efficiency of classification on microarray data sets.