Over-Sampling from an auxiliary domain

  • Authors:
  • Samir Al-Stouhi;Abhilash Pandya

  • Affiliations:
  • Department of Electrical and Computer Engineering, Wayne State University, Detroit, MI;Department of Electrical and Computer Engineering, Wayne State University, Detroit, MI

  • Venue:
  • ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part I
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The exponential growth of data dimensions presents an obstacle in informatics as data miners try to construct ever greater training sets to overcome the theoretical limitations of statistical learning theory. Machine learning models require a minimum set of samples within each label to develop a representative hypothesis. To overcome these bounds, we developed an algorithm that can extract samples from an auxiliary domain to augment the training set. Our work exploits concepts from the "Transfer Learning" and "Imbalanced Learning" domains to expand the training set and permit standard models to be applied. We present theoretical verification of our method and demonstrate the effectiveness of our framework with experimental results on real-world data.