Transfer Learning with Data Edit

  • Authors:
  • Yong Cheng;Qingyong Li

  • Affiliations:
  • Department of Computer Sciences, Beijing University of Chemical Technology, Beijing;School of Computer and Information Technology, Beijing Jiaotong University, Beijing

  • Venue:
  • ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We often face the situation where very limited labeled data are available to learn an effective classifier in target domain while there exist large amounts of labeled data with similar feature or distribution in certain relevant domains. Transfer learning aims at improving the performance of a learner in target domain given labeled data in one or more source domains. In this paper, we present an algorithm to learn effective classifier without or a few labeled data on target domain, given some labeled data with same features and similar distribution in source domain. Our algorithm uses the data edit technique to approach distribution from the source domain to the target domain by removing "mismatched" examples in source domain and adding "matched" examples in target domain. Experimental results on email classification problem have confirmed the effectiveness of the proposed algorithm.