Unsupervised Classifier Selection Based on Two-Sample Test

  • Authors:
  • Timo Aho;Tapio Elomaa;Jussi Kujala

  • Affiliations:
  • Department of Software Systems, Tampere University of Technology, Tampere, Finland FI-33101;Department of Software Systems, Tampere University of Technology, Tampere, Finland FI-33101;Department of Software Systems, Tampere University of Technology, Tampere, Finland FI-33101

  • Venue:
  • DS '08 Proceedings of the 11th International Conference on Discovery Science
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a well-founded method of ranking a pool of mtrained classifiers by their suitability for the current input of ninstances. It can be used when dynamically selecting a single classifier as well as in weighting the base classifiers in an ensemble. No classifiers are executed during the process. Thus, the ninstances, based on which we select the classifier, can as well be unlabeled. This is rare in previous work. The method works by comparing the training distributions of classifiers with the input distribution. Hence, the feasibility for unsupervised classification comes with a price of maintaining a small sample of the training data for each classifier in the pool.In the general case our method takes time and space , where tis the size of the stored sample from the training distribution for each classifier. However, for commonly used Gaussian and polynomial kernel functions we can execute the method more efficiently. In our experiments the proposed method was found to be accurate.