On-line classification of data streams with missing values based on reinforcement learning

  • Authors:
  • Mónica Millán-Giraldo;Vicente Javier Traver;J. Salvador Sánchez

  • Affiliations:
  • Institute of New Imaging Technologies, Universitat Jaume I, Castellón, Spain;Institute of New Imaging Technologies, Universitat Jaume I, Castellón, Spain;Institute of New Imaging Technologies, Universitat Jaume I, Castellón, Spain

  • Venue:
  • IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In some applications, data arrive sequentially and they are not available in batch form, what makes difficult the use of traditional classification systems. In addition, some attributes may lack due to some real-world conditions. For this problem, a number of decisions have to be made regarding how to proceed with the incomplete and unlabeled incoming objects, how to guess its missing attributes values, how to classify it, whether to include it in the training set, or when to ask for the class label to an expert. Unfortunately, no decision works well for all data sets. This data dependency motivates our formulation of the problem in terms of elements of reinforcement learning. The application of this learning paradigm for this problem is, to the best of our knowledge, novel. The empirical results are encouraging since the proposed framework behaves better and more generally than many strategies used isolatedly, and makes an efficient use of human effort (requests for the class label to an expert) and computer memory (the increase of size of the training set).