High dimensional visual data classification

  • Authors:
  • François Poulet

  • Affiliations:
  • ESIEA, Parc Universitaire de Laval-Changé, Laval, France

  • Venue:
  • VIEW'06 Proceedings of the 1st first visual information expert conference on Pixelization paradigm
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present new visual data mining algorithms for interactive decision tree construction with large datasets. The size of data stored in the world is constantly increasing but the limits of current visual data mining (and visualization) methods concerning the number of items and dimensions of the dataset treated are well known (even with pixellisation methods). One solution to improve these methods is to use a higher-level representation of the data, for example a symbolic data representation. Our new interactive decision tree construction algorithms deal with interval and taxonomical data. With such a representation, we are able to deal with potentially very large datasets because we do not use the original data but higher-level data representation. Interactive algorithms are examples of new data mining approach aiming at involving more intensively the user in the process. The main advantages of this user-centered approach are the increased confidence and comprehensibility of the obtained model, because the user was involved in its construction and the possible use of human pattern recognition capabilities. We present some results we obtained on very large datasets.