Assisted descriptor selection based on visual comparative data analysis

  • Authors:
  • Sebastian Bremm;Tatiana von Landesberger;Jürgen Bernard;Tobias Schreck

  • Affiliations:
  • Technische Universität Darmstadt, Germany;Technische Universität Darmstadt and 2Fraunhofer Institute for Computer Graphics Research, Darmstadt, Germany;Technische Universität Darmstadt, Germany;Technische Universität Darmstadt, Germany

  • Venue:
  • EuroVis'11 Proceedings of the 13th Eurographics / IEEE - VGTC conference on Visualization
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Exploration and selection of data descriptors representing objects using a set of features are important components in many data analysis tasks. Usually, for a given dataset, an optimal data description does not exist, as the suitable data representation is strongly use case dependent. Many solutions for selecting a suitable data description have been proposed. In most instances, they require data labels and often are black box approaches. Non-expert users have difficulties to comprehend the coherency of input, parameters, and output of these algorithms. Alternative approaches, interactive systems for visual feature selection, overburden the user with an overwhelming set of options and data views. Therefore, it is essential to offer the users a guidance in this analytical process. In this paper, we present a novel system for data description selection, which facilitates the user's access to the data analysis process. As finding of suitable data description consists of several steps, we support the user with guidance. Our system combines automatic data analysis with interactive visualizations. By this, the system provides a recommendation for suitable data descriptor selections. It supports the comparison of data descriptors with differing dimensionality for unlabeled data. We propose specialized scores and interactive views for descriptor comparison. The visualization techniques are scatterplot-based and grid-based. For the latter case, we apply Self-Organizing Maps as adaptive grids which are well suited for large multi-dimensional data sets. As an example, we demonstrate the usability of our system on a real-world biochemical application.