Selecting good views of high-dimensional data using class consistency

  • Authors:
  • Mike Sips;Boris Neubert;John P. Lewis;Pat Hanrahan

  • Affiliations:
  • Max Planck Center for Visual Computing Stanford, Saarbruecken;University of Konstanz;Massey University;Stanford University

  • Venue:
  • EuroVis'09 Proceedings of the 11th Eurographics / IEEE - VGTC conference on Visualization
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many visualization techniques involve mapping high-dimensional data spaces to lower-dimensional views. Unfortunately, mapping a high-dimensional data space into a scatterplot involves a loss of information; or, even worse, it can give a misleading picture of valuable structure in higher dimensions. In this paper, we propose class consistency as a measure of the quality of the mapping. Class consistency enforces the constraint that classes of n-D data are shown clearly in 2-D scatterplots. We propose two quantitative measures of class consistency, one based on the distance to the class's center of gravity, and another based on the entropies of the spatial distributions of classes. We performed an experiment where users choose good views, and show that class consistency has good precision and recall. We also evaluate both consistency measures over a range of data sets and show that these measures are efficient and robust.