Selecting good views of high-dimensional data using class consistency

Authors:
Mike Sips;Boris Neubert;John P. Lewis;Pat Hanrahan
Affiliations:
Max Planck Center for Visual Computing Stanford, Saarbruecken;University of Konstanz;Massey University;Stanford University
Venue:
EuroVis'09 Proceedings of the 11th Eurographics / IEEE - VGTC conference on Visualization
Year:
2009

Citing 6
Cited 7

The grand tour: a tool for viewing multidimensional data

SIAM Journal on Scientific and Statistical Computing
Semiology of graphics

Semiology of graphics
A rank-by-feature framework for interactive exploration of multidimensional data

Information Visualization
High-Dimensional Visual Analytics: Interactive Exploration Guided by Pairwise Views of Point Distributions

IEEE Transactions on Visualization and Computer Graphics
A Projection Pursuit Algorithm for Exploratory Data Analysis

IEEE Transactions on Computers
Robust linear dimensionality reduction

IEEE Transactions on Visualization and Computer Graphics

Visual quality metrics and human perception: an initial study on 2D projections of large multidimensional data

Proceedings of the International Conference on Advanced Visual Interfaces
Techniques for precision-based visual analysis of projected data

Information Visualization - Special issue on selected papers from visualization and data analysis 2010
Multi-objective genetic programming for visual analytics

EuroGP'11 Proceedings of the 14th European conference on Genetic programming
MusiCube: a visual music recommendation system featuring interactive evolutionary computing

Proceedings of the 2011 Visual Information Communication - International Symposium
Dual analysis of DNA microarrays

Proceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies
Assisted descriptor selection based on visual comparative data analysis

EuroVis'11 Proceedings of the 13th Eurographics / IEEE - VGTC conference on Visualization
Special Section on Visual Analytics: Visualization of cluster structure and separation in multivariate mixed data: A case study of diversity faultlines in work teams

Computers and Graphics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many visualization techniques involve mapping high-dimensional data spaces to lower-dimensional views. Unfortunately, mapping a high-dimensional data space into a scatterplot involves a loss of information; or, even worse, it can give a misleading picture of valuable structure in higher dimensions. In this paper, we propose class consistency as a measure of the quality of the mapping. Class consistency enforces the constraint that classes of n-D data are shown clearly in 2-D scatterplots. We propose two quantitative measures of class consistency, one based on the distance to the class's center of gravity, and another based on the entropies of the spatial distributions of classes. We performed an experiment where users choose good views, and show that class consistency has good precision and recall. We also evaluate both consistency measures over a range of data sets and show that these measures are efficient and robust.