Analyzing the role of dimension arrangement for data visualization in radviz

  • Authors:
  • Luigi Di Caro;Vanessa Frias-Martinez;Enrique Frias-Martinez

  • Affiliations:
  • Department of Computer Science, Universita' di Torino, Torino, Italy;Data Mining and User Modeling Group, Telefonica Research, Madrid, Spain;Data Mining and User Modeling Group, Telefonica Research, Madrid, Spain

  • Venue:
  • PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Radial Coordinate Visualization (Radviz) technique has been widely used to effectively evaluate the existence of patterns in highly dimensional data sets A crucial aspect of this technique lies in the arrangement of the dimensions, which determines the quality of the posterior visualization Dimension arrangement (DA) has been shown to be an NP-problem and different heuristics have been proposed to solve it using optimization techniques However, very little work has focused on understanding the relation between the arrangement of the dimensions and the quality of the visualization In this paper we first present two variations of the DA problem: (1) a Radviz independent approach and (2) a Radviz dependent approach We then describe the use of the Davies-Bouldin index to automatically evaluate the quality of a visualization i.e., its visual usefulness Our empirical evaluation is extensive and uses both real and synthetic data sets in order to evaluate our proposed methods and to fully understand the impact that parameters such as number of samples, dimensions, or cluster separability have in the relation between the optimization algorithm and the visualization tool.