Comparative Analysis of Multidimensional, Quantitative Data

  • Authors:
  • Alexander Lex;Marc Streit;Christian Partl;Dieter Schmalstieg

  • Affiliations:
  • -;-;-;-

  • Venue:
  • IEEE Transactions on Visualization and Computer Graphics
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

When analyzing multidimensional, quantitative data, the comparison of two or more groups of dimensions is a commontask. Typical sources of such data are experiments in biology, physics or engineering, which are conducted in different configurationsand use replicates to ensure statistically significant results. One common way to analyze this data is to filter it using statistical methodsand then run clustering algorithms to group similar values. The clustering results can be visualized using heat maps, which showdifferences between groups as changes in color. However, in cases where groups of dimensions have an a priori meaning, it is notdesirable to cluster all dimensions combined, since a clustering algorithm can fragment continuous blocks of records. Furthermore,identifying relevant elements in heat maps becomes more difficult as the number of dimensions increases. To aid in such situations,we have developed Matchmaker, a visualization technique that allows researchers to arbitrarily arrange and compare multiple groupsof dimensions at the same time. We create separate groups of dimensions which can be clustered individually, and place them in anarrangement of heat maps reminiscent of parallel coordinates. To identify relations, we render bundled curves and ribbons betweenrelated records in different groups. We then allow interactive drill-downs using enlarged detail views of the data, which enable in-depthcomparisons of clusters between groups. To reduce visual clutter, we minimize crossings between the views. This paper concludeswith two case studies. The first demonstrates the value of our technique for the comparison of clustering algorithms. In the second,biologists use our system to investigate why certain strains of mice develop liver disease while others remain healthy, informallyshowing the efficacy of our system when analyzing multidimensional data containing distinct groups of dimensions.