Data Signatures and Visualization of Scientific Data Sets

  • Authors:
  • Pak Chung Wong;Harlan Foote;Ruby Leung;Dan Adams;Jim Thomas

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • IEEE Computer Graphics and Applications
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Today, as data sets used in computations grow in size and complexity, the technologies developed over the years to deal with scientific data sets have become less efficient and effective. Many frequently used operations, such as eigenvector computation, could quickly exhaust our desktop workstations once the data size reaches certain limits. On the other hand, the high-dimensional data sets we collect every day don't relieve the problem. Many conventional metric designs that build on quantitative or categorical data sets cannot be applied directly to heterogeneous data sets with multiple data types. While building new machines with more resources might conquer the data size problems, the complexity of today's computations requires a new breed of projection techniques to support analysis of the data and verification of the results. We introduce the concept of a data signature, which captures the essence of a scientific data set in a compact format, and use it to conduct analysis as if using the original. A time-dependent climate simulation data set demonstrates our approach and presents the results