Visual terrain analysis of high-dimensional datasets

  • Authors:
  • Wenyuan Li;Kok-Leong Ong;Wee-Keong Ng

  • Affiliations:
  • Centre for Advanced Information Systems, Nanyang Technological University, Singapore;School of Information Technology, Deakin University, Waurn Ponds, Victoria, Australia;Centre for Advanced Information Systems, Nanyang Technological University, Singapore

  • Venue:
  • PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most real-world datasets are, to a certain degree, skewed. When considered that they are also large, they become the pinnacle challenge in data analysis. More importantly, we cannot ignore such datasets as they arise frequently in a wide variety of applications. Regardless of the analytic, it is often that the effectiveness of analysis can be improved if the characteristic of the dataset is known in advance. In this paper, we propose a novel technique to preprocess such datasets to obtain this insight. Our work is inspired by the resonance phenomenon, where similar objects resonate to a given response function. The key analytic result of our work is the data terrain, which shows properties of the dataset to enable effective and efficient analysis. We demonstrated our work in the context of various real-world problems. In doing so, we establish it as the tool for preprocessing data before applying computationally expensive algorithms.