Detecting Projected Outliers in High-Dimensional Data Streams

  • Authors:
  • Ji Zhang;Qigang Gao;Hai Wang;Qing Liu;Kai Xu

  • Affiliations:
  • CSIRO ICT Center, Hobart, Australia;Dalhousie University, Halifax, Canada;Saint Mary's University, Halifax, Canada;CSIRO ICT Center, Hobart, Australia;CSIRO ICT Center, Hobart, Australia

  • Venue:
  • DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we study the problem of projected outlier detection in high dimensional data streams and propose a new technique, called Stream Projected Ouliter deTector (SPOT), to identify outliers embedded in subspaces. Sparse Subspace Template (SST), a set of subspaces obtained by unsupervised and/or supervised learning processes, is constructed in SPOT to detect projected outliers effectively. Multi-Objective Genetic Algorithm (MOGA) is employed as an effective search method for finding outlying subspaces from training data to construct SST. SST is able to carry out online self-evolution in the detection stage to cope with dynamics of data streams. The experimental results demonstrate the efficiency and effectiveness of SPOT in detecting outliers in high-dimensional data streams.