ODDC: outlier detection using distance distribution clustering

  • Authors:
  • Kun Niu;Chong Huang;Shubo Zhang;Junliang Chen

  • Affiliations:
  • Dept. of Computer Sci. and Eng., Beijing University of Posts and Telecommunications, Beijing, China;Graduate University, Chinese Academy of Sciences, Beijing, China;Dept. of Strategy Research, China Telecom Beijing Institute, Beijing, China;Dept. of Computer Sci. and Eng., Beijing University of Posts and Telecommunications, Beijing, China

  • Venue:
  • PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Outlier detection is an important issue in many industrial and financial applications. Most outlier detection methods suffer from two problems: First, they need parameter tuning in accord to domain knowledge. Second, they are incapable to scale up to high dimensional space. In this paper, we propose a distance-based outlier definition and a detection algorithm ODDC (Distribution Clustering Outlier Detection). We redefine the problem by clustering in the distribution difference space rather than the original feature space. As a result, the new algorithm is stable regardless of different input and scalable to the dimensionality. Experiments on both synthetic and real datasets show that ODDC outperforms the counterpart both in effectiveness and efficiency.