Efficient outlier detection algorithm for heterogeneous data streams

  • Authors:
  • Jiadong Ren;Qunhui Wu;Jia Zhang;Jiadong Ren;Changzhen Hu

  • Affiliations:
  • College of Information Science and Engineering, Yanshan University, Qinhuangdao City, P.R.China;College of Information Science and Engineering, Yanshan University, Qinhuangdao City, P.R.China;College of Information Science and Engineering, Yanshan University, Qinhuangdao City, P.R.China;School of Computer Science and Technology, Beijing Institute of Technology, Beijing City, P.R.China;School of Computer Science and Technology, Beijing Institute of Technology, Beijing City, P.R.China

  • Venue:
  • FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data streams outlier mining is an important and active research issue in anomaly detection. Most of the existing outlier detection algorithms can only manipulate numeric attributes or categorical attributes. In this paper, we propose an efficient outlier detection algorithm based on heterogeneous data streams, which partitions the stream in chunks. Then each chunk is clustered and the corresponding clustering results are stored in cluster references. The representation degree and the number of adjacent cluster references of each cluster reference are computed to generate the final outlier references, which include potential outliers. Experimental results show that our approach has higher detection precision and better scalability.