An interactive approach to outlier detection

  • Authors:
  • R. M. Konijn;W. Kowalczyk

  • Affiliations:
  • Department of Computer Science, Vrije Universiteit Amsterdam;Department of Computer Science, Vrije Universiteit Amsterdam

  • Venue:
  • RSKT'10 Proceedings of the 5th international conference on Rough set and knowledge technology
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we describe an interactive approach for finding outliers in big sets of records, such as collected by banks, insurance companies, web shops. The key idea behind our approach is the usage of an easy-to-compute and easy-to-interpret outlier score function. This function is used to identify a set of potential outliers. The outliers, organized in clusters, are then presented to a domain expert, together with some context information, such as characteristics of clusters and distribution of scores. Consequently, they are analyzed, labelled as non-explainable or explainable, and removed from the data. The whole process is iterated several times, until no more interesting outliers can be found.