A New Approach to Outlier Detection

  • Authors:
  • Lancang Yang;Bing Shi;Xueqin Zhang;Lei Qiao

  • Affiliations:
  • School of Computer Science and Technology, Shandong University, Jinan 250061, P.R. China;School of Computer Science and Technology, Shandong University, Jinan 250061, P.R. China;School of Information Science and Engineering, Jinan University, Jinan 250022, P.R. China;School of Computer Science and Technology, Shandong University, Jinan 250061, P.R. China

  • Venue:
  • ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

For many data mining applications, finding the rare instances or the outliers is more interesting than finding the common patterns. At present, many automated outlier detection methods are available, however, most of those are limited by assumptions of a distribution or require upper and lower predefined boundaries in which the data should exist. Whereas a distribution is often unknown, and enough information may not exist about a set of data to be able to determine reliable upper and lower boundaries. For these cases, a new dissimilarity function was defined, which can be viewed as fitness function of genetic algorithm, and a GA-based outlier detection method was formed in this paper. This method allows for detection of multiple outliers, not just one at a time. The illustrations show that the improved approach can automatically detect outliers, and performs better than GLOF approach.