Unsupervised change analysis using supervised learning

  • Authors:
  • Shohei Hido;Tsuyoshi Idé;Hisashi Kashima;Harunobu Kubo;Hirofumi Matsuzawa

  • Affiliations:
  • IBM Research, Tokyo Research Laboratory, Yamato-shi, Kanagawa, Japan;IBM Research, Tokyo Research Laboratory, Yamato-shi, Kanagawa, Japan;IBM Research, Tokyo Research Laboratory, Yamato-shi, Kanagawa, Japan;IBM Research, Tokyo Research Laboratory, Yamato-shi, Kanagawa, Japan;IBM Research, Tokyo Research Laboratory, Yamato-shi, Kanagawa, Japan

  • Venue:
  • PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a formulation of a new problem, which we call change analysis, and a novel method for solving the problem. In contrast to the existing methods of change (or outlier) detection, the goal of change analysis goes beyond detecting whether or not any changes exist. Its ultimate goal is to find the explanation of the changes.While change analysis falls in the category of unsupervised learning in nature, we propose a novel approach based on supervised learning to achieve the goal. The key idea is to use a supervised classifier for interpreting the changes. A classifier should be able to discriminate between the two data sets if they actually come from two different data sources. In other words, we use a hypothetical label to train the supervised learner, and exploit the learner for interpreting the change. Experimental results using real data show the proposed approach is promising in change analysis as well as concept drift analysis.