Mining flipping correlations from large datasets with taxonomies

  • Authors:
  • Marina Barsky;Sangkyum Kim;Tim Weninger;Jiawei Han

  • Affiliations:
  • Univ. of Victoria, BC, Canada;Univ. of Illinois at Urbana-Champaign;Univ. of Illinois at Urbana-Champaign;Univ. of Illinois at Urbana-Champaign

  • Venue:
  • Proceedings of the VLDB Endowment
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we introduce a new type of pattern -- a flipping correlation pattern. The flipping patterns are obtained from contrasting the correlations between items at different levels of abstraction. They represent surprising correlations, both positive and negative, which are specific for a given abstraction level, and which "flip" from positive to negative and vice versa when items are generalized to a higher level of abstraction. We design an efficient algorithm for finding flipping correlations, the Flipper algorithm, which outperforms naïve pattern mining methods by several orders of magnitude. We apply Flipper to real-life datasets and show that the discovered patterns are non-redundant, surprising and actionable. Flipper finds strong contrasting correlations in itemsets with low-to-medium support, while existing techniques cannot handle the pattern discovery in this frequency range.