Contrasting correlations by an efficient double-clique condition

  • Authors:
  • Aixiang Li;Makoto Haraguchi;Yoshiaki Okubo

  • Affiliations:
  • Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Japan;Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Japan;Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Japan

  • Venue:
  • MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Contrast set mining has been well-studied to detect the change between several contrasted databases. In the previous studies, they compared the supports of an itemset and extracted the itemsets with significantly different supports across those databases. Differently, we contrast the correlations of an itemset between two contrasted databases and try to detect potential changes. Any highly correlated itemset is out of our concern in order to focus on implicitly emerging correlation. Therefore, we set correlation constraints (upper bounds) in both databases, and then extract the itemsets consisting of items that are not highly correlated in both databases, but having a significant change of correlations from one database to the other. We regard both of positive and negative correlation. We also consider correlated itemsets under conditioning by third variables. Thus so called partial correlation is also regarded. To cover the correlation notion, we use extended mutual information. In our search procedure for the correlated itemsets, we use double clique condition that is necessary for itemsets to be solutions satisfying the correlation constraints. We show its usefulness by some experiments.