Finding trees from unordered 0–1 data

  • Authors:
  • Hannes Heikinheimo;Heikki Mannila;Jouni K. Seppänen

  • Affiliations:
  • HIIT Basic Research Unit, Lab. Computer and Information Science, Helsinki University of Technology, Finland;HIIT Basic Research Unit, Lab. Computer and Information Science, Helsinki University of Technology, Finland;HIIT Basic Research Unit, Lab. Computer and Information Science, Helsinki University of Technology, Finland

  • Venue:
  • PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Tree structures are a natural way of describing occurrence relationships between attributes in a dataset. We define a new class of tree patterns for unordered 0–1 data and consider the problem of discovering frequently occurring members of this pattern class. Intuitively, a tree T occurs in a row u of the data, if the attributes of T that occur in u form a subtree of T containing the root. We show that this definition has advantageous properties: only shallow trees have a significant probability of occurring in random data, and the definition allows a simple levelwise algorithm for mining all frequently occurring trees. We demonstrate with empirical results that the method is feasible and that it discovers interesting trees in real data.