Attribute-Oriented Induction Using Domain Generalization Graphs

  • Authors:
  • Howard J. Hamilton;Robert J. Hilderman;Nick Cercone

  • Affiliations:
  • -;-;-

  • Venue:
  • ICTAI '96 Proceedings of the 8th International Conference on Tools with Artificial Intelligence
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

Attribute-oriented induction summarizes the information in a relational database by repeatedly replacing specific attribute values with more general concepts according to user-defined concept hierarchies. We show how domain generalization graphs can be constructed from multiple concept hierarchies associated with an attribute, describe how these graphs can be used to control the generalization of a set of attributes, and present the Multi-Attribute Generalization algorithm for attribute-oriented induction using domain generalization graphs. Based upon a generate-and-test approach, the algorithm generates all possible combinations of nodes from the domain generalization graphs associated with the individual attributes, to produce all possible generalized relations for the set of attributes. We rank the interestingness of the resulting generalized relations using measures based upon relative entropy and variance. Our experiments show that these measures provide a basis for analyzing summary data from relational databases. Variance appears more useful because it tends to rank the less complex generalized relations (i.e., those with few attributes and/or few tuples) as more interesting.