Extracting generalization hierarchies from relational databases: A reverse engineering approach

  • Authors:
  • Nadira Lammari;Isabelle Comyn-Wattiau;Jacky Akoka

  • Affiliations:
  • CEDRIC-CNAM, 292 Rue Saint Martin, 75141 Paris Cedex 03, France;CEDRIC-CNAM and ESSEC, 292 Rue Saint Martin, 75141 Paris Cedex 03, France;CEDRIC-CNAM and INT, 292 Rue Saint Martin, 75141 Paris Cedex 03, France

  • Venue:
  • Data & Knowledge Engineering
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Relational Data Base Management Systems (RDBMS) are currently the most popular database management systems. The relational model is a simple and powerful model for representing real world applications. However, it lacks the expressiveness of conceptual models. Unlike the latter, the relational model does not offer the generalization abstraction. Therefore, it does not allow the designer to represent directly a large variety of integrity constraints. Moreover, inclusion dependencies formalizing inter-relational constraints cannot directly be represented in the relational model, due to the fact that its basic construct, the relation, is the unique structure. Finally, relational databases do not enable a natural way to represent inheritances. In this paper we describe a reverse engineering method which particularly deals with the elicitation of inheritance links embedded in a relational database, combining heuristic and algorithmic approaches. We provide rules for detecting intersection constraints and inclusion dependencies. Heuristics are proposed for understanding null value semantics. Finally, we present decision rules for detecting existence dependencies and translating them into IS-A hierarchies among entities. An example is used to illustrate our approach.