Normalisation in the presence of lists

  • Authors:
  • Sebastian Link;Sven Hartmann

  • Affiliations:
  • Massey University, Palmerston North, New Zealand;Massey University, Palmerston North, New Zealand

  • Venue:
  • ADC '04 Proceedings of the 15th Australasian database conference - Volume 27
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the relational data model (RDM), normal forms are conditions for relation schemata that a database design should satisfy to ensure an absence of processing difficulties with the database. One prime example of such a normal form is the Boyce-Codd normal form guaranteeing the absence of redundancies and update anomalies caused by functional dependencies (FDs).Many different data models have been introduced over the years trying to capture data beyond relational structures. The success of those advanced data models will depend, in particular, on the study of normal forms. In fact, finding a unifying framework and extending achievements of relational databases to deal with advanced database features such as complex object types are currently two major challenges in database design (Biskup 1995, Biskup 1998).Such a unifying approach, capturing various different data models at a time, can be based on the type systems underlying the various data models. In the present paper, we study normalisation in the presence of base, record and finite list types. Nested lists are used as a data structure whenever order matters. List types are therefore supported by many advanced data models such as genomic sequence, deductive and object-oriented data models including XML.On the basis of a finite axiomatisation of FDs in the presence of lists the Nested List Normal Form (NLNF) is proposed as a weaker normal form than BCNF. This proposal is semantically justified by formally proving that NLNF is equivalent to the absence of redundancy. Moreover, NLNF is equivalent to the absence of strong insertion and most forms of replacement anomalies, and sufficient for the absence of all types of update anomalies.