Discovery of Constraints from Data for Information System Reverse Engineering

  • Authors:
  • Wieming Lim

  • Affiliations:
  • -

  • Venue:
  • ASWEC '97 Proceedings of the Australian Software Engineering Conference
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

The extraction of functional dependencies is a fundamental activity in the database design recovery process which is part of on an overall information systems reverse engineering effort. Existing algorithms for this task are computationally expensive and appear to be infeasible if applied to large legacy database instances, e.g., their performance deteriorated when number of attributes or/and instances is large and they cannot tolerate erroneous data that may occur in deployed commercial systems. We propose two algorithms for discovering functional dependencies from data. The collective-FD algorithm, which is based on top-down approach, eliminates redundant specialized functional dependencies to be proposed. The attribute-list algorithm, which is based on bottom-up approach, enables more accurate functional dependency hypotheses to be discovered. In anticipating noisy data, we propose an effective method to discover possible data errors and compute partial functional dependencies. The result is an error-tolerant functional dependencies discovery approach that is more applicable to real world databases for design recovery