Efficient discovery of functional dependencies with degrees of satisfaction: Research Articles

  • Authors:
  • Qiang Wei;Guoqing Chen

  • Affiliations:
  • School of Economics and Management, Tsinghua University Beijing 100084, China;School of Economics and Management, Tsinghua University Beijing 100084, China

  • Venue:
  • International Journal of Intelligent Systems - Intelligent and Soft Computing Techniques for Information Processing
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Functional dependency (FD) is an important type of semantic knowledge reflecting integrity constraints in databases, and has nowadays attracted an increasing amount of research attention in data mining. Traditionally, FD is defined in the light of precise or complete data, and can hardly tolerate partial truth due to imprecise or incomplete data (such as noises, nulls, etc.) that may often exist in massive databases, or due to a very tiny insignificance of tuple differences in a huge volume of data. Based on the notion of functional dependencies with degrees of satisfaction (FDs)d, this article presents an efficient approach to discovering all satisfied (FDs)d using some important results obtained from exploration of (FDs)d properties such as extended Armstrong-like axioms and their derivatives. In this way, many dependencies can be inferred from previously discovered ones without scanning databases, and those unsatisfied ones could be filtered out inside (rather than after) the mining process. Fuzzy relation matrix operation is used to infer transitive dependencies in the mining algorithm. Finally, the efficiency is demonstrated with data experiments. © 2004 Wiley Periodicals, Inc. Int J Int Syst 19: 1089–1110, 2004.