Logical analysis of binary data with missing bits

  • Authors:
  • Endre Boros;Toshihide Ibaraki;Kazuhisa Makino

  • Affiliations:
  • RUTCOR, Rutgers University, 640 Bartholomew Road, Piscataway, NJ 08854-8003, USA;Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, Kyoto 606-8501, Japan;Department of Systems and Human Science, Graduate School of Engineering Science, Osaka University, Toyonaka, Osaka 560-8531, Japan

  • Venue:
  • Artificial Intelligence
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

We model a given pair of sets of positive and negative examples, each of which may contain missing components, as a partially defined Boolean function with missing bits (pBmb) (T~,F~), where T~-@?{0,1*}^n and F~-@?{0,1*}^n, and ''*'' stands for a missing bit. Then we consider the problem of establishing a Boolean function (an extension) f : 0, 1^n - 0, 1 belonging to a given function class C, such that f is true (respectively, false) for every vector in T~ (respectively, in F~. This is a fundamental problem, encountered in many areas such as learning theory, pattern recognition, example-based knowledge bases, logical analysis of data, knowledge discovery and data mining. In this paper, depending upon how to deal with missing bits, we formulate three types of extensions called robust, consistent and most robust extensions, for various classes of Boolean functions such as general, positive, Horn, threshold, decomposable and k-DNF. The complexity of the associated problems are then clarified; some of them are solvable in polynomial time while the others are NP-hard.