More efficient PAC-learning of DNF with membership queries under the uniform distribution

  • Authors:
  • Nader H. Bshouty;Jeffrey C. Jackson;Christino Tamon

  • Affiliations:
  • Computer Science Department, Technion Israel Institute of Technology, Haifa 32000, Israel;Mathematics and Computer Science Department, Duquesne University, 600 Forbes Avenue, Pittsburgh, PA;Department of Mathematics and Computer Science, Clarkson University, Potsdam, NY

  • Venue:
  • Journal of Computer and System Sciences
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

An efficient algorithm exists for learning disjunctive normal form (DNF) expressions in the uniformdistribution PAC learning model with membership queries (J. Comput. System Sci. 55 (1997) 414), but in practice the algorithm can only be applied to small problems. We present several modifications to the algorithm that substantially improve its asymptotic efficiency. First, we show how to significantly improve the time and sample complexity of a key subprogram, resulting in similar improvements in the bounds on the overall DNF algorithm. We also apply known methods to convert the resulting algorithm to an attribute efficient algorithm. Furthermore, we develop a technique for lower bounding the sample size required for PAC learning with membership queries under a fixed distribution and apply this technique to produce a lower bound on the number of membership queries needed for the uniform-distribution DNF learning problem. Finally, we present a learning algorithm for DNF that is attribute efficient in its use of random bits.