PAC learning of probability distributions over a discrete domain

  • Authors:
  • Letizia Magnoni;Massimo Mirolli;Franco Montagna;Giulia Simi

  • Affiliations:
  • Dipartimento di Matematica, Via del Capitano 15, 53100 Siena, Italy;Dipartimento di Matematica, Via del Capitano 15, 53100 Siena, Italy;Dipartimento di Matematica, Via del Capitano 15, 53100 Siena, Italy;Dipartimento di Matematica, Via del Capitano 15, 53100 Siena, Italy

  • Venue:
  • Theoretical Computer Science
  • Year:
  • 2003

Quantified Score

Hi-index 5.23

Visualization

Abstract

We investigate learning of classes of distributions over a discrete domain in a PAC context. We introduce two paradigms of PAC learning, namely absolute PAC learning, which is independent of the representation of the class of hypotheses, and PAC learning wrt the indexes, which heavily depends on such representations. We characterize non-computable learnability in both contexts. Then we investigate efficient learning strategies which are simulated by a polynomial-time Turing machine. One strategy is the frequentist one. According to this strategy, the learner conjectures a hypothesis which is as close as possible to the distribution given by the frequency relative to the examples. We characterize the classes of distributions which are absolutely PAC learnable by means of this strategy, and we relate frequentist learning wrt the indexes to the NP = RP problem. Finally, we present another strategy for learning wrt the indexes, namely learning by tests.