Density estimation by stochastic complexity

  • Authors:
  • J. Rissanen;T. P. Speed;B. Yu

  • Affiliations:
  • IBM Almaden Res. Center, San Jose, CA;-;-

  • Venue:
  • IEEE Transactions on Information Theory - Part 2
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The results by P. Hall and E.J. Hannan (1988) on optimization of histogram density estimators with equal bin widths by minimization of the stochastic complexity are extended and sharpened in two separate ways. As the first contribution, two generalized histogram estimators are constructed. The first has unequal bin widths which, together with the number of the bins, are determined by minimization of the stochastic complexity using dynamic programming. The other estimator consists of a mixture of equal bin width estimators, each of which is defined by the associated stochastic complexity. As the main contribution in the present work, two theorems are proved, which together extend the universal coding theorems to a large class of data generating densities. The first gives an asymptotic upper bound for the code redundancy in the order of magnitude, achieved with a special predictive type of histogram estimator, which sharpens a related bound. The second theorem states that this bound cannot be improved upon by any code whatsoever