An identification problem in an urn and ball model with heavy tailed distributions

  • Authors:
  • Christine Fricker;Fabrice Guillemin;Philippe Robert

  • Affiliations:
  • Inria paris–rocquencourt domaine de voluceau 78153 le chesnay, france e-mail: christine.fricker@inria.fr;Orange labs f-22300 lannion, france e-mail: fabrice.guillemin@orange-ftgroup.com;Nria paris–rocquencourt domaine de voluceau 78153 le chesnay, france e-mail: philippe.robert@inria.fr

  • Venue:
  • Probability in the Engineering and Informational Sciences
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider in this article an urn and ball problem with replacement, where balls are with different colors and are drawn uniformly from a unique urn. The numbers of balls with a given color are independent and identically distributed random variables with a heavy tailed probability distribution—for instance a Pareto or a Weibull distribution. We draw a small fraction p≪1 of the total number of balls. The basic problem addressed in this article is to know to which extent we can infer the total number of colors and the distribution of the number of balls with a given color. By means of Le Cam's inequality and the Chen–Stein method, bounds for the total variation norm between the distribution of the number of balls drawn with a given color and the Poisson distribution with the same mean are obtained. We then show that the distribution of the number of balls drawn with a given color has the same tail as that of the original number of balls. Finally, we establish explicit bounds between the two distributions when each ball is drawn with fixed probability p.