Frequency of symbol occurrences in simple non-primitive stochastic models

  • Authors:
  • Diego de Falco;Massimiliano Goldwurm;Violetta Lonati

  • Affiliations:
  • Università degli Studi di Milano, Dipartimento di Scienze dell'Informazione, Milano, Italy;Università degli Studi di Milano, Dipartimento di Scienze dell'Informazione, Milano, Italy;Università degli Studi di Milano, Dipartimento di Scienze dell'Informazione, Milano, Italy

  • Venue:
  • DLT'03 Proceedings of the 7th international conference on Developments in language theory
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We study the random variable Yn representing the number of occurrences of a given symbol in a word of length n generated at random. The stochastic model we assume is a simple non-ergodic model defined by the product of two primitive rational formal series, which form two distinct ergodic components. We obtain asymptotic evaluations for the mean and the variance of Yn and its limit distribution. It turns out that there are two main cases: if one component is dominant and nondegenerate we get a Gaussian limit distribution; if the two components are equipotent and have different leading terms of the mean, we get a uniform limit distribution. Other particular limit distributions are obtained in the case of a degenerate dominant component and in the equipotent case when the leading terms of the expectation values are equal.