Optimal error exponents in hidden Markov models order estimation

Authors:
E. Gassiat;S. Boucheron
Affiliations:
Dept. of Math., Univ. Paris-Sud, Orsay, France;-
Venue:
IEEE Transactions on Information Theory
Year:
2006

Citing 0
Cited 3

Consistency of feature Markov processes

ALT'10 Proceedings of the 21st international conference on Algorithmic learning theory
Learning High-Dimensional Markov Forest Distributions: Analysis of Error Rates

The Journal of Machine Learning Research
Free energy of stochastic context free grammar on variational bayes

ICONIP'06 Proceedings of the 13 international conference on Neural Information Processing - Volume Part I

Quantified Score

Hi-index	754.84

Visualization

Abstract

We consider the estimation of the number of hidden states (the order) of a discrete-time finite-alphabet hidden Markov model (HMM). The estimators we investigate are related to code-based order estimators: penalized maximum-likelihood (ML) estimators and penalized versions of the mixture estimator introduced by Liu and Narayan (1994). We prove strong consistency of those estimators without assuming any a priori upper bound on the order and smaller penalties than previous works. We prove a version of Stein's lemma for HMM order estimation and derive an upper bound on underestimation exponents. Then we prove that this upper bound can be achieved by the penalized ML estimator and by the penalized mixture estimator. The proof of the latter result gets around the elusive nature of the ML in HMM by resorting to large-deviation techniques for empirical processes. Finally, we prove that for any consistent HMM order estimator, for most HMM, the overestimation exponent is .