Norm-observable operator models

Authors:
Ming-Jie Zhao;Herbert Jaeger
Affiliations:
-;-
Venue:
Neural Computation
Year:
2010

Citing 5
Cited 0

Observable Operator Models for Discrete Stochastic Time Series

Neural Computation
Modeling probability distributions with predictive state representations

Modeling probability distributions with predictive state representations
A bound on modeling error in observable operator models and an associated learning algorithm

Neural Computation
Hidden Markov processes

IEEE Transactions on Information Theory
Asymptotic Mean Stationarity of Sources With Finite Evolution Dimension

IEEE Transactions on Information Theory

Quantified Score

Hi-index	0.00

Visualization

Abstract

Hidden Markov models (HMMs) are one of the most popular and successful statistical models for time series. Observable operator models (OOMs) are generalizations of HMMs that exhibit several attractive advantages. In particular, a variety of highly efficient, constructive, and asymptotically correct learning algorithms are available for OOMs. However, the OOM theory suffers from the negative probability problem (NPP): a given, learned OOM may sometimes predict negative probabilities for certain events. It was recently shown that it is undecidable whether a given OOM will eventually produce such negative values. We propose a novel variant of OOMs, called norm-observable operator models (NOOMs), which avoid the NPP by design. Like OOMs, NOOMs use a set of linear operators to update system states. But differing from OOMs, they represent probabilities by the square of the norm of system states, thus precluding negative probability values. While being free of the NPP, NOOMs retain most advantages of OOMs. For example, NOOMs also capture (some) processes that cannot be modeled by HMMs. More importantly, in principle, NOOMs can be learned from data in a constructive way, and the learned models are asymptotically correct. We also prove that NOOMs capture all Markov chain (MC) describable processes. This letter presents the mathematical foundations of NOOMs, discusses the expressiveness of the model class, and explains how a NOOM can be estimated from data constructively.