On equivalent parameter learning in simplified feature space based on Bayesian asymptotic analysis

Authors:
Keisuke Yamazaki
Affiliations:
-
Venue:
Neural Networks
Year:
2012

Citing 10
Cited 0

Singularities in mixture models and upper bounds of stochastic complexity

Neural Networks
Asymptotic Model Selection for Naive Bayesian Networks

The Journal of Machine Learning Research
Algebraic Analysis for Nonidentifiable Learning Machines

Neural Computation
Stochastic complexities of reduced rank regression in Bayesian estimation

Neural Networks
Ideals, Varieties, and Algorithms: An Introduction to Computational Algebraic Geometry and Commutative Algebra, 3/e (Undergraduate Texts in Mathematics)

Ideals, Varieties, and Algorithms: An Introduction to Computational Algebraic Geometry and Commutative Algebra, 3/e (Undergraduate Texts in Mathematics)
Algebraic Geometry and Statistical Learning Theory

Algebraic Geometry and Statistical Learning Theory
Asymptotic analysis of Bayesian generalization error with Newton diagram

Neural Networks
Algebraic geometry and stochastic complexity of hidden Markov models

Neurocomputing
Stochastic Complexity and Generalization Error of a Restricted Boltzmann Machine in Bayesian Estimation

The Journal of Machine Learning Research
Singularities in complete bipartite graph-type Boltzmann machines and upper bounds of stochastic complexities

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

Parametric models for sequential data, such as hidden Markov models, stochastic context-free grammars, and linear dynamical systems, are widely used in time-series analysis and structural data analysis. Computation of the likelihood function is one of primary considerations in many learning methods. Iterative calculation of the likelihood such as the model selection is still time-consuming though there are effective algorithms based on dynamic programming. The present paper studies parameter learning in a simplified feature space to reduce the computational cost. Simplifying data is a common technique seen in feature selection and dimension reduction though an oversimplified space causes adverse learning results. Therefore, we mathematically investigate a condition of the feature map to have an asymptotically equivalent convergence point of estimated parameters, referred to as the vicarious map. As a demonstration to find vicarious maps, we consider the feature space, which limits the length of data, and derive a necessary length for parameter learning in hidden Markov models.