Speech recognition with dynamic Bayesian networks

Authors:
Geoffrey Zweig;Stuart Russell
Affiliations:
-;-
Venue:
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Year:
1998

Citing 12
Cited 13

A model for reasoning about persistence and causation

Computational Intelligence
Fusion and propagation with multiple observations in belief networks

Artificial Intelligence
Fundamentals of speech recognition

Fundamentals of speech recognition
The EM algorithm for graphical association models with missing data

Computational Statistics & Data Analysis - Special issue dedicated to Toma´sˇ Havra´nek
Probabilistic independence networks for hidden Markov probability models

Neural Computation
Factorial Hidden Markov Models

Machine Learning - Special issue on learning with probabilistic representations
Connectionist Speech Recognition: A Hybrid Approach

Connectionist Speech Recognition: A Hybrid Approach
Automatic Speech Recognition: The Development of the Sphinx Recognition System

Automatic Speech Recognition: The Development of the Sphinx Recognition System
Hybrid HMM/ANN Systems for Training Independent Tasks: Experiments on Phonebook and Related Improvements

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 3 - Volume 3
Compositional Modeling with DPNs

Compositional Modeling with DPNs
Speech recognition with dynamic bayesian networks

Speech recognition with dynamic bayesian networks
Local learning in probabilistic networks with hidden variables

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

Classification Rules + Time = Temporal Rules

ICCS '02 Proceedings of the International Conference on Computational Science-Part I
A Comparison of Association Rule Discovery and Bayesian Network Causal Inference Algorithms to Discover Relationships in Discrete Data

AI '00 Proceedings of the 13th Biennial Conference of the Canadian Society on Computational Studies of Intelligence: Advances in Artificial Intelligence
Expressive Probability Models in Science

DS '99 Proceedings of the Second International Conference on Discovery Science
Review of "Statistical methods for speech recognition" by Frederick Jelinek. The MIT Press 1997.

Computational Linguistics
Relationships between probabilistic Boolean networks and dynamic Bayesian networks as models of gene regulatory networks

Signal Processing
Privacy intrusion detection using dynamic Bayesian networks

ICEC '06 Proceedings of the 8th international conference on Electronic commerce: The new e-commerce: innovations for conquering current barriers, obstacles and limitations to conducting successful business on the internet
Exploiting referential context in spoken language interfaces for data-poor domains

Proceedings of the 13th international conference on Intelligent user interfaces
Dynamic Bayesian Networks for Real-Time Classification of Seismic Signals

PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
Multivariate analysis applied in Bayesian metareasoning

WSEAS TRANSACTIONS on SYSTEMS
A framework and token passing model for continuous speech recognition with dynamic Bayesian networks

SPPRA '08 Proceedings of the Fifth IASTED International Conference on Signal Processing, Pattern Recognition and Applications
Graphical models for integrating syllabic information

Computer Speech and Language
Learning the behavior model of a robot

Autonomous Robots
Learning the structure of dynamic probabilistic networks

UAI'98 Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Dynamic Bayesian networks (DBNs) are a useful tool for representing complex stochastic processes. Recent developments in inference and learning in DBNs allow their use in real-world applications. In this paper, we apply DBNs to the problem of speech recognition. The factored state representation enabled by DBNs allows us to explicitly represent long-term articulatory and acoustic context in addition to the phonetic-state information maintained by hidden Markov models (HMMs). Furthermore it enables us to model the short-term correlations among multiple observation streams within single time-frames. Given a DBN structure capable of representing these long- and short-term correlations, we applied the EM algorithm to learn models with up to 500,000 parameters. The use of structured DBN models decreased the error rate by 12 to 29% on a large-vocabulary isolated-word recognition task, compared to a discrete HMM; it also improved significantly on other published results for the same task. This is the first successful application of DBNs to a large-scale speech recognition problem. Investigation of the learned models indicates that the hidden state variables are strongly correlated with acoustic properties of the speech signal.