Neural Networks - 2005 Special issue: IJCNN 2005
Learning to Forget: Continual Prediction with LSTM
Neural Computation
ICML '06 Proceedings of the 23rd international conference on Machine learning
Neural Computation
EURASIP Journal on Audio, Speech, and Music Processing
An application of recurrent neural networks to discriminative keyword spotting
ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Opensmile: the munich versatile and fast open-source audio feature extractor
Proceedings of the international conference on Multimedia
IEEE Transactions on Affective Computing
Computer Speech and Language
Tandem connectionist feature extraction for conversational speech recognition
MLMI'04 Proceedings of the First international conference on Machine Learning for Multimodal Interaction
Bidirectional recurrent neural networks
IEEE Transactions on Signal Processing
Online Driver Distraction Detection Using Long Short-Term Memory
IEEE Transactions on Intelligent Transportation Systems
Hi-index | 0.01 |
We introduce a novel context-sensitive feature extraction approach for spontaneous speech recognition. As bidirectional Long Short-Term Memory (BLSTM) networks are known to enable improved phoneme recognition accuracies by incorporating long-range contextual information into speech decoding, we integrate the BLSTM principle into a Tandem front-end for probabilistic feature extraction. Unlike the previously proposed approaches which exploit BLSTM modeling by generating a discrete phoneme prediction feature, our feature extractor merges continuous high-level probabilistic BLSTM features with low-level features. By combining BLSTM modeling and Bottleneck (BN) feature generation, we propose a novel front-end that allows us to produce context-sensitive probabilistic feature vectors of arbitrary size, independent of the network training targets. Evaluations on challenging spontaneous, conversational speech recognition tasks show that this concept prevails over recently published architectures for feature-level context modeling.