Machine Learning
An overview of audio information retrieval
Multimedia Systems - Special issue on audio and multimedia
Relevance of time-frequency features for phonetic and speaker-channel classification
Speech Communication
Introduction to Bayesian Networks
Introduction to Bayesian Networks
Connectionist Speech Recognition: A Hybrid Approach
Connectionist Speech Recognition: A Hybrid Approach
Dynamic bayesian networks: representation, inference and learning
Dynamic bayesian networks: representation, inference and learning
Neural Networks - 2005 Special issue: IJCNN 2005
ICML '06 Proceedings of the 23rd international conference on Machine learning
Neural Computation
Vocabulary independent spoken term detection
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Hidden Conditional Random Fields
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Novel Connectionist System for Unconstrained Handwriting Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Emotion recognition from speech: Putting ASR in the loop
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Image and Vision Computing
Sequence labelling in structured domains with hierarchical recurrent neural networks
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
An application of recurrent neural networks to discriminative keyword spotting
ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Computer Speech and Language
On the impact of children's emotional speech on acoustic and language models
EURASIP Journal on Audio, Speech, and Music Processing - Special issue on atypical speech
Bidirectional LSTM networks for improved phoneme classification and recognition
ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Bidirectional recurrent neural networks
IEEE Transactions on Signal Processing
Learning long-term dependencies in NARX recurrent neural networks
IEEE Transactions on Neural Networks
Paralinguistics in speech and language-State-of-the-art and the challenge
Computer Speech and Language
Hi-index | 0.00 |
In this article, we focus on keyword detection in children's speech as it is needed in voice command systems. We use the FAU Aibo Emotion Corpus which contains emotionally colored spontaneous children's speech recorded in a child-robot interaction scenario and investigate various recent keyword spotting techniques. As the principle of bidirectional Long Short-Term Memory (BLSTM) is known to be well-suited for context-sensitive phoneme prediction, we incorporate a BLSTM network into a Tandem model for flexible coarticulation modeling in children's speech. Our experiments reveal that the Tandem model prevails over a triphone-based Hidden Markov Model approach.