Investigation of silicon auditory models and generalization of linear discriminant analysis for improved speech recognition
The 2005 AMI system for the transcription of speech in meetings
MLMI'05 Proceedings of the Second international conference on Machine Learning for Multimodal Interaction
A Hybrid Generative-Discriminative Approach to Speaker Diarization
MLMI '08 Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction
The efficient incorporation of MLP features into automatic speech recognition systems
Computer Speech and Language
Hi-index | 0.00 |
This paper investigates into feature extraction for meeting recognition. Three robust variants of popular HLDA transform are investigated. Influence of adding posterior features to PLP feature stream is studied. The experimental results are obtained on two data-sets: CTS (continuous telephone speech) and meeting data from NIST RT'05 evaluations. Silence-reduced HLDA and LCRC phoneme-state posterior features are found to be suitable for both recognition tasks.