Acoustical and Environmental Robustness in Automatic Speech Recognition
Acoustical and Environmental Robustness in Automatic Speech Recognition
PCM '01 Proceedings of the Second IEEE Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
HTIMIT and LLHDB: Speech Corpora for the Study of Handset Transducer Effects
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Speaker recognition using G.729 speech codec parameters
ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
IEEE Transactions on Neural Networks
Cluster-dependent feature transformation for telephone-based speaker verification
AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
Speaker recognition from coded speech using support vector machines
TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Hi-index | 0.00 |
A handset compensation technique for speaker verification from coded telephone speech is proposed. The proposed technique combines handset selectors with stochastic feature transformation to reduce the acoustic mismatch between different handsets and different speech coders. Coder-dependent GMM-based handset selectors are trained to identify the most likely handset used by the claimants. Stochastic feature transformations are then applied to remove the acoustic distortion introduced by the coder and the handset. Experimental results show that the proposed technique outperforms the CMS approach and significantly reduces the error rates under six different coders with bit rates ranging from 2.4 kb/s to 64 kb/s. Strong correlation between speech quality and verification performance is also observed.