Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features

Authors:
Ning Wang;P. C. Ching;Nengheng Zheng;Tan Lee
Affiliations:
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Hong Kong, China;-;-;-
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2011

Citing 0
Cited 3

Speaker verification under degraded condition: a perceptual study

International Journal of Speech Technology
Spectral histogram of oriented gradients (SHOGs) for Tamil language male/female speaker classification

International Journal of Speech Technology
Optimization of the parameters characterizing sigmoidal rate-level functions based on acoustic features

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

To alleviate the problem of severe degradation of speaker recognition performance under noisy environments because of inadequate and inaccurate speaker-discriminative information, a method of robust feature estimation that can capture both vocal source- and vocal tract-related characteristics from noisy speech utterances is proposed. Spectral subtraction, a simple yet useful speech enhancement technique, is employed to remove the noise-specific components prior to the feature extraction process. It has been shown through analytical derivation, as well as by simulation results, that the proposed feature estimation method leads to robust recognition performance, especially at low signal-to-noise ratios. In the context of Gaussian mixture model-based speaker recognition with the presence of additive white Gaussian noise, the new approach produces consistent reduction of both identification error rate and equal error rate at signal-to-noise ratios ranging from 0 to 15 dB.