Speaker verification using complementary information from vocal source and vocal tract

Authors:
Nengheng Zheng;Ning Wang;Tan Lee;P. C. Ching
Affiliations:
Department of Electronic Engineering, The Chinese University of Hong Kong, Hong Kong;Department of Electronic Engineering, The Chinese University of Hong Kong, Hong Kong;Department of Electronic Engineering, The Chinese University of Hong Kong, Hong Kong;Department of Electronic Engineering, The Chinese University of Hong Kong, Hong Kong
Venue:
ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
Year:
2006

Citing 2
Cited 0

Ten lectures on wavelets

Ten lectures on wavelets
Speaker identification and verification using Gaussian mixture speaker models

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes a speaker verification system which uses two complementary acoustic features: Mel-frequency cepstral coefficients (MFCC) and wavelet octave coefficients of residues (WOCOR). While MFCC characterizes mainly the spectral envelope, or the formant structure of the vocal tract system, WOCOR aims at representing the spectro-temporal characteristics of the vocal source excitation. Speaker verification experiments carried out on the ISCSLP 2006 SRE database demonstrate the complementary contributions of MFCC and WOCOR to speaker verification. Particularly, WOCOR performs even better than MFCC in single channel speaker verification task. Combining MFCC and WOCOR achieves higher performance than using MFCC only in both single and cross channel speaker verification tasks.