Speaker verification using complementary information from vocal source and vocal tract

  • Authors:
  • Nengheng Zheng;Ning Wang;Tan Lee;P. C. Ching

  • Affiliations:
  • Department of Electronic Engineering, The Chinese University of Hong Kong, Hong Kong;Department of Electronic Engineering, The Chinese University of Hong Kong, Hong Kong;Department of Electronic Engineering, The Chinese University of Hong Kong, Hong Kong;Department of Electronic Engineering, The Chinese University of Hong Kong, Hong Kong

  • Venue:
  • ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a speaker verification system which uses two complementary acoustic features: Mel-frequency cepstral coefficients (MFCC) and wavelet octave coefficients of residues (WOCOR). While MFCC characterizes mainly the spectral envelope, or the formant structure of the vocal tract system, WOCOR aims at representing the spectro-temporal characteristics of the vocal source excitation. Speaker verification experiments carried out on the ISCSLP 2006 SRE database demonstrate the complementary contributions of MFCC and WOCOR to speaker verification. Particularly, WOCOR performs even better than MFCC in single channel speaker verification task. Combining MFCC and WOCOR achieves higher performance than using MFCC only in both single and cross channel speaker verification tasks.