Speaker verification with adaptive spectral subband centroids

  • Authors:
  • Tomi Kinnunen;Bingjun Zhang;Jia Zhu;Ye Wang

  • Affiliations:
  • Speech and Dialogue Processing Lab, Institution for Infocomm Research, Singapore;Department of Computer Science, School of Computing, National University of Singapore, Singapore;Department of Computer Science, School of Computing, National University of Singapore, Singapore;Department of Computer Science, School of Computing, National University of Singapore, Singapore

  • Venue:
  • ICB'07 Proceedings of the 2007 international conference on Advances in Biometrics
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Spectral subband centroids (SSC) have been used as an additional feature to cepstral coefficients in speech and speaker recognition. SSCs are computed as the centroid frequencies of subbands and they capture the dominant frequencies of the short-term spectrum. In the baseline SSC method, the subband filters are pre-specified. To allow better adaptation to formant movements and other dynamic phenomena, we propose to adapt the subband filter boundaries on a frame-by-frame basis using a globally optimal scalar quantization scheme. The method has only one control parameter, the number of subbands. Speaker verification results on the NIST 2001 task indicate that the selection of the parameter is not critical and that the method does not require additional feature normalization.