Is masking a relevant aspect lacking in MFCC? A speaker verification perspective

  • Authors:
  • Jugurta MontalvãO;Marcos Renato Rodrigues Araujo

  • Affiliations:
  • Universidade Federal de Sergipe, 49100-000 São Cristóvão, Brazil;Griaule Biometrics, Campinas, São Paulo, Brazil

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2012

Quantified Score

Hi-index 0.10

Visualization

Abstract

We hypothesize that spectral masking may account for most of the gains in robustness against noise using ensemble interval histogram (EIH) and zero crossing with peak amplitude (ZCPA) compared to Mel-frequency cepstral coefficients (MFCCs). To test this hypothesis, we focus on this issue by comparing two MFCC implementations for which the only difference is spectral masking. The comparison involved biometric speaker verification tasks using two publicly available databases. The results confirm the superiority of MFCC with masking, thus corroborating our hypotheses that masking is a key aspect for improved robustness in feature extraction.