Analysis and design of Wavelet-Packet Cepstral coefficients for automatic speech recognition

  • Authors:
  • Eduardo Pavez;Jorge F. Silva

  • Affiliations:
  • University of Chile, Department of Electrical Engineering, Av. Tupper 2007, Santiago 412-3, Chile;University of Chile, Department of Electrical Engineering, Av. Tupper 2007, Santiago 412-3, Chile

  • Venue:
  • Speech Communication
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work proposes using Wavelet-Packet Cepstral coefficients (WPPCs) as an alternative way to do filter-bank energy-based feature extraction (FE) for automatic speech recognition (ASR). The rich coverage of time-frequency properties of Wavelet Packets (WPs) is used to obtain new sets of acoustic features, in which competitive and better performances are obtained with respect to the widely adopted Mel-Frequency Cepstral coefficients (MFCCs) in the TIMIT corpus. In the analysis, concrete filter-bank design considerations are stipulated to obtain most of the phone-discriminating information embedded in the speech signal, where the filter-bank frequency selectivity, and better discrimination in the lower frequency range [200Hz-1kHz] of the acoustic spectrum are important aspects to consider.