Analysis and design of Wavelet-Packet Cepstral coefficients for automatic speech recognition

Authors:
Eduardo Pavez;Jorge F. Silva
Affiliations:
University of Chile, Department of Electrical Engineering, Av. Tupper 2007, Santiago 412-3, Chile;University of Chile, Department of Electrical Engineering, Av. Tupper 2007, Santiago 412-3, Chile
Venue:
Speech Communication
Year:
2012

Citing 19
Cited 0

A Theory for Multiresolution Signal Decomposition: The Wavelet Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Introduction to algorithms

Introduction to algorithms
Ten lectures on wavelets

Ten lectures on wavelets
Multirate systems and filter banks

Multirate systems and filter banks
Trading Accuracy for Simplicity in Decision Trees

Machine Learning
Review: On the statistical decorrelation of the wavelet packet coefficients of a band-limited wide-sense stationary random process

Signal Processing
Discrete-time speech signal processing: principles and practice

Discrete-time speech signal processing: principles and practice
Discriminative wavelet packet filter bank selection for pattern recognition

IEEE Transactions on Signal Processing
On signal representations within the Bayes decision framework

Pattern Recognition
Tree pruning with subadditive penalties

IEEE Transactions on Signal Processing
Wavelet-based statistical signal processing using hidden Markovmodels

IEEE Transactions on Signal Processing
TEMPLAR: a wavelet-based framework for pattern learning and analysis

IEEE Transactions on Signal Processing
Minimum probability of error image retrieval

IEEE Transactions on Signal Processing
An Implementation of Rational Wavelets and Filter Design for Phonetic Classification

IEEE Transactions on Audio, Speech, and Language Processing
Optimal pruning with applications to tree-structured source coding and modeling

IEEE Transactions on Information Theory
Entropy-based algorithms for best basis selection

IEEE Transactions on Information Theory - Part 2
A sampling theorem for wavelet subspaces

IEEE Transactions on Information Theory - Part 2
Separability-based multiscale basis selection and feature extraction for signal and image classification

IEEE Transactions on Image Processing
Texture analysis and classification with tree-structured wavelet transform

IEEE Transactions on Image Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This work proposes using Wavelet-Packet Cepstral coefficients (WPPCs) as an alternative way to do filter-bank energy-based feature extraction (FE) for automatic speech recognition (ASR). The rich coverage of time-frequency properties of Wavelet Packets (WPs) is used to obtain new sets of acoustic features, in which competitive and better performances are obtained with respect to the widely adopted Mel-Frequency Cepstral coefficients (MFCCs) in the TIMIT corpus. In the analysis, concrete filter-bank design considerations are stipulated to obtain most of the phone-discriminating information embedded in the speech signal, where the filter-bank frequency selectivity, and better discrimination in the lower frequency range [200Hz-1kHz] of the acoustic spectrum are important aspects to consider.