Hybrid signal decomposition based on instantaneous harmonic parameters and perceptually motivated wavelet packets for scalable audio coding

Authors:
Alexey Petrovsky;Elias Azarov;Alexander Petrovsky
Affiliations:
Belarusian State University of Informatics and Radioelectronics, Computer Engineering Department, 6, P. Brovky str., 220013 Minsk, Belarus;Belarusian State University of Informatics and Radioelectronics, Computer Engineering Department, 6, P. Brovky str., 220013 Minsk, Belarus;Belarusian State University of Informatics and Radioelectronics, Computer Engineering Department, 6, P. Brovky str., 220013 Minsk, Belarus
Venue:
Signal Processing
Year:
2011

Citing 8
Cited 0

Dynamic Algorithm Transforms for Reconfigurable Real-Time Audio Coding Processor

PARELEC '02 Proceedings of the International Conference on Parallel Computing in Electrical Engineering
The fan-chirp transform for non-stationary harmonic signals

Signal Processing
Adaptive signal modeling based on sparse approximations for scalable parametric audio coding

IEEE Transactions on Audio, Speech, and Language Processing
Energy separation in signal modulations with application to speechanalysis

IEEE Transactions on Signal Processing
Matching pursuits with time-frequency dictionaries

IEEE Transactions on Signal Processing
Multicomponent AM–FM Representations: An Asymptotically Exact Approach

IEEE Transactions on Audio, Speech, and Language Processing
Entropy-based algorithms for best basis selection

IEEE Transactions on Information Theory - Part 2
Transform coding of audio signals using perceptual noise criteria

IEEE Journal on Selected Areas in Communications

Quantified Score

Hi-index	0.08

Visualization

Abstract

The paper presents a complete framework for hybrid representation of audio and speech signals that can be used in coding applications. The parameterization approach is based on the three-part model (sinusoids, transients and noise). The essential contributions of the paper can be summarized as follows: (i) a precise mathematical solution to the problem of instantaneous harmonic parameters estimation that can be applied to nonstationary (amplitude and frequency modulated) signals. The instantaneous harmonic parameters (magnitude, frequency and phase) are calculated as the result of the narrow-band filtering of signals. The frequency-modulated filters synthesis with the closed form impulse response has been proposed. The filter frequency bounds can be determined during the components frequency tracking and can be adjusted according to the fundamental frequency modulations; (ii) a practical technique of instantaneous harmonic analysis and numerical evaluation of its performance; (iii) a new transient parameterization scheme based on matching pursuit with frame-based psychoacoustic optimized wavelet packet dictionary. The choice of most relevant coefficients is based on maximizing the matching between the auditory excitation scalograms of original and modeled signals; (iv) the given hybrid analysis system is applied to speech and audio signals in order to validate the proposed methods.