Residual modeling in music analysis-synthesis

Authors:
M. Goodwin
Affiliations:
CNMAT, California Univ., Berkeley, CA, USA
Venue:
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Year:
1996

Citing 0
Cited 10

Extending Spectral Modeling Synthesis with Transient Modeling Synthesis

Computer Music Journal
Multirate Additive Synthesis

Computer Music Journal
Quality enhancement of compressed audio based on statistical conversion

EURASIP Journal on Audio, Speech, and Music Processing - Scalable Audio-Content Analysis
A multichannel sinusoidal model applied to spot microphone signals for immersive audio

IEEE Transactions on Audio, Speech, and Language Processing
Adaptive signal modeling based on sparse approximations for scalable parametric audio coding

IEEE Transactions on Audio, Speech, and Language Processing
A 3-D immersive synthesizer for environmental sounds

IEEE Transactions on Audio, Speech, and Language Processing
Time-frequency synthesis of noisy sounds with narrow spectral components

IEEE Transactions on Audio, Speech, and Language Processing
Adaptive signal models for wide-band speech and audio compression

IbPRIA'05 Proceedings of the Second Iberian conference on Pattern Recognition and Image Analysis - Volume Part II
Spatialized synthesis of noisy environmental sounds

CMMR/ICAD'09 Proceedings of the 6th international conference on Auditory Display
Design space exploration in many-core processors for sound synthesis of plucked string instruments

Journal of Parallel and Distributed Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In analysis-synthesis of musical sounds based on a sinusoidal model, the difference between the original signal and the synthesized signal, termed the residual, is typically a broadband noise process. It contains such musical phenomena as flute breath noise or violin bow noise. Synthesis without such "noise" tends to sound artificial; it is desirable to improve the synthesis realism by modeling the residual in such a way that it can be reinjected in the synthesized signal. This paper deals with a model of noise perception based on the equivalent rectangular bands (ERBs) of the auditory system. Since a broadband noise is perceptually well-represented by the time-varying energy in each of these frequency bands, the residual is parametrized in terms of these energies in the proposed model. An application of the model to music synthesis based on the inverse fast Fourier transform (FFT) is described in detail.