A multichannel sinusoidal model applied to spot microphone signals for immersive audio
IEEE Transactions on Audio, Speech, and Language Processing
Auditory time-frequency masking: psychoacoustical data and application to audio representations
CMMR'11 Proceedings of the 8th international conference on Speech, Sound and Music Processing: embracing research in India
Hi-index | 0.00 |
In this paper, we elaborate on the issue of analysis-synthesis consistency in sinusoidal coding. Our analysis is based on windowed sinusoids, and uses the same amplitude-complementary window as is used in the overlap-add synthesis. Reconstructions of the neighboring segments are taken into account when forming a particular analysis segment. Sinusoidal estimation is based on a perceptual criterion. In our new procedure, when analyzing the current segment we take advantage of the forward masking effect due to estimated sinusoids in the previous segments (possibly overlapping with the current segment). Experimental results verify that the number of sinusoids can be reduced significantly with our time masking model, without introducing perceptual artifacts in the reconstructed signal.