Speech analysis and synthesis using an AM-FM modulation model
Speech Communication
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
Speech nonlinearities, modulations, and energy operators
ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference
Oracle estimators for the benchmarking of source separation algorithms
Signal Processing
Analysis and resynthesis of musical instrument sounds using energy separation
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Separation of harmonic sound sources using sinusoidal modeling
ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
Multiband perceptual modulation analysis, processing and synthesis of audio signals
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
On amplitude and frequency demodulation using energy operators
IEEE Transactions on Signal Processing
Blind separation of speech mixtures via time-frequency masking
IEEE Transactions on Signal Processing
Energy separation in signal modulations with application to speechanalysis
IEEE Transactions on Signal Processing
IEEE Transactions on Audio, Speech, and Language Processing
Hi-index | 0.08 |
In this paper, we address the problem of monaural source separation of a mixed signal containing speech and music components. We use Discrete Energy Separation Algorithm (DESA) to estimate frequency-modulating (FM) signal energy. The FM signal energy is used to design a time-varying filter in the time-frequency domain for rejecting the interfering signal. The FM signal energy was chosen due to its good ability to differentiate between speech and music signals using localized information both in time and frequency. We present experimental results which demonstrate the advantages and limitations of the proposed method using synthetic data and real audio signals.