Monaural speech/music source separation using discrete energy separation algorithm

Authors:
Yevgeni Litvin;Israel Cohen;Dan Chazan
Affiliations:
Department of Electrical Engineering, Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel;Department of Electrical Engineering, Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel;IBM Research Laboratory in Haifa, Israel
Venue:
Signal Processing
Year:
2010

Citing 11
Cited 2

Speech analysis and synthesis using an AM-FM modulation model

Speech Communication
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Speech nonlinearities, modulations, and energy operators

ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference
Oracle estimators for the benchmarking of source separation algorithms

Signal Processing
Analysis and resynthesis of musical instrument sounds using energy separation

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Separation of harmonic sound sources using sinusoidal modeling

ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
Multiband perceptual modulation analysis, processing and synthesis of audio signals

ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
On amplitude and frequency demodulation using energy operators

IEEE Transactions on Signal Processing
Blind separation of speech mixtures via time-frequency masking

IEEE Transactions on Signal Processing
Energy separation in signal modulations with application to speechanalysis

IEEE Transactions on Signal Processing
Adaptation of Bayesian Models for Single-Channel Source Separation and its Application to Voice/Music Separation in Popular Songs

IEEE Transactions on Audio, Speech, and Language Processing

Speech source separation using a generalized mean shift algorithm

Signal Processing
Efficient computation of the short-time DFT based on a modified radix-2 decimation-in-frequency algorithm

Signal Processing

Quantified Score

Hi-index	0.08

Visualization

Abstract

In this paper, we address the problem of monaural source separation of a mixed signal containing speech and music components. We use Discrete Energy Separation Algorithm (DESA) to estimate frequency-modulating (FM) signal energy. The FM signal energy is used to design a time-varying filter in the time-frequency domain for rejecting the interfering signal. The FM signal energy was chosen due to its good ability to differentiate between speech and music signals using localized information both in time and frequency. We present experimental results which demonstrate the advantages and limitations of the proposed method using synthetic data and real audio signals.