Normalized Cuts for Predominant Melodic Source Separation

Authors:
M. Lagrange;L. G. Martins;J. Murdoch;G. Tzanetakis
Affiliations:
Univ. of Victoria, Victoria, BC;-;-;-
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2008

Citing 0
Cited 6

Vocal melody extraction in the presence of pitched accompaniment in polyphonic music

IEEE Transactions on Audio, Speech, and Language Processing
Single-channel speech separation based on long-short frame associated harmonic model

Digital Signal Processing
Machine Recognition of Music Emotion: A Review

ACM Transactions on Intelligent Systems and Technology (TIST)
A VNS heuristic for escaping local extrema entrapment in normalized cut clustering

Pattern Recognition
Segregating event streams and noise with a Markov renewal process model

The Journal of Machine Learning Research
Singing Voice Enhancement in Monaural Music Signals Based on Two-stage Harmonic/Percussive Sound Separation on Multiple Resolution Spectrograms

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

The predominant melodic source, frequently the singing voice, is an important component of musical signals. In this paper, we describe a method for extracting the predominant source and corresponding melody from ldquoreal-worldrdquo polyphonic music. The proposed method is inspired by ideas from computational auditory scene analysis. We formulate predominant melodic source tracking and formation as a graph partitioning problem and solve it using the normalized cut which is a global criterion for segmenting graphs that has been used in computer vision. Sinusoidal modeling is used as the underlying representation. A novel harmonicity cue which we term harmonically wrapped peak similarity is introduced. Experimental results supporting the use of this cue are presented. In addition, we show results for automatic melody extraction using the proposed approach.