Vocal melody extraction in the presence of pitched accompaniment in polyphonic music

Authors:
Vishweshwara Rao;Preeti Rao
Affiliations:
Department of Electrical Engineering, Indian Institute of Technology Bombay, Powai, Mumbai, India;Department of Electrical Engineering, Indian Institute of Technology Bombay, Powai, Mumbai, India
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2010

Citing 10
Cited 3

Pitch, periodiciy, and noise in the voice

Music, cognition, and computerized sound
Melody Detection in Polyphonic Musical Signals: Exploiting Perceptual Rules, Note Salience, and Melodic Smoothness

Computer Music Journal
Application-Specific Music Transcription for Tutoring

IEEE MultiMedia
On the improvement of singing voice separation for monaural recordings using the MIR-1K dataset

IEEE Transactions on Audio, Speech, and Language Processing
Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model

IEEE Transactions on Audio, Speech, and Language Processing
Normalized Cuts for Predominant Melodic Source Separation

IEEE Transactions on Audio, Speech, and Language Processing
Separation of Singing Voice From Music Accompaniment for Monaural Recordings

IEEE Transactions on Audio, Speech, and Language Processing
Melody Transcription From Music Audio: Approaches and Evaluation

IEEE Transactions on Audio, Speech, and Language Processing
Enhancing the Tracking of Partials for the Sinusoidal Modeling of Polyphonic Sounds

IEEE Transactions on Audio, Speech, and Language Processing
Unsupervised Single-Channel Music Source Separation by Average Harmonic Structure Modeling

IEEE Transactions on Audio, Speech, and Language Processing

A behavioral study of emotions in south indian classical music andits implications in music recommendation systems

Proceedings of the 2010 ACM workshop on Social, adaptive and personalized multimedia interaction and access
Context-Aware features for singing voice detection in polyphonic music

AMR'11 Proceedings of the 9th international conference on Adaptive Multimedia Retrieval: large-scale multimedia retrieval and evaluation
Singing Voice Enhancement in Monaural Music Signals Based on Two-stage Harmonic/Percussive Sound Separation on Multiple Resolution Spectrograms

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Melody extraction algorithms for single-channel polyphonic music typically rely on the salience of the lead melodic instrument, considered here to be the singing voice. However the simultaneous presence of one or more pitched instruments in the polyphony can cause such a predominant-F0 tracker to switch between tracking the pitch of the voice and that of an instrument of comparable strength, resulting in reduced voice-pitch detection accuracy. We propose a system that, in addition to biasing the salience measure in favor of singing voice characteristics, acknowledges that the voice may not dominate the polyphony at all instants and therefore tracks an additional pitch to better deal with the potential presence of locally dominant pitched accompaniment. A feature based on the temporal instability of voice harmonics is used to finally identify the voice pitch. The proposed system is evaluated on test data that is representative of polyphonic music with strong pitched accompaniment. Results show that the proposed system is indeed able to recover melodic information lost to its single-pitch tracking counterpart, and also outperforms another state-of-the-art melody extraction system designed for polyphonic music.