Note separation of polyphonic music by energy split
ISPRA'08 Proceedings of the 7th WSEAS International Conference on Signal Processing, Robotics and Automation
Polyphonic music separation based on the simplified energy splitter
WSEAS Transactions on Signal Processing
A computationally efficient method for polyphonic pitch estimation
EURASIP Journal on Advances in Signal Processing
Note onset detection for the transcription of polyphonic piano music
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Segmentation, indexing, and retrieval for environmental and natural sounds
IEEE Transactions on Audio, Speech, and Language Processing
Simultaneous estimation of chords and musical context from audio
IEEE Transactions on Audio, Speech, and Language Processing
Correlation-based amplitude estimation of coincident partials in monaural musical signals
EURASIP Journal on Audio, Speech, and Music Processing
Hi-index | 0.00 |
We propose a probabilistic method for the joint segmentation and melody extraction for musical audio signals which arise from a monophonic score. The method operates on framewise short-time Fourier transform (STFT) peaks, enabling a computationally efficient inference of note onset, duration, and pitch attributes while retaining sufficient information for pitch determination and spectral change detection. The system explicitly models note events in terms of transient and steady-state regions as well as possible gaps between note events. In this way, the system readily distinguishes abrupt spectral changes associated with musical onsets from other abrupt change events. Additionally, the method may incorporate melodic context by modeling note-to-note dependences. The method is successfully applied to a variety of piano and violin recordings containing reverberation, effective polyphony due to legato playing style, expressive pitch variations, and background voices. While the method does not provide a sample-accurate segmentation, it facilitates the latter in subsequent processing by isolating musical onsets to frame neighborhoods and identifying possible pitch content before and after the true onset sample location