Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model

Authors:
A. Klapuri
Affiliations:
Inst. of Signal Process., Tampere Univ. of Technol., Tampere
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2008

Citing 0
Cited 15

Fragmentation and Frontier Evolution for Genetic Algorithms Optimization in Music Transcription

IBERAMIA '08 Proceedings of the 11th Ibero-American conference on AI: Advances in Artificial Intelligence
Fundamental frequency estimation for music signals with modified Hilbert-Huang transform (HHT)

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Periodic signal modeling for the octave problem in music transcription

DSP'09 Proceedings of the 16th international conference on Digital Signal Processing
Towards timbre-invariant audio features for harmony-based music

IEEE Transactions on Audio, Speech, and Language Processing
Multiple fundamental frequency estimation and polyphony inference of polyphonic music signals

IEEE Transactions on Audio, Speech, and Language Processing
Vocal melody extraction in the presence of pitched accompaniment in polyphonic music

IEEE Transactions on Audio, Speech, and Language Processing
PIC detector for piano chords

EURASIP Journal on Advances in Signal Processing - Special issue on digital audio effects
Single-channel speech separation based on long-short frame associated harmonic model

Digital Signal Processing
Low-complexity F0-based speech/nonspeech discrimination approach for digital hearing aids

Multimedia Tools and Applications
Evolutionary algorithms and automatic transcription of music

Proceedings of the 14th annual conference companion on Genetic and evolutionary computation
Improved music similarity computation based on tone objects

Proceedings of the 7th Audio Mostly Conference: A Conference on Interaction with Sound
Musical pitch estimation using a supervised single hidden layer feed-forward neural network

Expert Systems with Applications: An International Journal
Multiple fundamental frequency estimation based on sparse representations in a structured dictionary

Digital Signal Processing
Singing Voice Enhancement in Monaural Music Signals Based on Two-stage Harmonic/Percussive Sound Separation on Multiple Resolution Spectrograms

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

A method is described for estimating the fundamental frequencies of several concurrent sounds in polyphonic music and multiple-speaker speech signals. The method consists of a computational model of the human auditory periphery, followed by a periodicity analysis mechanism where fundamental frequencies are iteratively detected and canceled from the mixture signal. The auditory model needs to be computed only once, and a computationally efficient strategy is proposed for implementing it. Simulation experiments were made using mixtures of musical sounds and mixed speech utterances. The proposed method outperformed two reference methods in the evaluations and showed a high level of robustness in processing signals where important parts of the audible spectrum were deleted to simulate bandlimited interference. Different system configurations were studied to identify the conditions where pitch analysis using an auditory model is advantageous over conventional time or frequency domain approaches.