Adaptive harmonic spectral decomposition for multiple pitch estimation

Authors:
Emmanuel Vincent;Nancy Bertin;Roland Badeau
Affiliations:
INRIA, Centre Inria Rennes-Bretagne Atlantique, Rennes Cedex, France;INRIA, Centre Inria Rennes-Bretagne Atlantique, Rennes Cedex, France;Institut Télécom, Télécom ParisTech, LTCI, CNRS, Paris, France
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2010

Citing 16
Cited 7

Information Retrieval

Information Retrieval
Comparison of different implementations of MFCC

Journal of Computer Science and Technology
Signal Processing Methods for Music Transcription

Signal Processing Methods for Music Transcription
Psychoacoustics: Facts and Models

Psychoacoustics: Facts and Models
Initialization enhancer for non-negative matrix factorization

Engineering Applications of Artificial Intelligence
A Generalized Divergence Measure for Nonnegative Matrix Factorization

Neural Computation
A discriminative model for polyphonic piano transcription

EURASIP Journal on Applied Signal Processing
Automatic transcription of melody, bass line, and chords in polyphonic music

Computer Music Journal
Nonnegative matrix factorization with the itakura-saito divergence: With application to music analysis

Neural Computation
Monaural music source separation: nonnegativity, sparseness, and shift-invariance

ICA'06 Proceedings of the 6th international conference on Independent Component Analysis and Blind Signal Separation
Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria

IEEE Transactions on Audio, Speech, and Language Processing
Musical source separation using time-frequency source priors

IEEE Transactions on Audio, Speech, and Language Processing
Separation of synchronous pitched notes by spectral filtering of harmonics

IEEE Transactions on Audio, Speech, and Language Processing
Automatic Piano Transcription Using Frequency and Time-Domain Information

IEEE Transactions on Audio, Speech, and Language Processing
A connectionist approach to automatic transcription of polyphonic piano music

IEEE Transactions on Multimedia
Unsupervised analysis of polyphonic music by sparse coding

IEEE Transactions on Neural Networks

A general modular framework for audio source separation

LVA/ICA'10 Proceedings of the 9th international conference on Latent variable analysis and signal separation
Stability analysis of multiplicative update algorithms and application to nonnegative matrix factorization

IEEE Transactions on Neural Networks
Algorithms for nonnegative matrix factorization with the β-divergence

Neural Computation
Multiple instrument mixtures source separation evaluation using instrument-dependent NMF models

LVA/ICA'12 Proceedings of the 10th international conference on Latent Variable Analysis and Signal Separation
Multiple fundamental frequency estimation based on sparse representations in a structured dictionary

Digital Signal Processing
Instrument identification and pitch estimation in multi-timbre polyphonic musical signals based on probabilistic mixture model decomposition

Journal of Intelligent Information Systems
Automatic music transcription: challenges and future directions

Journal of Intelligent Information Systems

Quantified Score

Hi-index	0.02

Visualization

Abstract

Multiple pitch estimation consists of estimating the fundamental frequencies and saliences of pitched sounds over short time frames of an audio signal. This task forms the basis of several applications in the particular context of musical audio. One approach is to decompose the short-term magnitude spectrum of the signal into a sum of basis spectra representing individual pitches scaled by time-varying amplitudes, using algorithms such as nonnegative matrix factorization (NMF). Prior training of the basis spectra is often infeasible due to the wide range of possible musical instruments. Appropriate spectra must then be adaptively estimated from the data, which may result in limited performance due to overfitting issues. In this paper, we model each basis spectrum as a weighted sum of narrowband spectra representing a few adjacent harmonic partials, thus enforcing harmonicity and spectral smoothness while adapting the spectral envelope to each instrument. We derive a NMF-like algorithm to estimate the model parameters and evaluate it on a database of piano recordings, considering several choices for the narrowband spectra. The proposed algorithm performs similarly to supervised NMF using pre-trained piano spectra but improves pitch estimation performance by 6% to 10% compared to alternative unsupervised NMF algorithms.