Time-frequency analysis: theory and applications
Time-frequency analysis: theory and applications
Evolution and Optimum Seeking: The Sixth Generation
Evolution and Optimum Seeking: The Sixth Generation
A discriminative model for polyphonic piano transcription
EURASIP Journal on Applied Signal Processing
Adaptive threshold determination for spectral peak classification
Computer Music Journal
The expected amplitude of overlapping partials of harmonic sounds
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Spectrogram segmentation by means of statistical features for non-stationary signal interpretation
IEEE Transactions on Signal Processing
Specmurt Analysis of Polyphonic Music Signals
IEEE Transactions on Audio, Speech, and Language Processing
IEEE Transactions on Audio, Speech, and Language Processing
A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering
IEEE Transactions on Audio, Speech, and Language Processing
Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model
IEEE Transactions on Audio, Speech, and Language Processing
Instrument-Specific Harmonic Atoms for Mid-Level Music Representation
IEEE Transactions on Audio, Speech, and Language Processing
Low-Latency instrument separation in polyphonic audio using timbre models
LVA/ICA'12 Proceedings of the 10th international conference on Latent Variable Analysis and Signal Separation
Dissonance Reduction In Polyphonic Audio Using Harmonic Reorganization
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Hi-index | 0.00 |
This paper presents a frame-based system for estimating multiple fundamental frequencies (F0s) of polyphonic music signals based on the short-time Fourier transform (STFT) representation. To estimate the number of sources along with their F0s, it is proposed to estimate the noise level beforehand and then jointly evaluate all the possible combinations among pre-selected F0 candidates. Given a set of F0 hypotheses, their hypothetical partial sequences are derived, taking into account where partial overlap may occur. A score function is used to select the plausible sets of F0 hypotheses. To infer the best combination, hypothetical sources are progressively combined and iteratively verified. A hypothetical source is considered valid if it either explains more energy than the noise, or improves significantly the envelope smoothness once the overlapping partials are treated. The proposed system has been submitted to Music Information Retrieval Evaluation eXchange (MIREX) 2007 and 2008 contests where the accuracy has been evaluated with respect to the number of sources inferred and the precision of the F0s estimated. The encouraging results demonstrate its competitive performance among the state-of-the-art methods.