A generative model for music transcription

Authors:
A. T. Cemgil;H. J. Kappen;D. Barber
Affiliations:
Stichfing Neurale Netwerken, Nimegen, Netherlands;-;-
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2006

Citing 0
Cited 16

Expectation Correction for Smoothed Inference in Switching Linear Dynamical Systems

The Journal of Machine Learning Research
A discriminative model for polyphonic piano transcription

EURASIP Journal on Applied Signal Processing
Note separation of polyphonic music by energy split

ISPRA'08 Proceedings of the 7th WSEAS International Conference on Signal Processing, Robotics and Automation
Efficient Bayesian inference for harmonic models via adaptive posterior factorization

Neurocomputing
Polyphonic music separation based on the simplified energy splitter

WSEAS Transactions on Signal Processing
A computationally efficient method for polyphonic pitch estimation

EURASIP Journal on Advances in Signal Processing
Basis Decomposition of Motion Trajectories Using Spatio-temporal NMF

ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part II
Generative spectrogram factorization models for polyphonic piano transcription

IEEE Transactions on Audio, Speech, and Language Processing
Enforcing harmonicity and smoothness in Bayesian non-negative matrix factorization applied to polyphonic music transcription

IEEE Transactions on Audio, Speech, and Language Processing
Prediction and classification of motion trajectories using spatio-temporal NMF

KI'09 Proceedings of the 32nd annual German conference on Advances in artificial intelligence
Multipitch estimation of piano sounds using a new probabilistic spectral smoothness principle

IEEE Transactions on Audio, Speech, and Language Processing
The informatics philharmonic

Communications of the ACM
Optimal filter designs for separating and enhancing periodic signals

IEEE Transactions on Signal Processing
Multiple fundamental frequency estimation based on sparse representations in a structured dictionary

Digital Signal Processing
Sparse coding of human motion trajectories with non-negative matrix factorization

Neurocomputing
Automatic music transcription: challenges and future directions

Journal of Intelligent Information Systems

Quantified Score

Hi-index	0.02

Visualization

Abstract

In this paper, we present a graphical model for polyphonic music transcription. Our model, formulated as a dynamical Bayesian network, embodies a transparent and computationally tractable approach to this acoustic analysis problem. An advantage of our approach is that it places emphasis on explicitly modeling the sound generation procedure. It provides a clear framework in which both high level (cognitive) prior information on music structure can be coupled with low level (acoustic physical) information in a principled manner to perform the analysis. The model is a special case of the, generally intractable, switching Kalman filter model. Where possible, we derive, exact polynomial time inference procedures, and otherwise efficient approximations. We argue that our generative model based approach is computationally feasible for many music applications and is readily extensible to more general auditory scene analysis scenarios.