Signal-to-score music transcription using graphical models

Authors:
Emir Kapanci;Avi Pfeffer
Affiliations:
Harvard University, Division of Engineering and Applied Sciences, Cambridge, MA;Harvard University, Division of Engineering and Applied Sciences, Cambridge, MA
Venue:
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Year:
2005

Citing 4
Cited 2

A hybrid graphical model for rhythmic parsing

Artificial Intelligence
Monte Carlo methods for tempo tracking and rhythm quantization

Journal of Artificial Intelligence Research
Approximate inference for first-order probabilistic languages

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 1
A connectionist approach to automatic transcription of polyphonic piano music

IEEE Transactions on Multimedia

Algorithms for Computing Geometric Measures of Melodic Similarity

Computer Music Journal
Aligning music audio with symbolic scores using a hybrid graphical model

Machine Learning

Quantified Score

Hi-index	0.02

Visualization

Abstract

We present a transcription system that takes a music signal as input and returns its musical score. Two stages of processing are used. The first employs a fundamental frequency detector and an onset detector to transform input signals into a sequence of sound events. The onset detection is inherently noisy. This paper focuses on the second stage, going from sound events to a notated score. We use a family of graphical models for this task. We allow the results of onset detection to be noisy, necessitating a search over possible segmentations of the sound events. We use a large corpus of monophonic vocal music to evaluate our system. Our results show that our approach is well-suited to the problem of music transcription. The initial onset detection reduces the number of observations and makes the system less instrument specific. The search over segmentations corrects the errors in the onset detection. Without such reasoning, these errors are magnified in subsequent rhythm transcription.