Bootstrap learning for accurate onset detection

Authors:
Ning Hu;Roger B. Dannenberg
Affiliations:
Google Inc. New York Office, New York, United States 10018;Computer Science Department, Carnegie Mellon University, Pittsburgh, United States 15213
Venue:
Machine Learning
Year:
2006

Citing 4
Cited 2

Kolmogorov's theorem and multilayer neural networks

Neural Networks
Automatic Segmentation of Acoustic Musical Signals Using Hidden Markov Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
A comparison of melodic database retrieval techniques using sung queries

Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Bootstrap learning for place recognition

Eighteenth national conference on Artificial intelligence

New interfaces for popular music performance

NIME '07 Proceedings of the 7th international conference on New interfaces for musical expression
The intelligent music editor: towards an automated platform for music analysis and editing

ICIC'10 Proceedings of the Advanced intelligent computing theories and applications, and 6th international conference on Intelligent computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Supervised learning models have been applied to create good onset detection systems for musical audio signals. However, this always requires a large set of labeled training examples, and hand-labeling is quite tedious and time consuming. In this paper, we present a bootstrap learning approach to train an accurate note onset detection model. Audio alignment techniques are first used to find the correspondence between a symbolic music representation (such as MIDI data) and an acoustic recording. This alignment provides an initial estimate of note boundaries which can be used to train an onset detector. Once trained, the detector can be used to refine the initial set of note boundaries and training can be repeated. This iterative training process eliminates the need for hand-labeled audio. Tests show that this training method can improve an onset detector initially trained on synthetic data.