A Review of Automatic Rhythm Description Systems
Computer Music Journal
Template-based estimation of time-varying tempo
EURASIP Journal on Applied Signal Processing
Music tempo estimation with k-NN regression
IEEE Transactions on Audio, Speech, and Language Processing
An experimental comparison of audio tempo induction algorithms
IEEE Transactions on Audio, Speech, and Language Processing
IEEE Transactions on Audio, Speech, and Language Processing
Hi-index | 0.01 |
In this work we propose a "copy and scale" method based on the 1-NN paradigm to estimate time-localized parameters and apply it to the problem of beat-tracking. The 1-NN algorithm consists in assigning the information of the closest item of a pre-annotated database to an unknown target. It can be viewed as a "copy and paste" method. The "copy and scale" method we propose consists in "scaling" this information to adapt it to the properties of the unknown target. For this, we first represent the content of an audio signal using a sampled and tempo-normalized complex DFT. This representation is used as the vectors over which the 1-NN search is performed. Along each vector of the 1-NN space, we store the corresponding annotated beat-marker positions in a normalized form. Once the closest vector is found, its tempo is assigned to the unknown item and the normalized beat-markers are scaled to this tempo in order to provide the estimation of the unknown item beat-markers. We perform a preliminary evaluation of this method and show that, with such a simple method, we can achieve results comparable to the ones obtained with sophisticated approaches.