Three dimensions of pitched instrument onset detection

  • Authors:
  • André Holzapfel;Yannis Stylianou;Ali C. Gedik;Bariş Bozkurt

  • Affiliations:
  • Institute of Computer Science, FORTH, Heraklion, Crete, Greece and Computer Science Department Multimedia Informatics Laboratory, University of Crete, Heraklion, Crete, Greece;Electrical and Electronics Engineering Department, Izmir Institute of Technology, Urla, Izmir, Turkey;Electrical and Electronics Engineering Department, Izmir Institute of Technology, Urla, Izmir, Turkey;Electrical and Electronics Engineering Department, Izmir Institute of Technology, Urla, Izmir, Turkey

  • Venue:
  • IEEE Transactions on Audio, Speech, and Language Processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we suggest a novel group delay based method for the onset detection of pitched instruments. It is proposed to approach the problem of onset detection by examining three dimensions separately: phase (i.e., group delay), magnitude and pitch. The evaluation of the suggested onset detectors for phase, pitch and magnitude is performed using a new publicly available and fully onset annotated database of monophonic recordings which is balanced in terms of included instruments and onset samples per instrument, while it contains different performance styles. Results show that the accuracy of onset detection depends on the type of instruments as well as on the style of performance. Combining the information contained in the three dimensions by means of a fusion at decision level leads to an improvement of onset detection by about 8% in terms of F-measure, compared to the best single dimension.