Musical pitch estimation using a supervised single hidden layer feed-forward neural network

  • Authors:
  • Pat Taweewat;Chai Wutiwiwatchai

  • Affiliations:
  • School of Electrical and Information Engineering, The University of Sydney, Australia;National Electronics and Computer Technology Center, Thailand

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2013

Quantified Score

Hi-index 12.05

Visualization

Abstract

Musical pitch estimation is used to find musical note pitch or the fundamental frequency (F0) of audio signal which can be applied to a pre-processing part of many applications such as sound separation, musical note transcription, etc. In this work, a method for the pitch estimation based on classification framework has been designed using a supervised single hidden layer feed-forward neural network. To make this method have good performances in terms of generalization, high-speed training and small network size, two main investigations have been done. First, we find the suitable feature vector by comparing different performances of feature generation methods using extreme learning machine (ELM) framework for training the network. Second, different input-weight fine tuning methods have been compared for reducing the network size. We evaluated the method using multiple-pitch multi-instrument signals generated from datasets of real musical instrument recordings. For feature generation method, the feature vector generated from combining pitch histogram and pitch-frequency scaled spectrum shows the best performance in the experiment. For the fine tuning method, we compare ELM framework with Cuckoo search and sign-based propagation tunings. After the network size is further reduced to 40%, we found that the network trained with sign-based propagation tuning shows a better performance than that trained by ELM framework for the unseen dataset.