Polyphonic transcription: exploring a hybrid of tone models and particle swarm optimisation

Authors:
Somnuk Phon-Amnuaisuk
Affiliations:
Music Informatics Research Group, Faculty of Creative Industries, Universiti Tunku Abdul Rahman, Selangor Darul Ehsan, Malaysia
Venue:
EvoMUSART'12 Proceedings of the First international conference on Evolutionary and Biologically Inspired Music, Sound, Art and Design
Year:
2012

Citing 2
Cited 0

Sparse representations of polyphonic music

Signal Processing - Sparse approximations in signal and image processing
Computational Auditory Scene Analysis: Principles, Algorithms, and Applications

Computational Auditory Scene Analysis: Principles, Algorithms, and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Polyphonic transcription could be formulated as a supervised classification task if the classifiers of all possible polyphonic combinations could be learned beforehand. However, it is impractical to learn all possible classification models in real life due to the exponential explosion of all possible polyphonic combinations. Here, we describe a novel polyphonic transcription approach that applies a hybrid of the Particle Swarm Optimisation (PSO) and the Tone-model techniques. This hybrid approach exploits the strengths from both the heuristic-search and the model based approaches. In our work, only the monophonic Tone-models of all pitches are learned and employed to calculate the first pass output of polyphonic transcription, which is then refined in the second pass by PSO. The experimental results show that the proposed hybrid approach outperform the competing Non-negative Matrix Factorisation (NMF) approach. This paper presents and discusses the design and the experimental results of this novel approach.