Music and timbre segmentation by recursive constrained K-means clustering

  • Authors:
  • Sebastian Krey;Uwe Ligges;Friedrich Leisch

  • Affiliations:
  • Fakultät Statistik, Technische Universität Dortmund, Dortmund, Germany 44221;Fakultät Statistik, Technische Universität Dortmund, Dortmund, Germany 44221;Institut für angewandte Statistik und EDV, Universität für Bodenkultur Wien, Wien, Austria 1190

  • Venue:
  • Computational Statistics
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clustering of features generated of musical sound recordings proved to be beneficial for further classification tasks such as instrument recognition (Ligges and Krey in Comput Stat 26(2):279---291, 2011). We propose to use order constrained solutions in K-means clustering to stabilize the results and improve the interpretability of the clustering. With this method a further improvement of the misclassification error in the aforementioned instrument recognition task is possible. Using order constrained K-means the musical structure of a whole piece of popular music can be extracted automatically. Visualizing the distances of the feature vectors through a self distance matrix allows for an easy visual verification of the result. For the estimation of the right number of clusters, we propose to calculate the adjusted Rand indices of bootstrap samples of the data and base the decision on the minimum of a robust version of the coefficient of variation. In addition to the average stability (measured through the adjusted Rand index) this approach takes the variation between the different bootstrap samples into account.