Prediction of Time-Varying Musical Mood Distributions Using Kalman Filtering

  • Authors:
  • Erik M. Schmidt;Youngmoo E. Kim

  • Affiliations:
  • -;-

  • Venue:
  • ICMLA '10 Proceedings of the 2010 Ninth International Conference on Machine Learning and Applications
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The medium of music has evolved specifically for the expression of emotions, and it is natural for us to organize music in terms of its emotional associations. In previous work, we have modeled human response labels to music in the arousal-valence (A-V) representation of affect as a time-varying, stochastic distribution reflecting the ambiguous nature of the perception of mood. These distributions are used to predict A-V responses from acoustic features of the music alone via multi-variate regression. In this paper, we extend our framework to account for multiple regression mappings contingent upon a general location in A-V space. Furthermore, we model A-V state as the latent variable of a linear dynamical system, more explicitly capturing the dynamics of musical mood. We validate this extension using a "genie-bounded" approach, in which we assume that a piece of music is correctly clustered in A-V space a priori, demonstrating significantly higher theoretical performance than the previous single-regressor approach.