Fundamentals of statistical signal processing: estimation theory
Fundamentals of statistical signal processing: estimation theory
Fundamentals of speech recognition
Fundamentals of speech recognition
N4SID: subspace algorithms for the identification of combined deterministic-stochastic systems
Automatica (Journal of IFAC) - Special issue on statistical signal processing and control
Visualizing music and audio using self-similarity
MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
International Journal of Computer Vision
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Music thumbnailing via structural analysis
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Probabilistic Kernels for the Classification of Auto-Regressive Visual Processes
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Variational Learning for Switching State-Space Models
Neural Computation
Using duration models to reduce fragmentation in audio segmentation
Machine Learning
Music summarization using key phrases
ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
A music search engine built upon audio-based and web-based similarity measures
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Modeling, Clustering, and Segmenting Video with Mixtures of Dynamic Textures
IEEE Transactions on Pattern Analysis and Machine Intelligence
Dynamic texture models of music
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
IEEE Transactions on Pattern Analysis and Machine Intelligence
Structural Segmentation of Musical Audio by Constrained Clustering
IEEE Transactions on Audio, Speech, and Language Processing
Semantic Annotation and Retrieval of Music and Sound Effects
IEEE Transactions on Audio, Speech, and Language Processing
IEEE Transactions on Audio, Speech, and Language Processing
An experimental comparison of audio tempo induction algorithms
IEEE Transactions on Audio, Speech, and Language Processing
Discovering nontrivial repeating patterns in music data
IEEE Transactions on Multimedia
"The way it Sounds": timbre models for analysis and retrieval of music signals
IEEE Transactions on Multimedia
MUSIZ: a generic framework for music resizing with stretching and cropping
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Cross matching of music and image
Proceedings of the 20th ACM international conference on Multimedia
Dynamic texture analysis and segmentation using deterministic partially self-avoiding walks
Expert Systems with Applications: An International Journal
Dynamic texture segmentation based on deterministic partially self-avoiding walks
Computer Vision and Image Understanding
Hi-index | 0.00 |
We consider representing a short temporal fragment of musical audio as a dynamic texture, a model of both the timbral and rhythmical qualities of sound, two of the important aspects required for automatic music analysis. The dynamic texture model treats a sequence of audio feature vectors as a sample from a linear dynamical system. We apply this new representation to the task of automatic song segmentation. In particular, we cluster audio fragments, extracted from a song, as samples from a dynamic texture mixture (DTM) model. We show that the DTM model can both accurately cluster coherent segments in music and detect transition boundaries. Moreover, the generative character of the proposed model of music makes it amenable for a wide range of applications besides segmentation. As examples, we use DTM models of songs to suggest possible improvements in other music information retrieval applications such as music annotation and similarity.