Music structure analysis by finding repeated parts
Proceedings of the 1st ACM workshop on Audio and music computing multimedia
Music retrieval: a tutorial and review
Foundations and Trends in Information Retrieval
Chroma Palette: chromatic maps of sound as granular synthesis interface
NIME '07 Proceedings of the 7th international conference on New interfaces for musical expression
Towards structural analysis of audio recordings in the presence of musical variations
EURASIP Journal on Applied Signal Processing
A Framework for Managing Multimodal Digitized Music Collections
ECDL '08 Proceedings of the 12th European conference on Research and Advanced Technology for Digital Libraries
Multimodal presentation and browsing of music
ICMI '08 Proceedings of the 10th international conference on Multimodal interfaces
Sonic gems: exploring the potential of audio recording as a form of sentimental memory capture
BCS-HCI '08 Proceedings of the 22nd British HCI Group Annual Conference on People and Computers: Culture, Creativity, Interaction - Volume 1
Refinement Strategies for Music Synchronization
Computer Music Modeling and Retrieval. Genesis of Meaning in Sound and Music
Local summarization and multi-level LSH for retrieving multi-variant audio tracks
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Music structure analysis using a probabilistic fitness measure and a greedy search algorithm
IEEE Transactions on Audio, Speech, and Language Processing
Automated analysis of performance variations in folk song recordings
Proceedings of the international conference on Multimedia information retrieval
Towards timbre-invariant audio features for harmony-based music
IEEE Transactions on Audio, Speech, and Language Processing
A concept for using combined multimodal queries in digital music libraries
ECDL'09 Proceedings of the 13th European conference on Research and advanced technology for digital libraries
Combining multi-probe histogram and order-statistics based LSH for scalable audio content retrieval
Proceedings of the international conference on Multimedia
Mining transposed motifs in music
Journal of Intelligent Information Systems
Indexing musical pieces using their major repetition
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
MUSIZ: a generic framework for music resizing with stretching and cropping
MM '11 Proceedings of the 19th ACM international conference on Multimedia
An efficient audio fingerprint design for MP3 music
Proceedings of the 9th International Conference on Advances in Mobile Computing and Multimedia
Structural and semantic modeling of audio for content-based querying and browsing
FQAS'06 Proceedings of the 7th international conference on Flexible Query Answering Systems
Lyrics-based audio retrieval and multimodal navigation in music collections
ECDL'07 Proceedings of the 11th European conference on Research and Advanced Technology for Digital Libraries
Towards cover group thumbnailing
Proceedings of the 21st ACM international conference on Multimedia
Music Homogeneity Analysis through Instantaneous Frequencies
Proceedings of International Conference on Advances in Mobile Computing & Multimedia
Hi-index | 0.00 |
With the growing prevalence of large databases of multimedia content, methods for facilitating rapid browsing of such databases or the results of a database search are becoming increasingly important. However, these methods are necessarily media dependent. We present a system for producing short, representative samples (or "audio thumbnails") of selections of popular music. The system searches for structural redundancy within a given song with the aim of identifying something like a chorus or refrain. To isolate a useful class of features for performing such structure-based pattern recognition, we present a development of the chromagram, a variation on traditional time-frequency distributions that seeks to represent the cyclic attribute of pitch perception, known as chroma. The pattern recognition system itself employs a quantized chromagram that represents the spectral energy at each of the 12 pitch classes. We evaluate the system on a database of popular music and score its performance against a set of "ideal" thumbnail locations. Overall performance is found to be quite good, with the majority of errors resulting from songs that do not meet our structural assumptions.