General sound classification and similarity in MPEG-7

Authors:
Michael Casey
Affiliations:
MERL Cambridge Research Laboratory, Cambridge, USA E-mail: casey@merl.com
Venue:
Organised Sound
Year:
2001

Citing 4
Cited 10

An information-maximization approach to blind separation and blind deconvolution

Neural Computation
Structure learning in conditional probability models via an entropic prior and parameter extinction

Neural Computation
Content-Based Classification, Search, and Retrieval of Audio

IEEE MultiMedia
Fast and robust fixed-point algorithms for independent component analysis

IEEE Transactions on Neural Networks

Spontaneous organisation, pattern models, and music

Organised Sound
Nearest-neighbor automatic sound annotation with a WordNet taxonomy

Journal of Intelligent Information Systems - Special issue: Intelligent multimedia applications
Musical instrument timbres classification with spectral features

EURASIP Journal on Applied Signal Processing
Musical genre classification using modified wavelet-like features and support vector machines

IMSA'07 IASTED European Conference on Proceedings of the IASTED European Conference: internet and multimedia systems and applications
Music structure analysis using a probabilistic fitness measure and a greedy search algorithm

IEEE Transactions on Audio, Speech, and Language Processing
Musical genre classification using modified wavelet-like features and support vector machines

EurolMSA '07 Proceedings of the Third IASTED European Conference on Internet and Multimedia Systems and Applications
Representing musical sounds with an interpolating state model

IEEE Transactions on Audio, Speech, and Language Processing
Development of the database for environmental sound research and application (DESRA): design, functionality, and retrieval considerations

EURASIP Journal on Audio, Speech, and Music Processing - Special issue on environmental sound synthesis, processing, and retrieval
Ecological acoustics perspective for content-based retrieval of environmental sounds

EURASIP Journal on Audio, Speech, and Music Processing - Special issue on environmental sound synthesis, processing, and retrieval
A self-similarity approach to repairing large dropouts of streamed music

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

We introduce a system for generalised sound classification and similarity using a machine-learning framework. Applications of the system include automatic classification of environmental sounds, musical instruments, music genre and human speakers. In addition to classification, the system may also be used for computing similarity metrics between a target sound and other sounds in a database. We discuss the use of hidden Markov models for representing the temporal evolution of audio spectra and present results of testing the system on classification and retrieval tasks. The system has been incorporated into the MPEG-7 international standard for multimedia content description and is therefore publicly available in the form of a set of standardised interfaces and software reference tools for developers and researchers.