Audio Content Description in Sound Databases

Authors:
Alicja Wieczorkowska;Zbigniew W. Ras
Affiliations:
-;-
Venue:
WI '01 Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development
Year:
2001

Citing 8
Cited 4

Music, signals, and representations: a survey

Representations of musical signals
Qualitative aspects of signal processing through dynamic neural networks

Representations of musical signals
Principles of multimedia database systems

Principles of multimedia database systems
Multimedia information networking

Multimedia information networking
Multimedia: Concepts and Practice

Multimedia: Concepts and Practice
Towards Musical Data Classification via Wavelet Analysis

ISMIS '00 Proceedings of the 12th International Symposium on Foundations of Intelligent Systems
Optimizing Self-Organizing Timbre Maps: Two Approaches

Music, Gestalt, and Computing - Studies in Cognitive and Systematic Musicology
Karl Erich Schumann's Principles of Timbre as a Helpful Tool in Stream Segregation Research

Music, Gestalt, and Computing - Studies in Cognitive and Systematic Musicology

KDD-Based Approach to Musical Instrument Sound Recognition

ISMIS '02 Proceedings of the 13th International Symposium on Foundations of Intelligent Systems
Application of Temporal Descriptors to Musical Instrument Sound Recognition

Journal of Intelligent Information Systems
Towards extracting emotions from music

IMTCI'04 Proceedings of the Second international conference on Intelligent Media Technology for Communicative Intelligence
Do we need automatic indexing of musical instruments?

IMTCI'04 Proceedings of the Second international conference on Intelligent Media Technology for Communicative Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Sound database indexing requires metadata to represent audio content of the data. If the metadata are not attached to the database by its creator, content information has to be extracted directly from sounds, using descriptors based on sound analysis. In this paper, authors present a number of sound descriptors based on various forms of signal analysis. Telescope Vector trees (TV-trees) and Frame Segment trees (FS-trees) are applied to represent audio content on the basis of the extracted sound descriptors and metadata provided by the database creator (if only available). Such a representation of audio content of the database is used to speed up the search of the audio material in multimedia databases.