Content-Based Classification, Search, and Retrieval of Audio
IEEE MultiMedia
A Study on Content-Based Classification and Retrieval of Audio Database
IDEAS '01 Proceedings of the International Database Engineering & Applications Symposium
Similarity-Based Operators and Query Optimization for Multimedia Database Systems
IDEAS '01 Proceedings of the International Database Engineering & Applications Symposium
Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
MARSYAS: a framework for audio analysis
Organised Sound
MARSYAS: a framework for audio analysis
Organised Sound
Learning word meanings and descriptive parameter spaces from music
HLT-NAACL-LWM '04 Proceedings of the HLT-NAACL 2003 workshop on Learning word meaning from non-linguistic data - Volume 6
Hierarchical classification of audio data for archiving and retrieving
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 06
Content-based methods for the management of digital music
ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 04
An integrated approach for content-based video object segmentation and retrieval
IEEE Transactions on Circuits and Systems for Video Technology
Hi-index | 0.00 |
The amount of available audio data is increasing rapidly in consequence of advancements in media creation, storage and compression technologies. This rapid increase imposes new demands in audio data management and retrieval. In this work, we proposed an audio data model and repository model to fulfill user requirements in retrieving audio data from large collections. The proposed audio data repository model facilitates a multi-criteria query formulation and audio data retrieval where by audio can be queried both by its low- and high-level features. In the proposed model, a generic audio repository model that can handle a general audio as well as a sub-repository model that can manipulate speech through its constituent units is discussed. Finally, the viability of the proposed model is demonstrated by a prototype system developed for an application in the medical domain.