Sound ontology for computational auditory scence analysis

Authors:
Tomohiro Nakatani;Hiroshi G. Okuno
Affiliations:
-;-
Venue:
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Year:
1998

Citing 7
Cited 4

Auditory stream segregation in auditory scene analysis with a multi-agent system

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Computational auditory scene analysis

Computational auditory scene analysis
Adaptation method based on HMM composition and EM algorithm

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Understanding three simultaneous speeches

IJCAI'97 Proceedings of the 15th international joint conference on Artifical intelligence - Volume 1
Organization of hierarchical perceptual sounds: music scene analysis with autonomous processing modules and a quantitative information integration mechanism

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Residue-driven architecture for computational auditory scene analysis

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Interfacing sound stream segregation to automatic speech recognition: preliminary results on listening to several sounds simultaneously

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2

Pitch-dependent musical instrument identification and its application to musical sound ontology

IEA/AIE'2003 Proceedings of the 16th international conference on Developments in applied artificial intelligence
Lexical and perceptual grounding of a sound ontology

TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
First steps to an audio ontology-based classifier for telemedicine

ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Ontology-Based classifier for audio scenes in telemedicine

IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes that sound ontology should be used both as a common vocabulary for sound representation and as a common terminology for integrating various sound stream segregation systems. Since research on computational auditory scene analysis (CASA) focuses on recognizing and understanding various kinds of sounds, sound stream segregation which extracts each sound stream from a mixture of sounds is essential for CASA. Even if sound stream segregation systems use a harmonic structure of sound as a cue of segregation, it is not easy to integrate such systems because the definition of a harmonic structure differs or the precision of extracted harmonic structures differs. Therefore, sound ontology is needed as a common knowledge representation of sounds.Another problem is to interface sound stream segregation systems with applications such as automatic speech recognition systems. Since the requirement of the quality of segregated sound streams depends on applications, sound stream segregation systems must provide a flexible interface. Therefore, sound ontology is needed to fulfill the requirements imposed by them. In addition, the hierarchical structure of sound ontology provides a means of controlling top-down and bottom-up processing of sound stream segregation.