Residue-driven architecture for computational auditory scene analysis

Authors:
Tomohiro Nakatani;Hiroshi G. Okuno;Takeshi Kawabata
Affiliations:
NTT Basic Research Laboratories, Nippon Telegraph and Telephone Corporation, Atsugi, Kanagawa, Japan;NTT Basic Research Laboratories, Nippon Telegraph and Telephone Corporation, Atsugi, Kanagawa, Japan;NTT Basic Research Laboratories, Nippon Telegraph and Telephone Corporation, Atsugi, Kanagawa, Japan
Venue:
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Year:
1995

Citing 4
Cited 9

The society of mind

The society of mind
Auditory stream segregation in auditory scene analysis with a multi-agent system

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Multiagent based binaural sound stream segregation

Computational auditory scene analysis
The Hearsay-II Speech-Understanding System: Integrating Knowledge to Resolve Uncertainty

ACM Computing Surveys (CSUR)

The role of data reprocessing in complex acoustic environments

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Sound ontology for computational auditory scence analysis

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Sound and Visual Tracking for Humanoid Robot

Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Computing auditory perception

Organised Sound
Sound and Visual Tracking for Humanoid Robot

Applied Intelligence
A music stream segregation system based on adaptive multi-agents

IJCAI'97 Proceedings of the Fifteenth international joint conference on Artifical intelligence - Volume 2
Understanding three simultaneous speeches

IJCAI'97 Proceedings of the 15th international joint conference on Artifical intelligence - Volume 1
Humanoid active audition system improved by the cover acoustics

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Interfacing sound stream segregation to automatic speech recognition: preliminary results on listening to several sounds simultaneously

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Residue-Driven Architecture presented here is a model of auditory stream segregation from input sounds. A subsystem to extract auditory streams by using some sound attributes is called an agency and the design of each agency is based on the residue-driven architecture. This architecture consists of three kinds of agents: an event-detector, a tracer-generator, and tracers. The event-detector calculates a residue by subtracting the predicted input from the actual input. When a residue exceeds a threshold value, tracer-generator generates a tracerthat extracts an auditory stream from the residue and returns a predicted input of the next time frame to the event-detector. This aproach improves the performance of segregation and the resulting system can segregate a woman's voiced stream, a man's voiced stream, and a noise stream from a mixture of these sounds. Binaural segregation is also designed by the architecture.