Understanding three simultaneous speeches

Authors:
Hiroshi G. Okuno;Tomohiro Nakatani;Takeshi Kawabata
Affiliations:
NTT Basic Research Laboratories, Atsugi, Kanagawa, Japan;NTT Basic Research Laboratories, Atsugi, Kanagawa, Japan;NTT Basic Research Laboratories, Atsugi, Kanagawa, Japan
Venue:
IJCAI'97 Proceedings of the 15th international joint conference on Artifical intelligence - Volume 1
Year:
1997

Citing 7
Cited 3

Integrated processing and understanding of signals

Symbolic and knowledge-based signal processing
Auditory stream segregation in auditory scene analysis with a multi-agent system

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Computational auditory scene analysis

Computational auditory scene analysis
Localization by harmonic structure and its application to harmonic sound stream segregation

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Residue-driven architecture for computational auditory scene analysis

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Interfacing sound stream segregation to automatic speech recognition: preliminary results on listening to several sounds simultaneously

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
Challenge problems for artificial intelligence

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2

Sound ontology for computational auditory scence analysis

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Using vision to improve sound source separation

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Recognition of simultaneous speech by estimating reliability of separated signals for robot audition

PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Understanding three simultaneous speeches is proposed as a challenge problem to foster artificial intelligence, speech and sound understanding or recognition, and computational auditory scene analysis research. Automatic speech recognition under noisy environments is attacked by speech enhancement techniques such as noise reduction and speaker adaptation. However, the signal-to-noise ratio of speech in two simultaneous speeches is too poor to apply these techniques. Therefore, novel techniques need to be developed. One candidate is to use speech stream segregation as a front-end of automatic speech recognition systems. Preliminary experiments on understanding two simultaneous speeches show that the proposed challenge problem will be feasible with speech stream segregation. The detailed plan of the research on and benchmark sounds for the proposed challenge problem is also presented.