The effect of voice cuing on releasing Chinese speech from informational masking

  • Authors:
  • Zhigang Yang;Jing Chen;Qiang Huang;Xihong Wu;Yanhong Wu;Bruce A. Schneider;Liang Li

  • Affiliations:
  • Department of Psychology, National Key Laboratory on Machine Perception, Speech and Hearing Research Center, Peking University, Beijing 100871, China;Department of Psychology, National Key Laboratory on Machine Perception, Speech and Hearing Research Center, Peking University, Beijing 100871, China;Department of Psychology, National Key Laboratory on Machine Perception, Speech and Hearing Research Center, Peking University, Beijing 100871, China;Department of Psychology, National Key Laboratory on Machine Perception, Speech and Hearing Research Center, Peking University, Beijing 100871, China;Department of Psychology, National Key Laboratory on Machine Perception, Speech and Hearing Research Center, Peking University, Beijing 100871, China;Department of Psychology, Centre for Research on Biological Communication Systems, University of Toronto at Mississauga, Mississauga, Ontario, Canada L5L 1C6;Department of Psychology, National Key Laboratory on Machine Perception, Speech and Hearing Research Center, Peking University, Beijing 100871, China and Department of Psychology, Centre for Resea ...

  • Venue:
  • Speech Communication
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In a cocktail-party environment, human listeners are able to use perceptual-level and cognitive-level cues to segregate the attended target speech from other background conversations. At the cognitive level, priming the listener with part of the target speech in quiet can markedly improve the recognition of the remaining parts when the target speech and competing speech are presented at the same time. Hence, knowledge of content (content cuing) improves speech recognition when other people are talking. In addition, familiarity or knowledge of the voice characteristics of the target talker could also help the listener attend to the target talker when other talkers are present. The present study investigated the extent to which a cognitive-level cue (content cuing) and a perceptual-level cue (voice cuing) can improve word identification for speech masked by noise or by other speech in Chinese listeners. Specifically, listeners were primed with part of a sentence in quiet before a sentence was repeated in the presence of either noise or speech. The priming sentence was always in the same voice as the target sentence. Two kinds of primes were investigated: same-sentence primes, and different-sentence primes. Under speech-masking conditions, each of the two prime types significantly improved recognition of the last key word in the full-length target sentence. Under noise-masking conditions, same-sentence primes had a weak but significant releasing effect, but different-sentence primes had only a negligible releasing effect. These results suggest that in addition to content cues, voice cues can be used by Chinese listeners to release speech from masking by other talkers.