A robust statistical-based speaker's location detection algorithm in a vehicular environment

Authors:
Jwu-Sheng Hu;Chieh-Cheng Cheng;Wei-Han Liu
Affiliations:
Department of Electrical and Control Engineering, National Chiao Tung University, Hsinchu, Taiwan;Department of Electrical and Control Engineering, National Chiao Tung University, Hsinchu, Taiwan;Department of Electrical and Control Engineering, National Chiao Tung University, Hsinchu, Taiwan
Venue:
EURASIP Journal on Applied Signal Processing
Year:
2007

Citing 11
Cited 1

A Robust Method for Speech Signal Time-Delay Estimation in Reverberant Rooms

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
Near-Field Beamforming for Microphone Arrays

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
Frequency Domain Microphone Array Calibration and Beamforming for Automatic Speech Recognition

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
Acoustic source location in noisy and reverberant environment using CSP analysis

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Classification of time delay estimates for robust speaker localization

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 06
Tracking of multiple moving sources using recursive EM algorithm

EURASIP Journal on Applied Signal Processing
Signal enhancement using beamforming and nonstationarity withapplications to speech

IEEE Transactions on Signal Processing
Modular fuzzy-neuro controller driven by spoken language commands

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Phase-based dual-microphone robust speech enhancement

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Robust speaker's location detection in a vehicle environment using GMM models

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Background noise reduction via dual-channel scheme for speech recognition in vehicular environment

IEEE Transactions on Consumer Electronics

Detecting driver phone use leveraging car speakers

MobiCom '11 Proceedings of the 17th annual international conference on Mobile computing and networking

Quantified Score

Hi-index	0.00

Visualization

Abstract

This work presents a robust speaker's location detection algorithm using a single linear microphone array that is capable of detecting multiple speech sources under the assumption that there exist nonoverlapped speech segments among sources. Namely, the overlapped speech segments are treated as uncertainty and are not used for detection. The location detection algorithm is derived from a previous work (2006), where Gaussian mixture models (GMMs) are used to model location-dependent and content and speaker-independent phase difference distributions. The proposed algorithm is proven to be robust against the complex vehicular acoustics including noise, reverberation, near-filed, far-field, line-of-sight, and non-line-of-sight conditions, and microphones' mismatch. An adaptive system architecture is developed to adjust the Gaussian mixture (GM) location model to environmental noises. To deal with unmodeled speech sources as well as overlapped speech signals, a threshold adaptation scheme is proposed in this work. Experimental results demonstrate high detection accuracy in a noisy vehicular environment.