A Robust Method for Speech Signal Time-Delay Estimation in Reverberant Rooms
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
Near-Field Beamforming for Microphone Arrays
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
Frequency Domain Microphone Array Calibration and Beamforming for Automatic Speech Recognition
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
Acoustic source location in noisy and reverberant environment using CSP analysis
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Classification of time delay estimates for robust speaker localization
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 06
Tracking of multiple moving sources using recursive EM algorithm
EURASIP Journal on Applied Signal Processing
Signal enhancement using beamforming and nonstationarity withapplications to speech
IEEE Transactions on Signal Processing
Modular fuzzy-neuro controller driven by spoken language commands
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Phase-based dual-microphone robust speech enhancement
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Robust speaker's location detection in a vehicle environment using GMM models
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Background noise reduction via dual-channel scheme for speech recognition in vehicular environment
IEEE Transactions on Consumer Electronics
Detecting driver phone use leveraging car speakers
MobiCom '11 Proceedings of the 17th annual international conference on Mobile computing and networking
Hi-index | 0.00 |
This work presents a robust speaker's location detection algorithm using a single linear microphone array that is capable of detecting multiple speech sources under the assumption that there exist nonoverlapped speech segments among sources. Namely, the overlapped speech segments are treated as uncertainty and are not used for detection. The location detection algorithm is derived from a previous work (2006), where Gaussian mixture models (GMMs) are used to model location-dependent and content and speaker-independent phase difference distributions. The proposed algorithm is proven to be robust against the complex vehicular acoustics including noise, reverberation, near-filed, far-field, line-of-sight, and non-line-of-sight conditions, and microphones' mismatch. An adaptive system architecture is developed to adjust the Gaussian mixture (GM) location model to environmental noises. To deal with unmodeled speech sources as well as overlapped speech signals, a threshold adaptation scheme is proposed in this work. Experimental results demonstrate high detection accuracy in a noisy vehicular environment.