Babble noise: modeling, analysis, and applications

Authors:
Nitish Krishnamurthy;John H. L. Hansen
Affiliations:
Department of Electrical Engineering, Erik Johnson School of Engineering and Computer Science, Center for Robust Speech Systems, University of Texas at Dallas, Richardson, TX;Department of Electrical Engineering, Erik Johnson School of Engineering and Computer Science, Center for Robust Speech Systems, University of Texas at Dallas, Richardson, TX
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2009

Citing 6
Cited 3

Speech recognition in noisy environments: a survey

Speech Communication
ICARUS: source generator based real-time recognition of speech in noisy stressful and Lombard effect environments

Speech Communication
Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition

Speech Communication - Special issue on speech under stress
Analysis and compensation of Lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition

IEEE Transactions on Audio, Speech, and Language Processing
In-Set/Out-of-Set Speaker Recognition Under Sparse Enrollment

IEEE Transactions on Audio, Speech, and Language Processing
Environmental Sniffing: Noise Knowledge Estimation for Robust Speech Systems

IEEE Transactions on Audio, Speech, and Language Processing

Robust speaker identification in babble noise

Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Robust emotional speech classification in the presence of babble noise

International Journal of Speech Technology
A coherence-based noise reduction algorithm for binaural hearing aids

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

Speech babble is one of the most challenging noise interference for all speech systems. Here, a systematic approach to model its underlying structure is proposed to further the existing knowledge of speech processing in noisy environments. This paper establishes a working foundation for the analysis and modeling of babble speech. We first address the underlying model for multiple speaker babble speech--considering the number of conversations versus the number of speakers contributing to babble. Next, based on this model, we develop an algorithm to detect the range of the number of speakers within an unknown babble speech sequence. Evaluation is performed using 110 h of data from the Switchboard corpus. The number of simultaneous conversations ranges from one to nine, or one to 18 subjects speaking. A speaker conversation stream detection rate in excess of 80% is achieved with a speaker window size of ±1 speakers. Finally, the problem of in-set/out-of-set speaker recognition is considered in the context of interfering babble speech noise. Results are shown for test durations from 2-8 s, with babble speaker groups ranging from two to nine subjects. It is shown that by choosing the correct number of speakers in the background babble an overall average performance gain of 6.44% equal error rate can be obtained. This study represents effectively the first effort in developing an overall model for speech babble, and with this, contributions are made for speech system robustness in noise.