Voice activity detection based on multiple statistical models
IEEE Transactions on Signal Processing - Part I
Evaluation of Objective Quality Measures for Speech Enhancement
IEEE Transactions on Audio, Speech, and Language Processing
Convex Combination of Multiple Statistical Models With Application to VAD
IEEE Transactions on Audio, Speech, and Language Processing
Wavelet based speech presence probability estimator for speech enhancement
Digital Signal Processing
Hi-index | 0.00 |
In this paper, we propose a novel speech enhancement technique based on an improved minimum statistics (MS) approach incorporating acoustic environmental noise awareness. A relevant noise estimation approach, known as MS, tracks the minimal values if a smoothed power estimate of the noisy signal is within a finite search window. From an investigation of previous MS-based methods, it is discovered that a fixed size of the minimum search window is assumed regardless of the environmental conditions. To overcome this limitation, we initially determine the optimal window sizes in terms of the perceived speech quality according to a variety of noise types. We then assign a different search window size according to the determined noise type, for which we use a real-time noise classification algorithm based on the Gaussian mixture model (GMM). The performance of the proposed approach is evaluated by a quantitative comparison method and by objective tests under various noise environments. It was found to yield better results compared to the previous MS method.