Objective Intelligibility Measures Based on Mutual Information for Speech Subjected to Speech Enhancement Processing

Authors:
Jalal Taghia;Rainer Martin
Affiliations:
Inst. ofCommunication Acoust., Ruhr-Univ. Bochum, Bochum, Germany;Inst. ofCommunication Acoust., Ruhr-Univ. Bochum, Bochum, Germany
Venue:
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Year:
2014

Citing 8
Cited 0

Mutual Information Theory for Adaptive Mixture Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing)

Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing)
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
Estimation of Mutual Information: A Survey

RSKT '09 Proceedings of the 4th International Conference on Rough Sets and Knowledge Technology
Prediction of speech intelligibility based on an auditory preprocessing model

Speech Communication
Improving objective intelligibility prediction by combining correlation and coherence based methods with a measure based on the negative distortion ratio

Speech Communication
Minimum Mean-Square Error Estimation of Discrete Fourier Coefficients With Generalized Gamma Priors

IEEE Transactions on Audio, Speech, and Language Processing
An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a novel method for objective speech intelligibility prediction which can be useful in many application domains such as hearing instruments and forensics. Most objective intelligibility measures available in the literature employ some kind of signal-to-noise ratio (SNR) or a correlation-based comparison between the spectro-temporal representations of clean and processed speech. In this paper, we investigate the speech intelligibility prediction from the viewpoint of information theory and introduce novel objective intelligibility measures based on the estimated mutual information between the temporal envelopes of clean speech and processed speech in the subband domain. Mutual information allows to account for higher order statistics and hence to consider dependencies beyond the conventional second order statistics. Using data from three different listening tests it is shown that the proposed objective intelligibility measures provide promising results for speech intelligibility prediction in different scenarios of speech enhancement where speech is processed by non-linear modification strategies.