Perceptual speech enhancement exploiting temporal masking properties of human auditory system

Authors:
Teddy Surya Gunawan;Eliathamby Ambikairajah;Julien Epps
Affiliations:
Department of Electrical and Computer Engineering, International Islamic University Malaysia, Gombak, 53100 Kuala Lumpur, Malaysia;School of Electrical Engineering and Telecommunications, The University of New South Wales, NSW 2052, Australia;School of Electrical Engineering and Telecommunications, The University of New South Wales, NSW 2052, Australia
Venue:
Speech Communication
Year:
2010

Citing 8
Cited 2

Experiments with a Nonlinear Spectral Subtractor (NSS), Hidden Markov Models and the projection, for robust speech recognition in cars

Speech Communication - Eurospeech '91
Enhancement of noisy speech signals: application to mobile radio communications

Speech Communication
Psychoacoustics: Facts and Models

Psychoacoustics: Facts and Models
Speech enhancement based on a priori signal to noise estimation

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Perceptual speech coding and enhancement using frame-synchronizedfast wavelet packet transform algorithms

IEEE Transactions on Signal Processing
Evaluation of Objective Quality Measures for Speech Enhancement

IEEE Transactions on Audio, Speech, and Language Processing
Enhancing speech degrated by additive noise or interfering speakers

IEEE Communications Magazine
Transform coding of audio signals using perceptual noise criteria

IEEE Journal on Selected Areas in Communications

The restoration of low-quality audio recordings based on non-negative matrix factorization and perceptual assessment by means of the ebu mushra test method

Proceedings of the second workshop on eHeritage and digital art preservation
An efficient solution to improve the spectral noise suppression rules

Digital Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The use of simultaneous masking in speech enhancement has shown promise for a range of noise types. In this paper, a new speech enhancement algorithm based on a short-term temporal masking threshold to noise ratio (MNR) is presented. A novel functional model for forward masking based on three parameters is incorporated into a speech enhancement framework based on speech boosting. The performance of the speech enhancement algorithm using the proposed forward masking model was compared with seven other speech enhancement methods over 12 different noise types and four SNRs. Objective evaluation using PESQ revealed that using the proposed forward masking model, the speech enhancement algorithm outperforms the other algorithms by 6-20% depending on the SNR. Moreover, subjective evaluation using 16 listeners confirmed the objective test results.