Mapping speech spectra from throat microphone to close-speaking microphone: a neural network approach

Authors:
A. Shahina;B. Yegnanarayana
Affiliations:
Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India;International Institute of Information Technology, Gachibowli, Hyderabad, India
Venue:
EURASIP Journal on Advances in Signal Processing
Year:
2007

Citing 6
Cited 0

Neural Networks: A Comprehensive Foundation

Neural Networks: A Comprehensive Foundation
Techniques for the regeneration of wideband speech from narrowband speech

EURASIP Journal on Applied Signal Processing - Nonlinear signal and image processing - part I
Discrete Time Processing of Speech Signals

Discrete Time Processing of Speech Signals
Speaker-specific mapping for text-independent speaker recognition

Speech Communication
Artificial Neural Networks

Artificial Neural Networks
Frequency recovery of narrow-band speech using adaptive spline neural networks

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 02

Quantified Score

Hi-index	0.00

Visualization

Abstract

Speech recorded from a throat microphone is robust to the surrounding noise, but sounds unnatural unlike the speech recorded from a close-speaking microphone. This paper addresses the issue of improving the perceptual quality of the throat microphone speech by mapping the speech spectra from the throat microphone to the close-speaking microphone. A neural network model is used to capture the speaker-dependent functional relationship between the feature vectors (cepstral coefficients) of the two speech signals. A method is proposed to ensure the stability of the all-pole synthesis filter. Objective evaluations indicate the effectiveness of the proposed mapping scheme. The advantage of this method is that the model gives a smooth estimate of the spectra of the close-speaking microphone speech. No distortions are perceived in the reconstructed speech. This mapping technique is also used for bandwidth extension of telephone speech.