Mapping speech spectra from throat microphone to close-speaking microphone: a neural network approach

  • Authors:
  • A. Shahina;B. Yegnanarayana

  • Affiliations:
  • Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India;International Institute of Information Technology, Gachibowli, Hyderabad, India

  • Venue:
  • EURASIP Journal on Advances in Signal Processing
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Speech recorded from a throat microphone is robust to the surrounding noise, but sounds unnatural unlike the speech recorded from a close-speaking microphone. This paper addresses the issue of improving the perceptual quality of the throat microphone speech by mapping the speech spectra from the throat microphone to the close-speaking microphone. A neural network model is used to capture the speaker-dependent functional relationship between the feature vectors (cepstral coefficients) of the two speech signals. A method is proposed to ensure the stability of the all-pole synthesis filter. Objective evaluations indicate the effectiveness of the proposed mapping scheme. The advantage of this method is that the model gives a smooth estimate of the spectra of the close-speaking microphone speech. No distortions are perceived in the reconstructed speech. This mapping technique is also used for bandwidth extension of telephone speech.