Speech Processing for Audio Indexing

  • Authors:
  • Lori Lamel;Jean-Luc Gauvain

  • Affiliations:
  • LIMSI-CNRS, Orsay Cedex, France 91403;LIMSI-CNRS, Orsay Cedex, France 91403

  • Venue:
  • GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper addresses some of the recent trends in speech processing, with a focus on speech-to-text transcription as a means to facilitate access to multimedia information in a multilingual context. A brief overview of automatic speech recognition is given along with indicative performance measures for a range of tasks. Enriched transcriptions, that is enhancing the automatic word transcripts with meta-data derived from the audio data is discussed, followed by some hightlights of recent progress and remaining challenges in speech recognition.