The LIMSI RT06s lecture transcription system

  • Authors:
  • Lori Lamel;Eric Bilinski;Gilles Adda;Jean-Luc Gauvain;Holger Schwenk

  • Affiliations:
  • LIMSI-CNRS, France;LIMSI-CNRS, France;LIMSI-CNRS, France;LIMSI-CNRS, France;LIMSI-CNRS, France

  • Venue:
  • MLMI'06 Proceedings of the Third international conference on Machine Learning for Multimodal Interaction
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes recent research carried out in the context of the FP6 Integrated Project Chil in developing a system to automatically transcribe lectures and presentations. Widely available corpora were used to train both the acoustic and language models, since only a small amount of Chil data was available for system development. Acoustic model training made use of the transcribed portion of the TED corpus of Eurospeech recordings, as well as the ICSI, ISL, and NIST meeting corpora. For language model training, text materials were extracted from a variety of on-line conference proceedings. Experimental results are reported for close-talking and far-field microphones on development and evaluation data.