The LIMSI RT07 Lecture Transcription System

  • Authors:
  • L. Lamel;E. Bilinski;J. L. Gauvain;G. Adda;C. Barras;X. Zhu

  • Affiliations:
  • LIMSI-CNRS, Orsay Cedex, France 91403;LIMSI-CNRS, Orsay Cedex, France 91403;LIMSI-CNRS, Orsay Cedex, France 91403;LIMSI-CNRS, Orsay Cedex, France 91403;LIMSI-CNRS, Orsay Cedex, France 91403 and Also with Univ Paris-Sud, Orsay, France F-91405;LIMSI-CNRS, Orsay Cedex, France 91403 and Also with Univ Paris-Sud, Orsay, France F-91405

  • Venue:
  • Multimodal Technologies for Perception of Humans
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

A system to automatically transcribe lectures and presentations has been developed in the context of the FP6 Integrated Project Chil. In addition to the seminar data recorded by the Chilpartners, widely available corpora were used to train both the acoustic and language models. Acoustic model training made use of the transcribed portion of the TED corpus of Eurospeech recordings, as well as the ICSI, ISL, and NIST meeting corpora. For language model training, text materials were extracted from a variety of on-line conference proceedings. Experimental results are reported for close-talking and far-field microphones on development and evaluation data.