Annotation of heterogeneous multimedia content using automatic speech recognition

  • Authors:
  • Marijn Huijbregts;Roeland Ordelman;Franciska de Jong

  • Affiliations:
  • University of Twente, Dept. of Electrical Engineering, Mathematics and Computer Science, Enschede, The Netherlands;University of Twente, Dept. of Electrical Engineering, Mathematics and Computer Science, Enschede, The Netherlands;University of Twente, Dept. of Electrical Engineering, Mathematics and Computer Science, Enschede, The Netherlands

  • Venue:
  • SAMT'07 Proceedings of the semantic and digital media technologies 2nd international conference on Semantic Multimedia
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.