Speech recognition in the Informedia Digital Video Library: uses and limitations

  • Authors:
  • A. G. Hauptmann

  • Affiliations:
  • -

  • Venue:
  • TAI '95 Proceedings of the Seventh International Conference on Tools with Artificial Intelligence
  • Year:
  • 1995

Quantified Score

Hi-index 0.00

Visualization

Abstract

In principle, speech recognition technology can make any spoken data useful for library indexing and retrieval. The paper describes the Informedia Digital Video Library project and discusses how speech recognition is used for transcript creation from video, alignment with hand-generated transcripts, query interface and audio paragraph segmentation. The results show that speech recognition accuracy varies dramatically depending on the quality and type of data used. Our information retrieval experiments also show that reasonable recall and precision can be obtained with moderate speech recognition accuracy. Finally we discuss some active areas of speech research relevant to the digital video library problem.