Automated speech and audio analysis for semantic access to multimedia

  • Authors:
  • Franciska de Jong;Roeland Ordelman;Marijn Huijbregts

  • Affiliations:
  • Dept. of Computer Science, University of Twente, Enschede, AE, The Netherlands;Dept. of Computer Science, University of Twente, Enschede, AE, The Netherlands;Dept. of Computer Science, University of Twente, Enschede, AE, The Netherlands

  • Venue:
  • SAMT'06 Proceedings of the First international conference on Semantic and Digital Media Technologies
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The deployment and integration of audio processing tools can enhance the semantic annotation of multimedia content, and as a consequence, improve the effectiveness of conceptual access tools. This paper overviews the various ways in which automatic speech and audio analysis can contribute to increased granularity of automatically extracted metadata. A number of techniques will be presented, including the alignment of speech and text resources, large vocabulary speech recognition, key word spotting and speaker classification. The applicability of techniques will be discussed from a media crossing perspective. The added value of the techniques and their potential contribution to the content value chain will be illustrated by the description of two (complementary) demonstrators for browsing broadcast news archives.