An overview of audio information retrieval

  • Authors:
  • Jonathan Foote

  • Affiliations:
  • National Univ. of Singapore, Singapore

  • Venue:
  • Multimedia Systems - Special issue on audio and multimedia
  • Year:
  • 1999

Quantified Score

Hi-index 0.01

Visualization

Abstract

The problem of audio information retrieval is familiar to anyone who has returned from vacation to find ananswering machine full of messages. While there is not yetan "AltaVista" for the audio data type, many workers arefinding ways to automatically locate, index, and browse audio using recent advances in speech recognition and machinelistening. This paper reviews the state of the art in audioinformation retrieval, and presents recent advances in automatic speech recognition, word spotting, speaker and music identification, and audio similarity with a view towardsmaking audio less "opaque". A special section addresses intelligent interfaces for navigating and browsing audio andmultimedia documents, using automatically derived information to go beyond the tape recorder metaphor.