A system for searching and browsing spoken communications

  • Authors:
  • Lee Begeja;Bernard Renger;Murat Saraclar;David Gibbon;Zhu Liu;Behzad Shahraray

  • Affiliations:
  • AT&T Labs -- Research, Florham Park, NJ;AT&T Labs -- Research, Florham Park, NJ;AT&T Labs -- Research, Florham Park, NJ;AT&T Labs -- Research, Middletown, NJ;AT&T Labs -- Research, Middletown, NJ;AT&T Labs -- Research, Middletown, NJ

  • Venue:
  • SpeechIR '04 Proceedings of the Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval at HLT-NAACL 2004
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

As the amount of spoken communications accessible by computers increases, searching and browsing is becoming crucial for utilizing such material for gathering information. It is desirable for multimedia content analysis systems to handle various formats of data and to serve varying user needs while presenting a simple and consistent user interface. In this paper, we present a research system for searching and browsing spoken communications. The system uses core technologies such as speaker segmentation, automatic speech recognition, transcription alignment, keyword extraction and speech indexing and retrieval to make spoken communications easy to navigate. The main focus is on telephone conversations and teleconferences with comparisons to broadcast news.