Processing broadcast audio for information access

Authors:
Jean-Luc Gauvain;Lori Lamel;Gilles Adda;Martine Adda-Decker;Claude Barras;Langzhou Chen;Yannick de Kercadio
Affiliations:
Spoken Language Processing Group, LIMSI-CNRS, Orsay cedex, France;Spoken Language Processing Group, LIMSI-CNRS, Orsay cedex, France;Spoken Language Processing Group, LIMSI-CNRS, Orsay cedex, France;Spoken Language Processing Group, LIMSI-CNRS, Orsay cedex, France;Spoken Language Processing Group, LIMSI-CNRS, Orsay cedex, France;Spoken Language Processing Group, LIMSI-CNRS, Orsay cedex, France;Spoken Language Processing Group, LIMSI-CNRS, Orsay cedex, France
Venue:
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Year:
2001

Citing 2
Cited 1

Informedia: news-on-demand multimedia information acquisition and retrieval

Intelligent multimedia information retrieval
News on demand: introduction

Communications of the ACM

Providing Content Aware Enterprise Communication Services

Principles, Systems and Applications of IP Telecommunications. Services and Security for Next Generation Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper addresses recent progress in speaker-independent, large vocabulary, continuous speech recognition, which has opened up a wide range of near and mid-term applications. One rapidly expanding application area is the processing of broadcast audio for information access. At LIMSI, broadcast news transcription systems have been developed for English, French, German, Mandarin and Portuguese, and systems for other languages are under development. Audio indexation must take into account the specificities of audio data, such as needing to deal with the continuous data stream and an imperfect word transcription. Some near-term applications areas are audio data mining, selective dissemination of information and media monitoring.