Structuring broadcast audio for information access

Authors:
Jean-Luc Gauvain;Lori Lamel
Affiliations:
Spoken Language Processing Group, Orsay Cedex, France;Spoken Language Processing Group, Orsay Cedex, France
Venue:
EURASIP Journal on Applied Signal Processing
Year:
2003

Citing 13
Cited 0

Informedia: news-on-demand multimedia information acquisition and retrieval

Intelligent multimedia information retrieval
News on demand: introduction

Communications of the ACM
The role of the national institute of standards and technology in DARPA's broadcast news continuous speech recognition research program

Speech Communication - Special issue on automatic transcription of broadcast news data
Connectionist speech recognition of Broadcast News

Speech Communication - Special issue on automatic transcription of broadcast news data
The development of the HTK Broadcast News transcription system: an overview

Speech Communication - Special issue on automatic transcription of broadcast news data
Automatic transcription of Broadcast News

Speech Communication - Special issue on automatic transcription of broadcast news data
The LIMSI Broadcast News transcription system

Speech Communication - Special issue on automatic transcription of broadcast news data
Large vocabulary continuous speech recognition of Broadcast News - The Philips/RWTH approach

Speech Communication - Special issue on automatic transcription of broadcast news data
Improved modeling and efficiency for automatic transcription of Broadcast News

Speech Communication - Special issue on automatic transcription of broadcast news data
Language-independent and language-adaptive acoustic modeling for speech recognition

Speech Communication
Guest Editorial: Content-Based Multimedia Indexing and Retrieval

Multimedia Tools and Applications
Progress in transcription of broadcast News using Byblos

Speech Communication
Progress in Broadcast News transcription at Dragon Systems

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01

Quantified Score

Hi-index	0.00

Visualization

Abstract

One rapidly expanding application area for state-of-the-art speech recognition technology is the automatic processing of broadcast audiovisual data for information access. Since much of the linguistic information is found in the audio channel, speech recognition is a key enabling technology which, when combined with information retrieval techniques, can be used for searching large audiovisual document collections. Audio indexing must take into account the specificities of audio data such as needing to deal with the continuous data stream and an imperfect word transcription. Other important considerations are dealing with language specificities and facilitating language portability. At Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI), broadcast news transcription systems have been developed for seven languages: English, French, German, Mandarin, Portuguese, Spanish, and Arabic. The transcription systems have been integrated into prototype demonstrators for several application areas such as audio data mining, structuring audiovisual archives, selective dissemination of information, and topic tracking for media monitoring. As examples, this paper addresses the spoken document retrieval and topic tracking tasks.