VoxaleadNews: robust automatic segmentation of video into browsable content
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Hi-index | 0.00 |
Video is poised to largely replace both text and images as the media for transmitting information in the coming years. The challenge of the Information Processing community is how to index the information found in this voluminous and dynamic media stream. Most of the linguistic information is encoded in the audio channel of video data, which, once transcribed, can be accessed using text-based tools. This talk will describe our current research in providing an index into the content of video and audio streams, using LIMSI's state-of-the-art automatic speech transcription system for French, English, Mandarin and Arabic languages. I will also describe and demonstrate the Voxalead News system, and other results of the French-German Quaero project, that integrate results from industry and research, for the next generation of video-based information searching