Transcriber: Development and use of a tool for assisting speech corpora production
Speech Communication - Special issue on speech annotation and corpus tools
Speaker Identification Based Text to Audio Alignment for an Audio Retrieval System
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Knowledge-based derivation of document logical structure
ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1) - Volume 1
Multi-paragraph segmentation of expository text
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Thematic segmentation of meetings through document/speech alignment
Proceedings of the 12th annual ACM international conference on Multimedia
Thematic alignment of documents with meeting dialogs
Proceedings of the 12th annual ACM international conference on Multimedia
Using bi-modal alignment and clustering techniques for documents and speech thematic segmentations
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Exploring media correlation and synchronization for navigated hypermedia documents
Proceedings of the 13th annual ACM international conference on Multimedia
From Searching to Browsing through Multimodal Documents Linking
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
ARCHIVUS: a system for accessing the content of recorded multimodal meetings
MLMI'04 Proceedings of the First international conference on Machine Learning for Multimodal Interaction
Hi-index | 0.00 |
We present in this article a method for detecting similarity links between documents' content and speech recordings' content. This process, further called thematic alignment, is a novel research area that combines both document and speech analysis. This alignment will a) provide temporal indexes to documents, which are non-temporal data, and b) help discovering hidden thematic structures. This article first introduces a multi-layered document structure and quickly introduces the traditional speech structure. Further, it presents a simple similarity measure and various multi-level simple alignments between those two structures. Later, the meeting corpus is presented, as well as an evaluation of the implemented alignments. Finally, we present our future works on multi-alignments and thematic structure discovery.