Name-It: Naming and Detecting Faces in News Videos
IEEE MultiMedia
Speaker Identification Based Text to Audio Alignment for an Audio Retrieval System
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Telop and Flip Frame Detection and Character Extraction from TV News Articles
ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Hi-index | 0.00 |
Because of the media digitization, a large amount of information such as speech, audio and video data is produced everyday. In order to retrieve data quickly and precisely from these databases, multimedia technologies for organizing and retrieving of speech, audio and video data are strongly required. In this paper, we overview the multimedia technologies such as organization and retrieval of speech, audio and video data, speaker indexing, audio summarization and cross media retrieval existing today. The main purpose of the organization is to produce tables of contents and indices from audio and video data automatically. In order to make these technologies feasible, first, processing units such as words on audio data and shots on video data are extracted. On a second step, they are meaningfully integrated into topics. Furthermore, the units extracted from different types of media are integrated for higher functions.