Retrieving spoken documents by combining multiple index sources
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Communications of the ACM
Algorithms on strings, trees, and sequences: computer science and computational biology
Algorithms on strings, trees, and sequences: computer science and computational biology
DL '97 Proceedings of the second ACM international conference on Digital libraries
Querying by color regions using VisualSEEk content-based visual query system
Intelligent multimedia information retrieval
Informedia: news-on-demand multimedia information acquisition and retrieval
Intelligent multimedia information retrieval
VideoQ: an automated content based video search system using visual cues
MULTIMEDIA '97 Proceedings of the fifth ACM international conference on Multimedia
Evolving video skims into useful multimedia abstractions
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
New techniques for open-vocabulary spoken document retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Time-compression: systems concerns, usage, and benefits
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
The visual analysis of human movement: a survey
Computer Vision and Image Understanding
Human motion analysis: a review
Computer Vision and Image Understanding
Visual information retrieval
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Overview of the sixth text REtrieval conference (TREC-6)
Information Processing and Management: an International Journal - The sixth text REtrieval conference (TREC-6)
Phonetic confusion matrix based spoken document retrieval
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Advances in phonetic word spotting
Proceedings of the tenth international conference on Information and knowledge management
Automatic transcription of Broadcast News
Speech Communication - Special issue on automatic transcription of broadcast news data
Modern Information Retrieval
Unifying Keywords and Visual Contents in Image Retrieval
IEEE MultiMedia
A Survey on Content-Based Retrieval for Multimedia Databases
IEEE Transactions on Knowledge and Data Engineering
Learning and inferring a semantic space from user's relevance feedback for image retrieval
Proceedings of the tenth ACM international conference on Multimedia
Query Expansion for Imperfect Speech: Applications in Distributed Learning
CBAIVL '00 Proceedings of the IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL'00)
Region Correspondence for Image Matching via EMD Flow
CBAIVL '00 Proceedings of the IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL'00)
Using Audio Time Scale Modification for Video Browsing
HICSS '00 Proceedings of the 33rd Hawaii International Conference on System Sciences-Volume 3 - Volume 3
HICSS '01 Proceedings of the 34th Annual Hawaii International Conference on System Sciences ( HICSS-34)-Volume 4 - Volume 4
VideoQ: A Fully Automated Video Retrieval System Using Motion Sketches
WACV '98 Proceedings of the 4th IEEE Workshop on Applications of Computer Vision (WACV'98)
Automatic Parsing of TV Soccer Programs
ICMCS '95 Proceedings of the International Conference on Multimedia Computing and Systems
"What is in that video anyway?": In Search of Better Browsing
ICMCS '99 Proceedings of the IEEE International Conference on Multimedia Computing and Systems - Volume 2
NeTra-V: toward an object-based video representation
IEEE Transactions on Circuits and Systems for Video Technology
Region-based representations of image and video: segmentation tools for multimedia services
IEEE Transactions on Circuits and Systems for Video Technology
Mutual relevance feedback for multimodal query formulation in video retrieval
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Automatic generation of conference video proceedings
Journal of Visual Communication and Image Representation
Examining feedback in interactive video retrieval
Journal of Information Science
Frontiers of Computer Science: Selected Publications from Chinese Universities
Hi-index | 0.00 |
The amount of digital video being shot, captured, and stored is growing at a rate faster than ever before. The large amount of stored video is not penetrable without efficient video indexing, retrieval, and browsing technology. Most prior work in the field can be roughly categorized into two classes. One class is based on image processing techniques, often called content-based image and video retrieval, in which video frames are indexed and searched for visual content. The other class is based on spoken document retrieval, which relies on automatic speech recognition and text queries. Both approaches have major limitations. In the first approach, semantic queries pose a great challenge, while the second, speech-based approach, does not support efficient video browsing. This paper describes a system where speech is used for efficient searching and visual data for efficient browsing, a combination that takes advantage of both approaches. A fully automatic indexing and retrieval system has been developed and tested. Automated speech recognition and phonetic speech indexing support text-to-speech queries. New browsable views are generated from the original video. A special synchronized browser allows instantaneous, context-preserving switching from one view to another. The system was successfully used to produce searchable-browsable video proceedings for three local conferences.