Fundamentals of speech recognition
Fundamentals of speech recognition
VisualSEEk: a fully automated content-based image query system
MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
NeTra: a toolbox for navigating large image databases
Multimedia Systems - Special issue on video content based retrieval
Multimedia Systems - Special section on video libraries
A Factor Graph Framework for Semantic Indexing and Retrieval in Video
CBAIVL '00 Proceedings of the IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL'00)
Efficient matching and clustering of video shots
ICIP '95 Proceedings of the 1995 International Conference on Image Processing (Vol. 1)-Volume 1 - Volume 1
Spatio-temporal video search using the object based video representation
ICIP '97 Proceedings of the 1997 International Conference on Image Processing (ICIP '97) 3-Volume Set-Volume 1 - Volume 1
Content-based video retrieval and compression: a unified solution
ICIP '97 Proceedings of the 1997 International Conference on Image Processing (ICIP '97) 3-Volume Set-Volume 1 - Volume 1
Semantic Video Indexing Using a Probabilistic Framework
ICPR '00 Proceedings of the International Conference on Pattern Recognition - Volume 3
"What is in that video anyway?": In Search of Better Browsing
ICMCS '99 Proceedings of the IEEE International Conference on Multimedia Computing and Systems - Volume 2
A probabilistic framework for semantic video indexing, filtering,and retrieval
IEEE Transactions on Multimedia
Relevance feedback: a power tool for interactive content-based image retrieval
IEEE Transactions on Circuits and Systems for Video Technology
Detection of video sequences using compact signatures
ACM Transactions on Information Systems (TOIS)
A framework for aligning and indexing movies with their script
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Classification of video events using 4-dimensional time-compressed motion features
Proceedings of the 6th ACM international conference on Image and video retrieval
Multimodal video copy detection applied to social media
WSM '09 Proceedings of the first SIGMM workshop on Social media
On supervision and statistical learning for semantic multimedia analysis
Journal of Visual Communication and Image Representation
Hi-index | 0.00 |
A necessary capability for content-based retrieval is to support the paradigm of query by example. Most systems for video retrieval support queries using image sequences only. We present an algorithm for matching multimodal (audio-visual) patterns for the purpose of content-based video retrieval. The novel ability of our approach to use the information content in multiple media coupled with a strong emphasis on temporal similarity differentiates it from the state-of-the-art in content-based retrieval. At the core of the pattern matching scheme is a dynamic programming algorithm, which leads to a significant improvement in performance. Coupling the use of audio with video this algorithm can be applied to grouping of shots based on audio-visual similarity. We also support relevance feedback. The user can provide feedback to the system, by choosing clips, which are closer to the user's desired target. The system then automatically adjusts the relative weights or relevance of the media and fetches different sets of target clips accordingly. It is our observation that a few iterations of such feedback are generally sufficient, for retrieving the desired video clips.