The nature of statistical learning theory
The nature of statistical learning theory
Video Manga: generating semantically meaningful video summaries
MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Multimodal Video Indexing: A Review of the State-of-the-art
Multimedia Tools and Applications
Information Processing and Management: an International Journal
Content aware video presentation on high-resolution displays
AVI '08 Proceedings of the working conference on Advanced visual interfaces
VSUMM: An Approach for Automatic Video Summarization and Quantitative Evaluation
SIBGRAPI '08 Proceedings of the 2008 XXI Brazilian Symposium on Computer Graphics and Image Processing
Effective content-based video retrieval using pattern-indexing and matching techniques
Expert Systems with Applications: An International Journal
Scalable object-based video retrieval in HD video databases
Image Communication
Evaluating Color Descriptors for Object and Scene Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
The ACM Multimedia Grand Challenge 2011 in a nutshell
ACM SIGMultimedia Records
Hi-index | 0.00 |
This paper reports a system developed for video browsing based on multimodal analysis. Our multimodal approach performs audio transcription for shot categorization (sports, weather, politics and economy) combining audio and visual information for theme categorization. Its main features include static and dynamic summaries, segmentation using face detection, classification into Indoor/Outdoor scenes based on Support Vector Machine (SVM) and audio transcription for theme keyword search. Keywords are selected to represent the subjects, followed by a simple text search. We conduct a set of experiments for evaluating the effectiveness of the shot subject categorization using audio transcription information.