A maximum entropy approach to natural language processing
Computational Linguistics
New enhancements to cut, fade, and dissolve detection processes in video segmentation
MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
Combined-media video tracking for summarization
MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
A user attention model for video summarization
Proceedings of the tenth ACM international conference on Multimedia
X-means: Extending K-means with Efficient Estimation of the Number of Clusters
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Constructing table-of-content for videos
Multimedia Systems - Special section on video libraries
RIDE '98 Proceedings of the Workshop on Research Issues in Database Engineering
Exploring Video Structure Beyond The Shots
ICMCS '98 Proceedings of the IEEE International Conference on Multimedia Computing and Systems
Retrieval effectiveness of an ontology-based model for information selection
The VLDB Journal — The International Journal on Very Large Data Bases
MMSS: Multi-Modal Story-Oriented Video Summarization
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Annotation-based multimedia summarization and translation
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Designing an intelligent user interface for instructional video indexing and browsing
Proceedings of the 11th international conference on Intelligent user interfaces
Keyword-based document clustering
AsianIR '03 Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11
Discovering important nodes through graph entropy the case of Enron email database
Proceedings of the 3rd international workshop on Link discovery
Automatic Video Annotation by Mining Speech Transcripts
CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
2006 Special Issue: Modeling attention to salient proto-objects
Neural Networks
A study on automatically extracted keywords in text categorization
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
SenseRelate::TargetWord: a generalized framework for word sense disambiguation
ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Adaptive estimated maximum-entropy distribution model
Information Sciences: an International Journal
Modeling personal and social network context for event annotation in images
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
The effect of text in storyboards for video navigation
ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 03
Visual Mining of Multi-Modal Social Networks at Different Abstraction Levels
IV '07 Proceedings of the 11th International Conference Information Visualization
VAST MM: multimedia browser for presentation video
Proceedings of the 6th ACM international conference on Image and video retrieval
Generating comprehensible summaries of rushes sequences based on robust feature matching
Proceedings of the international workshop on TRECVID video summarization
Video rushes summarization by adaptive acceleration and stacking of shots
Proceedings of the international workshop on TRECVID video summarization
An integrated statistical model for multimedia evidence combination
Proceedings of the 15th international conference on Multimedia
Supporting video library exploratory search: when storyboards are not enough
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Introduction to Information Retrieval
Introduction to Information Retrieval
Graph connectivity measures for unsupervised word sense disambiguation
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Spectral structuring of home videos
CIVR'03 Proceedings of the 2nd international conference on Image and video retrieval
Unpacking meaning from words: a context-centered approach to computational lexicon design
CONTEXT'03 Proceedings of the 4th international and interdisciplinary conference on Modeling and using context
A generic framework of user attention model and its application in video summarization
IEEE Transactions on Multimedia
Automated video program summarization using speech transcripts
IEEE Transactions on Multimedia
Video summarization and scene detection by graph modeling
IEEE Transactions on Circuits and Systems for Video Technology
MINMAX optimal video summarization
IEEE Transactions on Circuits and Systems for Video Technology
Information theory-based shot cut/fade detection and video summarization
IEEE Transactions on Circuits and Systems for Video Technology
Clip-based similarity measure for query-dependent clip retrieval and video summarization
IEEE Transactions on Circuits and Systems for Video Technology
A Multiple Visual Models Based Perceptive Analysis Framework for Multilevel Video Summarization
IEEE Transactions on Circuits and Systems for Video Technology
VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method
Pattern Recognition Letters
A novel video thumbnail extraction method using spatiotemporal vector quantization
Proceedings of the 3rd international workshop on Automated information extraction in media production
Proceedings of the 2010 ACM workshop on Social, adaptive and personalized multimedia interaction and access
Video summarization with visual and semantic features
PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
EURASIP Journal on Advances in Signal Processing
Video summarization via transferrable structured learning
Proceedings of the 20th international conference on World wide web
Beyond search: Event-driven summarization for web videos
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
A smart video player with content-based fast-forward playback
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Dynamic social network for narrative video analysis
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Finding the game flow from sports video
J-MRE '11 Proceedings of the 2011 joint ACM workshop on Modeling and representing events
Video summarization with semantic concept preservation
Proceedings of the 10th International Conference on Mobile and Ubiquitous Multimedia
Using eye-tracking data for automatic film comic creation
Proceedings of the Symposium on Eye Tracking Research and Applications
Video search and indexing with reinforcement agent for interactive multimedia services
ACM Transactions on Embedded Computing Systems (TECS) - Special issue on embedded systems for interactive multimedia services (ES-IMS)
Content-Based Keyframe Clustering Using Near Duplicate Keyframe Identification
International Journal of Multimedia Data Engineering & Management
Medical Video Summarization using Central Tendency-Based Shot Boundary Detection
International Journal of Computer Vision and Image Processing
Hi-index | 0.00 |
Video summarization techniques have been proposed for years to offer people comprehensive understanding of the whole story in the video. Roughly speaking, existing approaches can be classified into the two types: one is static storyboard, and the other is dynamic skimming. However, despite that these traditional methods give brief summaries for users, they still do not provide with a concept-organized and systematic view. In this paper, we present a structural video content browsing system and a novel summarization method by utilizing the four kinds of entities: who, what, where, and when to establish the framework of the video contents. With the assistance of the above-mentioned indexed information, the structure of the story can be built up according to the characters, the things, the places, and the time. Therefore, users can not only browse the video efficiently but also focus on what they are interested in via the browsing interface. In order to construct the fundamental system, we employ maximum entropy criterion to integrate visual and text features extracted from video frames and speech transcripts, generating high-level concept entities. A novel concept expansion method is introduced to explore the associations among these entities. After constructing the relational graph, we exploit graph entropy model to detect meaningful shots and relations, which serve as the indices for users. The results demonstrate that our system can achieve better performance and information coverage.