Transition region determination based thresholding
Pattern Recognition Letters
Content-Based Video Indexing and Retrieval
IEEE MultiMedia
Medium knowledge-based macro-segmentation of video into sequences
Intelligent multimedia information retrieval
The Application of Video Semantics and Theme Representation in Automated Video Editing
Multimedia Tools and Applications
Framework for Synthesizing Semantic-Level Indices
Multimedia Tools and Applications
Audio-Visual Speaker Detection Using Dynamic Bayesian Networks
FG '00 Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition 2000
Towards semantically meaningful feature spaces for the characterization of video content
ICIP '97 Proceedings of the 1997 International Conference on Image Processing (ICIP '97) 3-Volume Set-Volume 1 - Volume 1
Motion and Color-Based Video Indexing and Retrieval
ICPR '96 Proceedings of the International Conference on Pattern Recognition (ICPR '96) Volume III-Volume 7276 - Volume 7276
Speech recognition with dynamic bayesian networks
Speech recognition with dynamic bayesian networks
Dynamic bayesian networks for information fusion with applications to human-computer interfaces
Dynamic bayesian networks for information fusion with applications to human-computer interfaces
JACOB: just a content-based query system for video databases
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
IEEE Transactions on Information Theory
Rapid scene analysis on compressed video
IEEE Transactions on Circuits and Systems for Video Technology
On the optimization of Hierarchical Temporal Memory
Pattern Recognition Letters
Hi-index | 0.10 |
Specific domains in video data contain rich temporal structures that help in classification process. In this paper, we exploit the temporal structure to characterize video sequence data into different classes. We propose the following perceptual features: Time-to-Collision, shot length and transition, and temporal motion activity. Using these perceptual features, several video classes are characterized leading to formation of high-level sequence classification. Resulting high-level queries are more easily mapped onto the perceptual features enabling better accessibility of content-based retrieval systems. Temporal fusion of the perceptual features forms higher-level structures, which can be effectively tackled using the Dynamic Bayesian Networks. The Networks allow the power of statistical inference and learning to be combined with the temporal and contextual knowledge of the problem. The modeling and experimental results are presented for a number of key applications, like sequence identification, extracting highlights for sports, and parsing a news program.