Monitoring human behavior in an assistive environment using multiple views
Proceedings of the 1st international conference on PErvasive Technologies Related to Assistive Environments
Correlative multilabel video annotation with temporal kernels
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Human Behavior Classification Using Multiple Views
SETN '08 Proceedings of the 5th Hellenic conference on Artificial Intelligence: Theories, Models and Applications
Automated sip detection in naturally-evoked video
ICMI '08 Proceedings of the 10th international conference on Multimodal interfaces
Detecting abnormal human behaviour using multiple cameras
Signal Processing
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Recognition of Semantic Basketball Events Based on Optical Flow Patterns
ISVC '09 Proceedings of the 5th International Symposium on Advances in Visual Computing: Part II
Learning to recognize complex actions using conditional random fields
ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II
Soccer video event detection by fusing middle level visual semantics of an event clip
PCM'10 Proceedings of the Advances in multimedia information processing, and 11th Pacific Rim conference on Multimedia: Part II
Highlight events detection in soccer video using HCRF
ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
International Journal of Computer Vision
Finding the game flow from sports video
J-MRE '11 Proceedings of the 2011 joint ACM workshop on Modeling and representing events
A Generic Approach for Systematic Analysis of Sports Videos
ACM Transactions on Intelligent Systems and Technology (TIST)
HMM based soccer video event detection using enhanced mid-level semantic
Multimedia Tools and Applications
Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Hi-index | 0.00 |
Semantic event detection is an active research field of video mining in recent years. One of the challenging problems is how to effectively model temporal and multi-modality characteristics of video. In this paper, we employ Conditional Random Fields (CRFs) to fuse temporal multi-modality cues for event detection. CRFs are undirected probabilistic models designed for segmenting and labeling sequence data. Compared with traditional SVM and Hidden Markov Models (HMMs), CRFs based event detection offers several particular advantages including the abilities to relax strong independence assumptions in the state transition and avoid a fundamental limitation of directed graphical models. To detect event, we use a three-level framework based on multi-modality fusion and mid-level keywords. The first level extracts audiovisual features, the mid-level detects semantic keywords, and the high-level infers semantic events from multiple keyword sequences. The experimental results from soccer highlights detection demonstrate that CRFs achieves better performance particularly in slice level measure.