Ontological inference for image and video analysis
Machine Vision and Applications
Interpretation of complex situations in a semantic-based surveillance framework
Image Communication
IEEE Transactions on Information Technology in Biomedicine
Mapping DSP applications on processor systems with coarse-grain reconfigurable hardware
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
A comprehensive study of visual event computing
Multimedia Tools and Applications
Hi-index | 0.00 |
This paper presents a method to represent two-person interactions at a semantic level with a natural language description. A human interaction is composed of two single-person actions, which in turn are made up of torso and arm/leg motions. We adopt the 'everb argument structure' in linguistics to represent human action in terms of triplets. Various two-person interactions are represented at a detailed level using multiple triplets aligned along a time line according to the spatial/temporal constraints of the interactions. Our method provides a user-friendly natural-language description of various human interactions, and properly describes positive, neutral, and negative interactions occurring between two persons.