Actions speak louder than words: searching human action video based on body movement

Authors:
Yan-Ching Lin;Min-Chun Hu;Wen-Huang Cheng;Yung-Huan Hsieh;Hong-Ming Chen
Affiliations:
Academia Sinica, Taipei, Taiwan Roc;Academia Sinica, Taipei, Taiwan Roc;Academia Sinica, Taipei, Taiwan Roc;Academia Sinica, Taipei, Taiwan Roc;Academia Sinica, Taipei, Taiwan Roc
Venue:
Proceedings of the 20th ACM international conference on Multimedia
Year:
2012

Citing 3
Cited 0

Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns

IEEE Transactions on Pattern Analysis and Machine Intelligence
Actions as Space-Time Shapes

IEEE Transactions on Pattern Analysis and Machine Intelligence
A string matching approach for visual retrieval and classification

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Human action video search is a frequent demand in multimedia applications, and conventional video search schemes based on keywords usually fail to correctly find relevant videos due to noisy video tags. Observing the widespread use of Kinect-like depth cameras, we propose to search human action videos by directly performing the target action with body movements. Human actions are captured by Kinect and the recorded depth information is utilized to measure the similarity between the query action and each human action video in the database. We use representative depth descriptors without learning optimization to achieve real-time and promising performance as compatible as those of the leading methods based on color images and videos. Meanwhile, a large Depth-included Human Action video dataset, namely DHA, is collected to prove the effectiveness of the proposed video search system.