Mid-level features and spatio-temporal context for activity recognition
Pattern Recognition
Action recognition using canonical correlation kernels
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
Human action recognition with salient trajectories
Signal Processing
Editor's Choice Article: Human activity recognition in videos using a single example
Image and Vision Computing
Hi-index | 0.00 |
The popular bag of words approach for action recognition is based on the classifying quantized local features density. This approach focuses excessively on the local features but discards all information about the interactions among them. Local features themselves may not be discriminative enough, but combined with their contexts, they can be very useful for the recognition of some actions. In this paper, we present a novel representation that captures contextual interactions between interest points, based on the density of all features observed in each interest point's mutliscale spatio-temporal contextual domain. We demonstrate that augmenting local features with our contextual feature significantly improves the recognition performance.