Multi-channel shape-flow kernel descriptors for robust video event detection and retrieval

  • Authors:
  • Pradeep Natarajan;Shuang Wu;Shiv Vitaladevuni;Xiaodan Zhuang;Unsang Park;Rohit Prasad;Premkumar Natarajan

  • Affiliations:
  • Speech, Language and Multimedia Business Unit, Raytheon BBN Technologies, Cambridge, MA;Speech, Language and Multimedia Business Unit, Raytheon BBN Technologies, Cambridge, MA;Speech, Language and Multimedia Business Unit, Raytheon BBN Technologies, Cambridge, MA;Speech, Language and Multimedia Business Unit, Raytheon BBN Technologies, Cambridge, MA;Speech, Language and Multimedia Business Unit, Raytheon BBN Technologies, Cambridge, MA;Speech, Language and Multimedia Business Unit, Raytheon BBN Technologies, Cambridge, MA;Speech, Language and Multimedia Business Unit, Raytheon BBN Technologies, Cambridge, MA

  • Venue:
  • ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Despite the success of spatio-temporal visual features, they are hand-designed and aggregate image or flow gradients using a pre-specified, uniform set of orientation bins. Kernel descriptors [1] generalize such orientation histograms by defining match kernels over image patches, and have shown superior performance for visual object and scene recognition. In our work, we make two contributions: first, we extend kernel descriptors to the spatio-temporal domain to model salient flow, gradient and texture patterns in video. Further, we apply our kernel descriptors to extract features from different color channels. Second, we present a fast algorithm for kernel descriptor computation of O(1) complexity for each pixel in each video patch, producing two orders of magnitude speedup over conventional kernel descriptors and other popular motion features. Our evaluation results on TRECVID MED 2011 dataset indicate that the proposed multi-channel shape-flow kernel descriptors outperform several other features including SIFT, SURF, STIP and Color SIFT.