Fisher kernel based relevance feedback for multimodal video retrieval

Authors:
Ionut Mironica;Bogdan Ionescu;Jasper Uijlings;Nicu Sebe
Affiliations:
LAPI, University Politehnica of Bucharest, Romania., Bucharest, Romania;LAPI, University Politehnica of Bucharest, Romania., Bucharest, Romania;DISI, University of Trento, Italy., Trento, Romania;DISI, University of Trento, Italy., Trento, Romania
Venue:
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Year:
2013

Citing 16
Cited 0

Exploiting generative models in discriminative classifiers

Proceedings of the 1998 conference on Advances in neural information processing systems II
Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Metric for Distributions with Applications to Image Databases

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Using the Fisher kernel method for Web audio classification

ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 04
Negative Samples Analysis in Relevance Feedback

IEEE Transactions on Knowledge and Data Engineering
A nearest-neighbor approach to relevance feedback in content based image retrieval

Proceedings of the 6th ACM international conference on Image and video retrieval
Sketch retrieval and relevance feedback with biased SVM classification

Pattern Recognition Letters
Speech Processing for Audio Indexing

GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Fisher Kernels for Handwritten Word-spotting

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Adaptive relevance feedback in information retrieval

Proceedings of the 18th ACM conference on Information and knowledge management
An Approach to the Parameterization of Structure for Fast Categorization

International Journal of Computer Vision
Biased discriminant euclidean embedding for content-based image retrieval

IEEE Transactions on Image Processing
Improving the fisher kernel for large-scale image classification

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Contextual Video Recommendation by Multimodal Relevance and User Feedback

ACM Transactions on Information Systems (TOIS)
Relevance feedback: a power tool for interactive content-based image retrieval

IEEE Transactions on Circuits and Systems for Video Technology
The MPEG-7 visual standard for content description-an overview

IEEE Transactions on Circuits and Systems for Video Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a novel approach to relevance feedback based on the Fisher Kernel representation in the context of multimodal video retrieval. The Fisher Kernel representation describes a set of features as the derivative with respect to the log-likelihood of the generative probability distribution that models the feature distribution. In the context of relevance feedback, instead of learning the generative probability distribution over all features of the data, we learn it only over the top retrieved results. Hence during relevance feedback we create a new Fisher Kernel representation based on the most relevant examples. In addition, we propose to use the Fisher Kernel to capture temporal information by cutting up a video in smaller segments, extract a feature vector from each segment, and represent the resulting feature set using the Fisher Kernel representation. We evaluate our method on the MediaEval 2012 Video Genre Tagging Task, a large dataset, which contains 26 categories in 15.000 videos totalling up to 2.000 hours of footage. Results show that our method significantly improves results over existing state-of-the-art relevance feedback techniques. Furthermore, we show significant improvements by using the Fisher Kernel to capture temporal information, and we demonstrate that Fisher kernels are well suited for this task.