Exploiting generative models in discriminative classifiers
Proceedings of the 1998 conference on Advances in neural information processing systems II
Content-Based Image Retrieval at the End of the Early Years
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Metric for Distributions with Applications to Image Databases
ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Using the Fisher kernel method for Web audio classification
ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 04
Negative Samples Analysis in Relevance Feedback
IEEE Transactions on Knowledge and Data Engineering
A nearest-neighbor approach to relevance feedback in content based image retrieval
Proceedings of the 6th ACM international conference on Image and video retrieval
Sketch retrieval and relevance feedback with biased SVM classification
Pattern Recognition Letters
Speech Processing for Audio Indexing
GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Fisher Kernels for Handwritten Word-spotting
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Adaptive relevance feedback in information retrieval
Proceedings of the 18th ACM conference on Information and knowledge management
An Approach to the Parameterization of Structure for Fast Categorization
International Journal of Computer Vision
Biased discriminant euclidean embedding for content-based image retrieval
IEEE Transactions on Image Processing
Improving the fisher kernel for large-scale image classification
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Contextual Video Recommendation by Multimodal Relevance and User Feedback
ACM Transactions on Information Systems (TOIS)
Relevance feedback: a power tool for interactive content-based image retrieval
IEEE Transactions on Circuits and Systems for Video Technology
The MPEG-7 visual standard for content description-an overview
IEEE Transactions on Circuits and Systems for Video Technology
Hi-index | 0.00 |
This paper proposes a novel approach to relevance feedback based on the Fisher Kernel representation in the context of multimodal video retrieval. The Fisher Kernel representation describes a set of features as the derivative with respect to the log-likelihood of the generative probability distribution that models the feature distribution. In the context of relevance feedback, instead of learning the generative probability distribution over all features of the data, we learn it only over the top retrieved results. Hence during relevance feedback we create a new Fisher Kernel representation based on the most relevant examples. In addition, we propose to use the Fisher Kernel to capture temporal information by cutting up a video in smaller segments, extract a feature vector from each segment, and represent the resulting feature set using the Fisher Kernel representation. We evaluate our method on the MediaEval 2012 Video Genre Tagging Task, a large dataset, which contains 26 categories in 15.000 videos totalling up to 2.000 hours of footage. Results show that our method significantly improves results over existing state-of-the-art relevance feedback techniques. Furthermore, we show significant improvements by using the Fisher Kernel to capture temporal information, and we demonstrate that Fisher kernels are well suited for this task.