Revisiting fisher kernels for document similarities

  • Authors:
  • Martin Nyffenegger;Jean-Cédric Chappelier;Éric Gaussier

  • Affiliations:
  • Ecole Polytechnique Fédérale de Lausanne, Switzerland;Ecole Polytechnique Fédérale de Lausanne, Switzerland;Xerox Research Center Europe, Meylan, France

  • Venue:
  • ECML'06 Proceedings of the 17th European conference on Machine Learning
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a new metric to compute similarities between textual documents, based on the Fisher information kernel as proposed by T. Hofmann. By considering a new point-of-view on the embedding vector space and proposing a more appropriate way of handling the Fisher information matrix, we derive a new form of the kernel that yields significant improvements on an information retrieval task. We apply our approach to two different models: Naive Bayes and PLSI.