Nonextensive entropic kernels

  • Authors:
  • André F. T. Martins;Mário A. T. Figueiredo;Pedro M. Q. Aguiar;Noah A. Smith;Eric P. Xing

  • Affiliations:
  • Carnegie Mellon University, Pittsburgh, PA and Instituto de Telecomunicações;Instituto de Telecomunicações;Instituto Superior Técnico, Lisboa, Portugal;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA

  • Venue:
  • Proceedings of the 25th international conference on Machine learning
  • Year:
  • 2008

Quantified Score

Hi-index 0.06

Visualization

Abstract

Positive definite kernels on probability measures have been recently applied in structured data classification problems. Some of these kernels are related to classic information theoretic quantities, such as mutual information and the Jensen-Shannon divergence. Meanwhile, driven by recent advances in Tsallis statistics, nonextensive generalizations of Shannon's information theory have been proposed. This paper bridges these two trends. We introduce the Jensen-Tsallis q-difference, a generalization of the Jensen-Shannon divergence. We then define a new family of nonextensive mutual information kernels, which allow weights to be assigned to their arguments, and which includes the Boolean, Jensen-Shannon, and linear kernels as particular cases. We illustrate the performance of these kernels on text categorization tasks.