Mixtures of Probabilistic PCAs and Fisher Kernels for Word and Document Modeling

  • Authors:
  • George Siolas;Florence d'Alché-Buc

  • Affiliations:
  • -;-

  • Venue:
  • ICANN '02 Proceedings of the International Conference on Artificial Neural Networks
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a generative model for constructing continuous word representations using mixtures of probabilistic PCAs. Applied to co-occurrence data, the model performs word clustering and allows the visualization of each cluster in a reduced space. In combination with a simple document model, it permits the definition of low-dimensional Fisher scores which are used as document features. We investigate the models' potential through kernel-based methods using the corresponding Fisher kernels.