An extension of PLSA for document clustering

  • Authors:
  • Young-Min Kim;Jean-François Pessiot;Massih Reza Amini;Patrick Gallinari

  • Affiliations:
  • Université Pierre et Marie Curie, Paris, France;Université Pierre et Marie Curie, Paris, France;Université Pierre et Marie Curie, Paris, France;Université Pierre et Marie Curie, Paris, France

  • Venue:
  • Proceedings of the 17th ACM conference on Information and knowledge management
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we propose an extension of the PLSA model in which an extra latent variable allows the model to co-cluster documents and terms simultaneously. We show on three datasets that our extended model produces statistically significant improvements with respect to two clustering measures over the original PLSA and the multinomial mixture MM models.