Interpretable likelihood for vector representable topic

  • Authors:
  • Ken-Ichi Fukui;Kazumi Saito;Masahiro Kimura;Masayuki Numao

  • Affiliations:
  • The Institute of Scientific and Industrial Research, Osaka University, Japan;NTT Communication Science Laboratories, Japan;Department of Electronics and Informatics, Ryukoku University, Japan;The Institute of Scientific and Industrial Research, Osaka University, Japan

  • Venue:
  • KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part III
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automatic topic extraction from a large number of documents is useful to capture an entire picture of the documents or to classify the documents. Here, it is an important issue to evaluate how much the extracted topics, which are set of documents, are interpretable for human. As the objective is vector representable topic extractions, e.g., Latent Semantic Analysis, we tried to formulate the interpretable likelihood of the extracted topic using the manually derived topics. We evaluated this likelihood of topics on English news articles using LSA, PCA and Spherical k-means for topic extraction. The results show that this likelihood can be applied as a filter to select meaningful topics.