Hierarchical topic integration through semi-supervised hierarchical topic modeling

  • Authors:
  • Xian-Ling Mao;Jing He;Hongfei Yan;Xiaoming Li

  • Affiliations:
  • Peking University, Beijing, China;Université de Montréal, Montréal, Canada;Peking University, Beijing, China;Peking University, Bejing, China

  • Venue:
  • Proceedings of the 21st ACM international conference on Information and knowledge management
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Lots of document collections are well organized in hierarchical structure, and such structure can help users browse and understand these collections. Meanwhile, there are a large number of plain document collections loosely organized, and it is difficult for users to understand them effectively. In this paper we study how to automatically integrate latent topics in a plain collection with the topics in a hierarchical structured collection. We propose to use semi-supervised topic modeling to solve the problem in a principled way. The experiments show that the proposed method can generate both meaningful latent topics and expand high quality hierarchical topic structures.