Topic and keyword re-ranking for LDA-based topic modeling

  • Authors:
  • Yangqiu Song;Shimei Pan;Shixia Liu;Michelle X. Zhou;Weihong Qian

  • Affiliations:
  • IBM China Research Lab, Beijing, China;IBM T. J. Watson Research Center, Hawthorne, NY, USA;IBM China Research Lab, Beijing, China;IBM China Research Lab, Beijing, China and IBM T. J. Watson Research Center, Hawthorne, NY, USA;IBM China Research Lab, Beijing, China

  • Venue:
  • Proceedings of the 18th ACM conference on Information and knowledge management
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Topic-based text summaries promise to help average users quickly understand a text collection and derive insights. Recent research has shown that the Latent Dirichlet Allocation (LDA) model is one of the most effective approaches to topic analysis. However, the LDA-based results may not be ideal for human understanding and consumption. In this paper, we present several topic and keyword re-ranking approaches that can help users better understand and consume the LDA-derived topics in their text analysis. Our methods process the LDA output based on a set of criteria that model a user's information needs. Our evaluation demonstrates the usefulness of the methods in summarizing several large-scale, real world data sets.