Topic and keyword re-ranking for LDA-based topic modeling

Authors:
Yangqiu Song;Shimei Pan;Shixia Liu;Michelle X. Zhou;Weihong Qian
Affiliations:
IBM China Research Lab, Beijing, China;IBM T. J. Watson Research Center, Hawthorne, NY, USA;IBM China Research Lab, Beijing, China;IBM China Research Lab, Beijing, China and IBM T. J. Watson Research Center, Hawthorne, NY, USA;IBM China Research Lab, Beijing, China
Venue:
Proceedings of the 18th ACM conference on Information and knowledge management
Year:
2009

Citing 9
Cited 5

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
Unsupervised Feature Selection Using Feature Similarity

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Using machine learning to improve information access

Using machine learning to improve information access
Latent dirichlet allocation

The Journal of Machine Learning Research
A comprehensive comparative study on term weighting schemes for text categorization with support vector machines

WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
Interactive, topic-based visual text summarization and analysis

Proceedings of the 18th ACM conference on Information and knowledge management
Probabilistic latent semantic analysis

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence

TIARA: a visual exploratory text analytic system

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Who is talking about what: social map-based recommendation for content-centric social websites

Proceedings of the fourth ACM conference on Recommender systems
Ranking related news predictions

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Who is Doing What and When: Social Map-Based Recommendation for Content-Centric Social Web Sites

ACM Transactions on Intelligent Systems and Technology (TIST)
PhC: Multiresolution Visualization and Exploration of Text Corpora with Parallel Hierarchical Coordinates

ACM Transactions on Intelligent Systems and Technology (TIST)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Topic-based text summaries promise to help average users quickly understand a text collection and derive insights. Recent research has shown that the Latent Dirichlet Allocation (LDA) model is one of the most effective approaches to topic analysis. However, the LDA-based results may not be ideal for human understanding and consumption. In this paper, we present several topic and keyword re-ranking approaches that can help users better understand and consume the LDA-derived topics in their text analysis. Our methods process the LDA output based on a set of criteria that model a user's information needs. Our evaluation demonstrates the usefulness of the methods in summarizing several large-scale, real world data sets.