Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Unsupervised Feature Selection Using Feature Similarity
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Using machine learning to improve information access
Using machine learning to improve information access
The Journal of Machine Learning Research
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Pattern Recognition and Machine Learning (Information Science and Statistics)
Pattern Recognition and Machine Learning (Information Science and Statistics)
Interactive, topic-based visual text summarization and analysis
Proceedings of the 18th ACM conference on Information and knowledge management
Probabilistic latent semantic analysis
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
TIARA: a visual exploratory text analytic system
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Who is talking about what: social map-based recommendation for content-centric social websites
Proceedings of the fourth ACM conference on Recommender systems
Ranking related news predictions
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Who is Doing What and When: Social Map-Based Recommendation for Content-Centric Social Web Sites
ACM Transactions on Intelligent Systems and Technology (TIST)
ACM Transactions on Intelligent Systems and Technology (TIST)
Hi-index | 0.00 |
Topic-based text summaries promise to help average users quickly understand a text collection and derive insights. Recent research has shown that the Latent Dirichlet Allocation (LDA) model is one of the most effective approaches to topic analysis. However, the LDA-based results may not be ideal for human understanding and consumption. In this paper, we present several topic and keyword re-ranking approaches that can help users better understand and consume the LDA-derived topics in their text analysis. Our methods process the LDA output based on a set of criteria that model a user's information needs. Our evaluation demonstrates the usefulness of the methods in summarizing several large-scale, real world data sets.