Discovering different types of topics: factored topic models

Authors:
Yun Jiang;Ashutosh Saxena
Affiliations:
Department of Computer Science, Cornell University;Department of Computer Science, Cornell University
Venue:
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Year:
2013

Citing 20
Cited 0

Probabilistic latent semantic indexing

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
An Introduction to Variational Methods for Graphical Models

Machine Learning
Latent dirichlet allocation

The Journal of Machine Learning Research
Biclustering Algorithms for Biological Data Analysis: A Survey

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
The author-topic model for authors and documents

UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Topic modeling: beyond bag-of-words

ICML '06 Proceedings of the 23rd international conference on Machine learning
Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
On the equivalence between Non-negative Matrix Factorization and Probabilistic Latent Semantic Indexing

Computational Statistics & Data Analysis
Topic modeling with network regularization

Proceedings of the 17th international conference on World Wide Web
Joint latent topic models for text and citations

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Latent grouping models for user preference prediction

Machine Learning
Independent factor topic models

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Multi-HDP: a non parametric Bayesian model for tensor factorization

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Latent class models for collaborative filtering

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Jointly modeling aspects and opinions with a MaxEnt-LDA hybrid

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
The grouped author-topic model for unsupervised entity resolution

ICANN'11 Proceedings of the 21th international conference on Artificial neural networks - Volume Part I
Simultaneous joint and conditional modeling of documents tagged from two perspectives

Proceedings of the 20th ACM international conference on Information and knowledge management
Model-based multidimensional clustering of categorical data

Artificial Intelligence
Learning Topic Models -- Going beyond SVD

FOCS '12 Proceedings of the 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science
Hallucinated Humans as the Hidden Context for Labeling 3D Scenes

CVPR '13 Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

In traditional topic models such as LDA, a word is generated by choosing a topic from a collection. However, existing topic models do not identify different types of topics in a document, such as topics that represent the content and topics that represent the sentiment. In this paper, our goal is to discover such different types of topics, if they exist. We represent our model as several parallel topic models (called topic factors), where each word is generated from topics from these factors jointly. Since the latent membership of the word is now a vector, the learning algorithms become challenging. We show that using a variational approximation still allows us to keep the algorithm tractable. Our experiments over several datasets show that our approach consistently outperforms many classic topic models while also discovering fewer, more meaningful, topics.