Joint latent topic models for text and citations

Authors:
Ramesh M. Nallapati;Amr Ahmed;Eric P. Xing;William W. Cohen
Affiliations:
Stanford University, Stanford, CA, USA;Carnegie Mellon University, Pittsburgh, PA, USA;Carnegie Mellon University, Pittsburgh, PA, USA;Carnegie Mellon University, Pittsburgh, PA, USA
Venue:
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2008

Citing 10
Cited 62

Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
Latent dirichlet allocation

The Journal of Machine Learning Research
The link prediction problem for social networks

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Dynamic topic models

ICML '06 Proceedings of the 23rd international conference on Machine learning
Pachinko allocation: DAG-structured mixture models of topic correlations

ICML '06 Proceedings of the 23rd international conference on Machine learning
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
Unsupervised prediction of citation influences

Proceedings of the 24th international conference on Machine learning
Recommending citations for academic papers

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Multiscale topic tomography

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Information genealogy: uncovering the flow of ideas in non-hyperlinked document databases

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining

Linked latent Dirichlet allocation in web spam filtering

Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
Connections between the lines: augmenting social networks with text

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Combining link and content for community detection: a discriminative approach

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Detecting topic evolution in scientific literature: how can citations help?

Proceedings of the 18th ACM conference on Information and knowledge management
Estimating Likelihoods for Topic Models

ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Context-aware citation recommendation

Proceedings of the 19th international conference on World wide web
A Bayesian framework for community detection integrating content and link

UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
An efficient block model for clustering sparse graphs

Proceedings of the Eighth Workshop on Mining and Learning with Graphs
Modeling the evolution of associated data

Data & Knowledge Engineering
Topic models with power-law using Pitman-Yor process

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining topic-level influence in heterogeneous networks

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Who should I cite: learning literature search models from citation behavior

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
PTM: probabilistic topic mapping model for mining parallel document collections

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Citation recommendation without author supervision

Proceedings of the fourth ACM international conference on Web search and data mining
Towards automated related work summarization

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
The web of topics: discovering the topology of topic evolution in a corpus

Proceedings of the 20th international conference on World wide web
Empirical study of topic modeling in Twitter

Proceedings of the First Workshop on Social Media Analytics
Investigating task performance of probabilistic topic models: an empirical study of PLSA and LDA

Information Retrieval
Mining tags using social endorsement networks

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Probabilistic topic models with biased propagation on heterogeneous information networks

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Locally discriminative topic modeling

Pattern Recognition
Group Profiling for Understanding Social Structures

ACM Transactions on Intelligent Systems and Technology (TIST)
MEI: mutual enhanced infinite generative model for simultaneous community and topic detection

DS'11 Proceedings of the 14th international conference on Discovery science
Detecting health events on the social web to enable epidemic intelligence

SPIRE'11 Proceedings of the 18th international conference on String processing and information retrieval
Recommending citations with translation model

Proceedings of the 20th ACM international conference on Information and knowledge management
Indices of novelty for emerging topic detection

Information Processing and Management: an International Journal
Document hierarchies from text and links

Proceedings of the 21st international conference on World Wide Web
To better stand on the shoulder of giants

Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
Plink-LDA: using link as prior information in topic modeling

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Context sensitive topic models for author influence in document networks

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
A probabilistic graphical model for topic and preference discovery on social media

Neurocomputing
The contextual focused topic model

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Practical collapsed variational bayes inference for hierarchical dirichlet process

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
ComSoc: adaptive transfer of user behaviors over composite social network

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining contentions from discussions and debates

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Latent association analysis of document pairs

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Multiple location profiling for users and relationships from social network and content

Proceedings of the VLDB Endowment
Discovering factions in the computational linguistics community

ACL '12 Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries
Evaluating joint modeling of yeast biology literature and protein-protein interaction networks

BioNLP '12 Proceedings of the 2012 Workshop on Biomedical Natural Language Processing
Topic evolution prediction of user generated contents considering enterprise generated contents

Proceedings of the First ACM International Workshop on Hot Topics on Interdisciplinary Social Networks Research
Document-topic hierarchies from document graphs

Proceedings of the 21st ACM international conference on Information and knowledge management
Extraction of topic evolutions from references in scientific articles and its GPU acceleration

Proceedings of the 21st ACM international conference on Information and knowledge management
Recommending citations: translating papers into references

Proceedings of the 21st ACM international conference on Information and knowledge management
Like-Minded communities: bringing the familiarity and similarity together

WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
Transforming graph data for statistical relational learning

Journal of Artificial Intelligence Research
Intuitive Topic Discovery by Incorporating Word-Pair's Connection Into LDA

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Can't see the forest for the trees?: a citation recommendation system

Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
Incorporating popularity in topic models for social network analysis

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Scalable text and link analysis with mixed-topic link models

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Understanding evolution of research themes: a probabilistic generative model for citations

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Efficient community detection in large networks using content and links

Proceedings of the 22nd international conference on World Wide Web
Community detection by popularity based models for authored networked data

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Academic network analysis: a joint topic modeling approach

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Community detection in content-sharing social networks

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Enriching employee ontology for enterprises with knowledge discovery from social networks

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
On handling textual errors in latent document modeling

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
A unified graph model for personalized query-oriented reference paper recommendation

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Research paper recommender system evaluation: a quantitative literature survey

Proceedings of the International Workshop on Reproducibility and Replication in Recommender Systems Evaluation
Discovering different types of topics: factored topic models

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Spatial compactness meets topical consistency: jointly modeling links and content for community detection

Proceedings of the 7th ACM international conference on Web search and data mining
User behavior learning and transfer in composite social networks

ACM Transactions on Knowledge Discovery from Data (TKDD) - Casin special issue
Activity-based topic discovery

Web Intelligence and Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this work, we address the problem of joint modeling of text and citations in the topic modeling framework. We present two different models called the Pairwise-Link-LDA and the Link-PLSA-LDA models. The Pairwise-Link-LDA model combines the ideas of LDA [4] and Mixed Membership Block Stochastic Models [1] and allows modeling arbitrary link structure. However, the model is computationally expensive, since it involves modeling the presence or absence of a citation (link) between every pair of documents. The second model solves this problem by assuming that the link structure is a bipartite graph. As the name indicates, Link-PLSA-LDA model combines the LDA and PLSA models into a single graphical model. Our experiments on a subset of Citeseer data show that both these models are able to predict unseen data better than the baseline model of Erosheva and Lafferty [8], by capturing the notion of topical similarity between the contents of the cited and citing documents. Our experiments on two different data sets on the link prediction task show that the Link-PLSA-LDA model performs the best on the citation prediction task, while also remaining highly scalable. In addition, we also present some interesting visualizations generated by each of the models.