Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
The Journal of Machine Learning Research
TSSP: A Reinforcement Algorithm to Find Related Papers
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Recommending citations for academic papers
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Joint latent topic models for text and citations
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
A Discriminative Approach to Topic-Based Citation Recommendation
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Topic-link LDA: joint models of topic and author community
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
A latent topic model for linked documents
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Context-aware citation recommendation
Proceedings of the 19th international conference on World wide web
Citation recommendation without author supervision
Proceedings of the fourth ACM international conference on Web search and data mining
Panorama: a semantic-aware application search framework
Proceedings of the 16th International Conference on Extending Database Technology
Research paper recommender system evaluation: a quantitative literature survey
Proceedings of the International Workshop on Reproducibility and Replication in Recommender Systems Evaluation
Hi-index | 0.00 |
Citations are highly valuable for analyzing documents and have been widely studied in recent years. Among the document modeling, the citations are treated as documents' attributes just like the words in the documents; or as the degrees in graph theory. These methods add citations into word sampling process to reform the document representation but they miss the impact of the citations in the generation of content. In this paper, we view the citations as the prior information which authors have had. In the generation of document, content of the document is split into two parts: the idea of the author and the knowledge from the cited papers. We proposed a prior information enabled topic model-PLDA. In the modeling, both the document and its citations play the important role of generating the topic layer. Our experiments on two linked datasets show that our model greatly outperforms basic LDA procedures on a clustering task while also maintaining the dependencies among documents. In addition, we also show the feasibility by the task of citation recommendation.