A language model approach for tag recommendation

Authors:
Ke Sun;Xiaolong Wang;Chengjie Sun;Lei Lin
Affiliations:
School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China;School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China;School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China;School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China and Department of Control Science and Engineering, Harbin Institute of Technology, Harbin 150001, Ch ...
Venue:
Expert Systems with Applications: An International Journal
Year:
2011

Citing 17
Cited 1

An algorithm for suffix stripping

Readings in information retrieval
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Learning Algorithms for Keyphrase Extraction

Information Retrieval
A study of smoothing methods for language models applied to information retrieval

ACM Transactions on Information Systems (TOIS)
Finding similar questions in large question and answer archives

Proceedings of the 14th ACM international conference on Information and knowledge management
Usage patterns of collaborative tagging systems

Journal of Information Science
Improved annotation of the blogosphere via autotagging and hierarchical clustering

Proceedings of the 15th international conference on World Wide Web
AutoTag: a collaborative approach to automated tag assignment for weblog posts

Proceedings of the 15th international conference on World Wide Web
Why we tag: motivations for annotation in mobile and online media

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
P-TAG: large scale automatic generation of personalized annotation tags for the web

Proceedings of the 16th international conference on World Wide Web
Query expansion using probabilistic local feedback with application to multimedia retrieval

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Flickr tag recommendation based on collective knowledge

Proceedings of the 17th international conference on World Wide Web
Real-time automatic tag recommendation

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Social tag prediction

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Tag ranking

Proceedings of the 18th international conference on World wide web
Learning to tag

Proceedings of the 18th international conference on World wide web
Domain-specific keyphrase extraction

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2

Reorganizing clouds: A study on tag clustering and evaluation

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	12.05

Visualization

Abstract

Tags are user-generated keywords for entities. Recently tags have been used as a popular way to allow users to contribute metadata to large corpora on the web. However, tagging style websites lack the function of guaranteeing the quality of tags for other usages, like collaboration/community, clustering, and search, etc. Thus, as a remedy function, automatic tag recommendation which recommends a set of candidate tags for user to choice while tagging a certain document has recently drawn many attentions. In this paper, we introduce the statistical language model theory into tag recommendation problem named as language model for tag recommendation (LMTR), by converting the tag recommendation problem into a ranking problem and then modeling the correlation between tag and document with the language model framework. Furthermore, we leverage two different methods based on both keywords extraction and keywords expansion to collect candidate tag before ranking with LMTR to improve the performance of LMTR. Experiments on large-scale tagging datasets of both scientific and web documents indicate that our proposals are capable of making tag recommendation efficiently and effectively.