Utilising document content for tag recommendation in folksonomies

  • Authors:
  • Nikolas Landia

  • Affiliations:
  • University of Warwick, Coventry, United Kingdom

  • Venue:
  • Proceedings of the sixth ACM conference on Recommender systems
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Real-world tagging datasets have a large proportion of new/unseen documents. Few approaches for recommending tags to a user for a document address this new item problem, concentrating instead on artificially created post-core datasets where it is guaranteed that the user as well as the document of each test post is known to the system and already has some tags assigned to it. In order to recommend tags for unseen documents, approaches are required which model documents not only based on the tags assigned to it in the past (if any), but also the content. The focus of my research is on utilising the content of documents in order to address the new item problem in tag recommendation. I apply this methodology first to simple baseline tag recommenders and then the more advanced tag recommendation algorithm FolkRank. One of my main contributions is a novel adaptation to the FolkRank graph model to use multiple word nodes instead of a single document node to represent each document. This enables FolkRank to recommend tags for unseen documents and makes it applicable to full real-world tagging datasets, addressing the new item problem in tag recommendation.