An analysis of the use of tags in a blog recommender system

Authors:
Conor Hayes;Paolo Avesani;Sriharsha Veeramachaneni
Affiliations:
ITC-IRST, Povo, Trento, Italy;ITC-IRST, Povo, Trento, Italy;ITC-IRST, Povo, Trento, Italy
Venue:
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Year:
2007

Citing 5
Cited 10

Empirical and Theoretical Comparisons of Selected Criterion Functions for Document Clustering

Machine Learning
Semantic blogging and decentralized knowledge management

Communications of the ACM - The Blogosphere
Conversations in the Blogosphere: An Analysis "From the Bottom Up"

HICSS '05 Proceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS'05) - Track 4 - Volume 04
Usage patterns of collaborative tagging systems

Journal of Information Science
What would it mean to blog on the semantic web?

Web Semantics: Science, Services and Agents on the World Wide Web

An Analysis of Bloggers, Topics and Tags for a Blog Recommender System

From Web to Social Web: Discovering and Deploying User and Content Profiles
Tag-based filtering for personalized bookmark recommendations

Proceedings of the 17th ACM conference on Information and knowledge management
Semantic Modelling of User Interests Based on Cross-Folksonomy Analysis

ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
An online blog reading system by topic clustering and personalized ranking

ACM Transactions on Internet Technology (TOIT)
WisColl: Collective wisdom based blog clustering

Information Sciences: an International Journal
Live Social Semantics

ISWC '09 Proceedings of the 8th International Semantic Web Conference
Blog classification using tags: an empirical study

ICADL'07 Proceedings of the 10th international conference on Asian digital libraries: looking back 10 years and forging new frontiers
Semantics, sensors, and the social web: the live social semantics experiments

ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
Clustering tagged documents with labeled and unlabeled documents

Information Processing and Management: an International Journal
A smart TV system with body-gesture control, tag-based rating and context-aware recommendation

Knowledge-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Web is experiencing an exponential growth in the use of weblogs or blogs, websites containing dated journal-style entries. Blog entries are generally organised using informally defined labels known as tags. Increasingly, tags are being proposed as a 'grassroots' alternative to Semantic Web standards. We demonstrate that tags by themselves are weak at partitioning blog data. We then show how tags may contribute useful, discriminating information. Using content-based clustering, we observe that frequently occurring tags in each cluster are usually good meta-labels for the cluster concept. We then introduce the Tr score, a score based on the proportion of high-frequency tags in a cluster, and demonstrate that it is strongly correlated with cluster strength. We demonstrate how the Tr score enables the detection and removal of weak clusters. As such, the Tr score can be used as an independent means of verifying topic integrity in a cluster-based recommender system.