A Tag Clustering Method to Deal with Syntactic Variations on Collaborative Social Networks

  • Authors:
  • José Javier Astrain;Francisco Echarte;Alberto Córdoba;Jesús Villadangos

  • Affiliations:
  • Dpt. de Ingeniería Matemática e Informática, Universidad Pública de Navarra, Pamplona, Spain 31006;Dpt. de Ingeniería Matemática e Informática, Universidad Pública de Navarra, Pamplona, Spain 31006;Dpt. de Ingeniería Matemática e Informática, Universidad Pública de Navarra, Pamplona, Spain 31006;Dpt. de Ingeniería Matemática e Informática, Universidad Pública de Navarra, Pamplona, Spain 31006

  • Venue:
  • ICWE '9 Proceedings of the 9th International Conference on Web Engineering
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Folksonomies have emerged as a common way of annotating and categorizing content using a set of tags that are created and managed in a collaborative way. Tags carry the semantic information within a folksonomy, and provide thus the link to ontologies. The appeal of folksonomies comes from the fact that they require a low effort for creation and maintenance since they are community-generated. However they present important drawbacks regarding their limited navigation and searching capabilities, in contrast with other methods as taxonomies, thesauruses and ontologies. One of these drawbacks is an effect of its flexibility for tagging, producing frequently multiple syntactic variations of a same tag. Similarity measures allow the correct identification of tag variations when tag lengths are greater than five symbols. In this paper we propose the use of cosine relatedness measures in order to cluster tags with lengths lower or equal than five symbols. We build a discriminator based on the combination of a fuzzy similarity and a cosine measures and we analyze the results obtained.