Reorganizing clouds: A study on tag clustering and evaluation

  • Authors:
  • Alberto Pérez García-Plaza;Arkaitz Zubiaga;Víctor Fresno;Raquel Martínez

  • Affiliations:
  • NLP&IR Group, UNED Madrid, Spain;Queens College and Graduate Center, City University of New York, New York, NY, USA;NLP&IR Group, UNED Madrid, Spain;NLP&IR Group, UNED Madrid, Spain

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2012

Quantified Score

Hi-index 12.05

Visualization

Abstract

Finding and visualizing semantic relations among tags within a tag cloud enhances user experience, particularly regarding access to and retrieval of web pages on social tagging systems. Several approaches have been proposed to visualize tag relations in these systems. However, results of previous research rely on qualitative evaluation methods, and do not provide robust and sound comparison criteria. In order to allow quantitative evaluation we present a benchmark social tagging dataset, where a subset of 140 tags from a well-known social bookmarking site, delicious, have been manually categorized according to the open directory project (ODP). The manual categorization is utilized as a ground truth that enables quantitative evaluation providing a way of inferring the best of different clustering approaches. With this dataset we also explore different tag representation approaches to present a reorganized tag cloud by using self organizing maps. In addition, we present an approach to enrich the resultant tag cloud with the most characteristic terms for each tag and group of tags, making possible a further filtered navigation, both by tag and document content, and easing a deeper qualitative evaluation of the clusters.