Mining large query induced graphs towards a hierarchical query folksonomy

  • Authors:
  • Alexandre P. Francisco;Ricardo Baeza-Yates;Arlindo L. Oliveira

  • Affiliations:
  • INESC-ID, CSE Dept., IST, Tech Univ of Lisbon, Portugal;Yahoo! Research Barcelona, Spain & Santiago, Chile;INESC-ID, CSE Dept., IST, Tech Univ of Lisbon, Portugal

  • Venue:
  • SPIRE'10 Proceedings of the 17th international conference on String processing and information retrieval
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The human interaction through the web generates both implicit and explicit knowledge. An example of an implicit contribution is searching, as people contribute with their knowledge by clicking on retrieved documents. Thus, an important and interesting challenge is to extract semantic relations among queries and their terms from query logs. In this paper we present and discuss results on mining large query log induced graphs, and how they contribute to query classification and to understand user intent and interest. Our approach consists on efficiently obtaining a hierarchical clustering for such graphs and, then, a hierarchical query folksonomy. Results obtained with real data provide interesting insights on semantic relations among queries and are compared with conventional taxonomies, namely the ODP categorization.