"Tell me more": finding related items from user provided feedback

  • Authors:
  • Jeroen De Knijf;Anthony Liekens;Bart Goethals

  • Affiliations:
  • Department of Mathematics and Computer Science, Antwerp University;VIB Department of Molecular Genetics, Antwerp University;Department of Mathematics and Computer Science, Antwerp University

  • Venue:
  • DS'11 Proceedings of the 14th international conference on Discovery science
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The results returned by a search, datamining or database engine often contains an overload of potentially interesting information. A daunting and challenging problem for a user is to pick out the useful information. In this paper we propose an interactive framework to efficiently explore and (re)rank the objects retrieved by such an engine, according to feedback provided on part of the initially retrieved objects. In particular, given a set of objects, a similarity measure applicable to the objects and an initial set of objects that are of interest to the user, our algorithm computes the k most similar objects. This problem, previously coined as 'cluster on demand' [10], is solved by transforming the data into a weighted graph. On this weighted graph we compute a relevance score between the initial set of nodes and the remaining nodes based upon random walks with restart in graphs.We apply our algorithm "Tell Me More" (TMM) on text, numerical and zero/one data. The results show that TMM for almost every experiment significantly outperforms a k-nearest neighbor approach.