Exploring the semantics behind a collection to improve automated image annotation

  • Authors:
  • Ainhoa Llorente;Enrico Motta;Stefan Rüger

  • Affiliations:
  • Knowledge Media Institute, The Open University, Milton Keynes, United Kingdom;Knowledge Media Institute, The Open University, Milton Keynes, United Kingdom;Knowledge Media Institute, The Open University, Milton Keynes, United Kingdom

  • Venue:
  • CLEF'09 Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The goal of this research is to explore several semantic relatedness measures that help to refine annotations generated by a base-line non-parametric density estimation algorithm. Thus, we analyse the benefits of performing a statistical correlation using the training set or using the World Wide Web versus approaches based on a thesaurus like WordNet or Wikipedia (considered as a hyperlink structure). Experiments are carried out using the dataset provided by the 2009 edition of the ImageCLEF competition, a subset of the MIR-Flickr 25k collection. Best results correspond to approaches based on statistical correlation as they do not depend on a prior disambiguation phase like WordNet and Wikipedia. Further work needs to be done to assess whether proper disambiguation schemas might improve their performance.