Classification and annotation in social corpora using multiple relations

  • Authors:
  • Yann Jacob;Ludovic Denoyer;Patrick Gallinari

  • Affiliations:
  • Laboratoire d'informatique de Paris 6 University Pierre et Marie Curie, Paris, France;Laboratoire d'informatique de Paris 6 University Pierre et Marie Curie, Paris, France;Laboratoire d'informatique de Paris 6 University Pierre et Marie Curie, Paris, France

  • Venue:
  • Proceedings of the 20th ACM international conference on Information and knowledge management
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider the problem of learning to annotate documents with concepts or keywords in content information networks, where the documents may share multiple relations. The concepts associated to a document will depend both on its content and on its neighbors in the network through the different relations. We formalize this problem as single and multi-label classification in a multi-graph, the nodes being the documents and the edges representing the different relations. The proposed algorithm learns to weight the different relations according to their importance for the annotation task. We perform experiments on different corpora corresponding to different annotation tasks on scientific articles, emails and Flickr images and show how the model may take advantage of the rich relational information.