Using bags of symbols for automatic indexing of graphical document image databases

  • Authors:
  • Eugen Barbu;Pierre Héroux;Sébastien Adam;Éric Trupin

  • Affiliations:
  • LITIS, Université de Rouen, Saint-Etienne du Rouvray, France;LITIS, Université de Rouen, Saint-Etienne du Rouvray, France;LITIS, Université de Rouen, Saint-Etienne du Rouvray, France;LITIS, Université de Rouen, Saint-Etienne du Rouvray, France

  • Venue:
  • GREC'05 Proceedings of the 6th international conference on Graphics Recognition: ten Years Review and Future Perspectives
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

A database is only usefull if it is associated a set of procedures allowing to retrieve relevant elements for the users’ needs. A lot of IR techniques have been developed for automatic indexing and retrieval in document databases. Most of these use indexes depending on the textual content of documents, and very few are able to handle graphical or image content without human annotation. This paper describes an approach similar to the bag of words technique for automatic indexing of graphical document image databases and different ways to consequently query these databases. In an unsupervised manner, this approach proposes a set of automatically discovered symbols that can be combined with logical operators to build queries.