Representing Context Information for Document Retrieval

  • Authors:
  • Maya Carrillo;Esaú Villatoro-Tello;A. López-López;Chris Eliasmith;Manuel Montes-Y-Gómez;Luis Villaseñor-Pineda

  • Affiliations:
  • Coordinación de Ciencias Computacionales, INAOE, Puebla, Mexico 72840 and Facultad de Ciencias de la Computación, BUAP, Puebla, Mexico 72570;Coordinación de Ciencias Computacionales, INAOE, Puebla, Mexico 72840;Coordinación de Ciencias Computacionales, INAOE, Puebla, Mexico 72840;Department of Philosophy, Department of Systems Design Engineering, Centre for Theoretical Neuroscience, University of Waterloo, Canada;Coordinación de Ciencias Computacionales, INAOE, Puebla, Mexico 72840;Coordinación de Ciencias Computacionales, INAOE, Puebla, Mexico 72840

  • Venue:
  • FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The bag of words representation (BoW), which is widely used in information retrieval (IR), represents documents and queries as word lists that do not express anything about context information. When we look for information, we find that not everything is explicitly stated in a document, so context information is needed to understand its content. This paper proposes the use of bag of concepts (BoC) and Holographic reduced representation (HRR) in IR. These representations go beyond BoW by incorporating context information to document representations. Both HRR and BoC are produced using a vector space methodology known as Random Indexing, and allow expressing additional knowledge from different sources. Our experiments have shown the feasibility of the representations and improved the mean average precision by up to 7% when they are compared with the traditional vector space model.