Query reformulation using automatically generated query concepts from a document space

  • Authors:
  • Youjin Chang;Iadh Ounis;Minkoo Kim

  • Affiliations:
  • Graduate School of Information and Communication, Ajou University, San 5 Wonchon-Dong Youngtong-Gu, Suwon 443-749, Republic of Korea;Department of Computing Science, University of Glasgow, 17 Lilybank Gardens, Glasgow, G12 8QQ, UK;Department of Computer Engineering, Ajou University, San 5 Wonchon-Dong Youngtong-Gu, Suwon, 443-749, Republic of Korea

  • Venue:
  • Information Processing and Management: an International Journal
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a new query reformulation approach, using a set of query concepts that are introduced to precisely denote the user's information need. Since a document collection is considered to be a domain which includes latent primitive concepts, we identify those concepts through a local pattern discovery and a global modeling using data mining techniques. For a new query, we select its most associated primitive concepts and choose the most probable interpretations as query concepts. We discuss the issue of constructing the primitive concepts from either the whole corpus or from the retrieved set of documents. Our experiments are performed on the TREC8 collection. The experimental evaluation shows that our approach is as good as current query reformulation approaches, while being particularly effective for poorly performing queries. Moreover, we find that the approach using the primitive concepts generated from the set of retrieved documents leads to the most effective performance.