Managing Word Mismatch Problems in Information Retrieval: A Topic-Based Query Expansion Approach

  • Authors:
  • Chih-Ping Wei;Paul Hu;Chia-Hung Tai;Chun-Neng Huang;Chin-Sheng Yang

  • Affiliations:
  • The Institute of Technology Management, National Tsing Hua University, Taiwan;The David Eccles School of Business, University of Utah;The Institute of Information Science, Academia Sinica, Taiwan;Project Manager Engineer of Asiatek Taiwan;The Republic of China Army

  • Venue:
  • Journal of Management Information Systems
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Word mismatch represents a fundamental information retrieval challenge that has become increasingly important as electronic document repositories (e.g., Web resources, digital libraries) grow in number and sheer volume. In general, word mismatch refers to the phenomenon in which a concept is described by different terms in user queries and in source documents. Query expansion represents a promising avenue to address such problems. Previous research predominantly approaches query expansion on the basis of global or local analysis. However, these approaches emphasize a global perspective rather than taking a topic-specific view of term associations. As a consequence, their effectiveness can be severely constrained when the document corpus spans a diverse set of topics. In this study, we propose a topic-based approach for query expansion and develop and empirically evaluate two novel methods-namely, nonfuzzy and fuzzy topic-based query expansion-to address word mismatch problems. According to our evaluation results, the proposed topic-based approach is more effective than a benchmark global analysis method, particularly when user queries consist of multiple query terms.