High-performance FAQ retrieval using an automatic clustering method of query logs

  • Authors:
  • Harksoo Kim;Jungyun Seo

  • Affiliations:
  • CIIR, Department of Computer Science, University of Massachusetts, Amherst, MA and Diquest Inc., Seocho-dong, Seocho-gu, Seoul, Korea;Department of Computer Science and Program of Integrated Biotechnology, Sogang University, Sinsu-dong, Mapo-gu, Seoul, Korea

  • Venue:
  • Information Processing and Management: an International Journal
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

To resolve some of lexical disagreement problems between queries and FAQs, we propose a reliable FAQ retrieval system using query log clustering. On indexing time, the proposed system clusters the logs of users' queries into predefined FAQ categories. To increase the precision and the recall rate of clustering, the proposed system adopts a new similarity measure using a machine readable dictionary. On searching time, the proposed system calculates the similarities between users' queries and each cluster in order to smooth FAQs. By virtue of the cluster-based retrieval technique, the proposed system could partially bridge lexical chasms between queries and FAQs. In addition, the proposed system outperforms the traditional information retrieval systems in FAQ retrieval.