A reliable FAQ retrieval system using a query log classification technique based on latent semantic analysis

Authors:
Harksoo Kim;Hyunjung Lee;Jungyun Seo
Affiliations:
Program of Computer and Communications Engineering, College of Information Technology, Kangwon National University, Hyoja, Republic of Korea;Natural Language Processing Laboratory, Department of Computer Science, Sogang University, Seoul, Republic of Korea;Department of Computer Science and Interdisciplinary Program of Integrated Biotechnology, Sogang University, Seoul, Republic of Korea
Venue:
Information Processing and Management: an International Journal - Special issue: AIRS2005: Information retrieval research in Asia
Year:
2007

Citing 13
Cited 3

Recent trends in hierarchic document clustering: a critical review

Information Processing and Management: an International Journal
Comparison of hierarchic agglomerative clustering methods for document retrieval

The Computer Journal
An Information Retrieval Approach for Automatically Constructing Software Libraries

IEEE Transactions on Software Engineering
Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Auto-FAQ: an experiment in cyberspace leveraging

Computer Networks and ISDN Systems
Reexamining the cluster hypothesis: scatter/gather on retrieval results

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
The cluster hypothesis revisited

SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to Ad Hoc information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Information Retrieval

Information Retrieval
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
The effectiveness of query-specific hierarchic clustering in information retrieval

Information Processing and Management: an International Journal
FAQ finder: a case-based approach to knowledge navigation

CAIA '95 Proceedings of the 11th Conference on Artificial Intelligence for Applications
Cluster-based retrieval using language models

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval

A machine-translation method for normalization of SMS

MCPR'12 Proceedings of the 4th Mexican conference on Pattern Recognition
A high-performance FAQ retrieval method using minimal differentiator expressions

Knowledge-Based Systems
A cloud of FAQ: A highly-precise FAQ retrieval system for the Web 2.0

Knowledge-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

To obtain high performances, previous works on FAQ retrieval used high-level knowledge bases or handcrafted rules. However, it is a time and effort consuming job to construct these knowledge bases and rules whenever application domains are changed. To overcome this problem, we propose a high-performance FAQ retrieval system only using users' query logs as knowledge sources. During indexing time, the proposed system efficiently clusters users' query logs using classification techniques based on latent semantic analysis. During retrieval time, the proposed system smoothes FAQs using the query log clusters. In the experiment, the proposed system outperformed the conventional information retrieval systems in FAQ retrieval. Based on various experiments, we found that the proposed system could alleviate critical lexical disagreement problems in short document retrieval. In addition, we believe that the proposed system is more practical and reliable than the previous FAQ retrieval systems because it uses only data-driven methods without high-level knowledge sources.