Integration of multiple evidences based on a query type for web search

Authors:
In-Ho Kang;Gil Chang Kim
Affiliations:
Division of Computer Science, Department of EECS, KAIST, 373-1 Guseong-dong, Yuseong-gu, Daejon 305-701, South Korea;Division of Computer Science, Department of EECS, KAIST, 373-1 Guseong-dong, Yuseong-gu, Daejon 305-701, South Korea
Venue:
Information Processing and Management: an International Journal
Year:
2004

Citing 10
Cited 6

Analyses of multiple evidence combination

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
Document language models, query models, and risk minimization for information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Models for metasearch

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to Ad Hoc information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Modern Information Retrieval

Modern Information Retrieval
A taxonomy of web search

ACM SIGIR Forum
Engineering a multi-purpose test collection for web retrieval experiments

Information Processing and Management: an International Journal

The influence of task and gender on search and evaluation behavior using Google

Information Processing and Management: an International Journal
Getting work done on the web: supporting transactional queries

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Navigating the intranet with high precision

Proceedings of the 16th international conference on World Wide Web
Task Effects on Interactive Search: The Query Factor

Focused Access to XML Documents
An evolutionary approach for combining different sources of evidence in search engines

Information Systems
Exploring features for the automatic identification of user goals in web search

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

The massive and heterogeneous Web exacerbates IR problems and short user queries make them worse. The contents of web pages are not enough to find answer pages. PageRank compensates for the insufficiencies of content information. The content information and PageRank are combined to get better results. However, static combination of multiple evidences may lower the retrieval performance. We have to use different strategies to meet the need of a user. We can classify user queries as three categories according to users' intent, the topic relevance task, the homepage finding task, and the service finding task. In this paper, we present a user query classification method. The difference of distribution, mutual information, the usage rate as anchor texts and the POS information are used for the classification. After we classified a user query, we apply different algorithms and information for the better results. For the topic relevance task, we emphasize the content information, on the other hand, for the homepage finding task, we emphasize the Link information and the URL information. We could get the best performance when our proposed classification method with the OKAPI scoring algorithm was used.