On a combination of probabilistic and boolean ir models for WWW document retrieval

Authors:
Masaharu Yoshioka;Makoto Haraguchi
Affiliations:
Hokkaido University, Hokkaido, Japan;Hokkaido University, Hokkaido, Japan
Venue:
ACM Transactions on Asian Language Information Processing (TALIP)
Year:
2005

Citing 12
Cited 3

A direct manipulation interface for boolean information retrieval via natural language query

SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
Scatter/Gather: a cluster-based approach to browsing large document collections

SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
A graphical filter/flow representation of Boolean queries: a prototype implementation and evaluation

Journal of the American Society for Information Science
A case for interaction: a study of interactive information retrieval behavior and effectiveness

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Query expansion using local and global document analysis

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Graphical query specification and dynamic result previews for a digital library

Proceedings of the 11th annual ACM symposium on User interface software and technology
The impact of query structure and query expansion on retrieval performance

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Extended Boolean information retrieval

Communications of the ACM
Searching the Web: the public and their queries

Journal of the American Society for Information Science and Technology
Modern Information Retrieval

Modern Information Retrieval
Coverage, relevance, and ranking: The impact of query operators on Web search engine results

ACM Transactions on Information Systems (TOIS)

Query refinement based on topical term clustering

Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Evaluating topic difficulties from the viewpoint of query term expansion

AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
A new measure for query disambiguation using term co-occurrences

IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

Even though a Boolean query can express the information need precisely enough to select relevant documents, it is not easy to construct an appropriate Boolean query that covers all relevant documents. To utilize a Boolean query effectively, a mechanism to retrieve as many as possible relevant documents is therefore required. In accordance with this requirement, we propose a method for modifying a given Boolean query by using information from a relevant document set. The retrieval results, however, may deteriorate if some important query terms are removed by this reformulation. A further mechanism is thus required in order to use other query terms that are useful for finding more relevant documents, but are not strictly required in relevant documents. To meet this requirement, we propose a new method that combines the probabilistic IR and the Boolean IR models. We also introduce a new IR system---called appropriate Boolean query reformulation for information retrieval (ABRIR)---based on these two methods and the Okapi system. ABRIR uses both a word index and a phrase index formed from combinations of two adjacent noun words. The effectiveness of these two methods was confirmed according to the NTCIR-4 Web test collection.