Some research problems in automatic information retrieval
SIGIR '83 Proceedings of the 6th annual international ACM SIGIR conference on Research and development in information retrieval
A Rough Set-Aided System for Sorting WWW Bookmarks
WI '01 Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development
Effective Retrieval of Information in Tables on the Internet
IEA/AIE '02 Proceedings of the 15th international conference on Industrial and engineering applications of artificial intelligence and expert systems: developments in applied artificial intelligence
On the representation of query term relations by soft Boolean operators
EACL '85 Proceedings of the second conference on European chapter of the Association for Computational Linguistics
Some characteristics of future information systems
ACM SIGIR Forum
A blueprint for automatic Boolean query processing
ACM SIGIR Forum
Hi-index | 0.00 |
In conventional information retrieval Boolean combinations of index terms are used to formulate the users'' information requests. While any document is in principle retrievable by a Boolean query, the amount of output obtainable by Boolean processing is difficult to control, and the retrieved items are not ranked in any presumed order of importance to the user population. In the vector processing model of retrieval, the retrieved items are easily ranked in decreasing order of the query-record similarity, but the queries themselves are unstructured and expressed as simple sets of weighted index terms. A new, extended Boolean information retrieval system is introduced which is intermediate between the Boolean system of query processing and the vector processing model. The query structure inherent in the Boolean system is preserved, while at the same time weighted terms may be incorporated into both queries and stored documents; the retrieved output can also be ranked in strict similarity order with the user queries. A conventional retrieval system can be modified to make use of the extended system. Laboratory tests indicate that the extended system produces better retrieval output than either the Boolean or the vector processing systems.