RUBRIC: A System for Rule-Based Information Retrieval
IEEE Transactions on Software Engineering - Special issue on COMPSAC 1982 and 1983
Automatic text processing: the transformation, analysis, and retrieval of information by computer
Automatic text processing: the transformation, analysis, and retrieval of information by computer
Automatic structuring and retrieval of large text files
Communications of the ACM
On the reuse of past optimal queries
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Experiments on the determination of the relationships between terms
ACM Transactions on Database Systems (TODS)
Extended Boolean information retrieval
Communications of the ACM
Concept Based Retrieval by Minimal Term Sets
ISMIS '99 Proceedings of the 11th International Symposium on Foundations of Intelligent Systems
Set-based model: a new approach for information retrieval
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Toward High-Precision Service Retrieval
IEEE Internet Computing
Set-based vector model: An efficient approach for correlation-based ranking
ACM Transactions on Information Systems (TOIS)
Concept Based Retrieval Using Generalized Retrieval Functions
Fundamenta Informaticae - Intelligent Systems
On enhancing the performance of spam mail filtering system using semantic enrichment
AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence
Concept Based Retrieval Using Generalized Retrieval Functions
Fundamenta Informaticae - Intelligent Systems
Hi-index | 0.00 |
There is considerable interest in bridging theterminological gap that exists between the way users prefer tospecify their information needs and the way queries are expressed interms of keywords or text expressions that occur in documents. One ofthe approaches proposed for bridging this gap is based ontechnologies for expert systems. The central idea of such anapproach was introduced in the context of a system called Rule BasedInformation Retrieval by Computer (RUBRIC). In RUBRIC, user querytopics (or concepts) are captured in a rule base represented by anAND/OR tree. The evaluation of AND/OR tree is essentially based onminimum and maximum weights of query terms for conjunctions anddisjunctions, respectively. The time to generate the retrieval outputof AND/OR tree for a given query topic is exponential in number ofconjunctions in the DNF expression associated with the query topic.In this paper, we propose a new approach for computing the retrievaloutput. The proposed approach involves preprocessing of the rule baseto generate Minimal Term Sets (MTSs) that speed up the retrievalprocess. The computational complexity of the on-line query evaluationfollowing the preprocessing is polynomial in m. We show that thecomputation and use of MTSs allows a user to choose query topics thatbest suit their needs and to use retrieval functions that yield amore refined and controlled retrieval output than is possible withthe AND/OR tree when document terms are binary. We incorporatep-Norm model into the process of evaluating MTSs to handle the casewhere weights of both documents and query terms are non-binary.