Local Feedback in Full-Text Retrieval Systems
Journal of the ACM (JACM)
KEDMA—Linguistic Tools for Retrieval Systems
Journal of the ACM (JACM)
On original generation of structure in legal documents
ICAIL '03 Proceedings of the 9th international conference on Artificial intelligence and law
Conjugation-based compression for Hebrew texts
ACM Transactions on Asian Language Information Processing (TALIP)
Hi-index | 0.00 |
A full text retrieval system was designed for the responsa literature, which is a large corpus of Hebrew legal cases. The unique problems of the data base --- mixture of Hebrew, Aramaic and vernaculars, lack of vowels and punctuation, extreme language inflection problems, homographs, existence of thousands of grammatical variants of any given keyword --- dictated development of new methods. Among them we list "grammatical synthesis", which synthesizes all grammatical variants of a given keyword; "Compact KWIC", which enables the user to have a glimpse of the nature of the search before having performed it; effective citation index imbedded in full text searches; and, in general, extensive use of both positive and negative feedback within a single search run. A number of searches performed on a relatively small data base gave in each case a recall of 100%. The average precision was 34%. A KWIC of strategic portions of retrieved documents usually enables a quick disposal of non-relevant material.