Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Information Retrieval
Introduction to Information Retrieval
A light-weight summarizer based on language model with relative entropy
Proceedings of the 2009 ACM symposium on Applied Computing
Hi-index | 0.00 |
This article discusses a new document indexing scheme for information retrieval. For a structured (e.g., scientific) document, Pasi et al. proposed varying weights to different sections according to their importance in the document. This concept is extended here to unstructured documents. Each sentence in a document is initially assigned weight (significance in the document) with the help of a summarization technique. Accordingly, the term frequency of a term is decided as the sum of weights of the sentences the term belongs. The method is verified on a real life dataset using leading existing information retrieval models, and its performance has been found to be superior to conventional indexing schemes.