Textual context analysis for information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Authoritative sources in a hyperlinked environment
Journal of the ACM (JACM)
Extended Boolean information retrieval
Communications of the ACM
Lexical cohesion computed by thesaural relations as an indicator of the structure of text
Computational Linguistics
On document relevance and lexical cohesion between query terms
Information Processing and Management: an International Journal
Random walks on text structures
CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Automatically structuring domain knowledge from text: An overview of current research
Information Processing and Management: an International Journal
Hi-index | 0.00 |
Traditionally, information retrieval systems rank documents according to the query terms they contain. However, even if a document may contain all query terms, this does not guarantee that it is relevant to the query. The query terms can occur together in the same document, but may have been used in different contexts, expressing separate topics. Lexical cohesion is a characteristic of natural language texts, which can be used to determine whether the query terms are used in the same context in the document. In this paper we make use of a graph-based approach to capture term contexts and estimate the level of lexical cohesion in a document. To evaluate the performance of our system, we compare it against two benchmark systems using three TREC document collections.