A graph based approach to estimating lexical cohesion

Authors:
Hayrettin Gürkök;Murat Karamuftuoglu;Markus Schaal
Affiliations:
Bilkent University, Ankara, Turkey;Bilkent University, Ankara, Turkey;Bilkent University, Ankara, Turkey
Venue:
Proceedings of the second international symposium on Information interaction in context
Year:
2008

Citing 7
Cited 1

Textual context analysis for information retrieval

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
Extended Boolean information retrieval

Communications of the ACM
Lexical cohesion computed by thesaural relations as an indicator of the structure of text

Computational Linguistics
On document relevance and lexical cohesion between query terms

Information Processing and Management: an International Journal
Random walks on text structures

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing

Automatically structuring domain knowledge from text: An overview of current research

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Traditionally, information retrieval systems rank documents according to the query terms they contain. However, even if a document may contain all query terms, this does not guarantee that it is relevant to the query. The query terms can occur together in the same document, but may have been used in different contexts, expressing separate topics. Lexical cohesion is a characteristic of natural language texts, which can be used to determine whether the query terms are used in the same context in the document. In this paper we make use of a graph-based approach to capture term contexts and estimate the level of lexical cohesion in a document. To evaluate the performance of our system, we compare it against two benchmark systems using three TREC document collections.