Using the Co-occurrence of Words for Retrieval Weighting

  • Authors:
  • Elke Mittendorf;Bojidar Mateev;Peter Schäuble

  • Affiliations:
  • Systor A6, CH-8048 Zürich, Switzerland. elke.mittendrof@systor.com;Eurospider Information Technology AG, CH-8006 Zürich, Switzerland. mateev@eurospider.ch;Eurospider Information Technology AG, CH-8006 Zürich, Switzerland. schauble@eurospider.ch

  • Venue:
  • Information Retrieval
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

We have applied the well-known Robertson-Sparck Jones weighting to sets of indexing features that are different from word-based features. Our features describe the co-occurrences of words in a window range of predefined size. The experiments have been designed to analyse the value of features that are beyond word-based features but all used retrieval methods can be motivated strictly in the probabilistic framework. Among the several implications of our experiments for weighted retrieval is the surprising result that features that describe the co-occurrences of words in sentence-size or paragraph-size windows are significantly better descriptors than purely word-based indexing features.