Complete statistical indexing of text by overlapping word fragments

  • Authors:
  • Clinton P. Mah;Raymond J. D'Amore

  • Affiliations:
  • PAR Technology Corporation, New Hartford, NY;PAR Technology Corporation, New Hartford, NY

  • Venue:
  • ACM SIGIR Forum
  • Year:
  • 1983

Quantified Score

Hi-index 0.02

Visualization

Abstract

By using overlapping word fragments to index text, we can combine the best features of the keyword and the full text approaches to document retrieval so as to facilitate searches on any content word. The characteristics of a retrieval system based on word fragment indexing can be precisely predicted from a multinomial model of text. Controlled experiments with two different text collections indicate that such a system can be highly effective under quite general conditions.