A performance study of four index structures for set-valued attributes of low cardinality

  • Authors:
  • Sven Helmer;Guido Moerkotte

  • Affiliations:
  • Lehrstuhl für Praktische Informatik III, Universität Mannheim, 68131, Mannheim, Germany;Lehrstuhl für Praktische Informatik III, Universität Mannheim, 68131, Mannheim, Germany

  • Venue:
  • The VLDB Journal — The International Journal on Very Large Data Bases
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Abstract.The efficient retrieval of data items on set-valued attributes is an important research topic that has attracted little attention so far. We studied and modified four index structures (sequential signature files, signature trees, extendible signature hashing, and inverted files) for a fast retrieval of sets with low cardinality. We compared the index structures by implementing them and subjecting them to extensive experiments, investigating the influence of query set size, database size, domain size, and data distribution (synthetic and real). The results of the experiments clearly indicate that inverted files exhibit the best overall behavior of all tested index structures.