A combination of trie-trees and inverted files for the indexing of set-valued attributes

  • Authors:
  • Manolis Terrovitis;Spyros Passas;Panos Vassiliadis;Timos Sellis

  • Affiliations:
  • Nat. Technical Univ. Athens;Nat. Technical Univ. Athens;Univ. of Ioannina;Nat. Technical Univ. Athens

  • Venue:
  • CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Set-valued attributes frequently occur in contexts like market-basked analysis and stock market trends. Late research literature has mainly focused on set containment joins and data mining without considering simple queries on set valued attributes. In this paper we address superset, subset and equality queries and we propose a novel indexing scheme for answering them on set-valued attributes. The proposed index superimposes a trie-tree on top of an inverted file that indexes a relation with set-valued data. We show that we can efficiently answer the aforementioned queries by indexing only a subset of the most frequent of the items that occur in the indexed relation. Finally, we show through extensive experiments that our approach outperforms the state of the art mechanisms and scales gracefully as database size grows.