Opportunistic data structures with applications

  • Authors:
  • P. Ferragina;G. Manzini

  • Affiliations:
  • -;-

  • Venue:
  • FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
  • Year:
  • 2000

Quantified Score

Hi-index 0.01

Visualization

Abstract

We address the issue of compressing and indexing data. We devise a data structure whose space occupancy is a function of the entropy of the underlying data set. We call the data structure opportunistic since its space occupancy is decreased when the input is compressible and this space reduction is achieved at no significant slowdown in the query performance. More precisely, its space occupancy is optimal in an information-content sense because text T[1,u] is stored using O(H/sub k/(T))+o(1) bits per input symbol in the worst case, where H/sub k/(T) is the kth order empirical entropy of T (the bound holds for any fixed k). Given an arbitrary string P[1,p], the opportunistic data structure allows to search for the occurrences of P in T in O(p+occlog/sup /spl epsiv//u) time (for any fixed /spl epsiv/