Fast prefix search in little space, with applications

  • Authors:
  • Djamal Belazzougui;Paolo Boldi;Rasmus Pagh;Sebastiano Vigna

  • Affiliations:
  • Université Paris Diderot, Paris 7, France;Dipartimento di Scienze dell'Informazione, Università degli Studi di Milano, Italy;IT University of Copenhagen, Denmark;Dipartimento di Scienze dell'Informazione, Università degli Studi di Milano, Italy

  • Venue:
  • ESA'10 Proceedings of the 18th annual European conference on Algorithms: Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

A prefix search returns the strings out of a given collection S that start with a given prefix. Traditionally, prefix search is solved by data structures that are also dictionaries, that is, they actually contain the strings in S. For very large collections stored in slow-access memory, we propose extremely compact data structures that solve weak prefix searches--they return the correct result only if some string in S starts with the given prefix. Our data structures for weak prefix search use O(|S| log l) bits in the worst case, where l is the average string length, as opposed to O(|S|l) bits for a dictionary. We show a lower bound implying that this space usage is optimal.