Search data structures for skewed strings

  • Authors:
  • Pilu Crescenzi;Roberto Grossi;Giuseppe F. Italiano

  • Affiliations:
  • Dipartimento di Sistemi e Informatica, Università degli Studi di Firenze, Italy;Dipartimento di Informatica, Università di Pisa, Italy;Dipartimento di Informatica, Sistemi e Produzione, Università di Roma "Tor Vergata", Italy

  • Venue:
  • WEA'03 Proceedings of the 2nd international conference on Experimental and efficient algorithms
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider skewed distributions of strings, in which any two such strings share a common prefix much longer than that expected in uniformly distributed (random) strings. For instance, this is the case of URL addresses, IP addresses, or XML path strings, all representing paths in some hierarchical order. As strings sharing a portion of the path have a quite long common prefix, we need to avoid the time-consuming repeated examination of these common prefixes while handling the linked data structures storing them. For this purpose, we show how to implement search data structures that can operate on strings with long prefixes in common. Despite the simplicity and the generality of the method, our experimental study shows that it is quite competitive with several optimized and tuned implementations currently available in the literature.