Compressed Index for Dynamic Text

  • Authors:
  • Wing-Kai Hon;Tak-Wah Lam;Kunihiko Sadakane;Wing-Kin Sung;Siu-Ming Yiu

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • DCC '04 Proceedings of the Conference on Data Compression
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper investigates how to index a text which is subject to updates. Thebest solution in the literature is based on suffix tree using O(n log n) bits ofstorage, where n is the length of the text. It supports finding all occurrences ofa pattern P in O(|P| + occ) time, where occ is the number of occurrences.Eachtext update consists of inserting or deleting a substring of length y and can besupported in O(y + \sqrt n) time. In this paper, we initiate the study of compressedindex using only O(n log |\Sigma|) bits of space, where \Sigma denotes the alphabet. Oursolution supports finding all occurrences of a pattern P in O(|P| log2 n(log\epsilon n + log |\Sigma|) + occlog1+\epsilon n) time, while insertion or deletion of a substring of length y can be done inO((y + \sqrt n) log2+\epsilon n) amortized time, where0 \lt \epsilon \le 1.The corepart of our data structure is based on the recent work on Compressed Suffix Trees(CST) and Compressed Suffix Arrays (CSA).