Improving evolved alphabet using tabu set

  • Authors:
  • Jan Platos;Pavel Kromer

  • Affiliations:
  • Department of Computer Science, FEECS, VSB-Technical University of Ostrava, Ostrava Poruba, Czech Republic;Department of Computer Science, FEECS, VSB-Technical University of Ostrava, Ostrava Poruba, Czech Republic

  • Venue:
  • HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part I
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data compression is very important today and it will be even more important in the future. Textual data use only limited alphabet - total number of used symbols (letters, numbers, diacritics, dots, spaces, etc.). In most languages, letters are joined into syllables and words. Both these approaches has pros and cons, but none of them is the best for any file. This paper describes a variant of algorithm for evolving alphabet from characters and 2-grams, which is optimal for compressed text files. The efficiency of the new variant will be tested on three compression algorithms and a new compression algorithm based on LZ77 will be also used with this new approach.