A General Approach to Compression of Hierarchical Indexes

Authors:
Jukka Teuhola
Affiliations:
-
Venue:
DEXA '01 Proceedings of the 12th International Conference on Database and Expert Systems Applications
Year:
2001

Citing 15
Cited 0

Text compression

Text compression
The R*-tree: an efficient and robust access method for points and rectangles

SIGMOD '90 Proceedings of the 1990 ACM SIGMOD international conference on Management of data
Compression of correlated bit-vectors

Information Systems
S-tree: a dynamic balanced signature index for office retrieval

Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
Managing gigabytes (2nd ed.): compressing and indexing documents and images

Managing gigabytes (2nd ed.): compressing and indexing documents and images
The K-D-B-tree: a search structure for large multidimensional dynamic indexes

SIGMOD '81 Proceedings of the 1981 ACM SIGMOD international conference on Management of data
A class of data structures for associative searching

PODS '84 Proceedings of the 3rd ACM SIGACT-SIGMOD symposium on Principles of database systems
R-trees: a dynamic index structure for spatial searching

SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
Compressing Relations and Indexes

ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
M-tree: An Efficient Access Method for Similarity Search in Metric Spaces

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Generalized Search Trees for Database Systems

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Exploiting clustering in inverted file Compression

DCC '96 Proceedings of the Conference on Data Compression
Independent Quantization: An Index Compression Technique for High-Dimensional Data Spaces

ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Multidimensional Binary Search Trees in Database Applications

IEEE Transactions on Software Engineering
Arithmetic coding

IBM Journal of Research and Development

Quantified Score

Hi-index	0.00

Visualization

Abstract

Tree-structured indexes typically restrict the search domain level by level, which means that the search information can be encoded more and more compactly on the way down. This simple observation is here formulated as a general principle of index compression. Saving storage space is one advantage, but more important is reduction of disk accesses, because more entries can be packed into a page. The index fan-out can be increased, reducing the average height of the tree. The applicability of compression is studied for several popular one-and multi-dimensional indexes. Experiments with the well-known spatial index, R*-tree, show that with modest assumptions and simple coding, 30-40% reduction of disk accesses is obtainable for intersection queries. Compression of index entries can be used together with other index compaction techniques, such as quantization and pointer list compression.