Organization and maintenance of large ordered indexes

Authors:
Rudolf Bayer;E. McCreight
Affiliations:
Dept. of Computer Science, Purdue University, Lafayette, Ind.;Palo Alto Research Center, 3180 Porter Drive, Palo Alto, Calif.
Venue:
Software pioneers
Year:
2002

Citing 2
Cited 5

Use of tree structures for processing files

Communications of the ACM
Information retrieval: information storage and retrieval using AVL trees

ACM '65 Proceedings of the 1965 20th national conference

Elastic Cloud Caches for Accelerating Service-Oriented Computations

Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
KISS-Tree: smart latch-free in-memory indexing on modern architectures

DaMoN '12 Proceedings of the Eighth International Workshop on Data Management on New Hardware
Accelerating physics in large, continuous virtual environments

Concurrency and Computation: Practice & Experience
A high-throughput in-memory index, durable on flash-based SSD: insights into the winning solution of the SIGMOD programming contest 2011

ACM SIGMOD Record
Stochastically Balancing Trees for File and Database Systems

International Journal of Green Computing

Quantified Score

Hi-index	0.01

Visualization

Abstract

Organization and maintenance of an index for a dynamic random access file is considered. It is assumed that the index must be kept on some pseudo random access backup store like a disc or a drum. The index organization described allows retrieval, insertion, and deletion of keys in time proportional to logk I where I is the size of the index and k is a device dependent natural number such that the performance of the scheme becomes near optimal. Storage utilization is at least 50% but generally much higher. The pages of the index are organized in a special data-structure, so-called B-trees. The scheme is analyzed, performance bounds are obtained, and a near optimal k is computed. Experiments have been performed with indexes up to 100000 keys. An index of size 15000 (100000) can be maintained with an average of 9 (at least 4) transactions per second on an IBM 360/44 with a 2311 disc.