Efficient Organization of Large Multidimensional

  • Authors:
  • Sunita Sarawagi;Michael Stonebraker

  • Affiliations:
  • -;-

  • Venue:
  • Efficient Organization of Large Multidimensional
  • Year:
  • 1993

Quantified Score

Hi-index 0.00

Visualization

Abstract

Large multidimensional arrays are widely used in scientific and engineering database applications. In this paper, we present methods of organizing arrays to make their access on secondary and tertiary memory devices fast and efficient. We have developed four techniques for doing this: (1) storing the array in multidimensional "chunks" to minimize the number of blocks fetched, (2) reordering the chunked array to minimize seek distance between accessed blocks, (3) maintaining redundant copies of the array, each organized for a different chunk size and ordering and (4) partitioning the array onto platters of a tertiary memory device so as to minimize the number of platter switches. Our measurements on real data sets obtained from global change scientists demonstrate that accesses on arrays organized using the above techniques are often an order of magnitude faster than on the original unoptimized data.