A Comparative Study of Spatial Indexing Techniques for Multidimensional Scientific Datasets

  • Authors:
  • Beomseok Nam;Alan Sussman

  • Affiliations:
  • University of Maryland, College Park;University of Maryland, College Park

  • Venue:
  • SSDBM '04 Proceedings of the 16th International Conference on Scientific and Statistical Database Management
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Scientific applications that query into very large multi-dimensionaldatasets are becoming more common. Thesedatasets are growing in size every day, and are becomingtruly enormous, making it infeasible to index individualdata elements. We have instead been experimenting withchunking the datasets to index them, grouping data elementsinto small chunks of a fixed, but dataset-specific, sizeto take advantage of spatial locality. While spatial indexingstructures based on R-trees perform reasonably well forthe rectangular bounding boxes of such chunked datasets,other indexing structures based on KDB-trees, such as Hybridtrees, have been shown to perform very well for pointdata. In this paper, we investigate how all these indexingstructures perform for multidimensional scientific datasets,and compare their features and performance with that ofSH-trees, an extension of Hybrid trees, for indexing multi-dimensionalrectangles. Our experimental results show thatthe algorithms for building and searching SH-trees outperformthose for R-trees, R*-trees, and X-trees for both realapplication and synthetic datasets and queries. We showthat the SH-tree algorithms perform well for both low andhigh dimensional data, and that they scale well to high dimensionsboth for building and searching the trees.