Cache Conscious Clustering C3

  • Authors:
  • Zhen He;Alonso Marquez

  • Affiliations:
  • -;-

  • Venue:
  • DEXA '01 Proceedings of the 12th International Conference on Database and Expert Systems Applications
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Despite over 10 years of research into OODBMS design, performance remains as one of the major problems. I/O reduction has proven to be one of the most effective ways of enhancing performance. Two techniques of improving I/O performance of OODBMS are clustering and buffer replacement. Clustering places objects that are likely to be accessed together onto the same disk page and thus reduces I/O. Buffer replacement involves the selection of a page to be evicted, when the buffer is full. The page evictedid eally shouldb e the page needed furthest in the future. Selection of the correct page for eviction results in a reduction in the total I/O generatedb y the system. These two techniques effect the likelihood of a requested object being memory resident in an interdependent way. This fact has been acknowledged in the existing literature. However despite this acknowledgement no existing work investigates this interdependency. This paper makes the first investigation into this interdependency by exploring the effects of ten different buffer replacement algorithms on the performance of two different clustering algorithms. We developed a new family of clustering algorithms that incorporate cache behavior when performing clustering. We term this new family of clustering algorithms, cache conscious clustering (C3). A particular C3 algorithm (GPC3) was tested against a well known and highly competitive clustering algorithm GGP, andthe results show GPC3 out-performs GGP by upto 40% for popular buffer replacement algorithms such as LRU and CLOCK. These results show for the first time clustering should be approached from a cache conscious point of view.