Can SSDs help reduce random i/os in hash joins?

  • Authors:
  • Liang Huai Yang;Mingchao Liu;Yifan Pan;Weihua Gong;Simon Stannus

  • Affiliations:
  • School of Computer Science & Technology, Zhejiang Univ. of Technology, Hangzhou, China;School of Computer Science & Technology, Zhejiang Univ. of Technology, Hangzhou, China;School of Computer Science & Technology, Zhejiang Univ. of Technology, Hangzhou, China;School of Computer Science & Technology, Zhejiang Univ. of Technology, Hangzhou, China;School of Computing and Information Systems, University of Tasmania, Tasmania, Australia

  • Venue:
  • WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

A hybrid drive combines the features of SSDs and HDDs in the same unit by using SSD as the intermediate cache. In light of its promising features, we propose a new algorithm called CGHJ(Cached Grace Hash Join) for hybrid drives, which reduces hard disk random I/Os that occur in the partitioning phase of traditional Grace hash joins by caching the segments of buckets in the SSD and migrating these segments of each partition contiguously to the HDD. Experiment results show that CGHJ can greatly reduce random disk I/Os in the case of small joining working spaces or larger relations and improve hash join performance.