Fast Computation of Iceberg Dwarf

  • Authors:
  • Xiang Longgang;Feng Yucai

  • Affiliations:
  • Huazhong Univ. of Sci. and Technol., Wuhan, China;Huazhong Univ. of Sci. and Technol., Wuhan, China

  • Venue:
  • SSDBM '04 Proceedings of the 16th International Conference on Scientific and Statistical Database Management
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Iceberg Dwarf (IceDwarf for short) combines thestrength of Iceberg-Cube and Dwarf. It exploits the elegantDwarf structure for cube tuple store and eliminates thoseunsatisfied sub-dwarfs. By only storing nontrivial cubetuples, IceDwarf reduces the size of a dwarf significantly;even Dwarf itself compresses the data cube effectively.We studied how to efficiently compute icedwarfs, anddeveloped a straightforward algorithm (PAC). To furtherimprove the performance, we explored the structure ofDwarf and presented four nice lemmas. Based on theseobservations, we proposed a new algorithm called PWC. Itbuilds the IceDwarf by bottom-up computing all thepartitions of a fact table and inserting them into the Dwarfstructure, enabling Apriori-like pruning and single tuplepartition optimization, and facilitating the detection ofsuffix redundancies. Our performance study showed thatPWC is highly efficient and runs much faster than PAC foricedwarfs, even for computing full dwarfs.