The Grid File: An Adaptable, Symmetric Multikey File Structure
ACM Transactions on Database Systems (TODS)
Hadoop++: making a yellow elephant run like a cheetah (without it even noticing)
Proceedings of the VLDB Endowment
Eagle-eyed elephant: split-oriented indexing in Hadoop
Proceedings of the 16th International Conference on Extending Database Technology
Hi-index | 0.00 |
In Smart Grid, High-performance analysis of massive meter data is very crucial for electric companies to make decisions. With our observation, these data analysis applications typically involve multidimensional range queries (MDRQ) on meter data. While popular data warehouses for big data, like Hive, can perform complex analysis, but lack efficient index for MDRQ. In this paper, we propose DGFIndex, a dedicated index structure that effectively support MDRQ for massive meter data. Our preliminary experiments show that DGFIndex can save significant disk space than Compact Index in Hive, and almost keeps the same data IO.