Order preserving linear hashing using dynamic key statistics
PODS '86 Proceedings of the fifth ACM SIGACT-SIGMOD symposium on Principles of database systems
Balanced multidimensional extendible hash tree
PODS '86 Proceedings of the fifth ACM SIGACT-SIGMOD symposium on Principles of database systems
The BANG file: A new kind of grid file
SIGMOD '87 Proceedings of the 1987 ACM SIGMOD international conference on Management of data
Equi-depth multidimensional histograms
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
File organization for database design
File organization for database design
Statistical profile estimation in database systems
ACM Computing Surveys (CSUR)
Query optimization in a memory-resident domain relational calculus database system
ACM Transactions on Database Systems (TODS)
A linear-time probabilistic counting algorithm for database applications
ACM Transactions on Database Systems (TODS)
On the propagation of errors in the size of join results
SIGMOD '91 Proceedings of the 1991 ACM SIGMOD international conference on Management of data
The Grid File: An Adaptable, Symmetric Multikey File Structure
ACM Transactions on Database Systems (TODS)
Extendible hashing—a fast access method for dynamic files
ACM Transactions on Database Systems (TODS)
System R: relational approach to database management
ACM Transactions on Database Systems (TODS)
A detailed statistical model for relational query optimization
ACM '85 Proceedings of the 1985 ACM annual conference on The range of computing : mid-80's perspective: mid-80's perspective
Database Design
Access path selection in a relational database management system
SIGMOD '79 Proceedings of the 1979 ACM SIGMOD international conference on Management of data
The K-D-B-tree: a search structure for large multidimensional dynamic indexes
SIGMOD '81 Proceedings of the 1981 ACM SIGMOD international conference on Management of data
Accurate estimation of the number of tuples satisfying a condition
SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
Selectivity Estimation Using Homogeneity Measurement
Proceedings of the Sixth International Conference on Data Engineering
A Mapping Function for the Directory of a Multidimensional Extendible Hashing
VLDB '84 Proceedings of the 10th International Conference on Very Large Data Bases
Estimating Block Accessses when Attributes are Correlated
VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
The Multilevel Grid File - A Dynamic Hierarchical Multidimensional File Structure
Proceedings of the Second International Symposium on Database Systems for Advanced Applications
Multi-dimensional selectivity estimation using compressed histogram information
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Transformation-based spatial join
Proceedings of the eighth international conference on Information and knowledge management
Estimating nested selectivity in object-oriented databases
Proceedings of the ninth international conference on Information and knowledge management
A New Indexing Scheme for Content-Based Image Retrieval
Multimedia Tools and Applications
Spatial Join Processing Using Corner Transformation
IEEE Transactions on Knowledge and Data Engineering
A Region Splitting Strategy for Physical Database Design of Multidimensional File Organizations
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Integrating the UB-Tree into a Database System Kernel
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
2D-CHI: A Tunable Two-Dimensional Class Hierarchy Index for Object-Oriented Databases
COMPSAC '00 24th International Computer Software and Applications Conference
Wavelet-Based Cost Estimation for Spatial Queries
SSTD '01 Proceedings of the 7th International Symposium on Advances in Spatial and Temporal Databases
An aggregation algorithm using a multidimensional file in multidimensional OLAP
Information Sciences: an International Journal
Estimating nested selectivity in object-oriented and object-relational databases
Information and Software Technology
A one-pass aggregation algorithm with the optimal buffer size in multidimensional OLAP
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Performance of TPR*-Trees for Predicting Future Positions of Moving Objects in U-Cities
KES-AMSTA '07 Proceedings of the 1st KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Approximate indexing in road network databases
Proceedings of the 2009 ACM symposium on Applied Computing
Dimension transform based efficient event filtering for symmetric publish/subscribe system
DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
Music plagiarism detection using melody databases
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
An efficient phantom protection method for multi-dimensional index structures
DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Hi-index | 0.00 |
We propose a new dynamic method for multidimensional selectivity estimation for range queries that works accurately independent of data distribution. Good estimation of selectivity is important for query optimization and physical database design. Our method employs the multilevel grid file (MLGF) for accurate estimation of multidimensional data distribution. The MLGF is a dynamic, hierarchical, balanced, multidimensional file structure that gracefully adapts to nonuniform and correlated distributions. We show that the MLGF directory naturally represents a multidimensional data distribution. We then extend it for further refinement and present the selectivity estimation method based on the MLGF. Extensive experiments have been performed to test the accuracy of selectivity estimation. The results show that estimation errors are very small independent of distributions, even with correlated and/or highly skewed ones. Finally, we analyze the cause of errors in estimation and investigate the effects of various parameters on the accuracy of estimation.