Optimization of query evaluation algorithms
ACM Transactions on Database Systems (TODS)
On the estimation of the number of desired records with respect to a given query
ACM Transactions on Database Systems (TODS)
Estimating block accesses in database organizations: a closed noniterative formula
Communications of the ACM
On estimating block accesses in database organizations
Communications of the ACM
Approximating block accesses in database organizations
Communications of the ACM
Access path selection in a relational database management system
SIGMOD '79 Proceedings of the 1979 ACM SIGMOD international conference on Management of data
Estimating block transfers and join sizes
SIGMOD '83 Proceedings of the 1983 ACM SIGMOD international conference on Management of data
Top-down statistical estimation on a database
SIGMOD '83 Proceedings of the 1983 ACM SIGMOD international conference on Management of data
Estimating selectivities in data bases
Estimating selectivities in data bases
A self-organizing database system - a different approach to query optimization
A self-organizing database system - a different approach to query optimization
Join processing in database systems with large main memories
ACM Transactions on Database Systems (TODS)
The effect of join selectives on optimal nesting order
ACM SIGMOD Record
Equi-depth multidimensional histograms
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
Absolute Bounds on Set Intersection and Union Sizes from Distribution Information
IEEE Transactions on Software Engineering
Statistical profile estimation in database systems
ACM Computing Surveys (CSUR)
Processing aggregate relational queries with hard time constraints
SIGMOD '89 Proceedings of the 1989 ACM SIGMOD international conference on Management of data
Rule-based query optimization in IRIS
CSC '89 Proceedings of the 17th conference on ACM Annual Computer Science Conference
Estimating the size of generalized transitive closures
VLDB '89 Proceedings of the 15th international conference on Very large data bases
Practical selectivity estimation through adaptive sampling
SIGMOD '90 Proceedings of the 1990 ACM SIGMOD international conference on Management of data
A Contingency Approach to Estimating Record Selectivities
IEEE Transactions on Software Engineering
Join processing in relational databases
ACM Computing Surveys (CSUR)
Processing time-constrained aggregate queries in CASE-DB
ACM Transactions on Database Systems (TODS)
Multiple join size estimation by virtual domains (extended abstract)
PODS '93 Proceedings of the twelfth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Optimal histograms for limiting worst-case error propagation in the size of join results
ACM Transactions on Database Systems (TODS)
Quick and incomplete responses: the semantic approach
CIKM '93 Proceedings of the second international conference on Information and knowledge management
Using statistical sampling for query optimization in heterogeneous library information systems
CSC '93 Proceedings of the 1993 ACM conference on Computer science
Adaptive selectivity estimation using query feedback
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Balancing histogram optimality and practicality for query result size estimation
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Improved histograms for selectivity estimation of range predicates
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
ACM Computing Surveys (CSUR)
An overview of query optimization in relational systems
PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Approximate medians and other quantiles in one pass and with limited memory
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Random sampling for histogram construction: how much is enough?
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Wavelet-based histograms for selectivity estimation
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Query size estimation by adaptive sampling (extended abstract)
PODS '90 Proceedings of the ninth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Solving Local Cost Estimation Problem for Global Query Optimization in Multidatabase Systems
Distributed and Parallel Databases
Selectivity estimation in spatial databases
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
A comparison of selectivity estimators for range queries on metric attributes
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
A model of data distribution based on texture analysis
SIGMOD '85 Proceedings of the 1985 ACM SIGMOD international conference on Management of data
Performance evaluation of functional disk system with nonuniform data distribution
DPDS '90 Proceedings of the second international symposium on Databases in parallel and distributed systems
Sampling from databases using B+-trees
Proceedings of the ninth international conference on Information and knowledge management
Optimal and approximate computation of summary statistics for range aggregates
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
STHoles: a multidimensional workload-aware histogram
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Global optimization of histograms
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Executing SQL over encrypted data in the database-service-provider model
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Exploiting statistics on query expressions for optimization
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Top-k selection queries over relational databases: Mapping strategies and performance evaluation
ACM Transactions on Database Systems (TODS)
Cost models for overlapping and multiversion structures
ACM Transactions on Database Systems (TODS)
RHist: adaptive summarization over continuous data streams
Proceedings of the eleventh international conference on Information and knowledge management
Effective Query Size Estimation Using Neural Networks
Applied Intelligence
On Issues of Instance Selection
Data Mining and Knowledge Discovery
Dynamic maintenance of data distribution for selectivity estimation
The VLDB Journal — The International Journal on Very Large Data Bases
Heuristic approach for early separated filter and refinement strategy in spatial query optimization
Journal of Systems and Software
Toward an Accurate Analysis of Range Queries on Spatial Data
IEEE Transactions on Knowledge and Data Engineering
Supporting Efficient Parametric Search of E-Commerce Data: A Loosely-Coupled Solution
EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
On Rectangular Partitionings in Two Dimensions: Algorithms, Complexity, and Applications
ICDT '99 Proceedings of the 7th International Conference on Database Theory
Reducing the Braking Distance of an SQL Query Engine
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Optimal Histograms with Quality Guarantees
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Estimating Block Accessses when Attributes are Correlated
VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
Histogram-Based Approximation of Set-Valued Query-Answers
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
VLDB '88 Proceedings of the 14th International Conference on Very Large Data Bases
Dynamic Maintenance of Wavelet-Based Histograms
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Approximate Query Processing: Taming the TeraBytes
Proceedings of the 27th International Conference on Very Large Data Bases
Optimizing Boolean Expressions in Object-Bases
VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Random Sampling from Pseudo-Ranked B+ Trees
VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
A Blackboard Architecture for Query Optimization in Object Bases
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
Universality of Serial Histograms
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
A Cost Model for Clustered Object-Oriented Databases
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Estimation of Query-Result Distribution and its Application in Parallel-Join Load Balancing
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Selectivity Estimation Without the Attribute Value Independence Assumption
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
A One-Pass Algorithm for Accurately Estimating Quantiles for Disk-Resident Data
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
On Linear-Spline Based Histograms
WAIM '02 Proceedings of the Third International Conference on Advances in Web-Age Information Management
Adaptive XML Shredding: Architecture, Implementation, and Challenges
Proceedings of the VLDB 2002 Workshop EEXTT and CAiSE 2002 Workshop DTWeb on Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web-Revised Papers
Limiting Result Cardinalities for Multidatabase Queries Using Histograms
BNCOD 18 Proceedings of the 18th British National Conference on Databases: Advances in Databases
Approximate Query Answering In Numerical Databases
SSDBM '96 Proceedings of the Eighth International Conference on Scientific and Statistical Database Management
Wavelet-Based Cost Estimation for Spatial Queries
SSTD '01 Proceedings of the 7th International Symposium on Advances in Spatial and Temporal Databases
On Benchmarking Attribute Cardinality Maps for Database Systems Using the TPC-D Specification
DEXA '99 Proceedings of the 10th International Conference on Database and Expert Systems Applications
Dynamic Querying of Streaming Data with the dQUOB System
IEEE Transactions on Parallel and Distributed Systems
Efficient Algorithms for Large-Scale Temporal Aggregation
IEEE Transactions on Knowledge and Data Engineering
Multiple-granularity interleaving for piggyback query processing
CASCON '99 Proceedings of the 1999 conference of the Centre for Advanced Studies on Collaborative research
Utilizing histogram information
CASCON '01 Proceedings of the 2001 conference of the Centre for Advanced Studies on Collaborative research
A piggyback method to collect statistics for query optimization in database management systems
CASCON '98 Proceedings of the 1998 conference of the Centre for Advanced Studies on Collaborative research
IDEAS '99 Proceedings of the 1999 International Symposium on Database Engineering & Applications
A multi-dimensional histogram for selectivity estimation and fast approximate query answering
CASCON '03 Proceedings of the 2003 conference of the Centre for Advanced Studies on Collaborative research
An integrated method for estimating selectivities in a multidatabase system
CASCON '93 Proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: distributed computing - Volume 2
A new histogram method for sparse attributes: the averaged rectangular attribute cardinality map
ISICT '03 Proceedings of the 1st international symposium on Information and communication technologies
Query Size Estimation for Joins Using Systematic Sampling
Distributed and Parallel Databases
Evaluating holistic aggregators efficiently for very large datasets
The VLDB Journal — The International Journal on Very Large Data Bases
Effective use of block-level sampling in statistics estimation
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Conditional selectivity for statistics on query expressions
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Journal of Intelligent Information Systems
Medians and beyond: new aggregation techniques for sensor networks
SenSys '04 Proceedings of the 2nd international conference on Embedded networked sensor systems
Structure choices for two-dimensional histogram construction
CASCON '04 Proceedings of the 2004 conference of the Centre for Advanced Studies on Collaborative research
Fast range query estimation by N-level tree histograms
Data & Knowledge Engineering
IMAX: Incremental Maintenance of Schema-Based XML Statistics
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Synopses for query optimization: a space-complexity perspective
PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Approximation algorithms for array partitioning problems
Journal of Algorithms
Histograms revisited: when are histograms the best approximation method for aggregates over joins?
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A formal analysis of why heuristic functions work
Artificial Intelligence
Consistently estimating the selectivity of conjuncts of predicates
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Synopses for query optimization: A space-complexity perspective
ACM Transactions on Database Systems (TODS) - Special Issue: SIGMOD/PODS 2004
A framework for query optimization in temporal databases
SSDBM'1990 Proceedings of the 5th international conference on Statistical and Scientific Database Management
Answering top-k queries with multi-dimensional selections: the ranking cube approach
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Consistent selectivity estimation via maximum entropy
The VLDB Journal — The International Journal on Very Large Data Bases
Heuristic design of property maps
DOLAP '06 Proceedings of the 9th ACM international workshop on Data warehousing and OLAP
Query result ranking over e-commerce web databases
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Error minimization in approximate range aggregates
Data & Knowledge Engineering
A random walk approach to sampling hidden databases
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Local and global query optimization mechanisms for relational databases
VLDB '85 Proceedings of the 11th international conference on Very Large Data Bases - Volume 11
A practical approach for efficiently answering top-k relational queries
Decision Support Systems
Sampling from databases using B$^+$-Trees
Intelligent Data Analysis
The history of histograms (abridged)
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Rk-hist: an r-tree based histogram for multi-dimensional selectivity estimation
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Ad-hoc top-k query answering for data streams
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Accurate histogram-based XML summarization
Proceedings of the 2008 ACM symposium on Applied computing
SPARQL basic graph pattern optimization using selectivity estimation
Proceedings of the 17th international conference on World Wide Web
DAWN: an efficient framework of DCT for data with error estimation
The VLDB Journal — The International Journal on Very Large Data Bases
Enhancing histograms by tree-like bucket indices
The VLDB Journal — The International Journal on Very Large Data Bases
H-IQTS: a semantics-aware histogram for compressing categorical OLAP data
IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
A new approach to building histogram for selectivity estimation in query processing optimization
Computers & Mathematics with Applications
Multiplicative synopses for relative-error metrics
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
AMID: Approximation of MultI-measured Data using SVD
Information Sciences: an International Journal
by chance enhancing interaction with large data sets through statistical sampling
Proceedings of the Working Conference on Advanced Visual Interfaces
Enabling OLAP in mobile environments via intelligent data cube compression techniques
Journal of Intelligent Information Systems
A formal analysis of why heuristic functions work
Artificial Intelligence
Fast and effective histogram construction
Proceedings of the 18th ACM conference on Information and knowledge management
Statistical structures for Internet-scale data management
The VLDB Journal — The International Journal on Very Large Data Bases
Warm cache costing: a feedback optimization technique for buffer pool aware costing
Proceedings of the 13th International Conference on Extending Database Technology
Journal of Intelligent Information Systems
Histograms reloaded: the merits of bucket diversity
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
SQL query space and time complexity estimation for multidimensional queries
International Journal of Intelligent Information and Database Systems
Result-size estimation for information-retrieval subqueries
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Parallelizing join computations of SPARQL queries for large semantic web databases
Proceedings of the 2011 ACM Symposium on Applied Computing
Join selectivity re-estimation for repetitive queries in databases
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part II
Intelligent statistics management in sybase ASE 15.0
DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
OTM'05 Proceedings of the 2005 Confederated international conference on On the Move to Meaningful Internet Systems - Volume >Part I
Accelerating large semantic web databases by parallel join computations of SPARQL queries
ACM SIGAPP Applied Computing Review
Selectivity estimation of high dimensional window queries via clustering
SSTD'05 Proceedings of the 9th international conference on Advances in Spatial and Temporal Databases
SSTD'05 Proceedings of the 9th international conference on Advances in Spatial and Temporal Databases
Processing count queries over event streams at multiple time granularities
Information Sciences: an International Journal
Dynamic optimization of queries in pivot-based indexing
Multimedia Tools and Applications
Synopses for Massive Data: Samples, Histograms, Wavelets, Sketches
Foundations and Trends in Databases
Castor: a constraint-based SPARQL engine with active filter processing
ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
Adaptive differentially private histogram of low-dimensional data
PETS'12 Proceedings of the 12th international conference on Privacy Enhancing Technologies
Executing SQL queries over encrypted character strings in the Database-As-Service model
Knowledge-Based Systems
HEDC: a histogram estimator for data in the cloud
Proceedings of the fourth international workshop on Cloud data management
Histograms as statistical estimators for aggregate queries
Information Systems
Fast computation of approximate biased histograms on sliding windows over data streams
Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Entropy-based histograms for selectivity estimation
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Index selection: a query pattern mining based approach
Proceedings of the 2013 Research in Adaptive and Convergent Systems
A sampling algebra for aggregate estimation
Proceedings of the VLDB Endowment
Data & Knowledge Engineering
Range query estimation with data skewness for top-k retrieval
Decision Support Systems
Hi-index | 0.00 |
We present a new method for estimating the number of tuples satisfying a condition of the type attribute rel constant, where rel is one of "=", "", "distribution steps (histograms where buckets, instead of having equal width, have equal height). These distribution steps provide an upper bound on the error when estimating the number of tuples satisfying a condition. The estimation error can be arbitrarily reduced by increasing the number of steps. We analyze desirable conditions that such estimates should satisfy. Based on the distribution steps, we derive a set of estimation formulas which minimize the worst-case error. We also present another set of formulas which reduce the average-case error. Finally, we show how to use sampling to compute a close approximation of the distribution steps very quickly. The major applications of our method are in query optimization and in answering statistical queries.