Distributed databases principles and systems
Distributed databases principles and systems
Antisampling for Estimation: An Overview
IEEE Transactions on Software Engineering
Database performance evaluation in an indexed file environment
ACM Transactions on Database Systems (TODS)
R* optimizer validation and performance evaluation for local queries
SIGMOD '86 Proceedings of the 1986 ACM SIGMOD international conference on Management of data
Panel: Extensible database systems
SIGMOD '86 Proceedings of the 1986 ACM SIGMOD international conference on Management of data
The effect of join selectives on optimal nesting order
ACM SIGMOD Record
Physical database design for relational databases
ACM Transactions on Database Systems (TODS)
Equi-depth multidimensional histograms
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
An extensible model of selectivity estimation
Information Sciences: an International Journal
Implications of certain assumptions in database performance evauation
ACM Transactions on Database Systems (TODS)
A model of data distribution based on texture analysis
SIGMOD '85 Proceedings of the 1985 ACM SIGMOD international conference on Management of data
Support for repetitive transactions and ad hoc queries in System R
ACM Transactions on Database Systems (TODS)
Query processing in a system for distributed databases (SDD-1)
ACM Transactions on Database Systems (TODS)
Query optimization in star computer networks
ACM Transactions on Database Systems (TODS)
Query Optimization in Database Systems
ACM Computing Surveys (CSUR)
Estimating block accesses in database organizations: a closed noniterative formula
Communications of the ACM
On estimating block accesses in database organizations
Communications of the ACM
Estimating block accesses and number of records in file management
Communications of the ACM
Approximating block accesses in database organizations
Communications of the ACM
Analysis and performance of inverted data base structures
Communications of the ACM
Design of Database Structures
Access path selection in a relational database management system
SIGMOD '79 Proceedings of the 1979 ACM SIGMOD international conference on Management of data
Estimating block transfers and join sizes
SIGMOD '83 Proceedings of the 1983 ACM SIGMOD international conference on Management of data
Implementation techniques for main memory database systems
SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
Database evaluation using multiple regression techniques
SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
Accurate estimation of the number of tuples satisfying a condition
SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
Estimating Bucket Accesses: A Practical Approach
Proceedings of the Second International Conference on Data Engineering
Buffering Schemes for Permanent Data
Proceedings of the Second International Conference on Data Engineering
Estimating Block Accessses when Attributes are Correlated
VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
R* Optimizer Validation and Performance Evaluation for Distributed Queries
VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
The Size of Projections of Relations Satisfying a Functional Dependency
VLDB '82 Proceedings of the 8th International Conference on Very Large Data Bases
The optimization of queries in relational databases
The optimization of queries in relational databases
Estimating selectivities in data bases
Estimating selectivities in data bases
Dynamic query evaluation plans
SIGMOD '89 Proceedings of the 1989 ACM SIGMOD international conference on Management of data
Optimization Strategies for Relational Queries
IEEE Transactions on Software Engineering
Practical selectivity estimation through adaptive sampling
SIGMOD '90 Proceedings of the 1990 ACM SIGMOD international conference on Management of data
Expert design tools for physical database design
SIGBDP '90 Proceedings of the 1990 ACM SIGBDP conference on Trends and directions in expert systems
Statistical estimators for aggregate relational algebra queries
ACM Transactions on Database Systems (TODS)
On the propagation of errors in the size of join results
SIGMOD '91 Proceedings of the 1991 ACM SIGMOD international conference on Management of data
A Contingency Approach to Estimating Record Selectivities
IEEE Transactions on Software Engineering
Sequential sampling procedures for query size estimation
SIGMOD '92 Proceedings of the 1992 ACM SIGMOD international conference on Management of data
CSC '92 Proceedings of the 1992 ACM annual conference on Communications
Processing time-constrained aggregate queries in CASE-DB
ACM Transactions on Database Systems (TODS)
Query evaluation techniques for large databases
ACM Computing Surveys (CSUR)
Optimal histograms for limiting worst-case error propagation in the size of join results
ACM Transactions on Database Systems (TODS)
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
On the development of a site selection optimizer for distributed and parallel database systems
CIKM '93 Proceedings of the second international conference on Information and knowledge management
Using statistical sampling for query optimization in heterogeneous library information systems
CSC '93 Proceedings of the 1993 ACM conference on Computer science
Adaptive selectivity estimation using query feedback
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Estimating page fetches for index scans with finite LRU buffers
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Balancing histogram optimality and practicality for query result size estimation
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Improved histograms for selectivity estimation of range predicates
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
ACM Computing Surveys (CSUR)
Iterated DFT based techniques for join size estimation
Proceedings of the seventh international conference on Information and knowledge management
Multi-dimensional selectivity estimation using compressed histogram information
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
A comparison of selectivity estimators for range queries on metric attributes
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Optimal histograms for hierarchical range queries (extended abstract)
PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Estimating nested selectivity in object-oriented databases
Proceedings of the ninth international conference on Information and knowledge management
Optimal and approximate computation of summary statistics for range aggregates
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Global optimization of histograms
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Exploiting constraint-like data characterizations in query optimization
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
ACM Transactions on Database Systems (TODS)
ACM-SE 30 Proceedings of the 30th annual Southeast regional conference
Fast algorithms for hierarchical range histogram construction
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Fuzzy Statistics Estimation in Supporting Multidatabase Query Optimization
Electronic Commerce Research
Dynamic maintenance of data distribution for selectivity estimation
The VLDB Journal — The International Journal on Very Large Data Bases
Estimating page fetches for index scans with finite LRU buffers
The VLDB Journal — The International Journal on Very Large Data Bases
What You Always Wanted to Know About Datalog (And Never Dared to Ask)
IEEE Transactions on Knowledge and Data Engineering
Incremental Implementation Model for Relational Databases with Transaction Time
IEEE Transactions on Knowledge and Data Engineering
Estimating Block Selectivities for Physical Database Design
IEEE Transactions on Knowledge and Data Engineering
IEEE Transactions on Knowledge and Data Engineering
Time-Constrained Query Processing in CASE-DB
IEEE Transactions on Knowledge and Data Engineering
A Hybrid Estimator for Selectivity Estimation
IEEE Transactions on Knowledge and Data Engineering
Block Access Estimation for Clustered Data Using a Finite LRU Buffer
IEEE Transactions on Software Engineering
Learning Transformation Rules for Semantic Query Optimization: A Data-Driven Approach
IEEE Transactions on Knowledge and Data Engineering
Query Merging: Improving Query Subscription Processing in a Multicast Environment
IEEE Transactions on Knowledge and Data Engineering
Optimal Histograms with Quality Guarantees
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Combining Histograms and Parametric Curve Fitting for Feedback-Driven Query Result-size Estimation
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Optimizing Queries on Compressed Bitmaps
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Universality of Serial Histograms
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
A Cost Model for Clustered Object-Oriented Databases
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Scalable and Dynamic Grouping of Continual Queries
ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
Compressed Datacubes for fast OLAP Applications
DaWaK '99 Proceedings of the First International Conference on Data Warehousing and Knowledge Discovery
Approximate Query Answering In Numerical Databases
SSDBM '96 Proceedings of the Eighth International Conference on Scientific and Statistical Database Management
Performance Analysis of Database Systems
Performance Evaluation: Origins and Directions
Sing the truth about ad hoc join costs
The VLDB Journal — The International Journal on Very Large Data Bases
Bounding the cardinality of aggregate views through domain-derived constraints
Data & Knowledge Engineering - Special issue: Advances in OLAP
Multiple-granularity interleaving for piggyback query processing
CASCON '99 Proceedings of the 1999 conference of the Centre for Advanced Studies on Collaborative research
A piggyback method to collect statistics for query optimization in database management systems
CASCON '98 Proceedings of the 1998 conference of the Centre for Advanced Studies on Collaborative research
IDEAS '99 Proceedings of the 1999 International Symposium on Database Engineering & Applications
Optimizing Processing of Query Subscription in an WDM Network Environment
ITCC '00 Proceedings of the The International Conference on Information Technology: Coding and Computing (ITCC'00)
Interchanging group-by and join in distributed query processing
CASCON '93 Proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: distributed computing - Volume 2
A simple model of prolog's performance: extensional predicates
CASCON '93 Proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: distributed computing - Volume 2
A new histogram method for sparse attributes: the averaged rectangular attribute cardinality map
ISICT '03 Proceedings of the 1st international symposium on Information and communication technologies
Completeness of integrated information sources
Information Systems - Special issue: Data quality in cooperative information systems
Towards a robust query optimizer: a principled and practical approach
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
A formal analysis of why heuristic functions work
Artificial Intelligence
Sample-Based Quality Estimation of Query Results in Relational Database Environments
IEEE Transactions on Knowledge and Data Engineering
Journal of Intelligent Information Systems
Estimating the output cardinality of partial preaggregation with a measure of clusteredness
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Analytic use of bitmap indices
AIKED'07 Proceedings of the 6th Conference on 6th WSEAS Int. Conf. on Artificial Intelligence, Knowledge Engineering and Data Bases - Volume 6
Analytic-based estimation of query result sizes
AIKED'05 Proceedings of the 4th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering Data Bases
Capturing semantics from bitmap indices for data analysis
SMO'06 Proceedings of the 6th WSEAS International Conference on Simulation, Modelling and Optimization
Scalable multi-query optimization for exploratory queries over federated scientific databases
Proceedings of the VLDB Endowment
Approximate similarity search: A multi-faceted problem
Journal of Discrete Algorithms
A formal analysis of why heuristic functions work
Artificial Intelligence
Information Sciences: an International Journal
Quality-driven query answering for integrated information systems
Quality-driven query answering for integrated information systems
The DataPath system: a data-centric analytic processing engine for large data warehouses
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Propagation of densities of streaming data within query graphs
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Heuristic strategies for the discovery of inclusion dependencies and other patterns
Journal on Data Semantics V
Kernel Weaver: Automatically Fusing Database Primitives for Efficient GPU Computation
MICRO-45 Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture
QMapper: a tool for SQL optimization on hive using query rewriting
Proceedings of the 22nd international conference on World Wide Web companion
ACM SIGMOD Record
Hi-index | 0.00 |
A statistical profile summarizes the instances of a database. It describes aspects such as the number of tuples, the number of values, the distribution of values, the correlation between value sets, and the distribution of tuples among secondary storage units. Estimation of database profiles is critical in the problems of query optimization, physical database design, and database performance prediction. This paper describes a model of a database of profile, relates this model to estimating the cost of database operations, and surveys methods of estimating profiles. The operators and objects in the model include build profile, estimate profile, and update profile. The estimate operator is classified by the relational algebra operator (select, project, join), the property to be estimated (cardinality, distribution of values, and other parameters), and the underlying method (parametric, nonparametric, and ad-hoc). The accuracy, overhead, and assumptions of methods are discussed in detail. Relevant research in both the database and the statistics disciplines is incorporated in the detailed discussion.