Exploring optimization and caching for efficient collection operations

Authors:
Venkata Krishna Nerella;Swetha Surapaneni;Sanjay K. Madria;Thomas Weigert
Affiliations:
Department of Computer Science, Missouri University of Science and Technology, Rolla, USA;Department of Computer Science, Missouri University of Science and Technology, Rolla, USA;Department of Computer Science, Missouri University of Science and Technology, Rolla, USA;Department of Computer Science, Missouri University of Science and Technology, Rolla, USA
Venue:
Automated Software Engineering
Year:
2014

Citing 43
Cited 0

An incremental access method for ViewCache: concept, algorithms, and cost analysis

ACM Transactions on Database Systems (TODS)
Optimization of dynamic query evaluation plans

SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Query execution techniques for caching expensive methods

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Materialized view maintenance and integrity constraint checking: trading space for time

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Query optimization

ACM Computing Surveys (CSUR)
Query-based debugging of object-oriented programs

Proceedings of the 12th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
An overview of query optimization in relational systems

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Efficient mid-query re-optimization of sub-optimal query execution plans

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Caching multidimensional queries using chunks

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Least expected cost query optimization: an exercise in utility

PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Self-tuning histograms: building histograms without looking at data

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
View indexing in relational databases

ACM Transactions on Database Systems (TODS)
A middleware system which intelligently caches query results

IFIP/ACM International Conference on Distributed systems platforms
Iterative dynamic programming: a new class of query optimization algorithms

ACM Transactions on Database Systems (TODS)
Materialized view selection and maintenance using multi-query optimization

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Selectivity estimation using probabilistic models

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Making views self-maintainable for data warehousing

DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
Fast incremental maintenance of approximate histograms

ACM Transactions on Database Systems (TODS)
Dynamic Query Optimization in Rdb/VMS

Proceedings of the Ninth International Conference on Data Engineering
Query Folding

ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Performance Issues in Incremental Warehouse Maintenance

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Parametric Query Optimization

VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Semantic Data Caching and Replacement

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Semantic caching of Web queries

The VLDB Journal — The International Journal on Very Large Data Bases
Heuristic and randomized optimization for the join ordering problem

The VLDB Journal — The International Journal on Very Large Data Bases
A predicate-based caching scheme for client-server database architectures

The VLDB Journal — The International Journal on Very Large Data Bases
Query processing and optimization in Oracle Rdb

The VLDB Journal — The International Journal on Very Large Data Bases
Answering queries using views: A survey

The VLDB Journal — The International Journal on Very Large Data Bases
Predictive caching and prefetching of query results in search engines

WWW '03 Proceedings of the 12th international conference on World Wide Web
Adaptive Caching for Continuous Queries

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
LINQ: reconciling object, relations and XML in the .NET framework

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Cost-aware WWW proxy caching algorithms

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Imperative self-adjusting computation

Proceedings of the 35th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Data access history cache and associated data prefetching mechanisms

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Static query result caching revisited

Proceedings of the 17th international conference on World Wide Web
Caching and incrementalisation in the java query language

Proceedings of the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications
Progressive Parametric Query Optimization

IEEE Transactions on Knowledge and Data Engineering
An experimental analysis of self-adjusting computation

ACM Transactions on Programming Languages and Systems (TOPLAS)
Exploring Query Optimization in Programming Codes by Reducing Run-Time Execution

COMPSAC '10 Proceedings of the 2010 IEEE 34th Annual Computer Software and Applications Conference
Cost-Aware Strategies for Query Result Caching in Web Search Engines

ACM Transactions on the Web (TWEB)
Performance Improvement for Collection Operations Using Join Query Optimization

COMPSAC '11 Proceedings of the 2011 IEEE 35th Annual Computer Software and Applications Conference
Efficient object querying for java

ECOOP'06 Proceedings of the 20th European conference on Object-Oriented Programming
Exploring caching for efficient collection operations

ASE '11 Proceedings of the 2011 26th IEEE/ACM International Conference on Automated Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many large programs operate on collection types. Extensive libraries are available in many programming languages, such as the C++ Standard Template Library, which make programming with collections convenient. Extending programming languages to provide collection queries as first class constructs in the language would not only allow programmers to write queries explicitly in their programs but it would also allow compilers to leverage the wealth of experience available from the database domain to optimize such queries. This paper describes an approach to reduce the run time of programs involving explicit collection queries by performing run time query optimization that is effective for single runs of a program. In addition, it also leverages a cache to store previously computed results. The proposed approach relies on histograms built from the data at run time to estimate the selectivity of joins and predicates in order to construct query plans. Information from earlier executions of the same query during run time is leveraged during the construction of the query plans, even when the data has changed between these executions. An effective cache policy is also determined for caching the results of join (sub) queries. The cache is maintained incrementally, when the underlying collections change, and use of the cache space is optimized by a cache replacement policy. Our approach has been implemented within the Java Query Language (JQL) framework using AspectJ. Our approach demonstrated that its run time query optimization in integration with caching sub query result significantly improves the run time of programs with explicit queries over equivalent programs performing collection operations by iterating over those collections. This paper evaluates our approach using synthetic as well as real world Robocode programs by comparing it to JQL as a benchmark. Experimental results show that our approach performs better than the JQL approach with respect to the program run time.