Improving query response time in scientific databases using data aggregation -a case study

Authors:
W. Lehner;T. Ruf;M. Teschke
Affiliations:
-;-;-
Venue:
DEXA '96 Proceedings of the 7th International Workshop on Database and Expert Systems Applications
Year:
1996

Citing 0
Cited 6

Data Mining in a Multidimensional Environment

ADBIS '99 Proceedings of the Third East European Conference on Advances in Databases and Information Systems
Dealing with Complex Reports in OLAP Applications

DaWaK '99 Proceedings of the First International Conference on Data Warehousing and Knowledge Discovery
Mining information for constructing materialised views

International Journal of Information and Communication Technology
A query answering greedy algorithm for selecting materialized views

ICCCI'10 Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume Part II
Materialized views selection for answering queries

ICDEM'10 Proceedings of the Second international conference on Data Engineering and Management
Materialised view construction in data warehouse for decision making

International Journal of Business Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Although most state-of-the-art database systems have no inherent limitations w.r.t. the amount of data they can handle, the huge data quantities typically found in scientific database applications often exceed the feasibility level from a practical point of view when query performance is the issue. One theoretically well-known concept of improving query response time in scientific database applications is using the categorization and classification facilities often found in scientific computing domains for storing data aggregations that allow to substitute expensive access to raw data by the use of stored aggregated values. The results of an empirical performance study carried out in the application domain of market research are presented which substantiate the practical importance of such work. Using real market research data, it is shown that query response time can be shortened in an order of magnitude if a proper data aggregation concept is used. If the data aggregates are designed properly, the overhead of generating and managing materializations of data aggregates is by far outweighed by the improved query performance in realistic scenarios.