Principles of database and knowledge-base systems, Vol. I
Principles of database and knowledge-base systems, Vol. I
Foundations of computer science
Foundations of computer science
Query evaluation techniques for large databases
ACM Computing Surveys (CSUR)
Multi-table joins through bitmapped join indices
ACM SIGMOD Record
Implementing data cubes efficiently
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Providing better support for a class of decision support queries
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
An overview of data warehousing and OLAP technology
ACM SIGMOD Record
Range queries in OLAP data cubes
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Cubetree: organization of and bulk incremental updates on the data cube
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Maintenance of data cubes and summary tables in a warehouse
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Database management systems
An introduction to database systems (7th ed.)
An introduction to database systems (7th ed.)
Decision Support Systems - Special issue on WITS '97
A survey of logical models for OLAP databases
ACM SIGMOD Record
Database Systems: Design, Implementation, and Management
Database Systems: Design, Implementation, and Management
Essential Oracle8i Data Warehousing: Designing, Building, and Managing Oracle Data Warehouses (with Website)
Particle swarm optimisation for data warehouse logical design
International Journal of Bio-Inspired Computation
Exploiting data access for dynamic fragmentation in data warehouse
International Journal of Intelligent Information and Database Systems
Hi-index | 0.01 |
This paper examines strategic arrangement of fact data in a data warehouse in order to answer analytical queries efficiently. Usually, the composite of foreign keys from dimension tables are defined as the fact table's primary key. We focus on analytical queries that specify a value for a randomly chosen foreign key. The desired data for answering a query are typically located at different parts of the disk, thus requiting multiple disk I/Os to read them from disk to memory. We formulate a cost model to express the expected time to read the desired data as a function of disk system's parameters (seek time, rotational latency, and reading speed) and the lengths of foreign keys. For a predetermined disk page size, we search for an arrangement of the fact data that minimizes the expected time cost. An algorithm is then provided for identifying the most desirable disk page size. Finally, we present a heuristic for answering complex queries that specify values for multiple foreign keys.