Logical, internal, and physical reference behavior in CODASYL database systems
ACM Transactions on Database Systems (TODS)
Disk cache—miss ratio analysis and design considerations
ACM Transactions on Computer Systems (TOCS)
Buffer management in relational database systems
ACM Transactions on Database Systems (TODS)
The design of the UNIX operating system
The design of the UNIX operating system
Caching in the Sprite network file system
ACM Transactions on Computer Systems (TOCS)
Diversity in database reference behavior
SIGMETRICS '89 Proceedings of the 1989 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Integration of buffer management and query optimization in relational database environment
VLDB '89 Proceedings of the 15th international conference on Very large data bases
Data cache management using frequency-based replacement
SIGMETRICS '90 Proceedings of the 1990 ACM SIGMETRICS conference on Measurement and modeling of computer systems
An approximate analysis of the LRU and FIFO buffer replacement schemes
SIGMETRICS '90 Proceedings of the 1990 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Flexible buffer allocation based on marginal gains
SIGMOD '91 Proceedings of the 1991 ACM SIGMOD international conference on Management of data
The design and implementation of a log-structured file system
SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
Measurements of a distributed file system
SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
ACM Transactions on Database Systems (TODS)
Analysis of the Periodic Update Write Policy for Disk Cache
IEEE Transactions on Software Engineering
Analysis of the generalized clock buffer replacement scheme for database transaction processing
SIGMETRICS '92/PERFORMANCE '92 Proceedings of the 1992 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Analysis of file I/O traces in commercial computing environments
SIGMETRICS '92/PERFORMANCE '92 Proceedings of the 1992 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Numerical recipes in C (2nd ed.): the art of scientific computing
Numerical recipes in C (2nd ed.): the art of scientific computing
Query evaluation techniques for large databases
ACM Computing Surveys (CSUR)
A modeling study of the TPC-C benchmark
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
The LRU-K page replacement algorithm for database disk buffering
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
RAID: high-performance, reliable secondary storage
ACM Computing Surveys (CSUR)
The HP AutoRAID hierarchical storage system
ACM Transactions on Computer Systems (TOCS) - Special issue on operating system principles
Database buffer size investigation for OLTP workloads
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Database performance in the real world: TPC-D and SAP R/3
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Trace-driven memory simulation: a survey
ACM Computing Surveys (CSUR)
Computer architecture (2nd ed.): a quantitative approach
Computer architecture (2nd ed.): a quantitative approach
The art of computer programming, volume 3: (2nd ed.) sorting and searching
The art of computer programming, volume 3: (2nd ed.) sorting and searching
Empirical results on locality in database referencing
SIGMETRICS '85 Proceedings of the 1985 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Sequentiality and prefetching in database systems
ACM Transactions on Database Systems (TODS)
Principles of Optimal Page Replacement
Journal of the ACM (JACM)
A trace-driven analysis of the UNIX 4.2 BSD file system
Proceedings of the tenth ACM symposium on Operating systems principles
Operating system support for database management
Communications of the ACM
A relational model of data for large shared data banks
Communications of the ACM
Operating Systems Theory
Performance analysis of a relational data base management system
SIGMOD '79 Proceedings of the 1979 ACM SIGMOD international conference on Management of data
Access path selection in a relational database management system
SIGMOD '79 Proceedings of the 1979 ACM SIGMOD international conference on Management of data
Buffer management based on return on consumption in a multi-query environment
The VLDB Journal — The International Journal on Very Large Data Bases
Directory Reference Patterns in Hierarchical File Systems
IEEE Transactions on Knowledge and Data Engineering
Starburst Mid-Flight: As the Dust Clears
IEEE Transactions on Knowledge and Data Engineering
A Buffer Management Model For Use In Predicting Overall Database System Performance
Proceedings of the Fifth International Conference on Data Engineering
Database Access Characterization for Buffer Hit Prediction
Proceedings of the Ninth International Conference on Data Engineering
Predictive Load Control for Flexible Buffer Allocation
VLDB '91 Proceedings of the 17th International Conference on Very Large Data Bases
Adaptive Database Buffer Allocation Using Query Feedback
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
2Q: A Low Overhead High Performance Buffer Management Replacement Algorithm
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
A Mechanism for Managing the Buffer Pool in a Relational Database System Using the Hot Set Model
VLDB '82 Proceedings of the 8th International Conference on Very Large Data Bases
A performance comparison of RAID-5 and log-structured arrays
HPDC '95 Proceedings of the 4th IEEE International Symposium on High Performance Distributed Computing
Projecting the Performance of Decision Support Workloads on Systems with Smart Storage (SmartSTOR)
ICPADS '00 Proceedings of the Seventh International Conference on Parallel and Distributed Systems
I/O Reference Behavior of Production Database Workloads and the TPC
I/O Reference Behavior of Production Database Workloads and the TPC
Analysis of the Characteristics of Production Database Workloads and
Analysis of the Characteristics of Production Database Workloads and
Efficient Analysis of Caching Systems
Efficient Analysis of Caching Systems
Decision-Support Workload Characteristics on a Clustered Database Server from the OS Perspective
ICDCS '03 Proceedings of the 23rd International Conference on Distributed Computing Systems
The performance impact of I/O optimizations and disk improvements
IBM Journal of Research and Development
Characteristics of I/O traffic in personal computer and server workloads
IBM Systems Journal
SVL: Storage Virtualization Engine Leveraging DBMS Technology
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Mining block correlations to improve storage performance
ACM Transactions on Storage (TOS)
Reducing Server Data Traffic Using a Hierarchical Computation Model
IEEE Transactions on Parallel and Distributed Systems
The automatic improvement of locality in storage systems
ACM Transactions on Computer Systems (TOCS)
SARC: sequential prefetching in adaptive replacement cache
ATEC '05 Proceedings of the annual conference on USENIX Annual Technical Conference
Design of flash-based DBMS: an in-page logging approach
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
WOW: wise ordering for writes - combining spatial and temporal locality in non-volatile caches
FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
A new approach to dynamic self-tuning of database buffers
ACM Transactions on Storage (TOS)
Context-aware prefetching at the storage server
ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
Application of Queueing Network Models in the Performance Evaluation of Database Designs
Electronic Notes in Theoretical Computer Science (ENTCS)
STOW: a spatially and temporally optimized write caching algorithm
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
Adaptive logging for mobile device
Proceedings of the VLDB Endowment
Survey and analysis of disk scheduling methods
ACM SIGARCH Computer Architecture News
Database system performance evaluation models: A survey
Performance Evaluation
The constraints of magnetic versus flash disk capabilities in big data analysis
Proceedings of the 2nd Workshop on Architectures and Systems for Big Data
Performance modeling and analysis of a database server with write-heavy workload
ESOCC'12 Proceedings of the First European conference on Service-Oriented and Cloud Computing
Analyzing and improving table space allocation
ADC '11 Proceedings of the Twenty-Second Australasian Database Conference - Volume 115
Characterization of incremental data changes for efficient data protection
USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
An equation-based Heap Sizing Rule
Performance Evaluation
Hi-index | 0.00 |
As improvements in processor performance continue to far outpace improvements in storage performance, I/O is increasingly the bottleneck in computer systems, especially in large database systems that manage huge amoungs of data. The key to achieving good I/O performance is to thoroughly understand its characteristics. In this article we present a comprehensive analysis of the logical I/O reference behavior of the peak productiondatabase workloads from ten of the world's largest corporations. In particular, we focus on how these workloads respond to different techniques for caching, prefetching, and write buffering. Our findings include several broadly applicable rules of thumb that describe how effective the various I/O optimization techniques are for the production workloads. For instance, our results indicate that the buffer pool miss ratio tends to be related to the ratio of buffer pool size to data size by an inverse square root rule. A similar fourth root rule relates the write miss ratio and the ration of buffer pool size to data size.In addition, we characterize the reference characteristics of workloads similar to the Transaction Processing Performance Council (TPC) benchmarks C (TPC-C) and D(TPC-D), which are de facto standard performance measures for online transaction processing (OLTP) systems and decision support systems (DSS), respectively. Since benchmarks such as TPC-C and TPC-D can only be used effectively if their strengths and limitations are understood, a major focus of our analysis is to identify aspects of the benchmarks that stress the system differently than the production workloads. We discover that for the most part, the reference behavior of TPC-C and TPC-D fall within the range of behavior exhibited by the production workloads. However, there are some noteworthy exceptions that affect well-known I/O optimization techniques such as caching (LRU is further from the optimal for TPC-C, while there is little sharing of pages between transactions for TPC-D), prefetching (TPC-C exhibits no significant sequentiality), and write buffering (write buffering is lees effective for the TPC benchmarks). While the two TPC benchmarks generally complement one another in reflecting the characteristics of the production workloads, there remain aspects of the real workloads that are not represented by either of the benchmarks.