SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
A Microeconomic Approach to Optimal Resource Allocation in Distributed Computer Systems
IEEE Transactions on Computers
On the performance of on-line algorithms for partition problems
Acta Cybernetica
Models for the Combined Logical and Physical Design of Databases
IEEE Transactions on Computers
The placement optimization program: a practical solution to the disk file assignment problem
SIGMETRICS '89 Proceedings of the 1989 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
New algorithms for an ancient scheduling problem
STOC '92 Proceedings of the twenty-fourth annual ACM symposium on Theory of computing
A Parallel Hash Join Algorithm for Managing Data Skew
IEEE Transactions on Parallel and Distributed Systems
RAID: high-performance, reliable secondary storage
ACM Computing Surveys (CSUR)
Theoretical Computer Science - Special issue on dynamic and on-line algorithms
Allocating data and workload among multiple servers in a local area network
Information Systems
Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
A better algorithm for an ancient scheduling problem
SODA '94 Proceedings of the fifth annual ACM-SIAM symposium on Discrete algorithms
Comparative Models of the File Assignment Problem
ACM Computing Surveys (CSUR)
Computers and Intractability: A Guide to the Theory of NP-Completeness
Computers and Intractability: A Guide to the Theory of NP-Completeness
The Art of Computer Programming, 2nd Ed. (Addison-Wesley Series in Computer Science and Information
The Art of Computer Programming, 2nd Ed. (Addison-Wesley Series in Computer Science and Information
Allocating Data and Operations to Nodes in Distributed Database Design
IEEE Transactions on Knowledge and Data Engineering
Database Reorganization in Parallel Disk Arrays with I/O Service Stealing
IEEE Transactions on Knowledge and Data Engineering
Data Allocation for Multidisk Databases
IEEE Transactions on Knowledge and Data Engineering
Data partitioning and load balancing in parallel disk systems
The VLDB Journal — The International Journal on Very Large Data Bases
A Trace-Driven Analysis of the UNIX 4.2BSD File System
A Trace-Driven Analysis of the UNIX 4.2BSD File System
Variable-size data item placement for load and storage balancing
Journal of Systems and Software
CEFT: A cost-effective, fault-tolerant parallel virtual file system
Journal of Parallel and Distributed Computing
Design and analysis of a load balancing strategy in data grids
Future Generation Computer Systems - Special section: Data mining in grid computing environments
General store placement for response time minimization in parallel disks
Journal of Parallel and Distributed Computing
Performance comparisons of load balancing algorithms for I/O-intensive workloads on clusters
Journal of Network and Computer Applications
Storage optimization for large-scale distributed stream-processing systems
ACM Transactions on Storage (TOS)
A Subsection Storage Policy in Intelligent RAID-Based Object Storage Device
ICESS '07 Proceedings of the 3rd international conference on Embedded Software and Systems
Resource overbooking and application profiling in a shared Internet hosting platform
ACM Transactions on Internet Technology (TOIT)
Energy Smart Management of Scientific Data
SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Dynamic load balancing for I/O-intensive applications on clusters
ACM Transactions on Storage (TOS)
A file assignment strategy independent of workload characteristic assumptions
ACM Transactions on Storage (TOS)
HiPC'07 Proceedings of the 14th international conference on High performance computing
SAIL: self-adaptive file reallocation on hybrid disk arrays
HiPC'08 Proceedings of the 15th international conference on High performance computing
Understanding the relationship between energy conservation and reliability in parallel disk arrays
Journal of Parallel and Distributed Computing
Load and storage balanced posting file partitioning for parallel information retrieval
Journal of Systems and Software
A load-aware data placement policy on cluster file system
NPC'11 Proceedings of the 8th IFIP international conference on Network and parallel computing
Dynamic object assignment in object-based storage devices
EUC'05 Proceedings of the 2005 international conference on Embedded and Ubiquitous Computing
Task partitioning, scheduling and load balancing strategy for mixed nature of tasks
The Journal of Supercomputing
Algorithms for the database layout problem
ICDT'05 Proceedings of the 10th international conference on Database Theory
An optimal candidate selection model for self-acting load balancing of parallel file system
International Journal of High Performance Computing and Networking
Window-based popularity caching for IPTV on-demand services
ISRN Communications and Networking
A dynamic and adaptive load balancing strategy for parallel file system with large-scale I/O servers
Journal of Parallel and Distributed Computing
ICPCA/SWS'12 Proceedings of the 2012 international conference on Pervasive Computing and the Networked World
SDS: a framework for scientific data services
PDSW '13 Proceedings of the 8th Parallel Data Storage Workshop
MORM: A Multi-objective Optimized Replication Management strategy for cloud storage cluster
Journal of Systems Architecture: the EUROMICRO Journal
Hi-index | 14.98 |
We address the problem of assigning nonpartitioned files in a parallel I/O system where the file accesses exhibit Poisson arrival rates and fixed service times. We present two new file assignment algorithms based on open queuing networks which aim at minimizing simultaneously the load balance across all disks, as well as the variance of the service time at each disk. We first present an off-line algorithm, Sort Partition, which assigns to each disk files with similar access time. Next, we show that, assuming that a perfectly balanced file assignment can be found for a given set of files, Sort Partition will find the one with minimal mean response time. We then present an on-line algorithm, Hybrid Partition, that assigns groups of files with similar service times in successive intervals while guaranteeing that the load imbalance at any point does not exceed a certain threshold. We report on synthetic experiments which exhibit skew in file accesses and sizes and we compare the performance of our new algorithms with the vanilla greedy file allocation algorithm.