GPFS: A Shared-Disk File System for Large Computing Clusters
FAST '02 Proceedings of the Conference on File and Storage Technologies
Replica Selection in the Globus Data Grid
CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
Matchmaking: Distributed Resource Management for High Throughput Computing
HPDC '98 Proceedings of the 7th IEEE International Symposium on High Performance Distributed Computing
Amazon S3 for science grids: a viable solution?
DADC '08 Proceedings of the 2008 international workshop on Data-aware distributed computing
The cost of doing science on the cloud: the Montage example
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Elastic management of cluster-based services in the cloud
ACDC '09 Proceedings of the 1st workshop on Automated control for datacenters and clouds
The Eucalyptus Open-Source Cloud-Computing System
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
Cost-benefit analysis of Cloud Computing versus desktop grids
IPDPS '09 Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing
Virtual Infrastructure Management in Private and Hybrid Clouds
IEEE Internet Computing
Bridging the Gap between Desktop and the Cloud for eScience Applications
CLOUD '10 Proceedings of the 2010 IEEE 3rd International Conference on Cloud Computing
Early observations on the performance of Windows Azure
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
ElasTraS: an elastic transactional data store in the cloud
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
In search of an API for scalable file systems: under the table or above it?
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
Cloud analytics: do we really need to reinvent the storage stack?
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
A Model and Decision Procedure for Data Storage in Cloud Computing
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Scalia: an adaptive scheme for efficient multi-cloud storage
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Cloud service selection based on variability modeling
ICSOC'12 Proceedings of the 10th international conference on Service-Oriented Computing
A declarative recommender system for cloud infrastructure services selection
GECON'12 Proceedings of the 9th international conference on Economics of Grids, Clouds, Systems, and Services
Lifecycle management of service-based applications on multi-clouds: a research roadmap
Proceedings of the 2013 international workshop on Multi-cloud applications and federated clouds
An approach for constructing private storage services as a unified fault-tolerant system
Journal of Systems and Software
Hi-index | 0.00 |
We present a new, automated approach to selecting the cloud storage service that best matches each dataset of a given application. Our approach relies on a machine readable description of the capabilities (features, performance, cost, etc.) of each storage system, which is processed together with the user's specified requirements. The result is an assignment of datasets to storage systems, that has multiple advantages: the resulting match meets performance requirements and estimates cost; users express their storage needs using high-level concepts rather than reading the documentation from different cloud providers and manually calculating or estimating a solution. Together with our storage capabilities XML schema we present different use cases for our system that evaluate the Amazon, Azure and local clouds under several scenarios: choosing cloud storage services for a new application, estimating cost savings by switching storage services, estimating the evolution over time of cost and performance and providing information in an Amazon EC2 to Eucalyptus migration. Our application is able to process each use case in under 70 ms; it is also possible to easily expand it to account for new features and data requirements.