A security architecture for computational grids
CCS '98 Proceedings of the 5th ACM conference on Computer and communications security
Condor: a distributed job scheduler
Beowulf cluster computing with Linux
The UK e-science core programme and the grid
Future Generation Computer Systems - Grid computing: Towards a new computing infrastructure
GridLab: a grid application toolkit and testbed
Future Generation Computer Systems - Grid computing: Towards a new computing infrastructure
Condor-G: A Computation Management Agent for Multi-Institutional Grids
HPDC '01 Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing
An Online Credential Repository for the Grid: MyProxy
HPDC '01 Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing
The Grid 2: Blueprint for a New Computing Infrastructure
The Grid 2: Blueprint for a New Computing Infrastructure
The Inca Test Harness and Reporting Framework
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Future Generation Computer Systems - Special issue: Advanced grid technologies
Advanced resource connector middleware for lightweight computational Grids
Future Generation Computer Systems - Special section: Information engineering and enterprise architecture in distributed computing environments
Future Generation Computer Systems
Integration of multiple middlewares on a single computing resource
Future Generation Computer Systems
A recursive architecture for hierarchical grid resource management
Future Generation Computer Systems
A decentralized model for scheduling independent tasks in Federated Grids
Future Generation Computer Systems
Grid broker selection strategies using aggregated resource information
Future Generation Computer Systems
An integrated security-aware job scheduling strategy for large-scale computational grids
Future Generation Computer Systems
Federation of TeraGrid, EGEE and OSG infrastructures through a metascheduler
Future Generation Computer Systems
Hybrid Computing-Where HPC meets grid and Cloud Computing
Future Generation Computer Systems
Service-oriented grid computing for SAFORAH
HPCS'09 Proceedings of the 23rd international conference on High Performance Computing Systems and Applications
Evaluation of a utility computing model based on the federation of grid infrastructures
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Enabling Interoperability among Grid Meta-Schedulers
Journal of Grid Computing
Hi-index | 0.00 |
The present paper discusses the design and application of GridX1, a computational grid project which uses shared resources at several Canadian research institutions. The infrastructure of GridX1 is built using off-the-shelf Globus Toolkit 2 middleware, a MyProxy credential server, and a resource broker based on Condor-G to manage the distributed computing environment. The broker-based job scheduling and management functionality are exposed as a Globus GRAM job service. Resource brokering is based on the Condor matchmaking mechanism, whereby job and resource attributes are expressed as ClassAds, with the attributes Requirements and Rank being used to define respectively the constraints and preferences that the matched entity must meet. Various strategies for ranking resources are presented, including an Estimated-Waiting-Time (EWT) algorithm, a throttled load balancing strategy, and a novel external ranking strategy based on data location. One of the unique features is a mechanism which transparently presents the GridX1 resources as a single compute element to the LHC Computing Grid (LCG), based at the CERN Laboratory in Geneva. This interface was used during the ATLAS data challenge 2 to federate the Canadian resources into the LCG without the overhead of maintaining separate LCG sites. Further, the BaBar particle physics simulation has been adapted to execute on GridX1 and resulted in a simplified management of the production. The usage of the throttled EWT and load balancing strategies combined with external data ranking was found to be very effective in improving efficiency and reducing the job failure rate.