The grid: blueprint for a new computing infrastructure
The grid: blueprint for a new computing infrastructure
The SDSC storage resource broker
CASCON '98 Proceedings of the 1998 conference of the Centre for Advanced Studies on Collaborative research
The Globus Striped GridFTP Framework and Server
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Future Generation Computer Systems
Grid Computing Solutions for Distributed Repositories of Protein Folding and Unfolding Simulations
ICCS '08 Proceedings of the 8th international conference on Computational Science, Part III
Managing Large Volumes of Distributed Scientific Data
ICCS '08 Proceedings of the 8th international conference on Computational Science, Part III
Exploiting locality for query processing and compression in scientific databases
Proceedings of the Fourth SIGMOD PhD Workshop on Innovative Database Research
Future Generation Computer Systems
Distance histogram computation based on spatiotemporal uniformity in scientific data
Proceedings of the 15th International Conference on Extending Database Technology
A framework for user driven data management
Information Systems
Hi-index | 0.00 |
In computational biomolecular research, large amounts of simulation data are generated to capture the motion of proteins. These massive simulation data can be analysed in a number of ways to reveal the biochemical properties of the proteins. However, the legacy way of storing these data (usually in the laboratory where the simulations have been run) often hinders a wider sharing and easier cross-comparison of simulation results. The data is commonly encoded in a way specific to the simulation package that produced the data and can only be analysed with tools developed specifically for that simulation package. The BioSimGrid platform seeks to provide a solution to these challenges by exploiting the potential of the Grid in facilitating data sharing. By using BioSimGrid either in a scripting or web environment, users can deposit their data and reuse it for analysis. BioSimGrid tools manage the multiple storage locations transparently to the users and provide a set of retrieval and analysis tools for processing the data in a convenient and efficient manner. This paper details the usage and implementation of BioSimGrid using a combination of commercial databases, the Storage Resource Broker and Python scripts, gluing the building blocks together. It introduces a case study of how BioSimGrid can be used for better storage, retrieval and analysis of biomolecular simulation data.