Guided self-scheduling: A practical scheduling scheme for parallel supercomputers
IEEE Transactions on Computers
Bioinformatics
Enabling Large-Scale Bioinformatics Data Analysis with Cloud Computing
ISPA '12 Proceedings of the 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications
Hi-index | 0.00 |
Technological advances in biological and biomedical data acquisition are creating mountains of data. Existing legacy applications are unable to process this data without using new strategies. However, some workloads in bioinformatics are easily parallelized by splitting the data, running legacy applications in parallel and then join the partial results into one final result. In this paper, we present Bio-Cirrus, a software package which facilitates this process. Our software consists of a user-friendly client (jORCA) for accessing Web Services and enacting workflows, and a module (Mr. Cirrus) for processing the data with a map/reduce style approach. Bio-Cirrus binaries and documentation are freely available at http://www.bitlab-es.com/cloud under the Creative Commons Attribution-No Derivative Works 2.5 Spain License and its source code is available under request. (GPL v3 license).