Methodological Review: 'Big data', Hadoop and cloud computing in genomics

Authors:
Aisling O' Driscoll;Jurate Daugelaite;Roy D. Sleator
Affiliations:
Department of Computing, Cork Institute of Technology, Rossa Avenue, Bishopstown, Cork, Ireland;Department of Biological Sciences, Cork Institute of Technology, Rossa Avenue, Bishopstown, Cork, Ireland;Department of Biological Sciences, Cork Institute of Technology, Rossa Avenue, Bishopstown, Cork, Ireland
Venue:
Journal of Biomedical Informatics
Year:
2013

Citing 12
Cited 0

Taverna: a tool for the composition and enactment of bioinformatics workflows

Bioinformatics
CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications

ESCIENCE '08 Proceedings of the 2008 Fourth IEEE International Conference on eScience
CloudBurst

Bioinformatics
Biodoop: Bioinformatics on Hadoop

ICPPW '09 Proceedings of the 2009 International Conference on Parallel Processing Workshops
GPU-BLAST

Bioinformatics
SEAL

Bioinformatics
Gene set analysis in the cloud

Bioinformatics
FX

Bioinformatics
Hadoop-BAM

Bioinformatics
SOAP3

Bioinformatics
Eoulsan

Bioinformatics
BlueSNP

Bioinformatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology's big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community.