ParTriCluster: A Scalable Parallel Algorithm for Gene Expression Analysis

  • Authors:
  • Renata Araujo;Guilherme Trielli;Gustavo Orair;Wagner Meira Jr.;Renato Ferreira;Dorgival Guedes

  • Affiliations:
  • Universidade Federal de Minas Gerais, Brazil;Universidade Federal de Minas Gerais, Brazil;Universidade Federal de Minas Gerais, Brazil;Universidade Federal de Minas Gerais, Brazil;Universidade Federal de Minas Gerais, Brazil;Universidade Federal de Minas Gerais, Brazil

  • Venue:
  • SBAC-PAD '06 Proceedings of the 18th International Symposium on Computer Architecture and High Performance Computing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Analyzing gene expression patterns is becoming a highly relevant task in the Bioinformatics area. This analysis makes it possible to determine the behavior patterns of genes under various conditions, a fundamental information for treating diseases, among other applications. A recent advance in this area is the Tricluster algorithm, which is the first algorithm capable of determining 3D clusters, that is, it determines clusters of sets of genes that behave similarly in a set of samples and set of timestamps. However, while biological experiments collect an increasing amount of data to be analyzed and correlated, the triclustering problem is NP-Complete, and its parallelization seems to be an essential step towards obtaining feasible solutions. In this work we propose and evaluate the implementation of a parallel version of the Tricluster algorithm using the filter-labeledstream paradigm supported by the Anthill parallel programming environment. The results show that our parallelization scales linearly with the data size. Further, the parallelization strategy is applicable to any depth-first searches.