MACE: lossless compression and analysis of microarray images

  • Authors:
  • Robert Bierman;Nidhi Maniyar;Charles Parsons;Rahul Singh

  • Affiliations:
  • San Francisco State University, San Francisco, CA;San Francisco State University, San Francisco, CA;University of Vermont, Burlington, VT;San Francisco State University, San Francisco, CA

  • Venue:
  • Proceedings of the 2006 ACM symposium on Applied computing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The ubiquity of microarray expression data in state-of-the-art biology has been well established. The widespread adoption of this technology coupled with the significant volume of image-based experimental data generated per experiment (averaging 40 MB), have led to significant challenges in storage and query-retrieval of primary data from microarray experiments. Research in the yet nascent area of microarray data-compression seeks to address this problem. In this paper, we propose a conceptually novel approach that achieves significantly better lossless compression. Unlike lossy compression, our algorithm is guaranteed against loss of information that may have potential biological relevance. The proposed method supports key operations such as automated grid and spot finding, histogram-based automatic thresholding for spot segmentation, and subsequent foreground and background separation. Based on the proposed approach, we have developed a standardized format for storing microarray data that encapsulates all the relevant information, including both the Cy3 and Cy5 expression images significantly compressed. We have also developed a software application called MACE (Microarray Compression and Extraction application) to compress-decompress microarray data and generate the aforementioned format. Compression-decompression results on a wide class of microarray experiments involving different spot layouts validate the effectiveness of our approach and its potential to significantly address the aforementioned challenges in storage and management of microarray data.