Stargazing through a digital veil: managing a large scale sky survey using distributed databases on HPC clusters

  • Authors:
  • Yogesh Simmhan;Catharine van Ingen;Jim Heasley;Alex Szalay

  • Affiliations:
  • University of Southern California, Los Angeles, CA, USA;Microsoft Research, San Francisco, CA, USA;University of Hawaii, Honolulu, HI, USA;The Johns Hopkins University , Baltimore, MD, USA

  • Venue:
  • Proceedings of the first annual workshop on High performance computing meets databases
  • Year:
  • 2011

Quantified Score

Hi-index 0.01

Visualization

Abstract

The Sloan Digital Sky Survey established the use of relational databases for the scans and cone searches common to astronomy analyses. The Pan-STARRS project scales up SDSS by melding HPC clusters with hierarchical and spatially partitioned distributed databases to meet the challenge of near realtime handling of the multiple data surveys generated by a GigaPixel telescope. This meld provides job management capabilities on the cluster for scientist query submission as well as the backend data updates and fault management necessary for a system with no traditional backup. This paper describes the Pan-STARRS HPC+database experience, highlights the current focus of our work and where further research is needed.