Build less code deliver more science: an experience report on composing scientific environments using component-based and commodity software platforms

  • Authors:
  • Ian Gorton;Yan Liu;Carina Lansing;Todd Elsethagen;Kerstin Kleese van Dam

  • Affiliations:
  • Software Engineering Institute, CMU, Pittsburgh, PA, USA;Concordia University, Montreal, PQ, Canada;Pacific Northwest National Laboratory, Richland, WA, USA;Pacific Northwest National Laboratory, Richland, WA, USA;Pacific Northwest National Laboratory, Richland, WA, USA

  • Venue:
  • Proceedings of the 16th International ACM Sigsoft symposium on Component-based software engineering
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Modern scientific software is daunting in its diversity and complexity. From massively parallel simulations running on the world's largest supercomputers, to visualizations and user support environments that manage ever growing complex data collections, the challenges for software engineers are plentiful. While high performance simulators are necessarily specialized codes to maximize performance on specific supercomputer architectures, we argue the vast majority of supporting infrastructure, data management and analysis tools can leverage commodity open source and component-based technologies. This approach can significantly drive down the effort and costs of building complex, collaborative scientific user environments, as well as increase their reliability and extensibility. In this paper we describe our experiences in creating an initial user environment for scientists involved in modeling the detailed effects of climate change on the environment of selected geographical regions. Our approach composes the user environment using the Velo scientific knowledge management platform and the MeDICi Integration Framework for scientific workflows. These established platforms leverage component-based technologies and extend commodity open source platforms with abstractions and capabilities that make them amenable for broad use in science. Using this approach we were able to deliver an operational user environment capable of running thousands of simulations in a 7 month period, and achieve significant software reuse.