Enabling 1,000,000-entity simulations on distributed Linux clusters

  • Authors:
  • Gene Wagenbreth;Ke-Thia Yao;Dan M. Davis;Robert F. Lucas;Thomas D. Gottschalk

  • Affiliations:
  • University of Southern California, Marina del Rey, CA;University of Southern California, Marina del Rey, CA;University of Southern California, Marina del Rey, CA;University of Southern California, Marina del Rey, CA;California Institute of Technology, Pasadena, CA

  • Venue:
  • WSC '05 Proceedings of the 37th conference on Winter simulation
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Information Sciences Institute and Caltech are enabling USJFCOM and the Institute for Defense Analyses to conduct entity-level simulation experiments using hundreds of distributed computer nodes on Linux Clusters as a vehicle for simulating millions of JSAF entities. Included below is the experience with the design and implementation of the code that increased scalability, thereby enabling two orders of magnitude growth and the effective use of DoD high-end computers. A typical JSAF experiment generates several terabytes of logged data, which is queried in near-real-time and for months afterward. The amount of logged data and the desired database query performance mandated the redesign of the original logger system's monolithic database, making it distributed and incorporating several advanced concepts. System procedures and practices were established to reliably execute the global-scale simulations, effectively operate the distributed computers, efficiently process and store terabytes of data, and provide straightforward access to the data by analysts.