On the simulation of large-scale architectures using multiple application abstraction levels

Authors:
Alejandro Rico;Felipe Cabarcas;Carlos Villavieja;Milan Pavlovic;Augusto Vega;Yoav Etsion;Alex Ramirez;Mateo Valero
Affiliations:
Barcelona Supercomputing Center and Universitat Politècnica de Catalunya, Spain;Barcelona Supercomputing Center and Universitat Politècnica de Catalunya, Spain;Barcelona Supercomputing Center and Universitat Politècnica de Catalunya, Spain;Barcelona Supercomputing Center and Universitat Politècnica de Catalunya, Spain;Barcelona Supercomputing Center and Universitat Politècnica de Catalunya, Spain;Barcelona Supercomputing Center and Universitat Politècnica de Catalunya, Spain;Barcelona Supercomputing Center and Universitat Politècnica de Catalunya, Spain;Barcelona Supercomputing Center and Universitat Politècnica de Catalunya, Spain
Venue:
ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers
Year:
2012

Citing 29
Cited 0

Efficient trace-driven simulation method for cache performance analysis

SIGMETRICS '90 Proceedings of the 1990 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Cilk: an efficient multithreaded runtime system

PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Trace-driven memory simulation: a survey

ACM Computing Surveys (CSUR)
Wisconsin Wind Tunnel II: A Fast, Portable Parallel Architecture Simulator

IEEE Concurrency
Simics: A Full System Simulation Platform

Computer
SimpleScalar: An Infrastructure for Computer System Modeling

Computer
Can Trace-Driven Simulators Accurately Predict Superscalar Performance?

ICCD '96 Proceedings of the 1996 International Conference on Computer Design, VLSI in Computers and Processors
Using SimPoint for accurate and efficient simulation

SIGMETRICS '03 Proceedings of the 2003 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
SMARTS: accelerating microarchitecture simulation via rigorous statistical sampling

Proceedings of the 30th annual international symposium on Computer architecture
Analysis of cache replacement-algorithms

Analysis of cache replacement-algorithms
TurboSMARTS: accurate microarchitecture simulation sampling in minutes

SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Pin: building customized program analysis tools with dynamic instrumentation

Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation
X10: an object-oriented approach to non-uniform cluster computing

OOPSLA '05 Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset

ACM SIGARCH Computer Architecture News - Special issue: dasCMP'05
Introduction to the cell multiprocessor

IBM Journal of Research and Development - POWER5 and packaging
The M5 Simulator: Modeling Networked Systems

IEEE Micro
The Future of Simulation: A Field of Dreams

Computer
CellSs: a programming model for the cell BE architecture

Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Entering the petaflop era: the architecture and performance of Roadrunner

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Intel threading building blocks

Intel threading building blocks
Available task-level parallelism on the Cell BE

Scientific Programming - High Performance Computing with the Cell Broadband Engine
SlackSim: a platform for parallel simulations of CMPs on CMPs

ACM SIGARCH Computer Architecture News
PSINS: An Open Source Event Tracer and Execution Simulator for MPI Applications

Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Two-phase trace-driven simulation (TPTS): a fast multicore processor architecture simulation approach

Software—Practice & Experience
Comparing last-level cache designs for CMP architectures

Proceedings of the Second International Forum on Next-Generation Multicore/Manycore Technologies
The SARC Architecture

IEEE Micro
Simulating Whole Supercomputer Applications

IEEE Micro
Trace-driven simulation of multithreaded applications

ISPASS '11 Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software
Keynote II: Integrated modeling challenges in extreme-scale computing

ISPASS '11 Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software

Quantified Score

Hi-index	0.00

Visualization

Abstract

Simulation is a key tool for computer architecture research. In particular, cycle-accurate simulators are extremely important for microarchitecture exploration and detailed design decisions, but they are slow and, so, not suitable for simulating large-scale architectures, nor are they meant for this. Moreover, microarchitecture design decisions are irrelevant, or even misleading, for early processor design stages and high-level explorations. This allows one to raise the abstraction level of the simulated architecture, and also the application abstraction level, as it does not necessarily have to be represented as an instruction stream. In this paper we introduce a definition of different application abstraction levels, and how these are employed in TaskSim, a multi-core architecture simulator, to provide several architecture modeling abstractions, and simulate large-scale architectures with hundreds of cores. We compare the simulation speed of these abstraction levels to the ones in existing simulation tools, and also evaluate their utility and accuracy. Our simulations show that a very high-level abstraction, which may be even faster than native execution, is useful for scalability studies on parallel applications; and that just simulating explicit memory transfers, we achieve accurate simulations for architectures using non-coherent scratchpad memories, with just a 25x slowdown compared to native execution. Furthermore, we revisit trace memory simulation techniques, that are more abstract than instruction-by-instruction simulations and provide an 18x simulation speedup.