Queues in series via interacting particle systems
Mathematics of Operations Research
IEEE Transactions on Parallel and Distributed Systems
Application-specific network-on-chip architecture customization via long-range link insertion
ICCAD '05 Proceedings of the 2005 IEEE/ACM International conference on Computer-aided design
Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing)
Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing)
A performance analysis of local synchronization
Proceedings of the eighteenth annual ACM symposium on Parallelism in algorithms and architectures
An Empirical Evaluation ofWork Stealing with Parallelism Feedback
ICDCS '06 Proceedings of the 26th IEEE International Conference on Distributed Computing Systems
Real-Time Applications with Stochastic Task Execution Times: Analysis and Optimisation
Real-Time Applications with Stochastic Task Execution Times: Analysis and Optimisation
Period optimization for hard real-time distributed automotive systems
Proceedings of the 44th annual Design Automation Conference
Fault-aware communication mapping for NoCs with guaranteed latency
International Journal of Parallel Programming
Amdahl's Law in the Multicore Era
Computer
Validity of the single processor approach to achieving large scale computing capabilities
AFIPS '67 (Spring) Proceedings of the April 18-20, 1967, spring joint computer conference
A view of the parallel computing landscape
Communications of the ACM - A View of Parallel Computing
Statistical physics approaches for network-on-chip traffic characterization
CODES+ISSS '09 Proceedings of the 7th IEEE/ACM international conference on Hardware/software codesign and system synthesis
Parallelizable stable explicit numerical integration for efficient circuit simulation
Proceedings of the 46th Annual Design Automation Conference
Workload characterization and its impact on multicore platform design
CODES/ISSS '10 Proceedings of the eighth IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
Cyberphysical Systems: Workload Modeling and Design Optimization
IEEE Design & Test
Non-Stationary Traffic Analysis and Its Implications on Multicore Platform Design
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Optimizing the Power-Delay Product of a Linear Pipeline by Opportunistic Time Borrowing
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Hi-index | 0.00 |
Past improvements in clock frequencies have traditionally been obtained through technology scaling, but most recent technology nodes do not offer such benefits. Instead, parallelism has emerged as the key driver of chip-performance growth. Unfortunately, efficient simultaneous use of on-chip resources is hampered by sequential dependencies, as illustrated by Amdahl's law. Quantifying achievable parallelism in terms of provable mathematical results can help prevent futile programming efforts and guide innovation in computer architecture toward the most significant challenges. To complement Amdahl's law, we focus on stream processing and quantify performance losses due to stochastic runtimes. Using spectral theory of random matrices, we derive new analytical results and validate them by numerical simulations. These results allow us to explore unique benefits of stochasticity and show how and when they outweigh the costs for software streams.