Queue - Multiprocessors
IBM Journal of Research and Development - IBM BladeCenter systems
Introduction to the cell multiprocessor
IBM Journal of Research and Development - POWER5 and packaging
Computer Architecture, Fourth Edition: A Quantitative Approach
Computer Architecture, Fourth Edition: A Quantitative Approach
MapReduce: simplified data processing on large clusters
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Patterns for parallel programming
Patterns for parallel programming
3D chip stacking with C4 technology
IBM Journal of Research and Development
Applying Amdahl's other law to the data center
IBM Journal of Research and Development
Exploiting heterogeneous multicore-processor systems for high-performance network processing
IBM Journal of Research and Development
Introduction to the wire-speed processor and architecture
IBM Journal of Research and Development
Wireless network cloud: architecture and system requirements
IBM Journal of Research and Development
Exploiting heterogeneous multicore-processor systems for high-performance network processing
IBM Journal of Research and Development
Introduction to the wire-speed processor and architecture
IBM Journal of Research and Development
Case studies in hardware XPath acceleration
Proceedings of the 4th Annual International Conference on Systems and Storage
Multi-tiered, burstiness-aware bandwidth estimation and scheduling for VBR video flows
Proceedings of the Nineteenth International Workshop on Quality of Service
Reconstructing hardware transactional memory for workload optimized systems
APPT'11 Proceedings of the 9th international conference on Advanced parallel processing technologies
ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I
Hi-index | 0.00 |
This paper describes a recent system-level trend toward the use of massive on-chip parallelism combined with efficient hardware accelerators and integrated networking to enable new classes of applications and computing-systems functionality. This system transition is driven by semiconductor physics and emerging network-application requirements. In contrast to general-purpose approaches, workload and network-optimized computing provides significant cost, performance, and power advantages relative to historical frequency-scaling approaches in a serial computational model. We highlight the advantages of on-chip network optimization that enables efficient computation and new services at the network edge of the data center. Software and application development challenges are presented, and a service-oriented architecture application example is shown that characterizes the power and performance advantages for these systems. We also discuss a roadmap for next-generation systems that proportionally scale with future networking bandwidth growth rates and employ 3-D chip integration methods for design flexibility and modularity.