Workload and network-optimized computing systems

Authors:
D. P. LaPotin;S. Daijavad;C. L. Johnson;S. W. Hunter;K. Ishizaki;H. Franke;H. D. Achilles;D. P. Dumarot;N. A. Greco;B. Davari
Affiliations:
IBM Research Division, Austin Research Laboratory, Austin, TX;IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, NY;IBM Research Division, Rochester, MN;IBM Research, Research Triangle Park, NC;IBM Research Division, IBM Tokyo Research Laboratory, Kanagawa, Japan;IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, NY;IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, NY;IBM Systems and Technology Group, Integrated Systems Development, Poughkeepsie, NY;IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, NY;IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, NY
Venue:
IBM Journal of Research and Development
Year:
2010

Citing 12
Cited 6

The Future of Microprocessors

Queue - Multiprocessors
BladeCenter networking

IBM Journal of Research and Development - IBM BladeCenter systems
Introduction to the cell multiprocessor

IBM Journal of Research and Development - POWER5 and packaging
Computer Architecture, Fourth Edition: A Quantitative Approach

Computer Architecture, Fourth Edition: A Quantitative Approach
MapReduce: simplified data processing on large clusters

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Processor Design in 3D Die-Stacking Technologies

IEEE Micro
Patterns for parallel programming

Patterns for parallel programming
3D chip stacking with C4 technology

IBM Journal of Research and Development
Applying Amdahl's other law to the data center

IBM Journal of Research and Development
Exploiting heterogeneous multicore-processor systems for high-performance network processing

IBM Journal of Research and Development
Introduction to the wire-speed processor and architecture

IBM Journal of Research and Development
Wireless network cloud: architecture and system requirements

IBM Journal of Research and Development

Exploiting heterogeneous multicore-processor systems for high-performance network processing

IBM Journal of Research and Development
Introduction to the wire-speed processor and architecture

IBM Journal of Research and Development
Case studies in hardware XPath acceleration

Proceedings of the 4th Annual International Conference on Systems and Storage
Multi-tiered, burstiness-aware bandwidth estimation and scheduling for VBR video flows

Proceedings of the Nineteenth International Workshop on Quality of Service
Reconstructing hardware transactional memory for workload optimized systems

APPT'11 Proceedings of the 9th international conference on Advanced parallel processing technologies
Parallel implementation of external sort and join operations on a multi-core network-optimized system on a chip

ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes a recent system-level trend toward the use of massive on-chip parallelism combined with efficient hardware accelerators and integrated networking to enable new classes of applications and computing-systems functionality. This system transition is driven by semiconductor physics and emerging network-application requirements. In contrast to general-purpose approaches, workload and network-optimized computing provides significant cost, performance, and power advantages relative to historical frequency-scaling approaches in a serial computational model. We highlight the advantages of on-chip network optimization that enables efficient computation and new services at the network edge of the data center. Software and application development challenges are presented, and a service-oriented architecture application example is shown that characterizes the power and performance advantages for these systems. We also discuss a roadmap for next-generation systems that proportionally scale with future networking bandwidth growth rates and employ 3-D chip integration methods for design flexibility and modularity.