Parallelism versus memory allocation in pipelined router forwarding engines

Authors:
Fan Chung;Ronald Graham;George Varghese
Affiliations:
University of California, San Diego;University of California, San Diego;University of California, San Diego
Venue:
Proceedings of the sixteenth annual ACM symposium on Parallelism in algorithms and architectures
Year:
2004

Citing 11
Cited 5

A bridging model for parallel computation

Communications of the ACM
How to emulate shared memory

Journal of Computer and System Sciences
Pseudo-randomly interleaved memory

ISCA '91 Proceedings of the 18th annual international symposium on Computer architecture
Small forwarding tables for fast routing lookups

SIGCOMM '97 Proceedings of the ACM SIGCOMM '97 conference on Applications, technologies, architectures, and protocols for computer communication
Accounting for Memory Bank Contention and Delay in High-Bandwidth Multiprocessors

IEEE Transactions on Parallel and Distributed Systems
High-speed policy-based packet forwarding using efficient multi-dimensional range matching

Proceedings of the ACM SIGCOMM '98 conference on Applications, technologies, architectures, and protocols for computer communication
Memory-efficient state lookups with fast updates

Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication
Parallel Computer Architecture: A Hardware/Software Approach

Parallel Computer Architecture: A Hardware/Software Approach
Computers and Intractability: A Guide to the Theory of NP-Completeness

Computers and Intractability: A Guide to the Theory of NP-Completeness
Graphs and Hypergraphs

Graphs and Hypergraphs
Survey and taxonomy of IP address lookup algorithms

IEEE Network: The Magazine of Global Internetworking

CAMP: fast and efficient IP lookup architecture

Proceedings of the 2006 ACM/IEEE symposium on Architecture for networking and communications systems
Virtually Pipelined Network Memory

Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture
PLUG: flexible lookup modules for rapid deployment of new protocols in high-speed routers

Proceedings of the ACM SIGCOMM 2009 conference on Data communication
High-bandwidth network memory system through virtual pipelines

IEEE/ACM Transactions on Networking (TON)
Decision support for packing in warehouses

ISCIS'06 Proceedings of the 21st international conference on Computer and Information Sciences

Quantified Score

Hi-index	0.00

Visualization

Abstract

A crucial problem that needs to be solved is the allocation of memory to processors in a pipeline. Ideally, the processor memories should be totally separate (i.e., one port memories) in order to minimize contention; however, this minimizes memory sharing. Idealized sharing occurs by using a single shared memory for all processors but this maximizes contention. Instead, in this paper we show that perfect memory sharing of shared memory can be achieved with a collection of *two*-port memories, as long as the number of processors is less than the number of memories. We show that the problem of allocation is NP-complete in general, but has a fast approximation algorithm that comes within a factor of 3/2. The proof utilizes a new bin packing model, which is interesting in its own right. Further, for important special cases that arise in practice the approximation algorithm is indeed optimal. We also describe an incremental memory allocation algorithm that provides good memory utilization while allowing fast updates.