Remote Store Programming: Mechanisms and Performance

Wentzlaff, David; Agarwal, Anant; Hoffmann, Henry

dc.contributor.advisor	Anant Agarwal
dc.contributor.author	Wentzlaff, David	en_US
dc.contributor.author	Agarwal, Anant	en_US
dc.contributor.author	Hoffmann, Henry	en_US
dc.contributor.other	Computer Architecture	en_US
dc.date.accessioned	2009-05-06T15:30:08Z
dc.date.available	2009-05-06T15:30:08Z
dc.date.issued	2009-05-05
dc.identifier.uri	http://hdl.handle.net/1721.1/45509
dc.description.abstract	This paper presents remote store programming (RSP). This paradigm combines usability and efficiency through the exploitation of a simple hardware mechanism, the remote store, which can easily be added to existing multicores.Remote store programs are marked by fine-grained and one-sided communication which results in a stream of data flowing from the registers of a sending process to the cache of a destination process. The RSP model and its hardware implementation trade a relatively high store latency for a low load latency because loads are more common than stores, and it is easier to tolerate store latency than load latency. This paper demonstrates the performance advantages of remote store programming by comparing it to both cache-coherent shared memory and direct memory access (DMA) based approaches using the TILEPro64 processor. The paper studies two applications: a two-dimensional Fast Fourier Transform (2D FFT) and an H.264 encoder for high-definition video. For a 2D FFT using 56 cores, RSP is 1.64x faster than DMA and 4.4x faster than shared memory. For an H.264 encoder using 40 cores, RSP achieves the same performance as DMA and 4.8x the performance of shared memory. Along with these performance advantages, RSP requires the least hardware support of the three. RSP's features, performance, and hardware simplicity make it well suited to the embedded processing domain.	en_US
dc.format.extent	11 p.	en_US
dc.relation.ispartofseries	MIT-CSAIL-TR-2009-017	en_US
dc.subject	Programming models	en_US
dc.subject	Multicore architecture	en_US
dc.title	Remote Store Programming: Mechanisms and Performance	en_US

Files in this item

Name:: MIT-CSAIL-TR-2009-017.pdf
Size:: 377.7Kb
Format:: PDF

View/Open

Name:: MIT-CSAIL-TR-2009-017.ps
Size:: 2.085Mb
Format:: Postscript

View/Open

This item appears in the following Collection(s)

CSAIL Technical Reports (July 1, 2003 - present)

Show simple item record