IMP: Indirect Memory Prefetcher

Yu, Xiangyao; Hughes, Christopher J.; Satish, Nadathur; Devadas, Srinivas

dc.contributor.author	Yu, Xiangyao
dc.contributor.author	Hughes, Christopher J.
dc.contributor.author	Satish, Nadathur
dc.contributor.author	Devadas, Srinivas
dc.date.accessioned	2017-12-01T22:03:16Z
dc.date.available	2017-12-01T22:03:16Z
dc.date.issued	2015-12
dc.identifier.isbn	9781450340342
dc.identifier.uri	http://hdl.handle.net/1721.1/112347
dc.description.abstract	Machine learning, graph analytics and sparse linear algebra-based applications are dominated by irregular memory accesses resulting from following edges in a graph or non-zero elements in a sparse matrix. These accesses have little temporal or spatial locality, and thus incur long memory stalls and large bandwidth requirements. A traditional streaming or striding prefetcher cannot capture these irregular access patterns. A majority of these irregular accesses come from indirect patterns of the form A[B[i]]. We propose an efficient hardware indirect memory prefetcher (IMP) to capture this access pattern and hide latency. We also propose a partial cacheline accessing mechanism for these prefetches to reduce the network and DRAM bandwidth pressure from the lack of spatial locality. Evaluated on 7 applications, IMP shows 56% speedup on average (up to 2.3×) compared to a baseline 64 core system with streaming prefetchers. This is within 23% of an idealized system. With partial cacheline accessing, we see another 9.4% speedup on average (up to 46.6%).	en_US
dc.description.sponsorship	Intel Science and Technology Center for Big Data	en_US
dc.language.iso	en_US
dc.publisher	Association for Computing Machinery (ACM)	en_US
dc.relation.isversionof	http://dx.doi.org/10.1145/2830772.2830807	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	MIT Web Domain	en_US
dc.title	IMP: Indirect Memory Prefetcher	en_US
dc.type	Article	en_US
dc.identifier.citation	Yu, Xiangyao, Christopher J. Hughes, Nadathur Satish, and Srinivas Devadas. “IMP: Indirect Memory Prefetcher” Proceedings of the 48th International Symposium on Microarchitecture - MICRO-48 (2015).	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.mitauthor	Yu, Xiangyao
dc.contributor.mitauthor	Devadas, Srinivas
dc.relation.journal	Proceedings of the 48th International Symposium on Microarchitecture - MICRO-48	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Yu, Xiangyao; Hughes, Christopher J.; Satish, Nadathur; Devadas, Srinivas	en_US
dspace.embargo.terms	N	en_US
dc.identifier.orcid	https://orcid.org/0000-0003-4317-3457
dc.identifier.orcid	https://orcid.org/0000-0001-8253-7714
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: IMP Indirect memory prefetcher.pdf
Size:: 812.7Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record