| dc.contributor.author | Yu, Xiangyao | |
| dc.contributor.author | Hughes, Christopher J. | |
| dc.contributor.author | Satish, Nadathur | |
| dc.contributor.author | Devadas, Srinivas | |
| dc.date.accessioned | 2017-12-01T22:03:16Z | |
| dc.date.available | 2017-12-01T22:03:16Z | |
| dc.date.issued | 2015-12 | |
| dc.identifier.isbn | 9781450340342 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/112347 | |
| dc.description.abstract | Machine learning, graph analytics and sparse linear algebra-based applications are dominated by irregular memory accesses resulting from following edges in a graph or non-zero elements in a sparse matrix. These accesses have little temporal or spatial locality, and thus incur long memory stalls and large bandwidth requirements. A traditional streaming or striding prefetcher cannot capture these irregular access patterns.
A majority of these irregular accesses come from indirect patterns of the form A[B[i]]. We propose an efficient hardware indirect memory prefetcher (IMP) to capture this access pattern and hide latency. We also propose a partial cacheline accessing mechanism for these prefetches to reduce the network and DRAM bandwidth pressure from the lack of spatial locality.
Evaluated on 7 applications, IMP shows 56% speedup on average (up to 2.3×) compared to a baseline 64 core system with streaming prefetchers. This is within 23% of an idealized system. With partial cacheline accessing, we see another 9.4% speedup on average (up to 46.6%). | en_US |
| dc.description.sponsorship | Intel Science and Technology Center for Big Data | en_US |
| dc.language.iso | en_US | |
| dc.publisher | Association for Computing Machinery (ACM) | en_US |
| dc.relation.isversionof | http://dx.doi.org/10.1145/2830772.2830807 | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | MIT Web Domain | en_US |
| dc.title | IMP: Indirect Memory Prefetcher | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Yu, Xiangyao, Christopher J. Hughes, Nadathur Satish, and Srinivas Devadas. “IMP: Indirect Memory Prefetcher” Proceedings of the 48th International Symposium on Microarchitecture - MICRO-48 (2015). | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.contributor.mitauthor | Yu, Xiangyao | |
| dc.contributor.mitauthor | Devadas, Srinivas | |
| dc.relation.journal | Proceedings of the 48th International Symposium on Microarchitecture - MICRO-48 | en_US |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dspace.orderedauthors | Yu, Xiangyao; Hughes, Christopher J.; Satish, Nadathur; Devadas, Srinivas | en_US |
| dspace.embargo.terms | N | en_US |
| dc.identifier.orcid | https://orcid.org/0000-0003-4317-3457 | |
| dc.identifier.orcid | https://orcid.org/0000-0001-8253-7714 | |
| mit.license | OPEN_ACCESS_POLICY | en_US |
| mit.metadata.status | Complete | |