Show simple item record

dc.contributor.authorYu, Xiangyao
dc.contributor.authorHughes, Christopher J.
dc.contributor.authorSatish, Nadathur
dc.contributor.authorDevadas, Srinivas
dc.date.accessioned2017-12-01T22:03:16Z
dc.date.available2017-12-01T22:03:16Z
dc.date.issued2015-12
dc.identifier.isbn9781450340342
dc.identifier.urihttp://hdl.handle.net/1721.1/112347
dc.description.abstractMachine learning, graph analytics and sparse linear algebra-based applications are dominated by irregular memory accesses resulting from following edges in a graph or non-zero elements in a sparse matrix. These accesses have little temporal or spatial locality, and thus incur long memory stalls and large bandwidth requirements. A traditional streaming or striding prefetcher cannot capture these irregular access patterns. A majority of these irregular accesses come from indirect patterns of the form A[B[i]]. We propose an efficient hardware indirect memory prefetcher (IMP) to capture this access pattern and hide latency. We also propose a partial cacheline accessing mechanism for these prefetches to reduce the network and DRAM bandwidth pressure from the lack of spatial locality. Evaluated on 7 applications, IMP shows 56% speedup on average (up to 2.3×) compared to a baseline 64 core system with streaming prefetchers. This is within 23% of an idealized system. With partial cacheline accessing, we see another 9.4% speedup on average (up to 46.6%).en_US
dc.description.sponsorshipIntel Science and Technology Center for Big Dataen_US
dc.language.isoen_US
dc.publisherAssociation for Computing Machinery (ACM)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/2830772.2830807en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT Web Domainen_US
dc.titleIMP: Indirect Memory Prefetcheren_US
dc.typeArticleen_US
dc.identifier.citationYu, Xiangyao, Christopher J. Hughes, Nadathur Satish, and Srinivas Devadas. “IMP: Indirect Memory Prefetcher” Proceedings of the 48th International Symposium on Microarchitecture - MICRO-48 (2015).en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorYu, Xiangyao
dc.contributor.mitauthorDevadas, Srinivas
dc.relation.journalProceedings of the 48th International Symposium on Microarchitecture - MICRO-48en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsYu, Xiangyao; Hughes, Christopher J.; Satish, Nadathur; Devadas, Srinivasen_US
dspace.embargo.termsNen_US
dc.identifier.orcidhttps://orcid.org/0000-0003-4317-3457
dc.identifier.orcidhttps://orcid.org/0000-0001-8253-7714
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record