Show simple item record

dc.contributor.authorKurian, George
dc.contributor.authorShi, Qingchuan
dc.contributor.authorDevadas, Srinivas
dc.contributor.authorKhan, Omer
dc.date.accessioned2018-05-11T16:55:28Z
dc.date.available2018-05-11T16:55:28Z
dc.date.issued2016-03
dc.date.submitted2015-10
dc.identifier.isbn978-1-4673-9524-3
dc.identifier.urihttp://hdl.handle.net/1721.1/115325
dc.description.abstractData access in modern processors contributes significantly to the overall performance and energy consumption. Traditionally, data is distributed among the cores through an on-chip cache hierarchy, and each producer/consumer accesses data through its private level-1 cache relying on the cache coherence protocol for consistency. Recently, remote access, a mechanism that reduces energy and latency through word-level access to data anywhere on chip has been proposed. Remote access does not replicate data in the private caches, and thereby removes the need for expensive cache line invalidations or updates. Researchers have implemented remote access as an auxiliary mechanism in cache coherence to improve efficiency. Unfortunately, stronger memory models, such as Intel's TSO, require strict ordering among the loads and stores. This introduces serialization penalties for data classified to be accessed remotely, which hampers each core's ability to optimally exploit memory level parallelism. In this paper we propose a novel timestamp-based scheme to detect memory consistency violations. The proposed scheme enables remote accesses to be issued and completed in parallel while continuously detecting whether any ordering violations have occurred, and rolling back the pipeline state (if needed). We implement our scheme for the locality-aware cache coherence protocol that uses remote access as an auxiliary mechanism for efficient data access. Our evaluation using a 64-core multicore processor with out-of-order speculative cores shows that the proposed technique improves completion time by 26% and energy by 20% over a state-of-the-art cache management scheme.en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (Grant CCF-1452327)en_US
dc.language.isoen_US
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1109/PACT.2015.45en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT Web Domainen_US
dc.titleOSPREY: Implementation of Memory Consistency Models for Cache Coherence Protocols involving Invalidation-Free Data Accessen_US
dc.typeArticleen_US
dc.identifier.citationKurian, George, et al. "OSPREY: Implementation of Memory Consistency Models for Cache Coherence Protocols Involving Invalidation-Free Data Access."2015 International Conference on Parallel Architecture and Compilation (PACT), 18-25 October, 2015, San Francisco, California, IEEE, 2015, pp. 392–405.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorKurian, George
dc.contributor.mitauthorDevadas, Srinivas
dc.relation.journal2015 International Conference on Parallel Architecture and Compilation (PACT)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsKurian, George; Shi, Qingchuan; Devadas, Srinivas; Khan, Omeren_US
dspace.embargo.termsNen_US
dc.identifier.orcidhttps://orcid.org/0000-0001-8253-7714
mit.licenseOPEN_ACCESS_POLICYen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record