| dc.contributor.author | Krishna, Tushar | |
| dc.contributor.author | Peh, Li-Shiuan | |
| dc.contributor.author | Kwon, Woo Cheol | |
| dc.date.accessioned | 2014-10-02T16:56:05Z | |
| dc.date.available | 2014-10-02T16:56:05Z | |
| dc.date.issued | 2014-03 | |
| dc.identifier.isbn | 9781450323055 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/90541 | |
| dc.description.abstract | Locality has always been a critical factor in on-chip data placement on CMPs as accessing further-away caches has in the past been more costly than accessing nearby ones. Substantial research on locality-aware designs have thus focused on keeping a copy of the data private. However, this complicatesthe problem of data tracking and search/invalidation; tracking the state of a line at all on-chip caches at a directory or performing full-chip broadcasts are both non-scalable and extremely expensive solutions. In this paper, we make the case for Locality-Oblivious Cache Organization (LOCO), a CMP cache organization that leverages the on-chip network to create virtual single-cycle paths between distant caches, thus redefining the notion of locality. LOCO is a clustered cache organization, supporting both homogeneous and heterogeneous cluster sizes, and provides near single-cycle accesses to data anywhere within the cluster, just like a private cache. Globally, LOCO dynamically creates a virtual mesh connecting all the clusters, and performs an efficient global data search and migration over this virtual mesh, without having to resort to full-chip broadcasts or perform expensive directory lookups. Trace-driven and full system simulations running SPLASH-2 and PARSEC benchmarks show that LOCO improves application run time by up to 44.5% over baseline private and shared cache. | en_US |
| dc.description.sponsorship | Semiconductor Research Corporation | en_US |
| dc.description.sponsorship | United States. Defense Advanced Research Projects Agency (Semiconductor Technology Advanced Research Network) | en_US |
| dc.language.iso | en_US | |
| dc.publisher | Association for Computing Machinery (ACM) | en_US |
| dc.relation.isversionof | http://dx.doi.org/10.1145/2541940.2541976 | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | MIT web domain | en_US |
| dc.title | Locality-oblivious cache organization leveraging single-cycle multi-hop NoCs | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Woo-Cheol Kwon, Tushar Krishna, and Li-Shiuan Peh. 2014. Locality-oblivious cache organization leveraging single-cycle multi-hop NoCs. SIGPLAN Not. 49, 4 (February 2014), 715-728. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.contributor.mitauthor | Kwon, Woo Cheol | en_US |
| dc.contributor.mitauthor | Peh, Li-Shiuan | en_US |
| dc.contributor.mitauthor | Krishna, Tushar | en_US |
| dc.relation.journal | Proceedings of the 19th international conference on Architectural support for programming languages and operating systems (ASPLOS '14) | en_US |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dspace.orderedauthors | Kwon, Woo-Cheol; Krishna, Tushar; Peh, Li-Shiuan | en_US |
| dc.identifier.orcid | https://orcid.org/0000-0002-2345-5791 | |
| dc.identifier.orcid | https://orcid.org/0000-0001-9010-6519 | |
| mit.license | OPEN_ACCESS_POLICY | en_US |
| mit.metadata.status | Complete | |