Show simple item record

dc.contributor.authorShim, Keun Sup
dc.contributor.authorLis, Mieszko
dc.contributor.authorKhan, Omer
dc.contributor.authorDevadas, Srinivas
dc.date.accessioned2012-09-24T18:49:52Z
dc.date.available2012-09-24T18:49:52Z
dc.date.issued2012-01
dc.identifier.urihttp://hdl.handle.net/1721.1/73130
dc.description.abstractChip-multiprocessors (CMPs) have become the mainstream chip design in recent years; for scalability reasons, designs with high core counts tend towards tiled CMPs with physically distributed shared caches. This naturally leads to a Non-Uniform Cache Architecture (NUCA) design, where on chip access latencies depend on the physical distances between requesting cores and home cores where the data is cached. Improving data locality is thus key to performance, and several studies have addressed this problem using data replication and data migration. In this paper, we consider another mechanism, hardware level thread migration. This approach, we argue, can better exploit shared data locality for NUCA designs by effectively replacing multiple round-trip remote cache accesses with a smaller number of migrations. High migration costs, however, make it crucial to use thread migrations judiciously; we therefore propose a novel, on-line prediction scheme which decides whether to perform a remote access (as in traditional NUCA designs) or to perform a thread migration at the instruction level. For a set of parallel benchmarks, our thread migration predictor improves the performance by 18% on average and at best by 2.3X over the standard NUCA design that only uses remote accesses.en_US
dc.language.isoen_US
dc.relation.isversionofhttp://projects.csail.mit.edu/caos/caos_2012.pdfen_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike 3.0en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/en_US
dc.sourceMIT web domainen_US
dc.titleJudicious Thread Migration When Accessing Distributed Shared Cachesen_US
dc.typeArticleen_US
dc.identifier.citationShim, Keun Sup, Mieszko Lis, Omer Khan, and Srinivas Devadas. Judicious Thread Migration When Accessing Distributed Shared Caches." in Proceedings of the Third Workshop on Computer Architecture and Operating System Co-design (CAOS), 2012 January 25, 2012, Paris, France.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.approverDevadas, Srinivas
dc.contributor.mitauthorDevadas, Srinivas
dc.contributor.mitauthorShim, Keun Sup
dc.contributor.mitauthorLis, Mieszko
dc.contributor.mitauthorKhan, Omer
dc.relation.journalProceedings of the Third Workshop on Computer Architecture and Operating System Co-design (CAOS), 2012en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
dspace.orderedauthorsShim, Keun Sup; Lis, Mieszko; Khan, Omer; Devadas, Srinivasen_US
dc.identifier.orcidhttps://orcid.org/0000-0001-8253-7714
dc.identifier.orcidhttps://orcid.org/0000-0001-5490-2323
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record