Show simple item record

dc.contributor.authorZhao, Qin
dc.contributor.authorKoh, David F.
dc.contributor.authorRaza, Syed A.
dc.contributor.authorAmarasinghe, Saman P.
dc.contributor.authorBruening, Derek
dc.contributor.authorWong, Weng-Fai
dc.date.accessioned2011-05-04T19:18:08Z
dc.date.available2011-05-04T19:18:08Z
dc.date.issued2011-03
dc.identifier.isbn978-1-4503-0687-4
dc.identifier.urihttp://hdl.handle.net/1721.1/62586
dc.description.abstractIn today's multi-core systems, cache contention due to true and false sharing can cause unexpected and significant performance degradation. A detailed understanding of a given multi-threaded application's behavior is required to precisely identify such performance bottlenecks. Traditionally, however, such diagnostic information can only be obtained after lengthy simulation of the memory hierarchy. In this paper, we present a novel approach that efficiently analyzes interactions between threads to determine thread correlation and detect true and false sharing. It is based on the following key insight: although the slowdown caused by cache contention depends on factors including the thread-to-core binding and parameters of the memory hierarchy, the amount of data sharing is primarily a function of the cache line size and application behavior. Using memory shadowing and dynamic instrumentation, we implemented a tool that obtains detailed sharing information between threads without simulating the full complexity of the memory hierarchy. The runtime overhead of our approach --- a 5x slowdown on average relative to native execution --- is significantly less than that of detailed cache simulation. The information collected allows programmers to identify the degree of cache contention in an application, the correlation among its threads, and the sources of significant false sharing. Using our approach, we were able to improve the performance of some applications up to a factor of 12x. For other contention-intensive applications, we were able to shed light on the obstacles that prevent their performance from scaling to many cores.en_US
dc.language.isoen_US
dc.publisherAssociation for Computing Machinery / ACM Special Interest Group on Programming Languages./ ACM Special Interest Group in Operating Systems.en_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/1952682.1952688en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike 3.0en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/en_US
dc.sourceMIT web domainen_US
dc.titleDynamic Cache Contention Detection in Multi-threaded Applicationsen_US
dc.typeArticleen_US
dc.identifier.citationZhao, Qin et al. “Dynamic Cache Contention Detection in Multi-threaded Applications.” Proceedings of the 7th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments - VEE ’11. Newport Beach, California, USA, 2011. 27. Copyright c2011 ACMen_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.approverAmarasinghe, Saman P.
dc.contributor.mitauthorZhao, Qin
dc.contributor.mitauthorKoh, David F.
dc.contributor.mitauthorRaza, Syed A.
dc.contributor.mitauthorAmarasinghe, Saman P.
dc.relation.journalVEE Proceedings (ACM SIGPLAN SIGOPS International Conference on Virtual Execution Environments)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
dspace.orderedauthorsZhao, Qin; Koh, David; Raza, Syed; Bruening, Derek; Wong, Weng-Fai; Amarasinghe, Samanen
dc.identifier.orcidhttps://orcid.org/0000-0002-7231-7643
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record