A scalable architecture for ordered parallelism
Author(s)
Emer, Joel; Jeffrey, Mark Christopher; Subramanian, Suvinay; Yan, Cong; Sanchez, Daniel
DownloadSanchez_A scalable.pdf (899.9Kb)
OPEN_ACCESS_POLICY
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
We present Swarm, a novel architecture that exploits ordered irregular parallelism, which is abundant but hard to mine with current software and hardware techniques. In this architecture, programs consist of short tasks with programmer-specified timestamps. Swarm executes tasks speculatively and out of order, and efficiently speculates thousands of tasks ahead of the earliest active task to uncover ordered parallelism. Swarm builds on prior TLS and HTM schemes, and contributes several new techniques that allow it to scale to large core counts and speculation windows, including a new execution model, speculation-aware hardware task management, selective aborts, and scalable ordered commits.
We evaluate Swarm on graph analytics, simulation, and database benchmarks. At 64 cores, Swarm achieves 51--122× speedups over a single-core system, and out-performs software-only parallel algorithms by 3--18×.
Date issued
2015-12Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer ScienceJournal
Proceedings of the 48th International Symposium on Microarchitecture (MICRO-48)
Publisher
Association for Computing Machinery (ACM)
Citation
Jeffrey, Mark C. et al. “A Scalable Architecture for Ordered Parallelism.” Proceedings of the 48th International Symposium on Microarchitecture (MICRO-48), December 5-9, 2015, Waikiki, Hawaii, USA, Association for Computing Machinery (ACM), December 2015 © 2015 Association for Computing Machinery (ACM)
Version: Author's final manuscript
ISSN
978-1-4503-4034-2