Executing dynamic data-graph computations deterministically using chromatic scheduling

Kaler, Tim; Hasenplaugh, William; Schardl, Tao B.; Leiserson, Charles E.

dc.contributor.author	Leiserson, Charles E.
dc.contributor.author	Kaler, Timothy
dc.contributor.author	Hasenplaugh, William Cleaburn
dc.contributor.author	Schardl, Tao Benjamin
dc.date.accessioned	2016-01-19T19:25:18Z
dc.date.available	2016-01-19T19:25:18Z
dc.date.issued	2014-06
dc.identifier.isbn	9781450328210
dc.identifier.uri	http://hdl.handle.net/1721.1/100928
dc.description.abstract	A data-graph computation — popularized by such programming systems as Galois, Pregel, GraphLab, PowerGraph, and GraphChi — is an algorithm that performs local updates on the vertices of a graph. During each round of a data-graph computation, an update function atomically modifies the data associated with a vertex as a function of the vertex's prior data and that of adjacent vertices. A dynamic data-graph computation updates only an active subset of the vertices during a round, and those updates determine the set of active vertices for the next round. This paper introduces PRISM, a chromatic-scheduling algorithm for executing dynamic data-graph computations. PRISM uses a vertex-coloring of the graph to coordinate updates performed in a round, precluding the need for mutual-exclusion locks or other nondeterministic data synchronization. A multibag data structure is used by PRISM to maintain a dynamic set of active vertices as an unordered set partitioned by color. We analyze PRISM using work-span analysis. Let G=(V,E) be a degree-Δ graph colored with Χ colors, and suppose that Q⊆V is the set of active vertices in a round. Define size(Q)=[Q] + Σ[subscript v∈Q]deg(v), which is proportional to the space required to store the vertices of Q using a sparse-graph layout. We show that a P-processor execution of PRISM performs updates in Q using O(Χ(lg (Q/Χ)+lgΔ)+ lgP) span and Θ(size(Q)+Χ+P) work. These theoretical guarantees are matched by good empirical performance. We modified GraphLab to incorporate PRISM and studied seven application benchmarks on a 12-core multicore machine. PRISM executes the benchmarks 1.2–2.1 times faster than GraphLab's nondeterministic lock-based scheduler while providing deterministic behavior. This paper also presents PRISM-R, a variation of PRISM that executes dynamic data-graph computations deterministically even when updates modify global variables with associative operations. PRISM-R satisfies the same theoretical bounds as PRISM, but its implementation is more involved, incorporating a multivector data structure to maintain an ordered set of vertices partitioned by color.	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (Grant CNS-1017058)	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (Grant CCF-1162148)	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (Grant CCF-1314547)	en_US
dc.description.sponsorship	Intel Corporation	en_US
dc.description.sponsorship	Foxconn International Holdings Ltd.	en_US
dc.description.sponsorship	National Science Foundation (U.S.). Graduate Research Fellowship	en_US
dc.language.iso	en_US
dc.publisher	Association for Computing Machinery (ACM)	en_US
dc.relation.isversionof	http://dx.doi.org/10.1145/2612669.2612673	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	MIT web domain	en_US
dc.title	Executing dynamic data-graph computations deterministically using chromatic scheduling	en_US
dc.type	Article	en_US
dc.identifier.citation	Tim Kaler, William Hasenplaugh, Tao B. Schardl, and Charles E. Leiserson. 2014. Executing dynamic data-graph computations deterministically using chromatic scheduling. In Proceedings of the 26th ACM symposium on Parallelism in algorithms and architectures (SPAA '14). ACM, New York, NY, USA, 154-165.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.mitauthor	Kaler, Timothy	en_US
dc.contributor.mitauthor	Hasenplaugh, William Cleaburn	en_US
dc.contributor.mitauthor	Schardl, Tao Benjamin	en_US
dc.contributor.mitauthor	Leiserson, Charles E.	en_US
dc.relation.journal	Proceedings of the 26th ACM symposium on Parallelism in algorithms and architectures (SPAA '14)	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Kaler, Tim; Hasenplaugh, William; Schardl, Tao B.; Leiserson, Charles E.	en_US
dc.identifier.orcid	https://orcid.org/0000-0001-6915-0216
dc.identifier.orcid	https://orcid.org/0000-0002-3831-8255
dc.identifier.orcid	https://orcid.org/0000-0003-0198-3283
mit.license	OPEN_ACCESS_POLICY	en_US

Files in this item

Name:: Leiserson_Executing dynamic.pdf
Size:: 334.0Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record