Autotuning multigrid with PetaBricks

Chan, Cy; Ansel, Jason; Wong, Yee Lok; Amarasinghe, Saman; Edelman, Alan

dc.contributor.author	Chan, Cy
dc.contributor.author	Wong, Yee Lok
dc.contributor.author	Edelman, Alan
dc.contributor.author	Ansel, Jason Andrew
dc.contributor.author	Amarasinghe, Saman P.
dc.date.accessioned	2014-03-28T13:14:04Z
dc.date.available	2014-03-28T13:14:04Z
dc.date.issued	2009-11
dc.identifier.isbn	9781605587448
dc.identifier.uri	http://hdl.handle.net/1721.1/85940
dc.description.abstract	Algorithmic choice is essential in any problem domain to realizing optimal computational performance. Multigrid is a prime example: not only is it possible to make choices at the highest grid resolution, but a program can switch techniques as the problem is recursively attacked on coarser grid levels to take advantage of algorithms with different scaling behaviors. Additionally, users with different convergence criteria must experiment with parameters to yield a tuned algorithm that meets their accuracy requirements. Even after a tuned algorithm has been found, users often have to start all over when migrating from one machine to another. We present an algorithm and autotuning methodology that address these issues in a near-optimal and efficient manner. The freedom of independently tuning both the algorithm and the number of iterations at each recursion level results in an exponential search space of tuned algorithms that have different accuracies and performances. To search this space efficiently, our autotuner utilizes a novel dynamic programming method to build efficient tuned algorithms from the bottom up. The results are customized multigrid algorithms that invest targeted computational power to yield the accuracy required by the user. The techniques we describe allow the user to automatically generate tuned multigrid cycles of different shapes targeted to the user's specific combination of problem, hardware, and accuracy requirements. These cycle shapes dictate the order in which grid coarsening and grid refinement are interleaved with both iterative methods, such as Jacobi or Successive Over-Relaxation, as well as direct methods, which tend to have superior performance for small problem sizes. The need to make choices between all of these methods brings the issue of variable accuracy to the forefront. Not only must the autotuning framework compare different possible multigrid cycle shapes against each other, but it also needs the ability to compare tuned cycles against both direct and (non-multigrid) iterative methods. We address this problem by using an accuracy metric for measuring the effectiveness of tuned cycle shapes and making comparisons over all algorithmic types based on this common yardstick. In our results, we find that the flexibility to trade performance versus accuracy at all levels of recursive computation enables us to achieve excellent performance on a variety of platforms compared to algorithmically static implementations of multigrid. Our implementation uses PetaBricks, an implicitly parallel programming language where algorithmic choices are exposed in the language. The PetaBricks compiler uses these choices to analyze, autotune, and verify the PetaBricks program. These language features, most notably the autotuner, were key in enabling our implementation to be clear, correct, and fast.	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (Award CCF-0832997)	en_US
dc.description.sponsorship	GigaScale Systems Research Center	en_US
dc.language.iso	en_US
dc.publisher	Association for Computing Machinery (ACM)	en_US
dc.relation.isversionof	http://dx.doi.org/10.1145/1654059.1654065	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	MIT web domain	en_US
dc.title	Autotuning multigrid with PetaBricks	en_US
dc.type	Article	en_US
dc.identifier.citation	Cy Chan, Jason Ansel, Yee Lok Wong, Saman Amarasinghe, and Alan Edelman. 2009. Autotuning multigrid with PetaBricks. In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (SC '09). ACM, New York, NY, USA, Article 5, 12 pages.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Mathematics	en_US
dc.contributor.mitauthor	Chan, Cy	en_US
dc.contributor.mitauthor	Ansel, Jason Andrew	en_US
dc.contributor.mitauthor	Wong, Yee Lok	en_US
dc.contributor.mitauthor	Amarasinghe, Saman P.	en_US
dc.contributor.mitauthor	Edelman, Alan	en_US
dc.relation.journal	Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (SC '09)	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Chan, Cy; Ansel, Jason; Wong, Yee Lok; Amarasinghe, Saman; Edelman, Alan	en_US
dc.identifier.orcid	https://orcid.org/0000-0001-7676-3133
dc.identifier.orcid	https://orcid.org/0000-0002-7231-7643
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Edelman_Autotuning Multigrid.pdf
Size:: 228.8Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record