Show simple item record

dc.contributor.authorPoulson, Jack
dc.contributor.authorDemanet, Laurent
dc.contributor.authorMaxwell, Nicholas
dc.contributor.authorYing, Lexing
dc.date.accessioned2014-07-01T21:10:24Z
dc.date.available2014-07-01T21:10:24Z
dc.date.issued2014-02
dc.identifier.issn1064-8275
dc.identifier.issn1095-7197
dc.identifier.urihttp://hdl.handle.net/1721.1/88176
dc.description.abstractThe butterfly algorithm is a fast algorithm which approximately evaluates a discrete analogue of the integral transform $\int_{\mathbb{R}^d} K(x,y) g(y) dy$ at large numbers of target points when the kernel, $K(x,y)$, is approximately low-rank when restricted to subdomains satisfying a certain simple geometric condition. In $d$ dimensions with $O(N^d)$ quasi-uniformly distributed source and target points, when each appropriate submatrix of $K$ is approximately rank-$r$, the running time of the algorithm is at most $O(r^2 N^d \log N)$. A parallelization of the butterfly algorithm is introduced which, assuming a message latency of $\alpha$ and per-process inverse bandwidth of $\beta$, executes in at most $O(r^2 \frac{N^d}{p} \log N + (\beta r\frac{N^d}{p}+\alpha)\log p)$ time using $p$ processes. This parallel algorithm was then instantiated in the form of the open-source \textttDistButterfly library for the special case where $K(x,y)=\exp(i \Phi(x,y))$, where $\Phi(x,y)$ is a black-box, sufficiently smooth, real-valued phase function. Experiments on Blue Gene/Q demonstrate impressive strong-scaling results for important classes of phase functions. Using quasi-uniform sources, hyperbolic Radon transforms, and an analogue of a three-dimensional generalized Radon transform were, respectively, observed to strong-scale from 1-node/16-cores up to 1024-nodes/16,384-cores with greater than 90% and 82% efficiency, respectively.en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (NSF CAREER grant 0846501)en_US
dc.description.sponsorshipUnited States. Dept. of Energy (DOE grant DE-SC0009409)en_US
dc.description.sponsorshipKing Abdullah University of Science and Technologyen_US
dc.language.isoen_US
dc.publisherSociety for Industrial and Applied Mathematicsen_US
dc.relation.isversionofhttp://dx.doi.org/10.1137/130921544en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceSociety for Industrial and Applied Mathematicsen_US
dc.titleA Parallel Butterfly Algorithmen_US
dc.typeArticleen_US
dc.identifier.citationPoulson, Jack, Laurent Demanet, Nicholas Maxwell, and Lexing Ying. “A Parallel Butterfly Algorithm.” SIAM Journal on Scientific Computing 36, no. 1 (February 4, 2014): C49–C65.© 2014, Society for Industrial and Applied Mathematics.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Mathematicsen_US
dc.contributor.mitauthorDemanet, Laurenten_US
dc.relation.journalSIAM Journal on Scientific Computingen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsPoulson, Jack; Demanet, Laurent; Maxwell, Nicholas; Ying, Lexingen_US
dc.identifier.orcidhttps://orcid.org/0000-0001-7052-5097
mit.licensePUBLISHER_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record