Notice

This is not the latest version of this item. The latest version can be found at:https://dspace.mit.edu/handle/1721.1/137425.2

Show simple item record

dc.contributor.authorPalkar, S
dc.contributor.authorThomas, JJ
dc.contributor.authorShanbhag, A
dc.contributor.authorNarayanan, D
dc.contributor.authorPirk, H
dc.contributor.authorSchwarzkopf, M
dc.contributor.authorAmarasinghe, S
dc.contributor.authorZaharia, M
dc.date.accessioned2021-11-05T11:48:09Z
dc.date.available2021-11-05T11:48:09Z
dc.date.issued2017-01
dc.identifier.urihttps://hdl.handle.net/1721.1/137425
dc.description.abstract© 2017 Conference on Innovative Data Systems Research (CIDR). All rights reserved. Modern analytics applications combine multiple functions from different libraries and frameworks to build increasingly complex workflows. Even though each function may achieve high performance in isolation, the performance of the combined workflow is often an order of magnitude below hardware limits due to extensive data movement across the functions. To address this problem, we propose Weld, a runtime for data-intensive applications that optimizes across disjoint libraries and functions. Weld uses a common intermediate representation to capture the structure of diverse data-parallel workloads, including SQL, machine learning and graph analytics. It then performs key data movement optimizations and generates efficient parallel code for the whole workflow. Weld can be integrated incrementally into existing frameworks like TensorFlow, Apache Spark, NumPy and Pandas without changing their user-facing APIs. We show that Weld can speed up these frameworks, as well as applications that combine them, by up to 30×.en_US
dc.language.isoen
dc.rightsCreative Commons Attribution 3.0 unported licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/3.0/en_US
dc.source8th Biennial Conference on Innovative Data Systems Research (CIDR ‘17)en_US
dc.titleWeld: A common runtime for high performance data analyticsen_US
dc.typeArticleen_US
dc.identifier.citationPalkar, S, Thomas, JJ, Shanbhag, A, Narayanan, D, Pirk, H et al. 2017. "Weld: A common runtime for high performance data analytics." CIDR 2017 - 8th Biennial Conference on Innovative Data Systems Research.
dc.relation.journalCIDR 2017 - 8th Biennial Conference on Innovative Data Systems Researchen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2020-11-23T19:33:31Z
dspace.orderedauthorsPalkar, S; Thomas, JJ; Shanbhag, A; Narayanan, D; Pirk, H; Schwarzkopf, M; Amarasinghe, S; Zaharia, Men_US
dspace.date.submission2020-11-23T19:33:43Z
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version