An order-aware dataflow model for parallel Unix pipelines
Author(s)
Handa, Shivam; Kallas, Konstantinos; Vasilakis, Nikos; Rinard, Martin C
DownloadPublished version (702.9Kb)
Publisher with Creative Commons License
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
<jats:p>We present a dataflow model for modelling parallel Unix shell pipelines. To accurately capture the semantics of complex Unix pipelines, the dataflow model is order-aware, i.e., the order in which a node in the dataflow graph consumes inputs from different edges plays a central role in the semantics of the computation and therefore in the resulting parallelization. We use this model to capture the semantics of transformations that exploit data parallelism available in Unix shell computations and prove their correctness. We additionally formalize the translations from the Unix shell to the dataflow model and from the dataflow model back to a parallel shell script. We implement our model and transformations as the compiler and optimization passes of a system parallelizing shell pipelines, and use it to evaluate the speedup achieved on 47 pipelines.</jats:p>
Date issued
2021Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence LaboratoryJournal
Proceedings of the ACM on Programming Languages
Publisher
Association for Computing Machinery (ACM)
Citation
Handa, Shivam, Kallas, Konstantinos, Vasilakis, Nikos and Rinard, Martin C. 2021. "An order-aware dataflow model for parallel Unix pipelines." Proceedings of the ACM on Programming Languages, 5 (ICFP).
Version: Final published version