dc.contributor.author | Murray, Riley | |
dc.contributor.author | Khuller, Samir | |
dc.contributor.author | Chao, Megan C. | |
dc.date.accessioned | 2018-06-21T17:00:22Z | |
dc.date.available | 2018-06-21T17:00:22Z | |
dc.date.issued | 2017-07 | |
dc.date.submitted | 2016-09 | |
dc.identifier.issn | 0178-4617 | |
dc.identifier.issn | 1432-0541 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/116477 | |
dc.description.abstract | The Map-Reduce computing framework rose to prominence with datasets of such size that dozens of machines on a single cluster were needed for individual jobs. As datasets approach the exabyte scale, a single job may need distributed processing not only on multiple machines, but on multiple clusters. We consider a scheduling problem to minimize weighted average completion time of n jobs on m distributed clusters of parallel machines. In keeping with the scale of the problems motivating this work, we assume that (1) each job is divided into m “subjobs” and (2) distinct subjobs of a given job may be processed concurrently. When each cluster is a single machine, this is the NP-Hard concurrent open shop problem. A clear limitation of such a model is that a serial processing assumption sidesteps the issue of how different tasks of a given subjob might be processed in parallel. Our algorithms explicitly model clusters as pools of resources and effectively overcome this issue. Under a variety of parameter settings, we develop two constant factor approximation algorithms for this problem. The first algorithm uses an LP relaxation tailored to this problem from prior work. This LP-based algorithm provides strong performance guarantees. Our second algorithm exploits a surprisingly simple mapping to the special case of one machine per cluster. This mapping-based algorithm is combinatorial and extremely fast. These are the first constant factor approximations for this problem. | en_US |
dc.description.sponsorship | National Science Foundation (U.S.) | en_US |
dc.description.sponsorship | National Science Foundation (U.S.). Research Experience for Undergraduates (Program) (Grant CCF 1262805) | en_US |
dc.description.sponsorship | Winkler Foundation | en_US |
dc.publisher | Springer-Verlag | en_US |
dc.relation.isversionof | https://doi.org/10.1007/s00453-017-0345-x | en_US |
dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
dc.source | Springer US | en_US |
dc.title | Scheduling Distributed Clusters of Parallel Machines : Primal-Dual and LP-based Approximation Algorithms | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Murray, Riley, Samir Khuller, and Megan Chao. “Scheduling Distributed Clusters of Parallel Machines : Primal-Dual and LP-Based Approximation Algorithms.” Algorithmica 80, no. 10 (July 19, 2017): 2777–2798. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.contributor.mitauthor | Chao, Megan C. | |
dc.relation.journal | Algorithmica | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
dc.date.updated | 2018-05-31T05:10:36Z | |
dc.language.rfc3066 | en | |
dc.rights.holder | Springer Science+Business Media, LLC | |
dspace.orderedauthors | Murray, Riley; Khuller, Samir; Chao, Megan | en_US |
dspace.embargo.terms | N | en |
mit.license | PUBLISHER_POLICY | en_US |