Show simple item record

dc.contributor.authorDelimitrou, Christina
dc.contributor.authorSanchez, Daniel
dc.contributor.authorKozyrakis, Christos
dc.date.accessioned2017-10-27T13:46:53Z
dc.date.available2017-10-27T13:46:53Z
dc.date.issued2015-08
dc.identifier.isbn978-1-4503-3651-2
dc.identifier.urihttp://hdl.handle.net/1721.1/111979
dc.description.abstractScheduling diverse applications in large, shared clusters is particularly challenging. Recent research on cluster scheduling focuses either on scheduling speed, using sampling to quickly assign resources to tasks, or on scheduling quality, using centralized algorithms that search for the resources that improve both task performance and cluster utilization. We present Tarcil, a distributed scheduler that targets both scheduling speed and quality. Tarcil uses an analytically derived sampling framework that adjusts the sample size based on load, and provides statistical guarantees on the quality of allocated resources. It also implements admission control when sampling is unlikely to find suitable resources. This makes it appropriate for large, shared clusters hosting short- and long-running jobs. We evaluate Tarcil on clusters with hundreds of servers on EC2. For highly-loaded clusters running short jobs, Tarcil improves task execution time by 41% over a distributed, sampling-based scheduler. For more general scenarios, Tarcil achieves near-optimal performance for 4× and 2× more jobs than sampling-based and centralized schedulers respectively.en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (Grant CNS-1422088)en_US
dc.language.isoen_US
dc.publisherAssociation for Computing Machinery (ACM)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/2806777.2806779en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT Web Domainen_US
dc.titleTarcil: reconciling scheduling speed and quality in large shared clustersen_US
dc.typeArticleen_US
dc.identifier.citationDelimitrou, Christina et al. “Tarcil: reconciling scheduling speed and quality in large shared clusters” Proceedings of the Sixth ACM Symposium on Cloud Computing (SoCC ’15), August 27-29 2015, Kohala Coast, Hawaii, Association for Computing Machinery (ACM), August 2015 © 2015 Association for Computing Machinery (ACM)en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorSanchez, Daniel
dc.relation.journalProceedings of the Sixth ACM Symposium on Cloud Computing (SoCC '15)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsDelimitrou, Christina; Sanchez, Daniel; Kozyrakis, Christosen_US
dspace.embargo.termsNen_US
mit.licenseOPEN_ACCESS_POLICYen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record