dc.contributor.author | Li, Baolin | |
dc.contributor.author | Samsi, Siddharth | |
dc.contributor.author | Gadepally, Vijay | |
dc.contributor.author | Tiwari, Devesh | |
dc.date.accessioned | 2023-12-12T14:34:04Z | |
dc.date.available | 2023-12-12T14:34:04Z | |
dc.date.issued | 2023-11-12 | |
dc.identifier.isbn | 979-8-4007-0109-2 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/153142 | |
dc.description.abstract | This paper presents a solution to the challenge of mitigating carbon emissions from hosting large-scale machine learning (ML) inference services. ML inference is critical to modern technology products, but it is also a significant contributor to carbon footprint. We introduce, Clover, a carbon-friendly ML inference service runtime system that balances performance, accuracy, and carbon emissions through mixed-quality models and GPU resource partitioning. Our experimental results demonstrate that Clover is effective in substantially reducing carbon emissions while maintaining high accuracy and meeting service level agreement (SLA) targets. | en_US |
dc.publisher | ACM|The International Conference for High Performance Computing, Networking, Storage and Analysis | en_US |
dc.relation.isversionof | https://doi.org/10.1145/3581784.3607034 | en_US |
dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
dc.source | Association for Computing Machinery | en_US |
dc.title | Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Li, Baolin, Samsi, Siddharth, Gadepally, Vijay and Tiwari, Devesh. 2023. "Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service." | |
dc.contributor.department | Lincoln Laboratory | |
dc.identifier.mitlicense | PUBLISHER_POLICY | |
dc.eprint.version | Final published version | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2023-12-01T08:46:33Z | |
dc.language.rfc3066 | en | |
dc.rights.holder | The author(s) | |
dspace.date.submission | 2023-12-01T08:46:33Z | |
mit.license | PUBLISHER_POLICY | |
mit.metadata.status | Authority Work and Publication Information Needed | en_US |