Show simple item record

dc.contributor.authorLi, Baolin
dc.contributor.authorSamsi, Siddharth
dc.contributor.authorGadepally, Vijay
dc.contributor.authorTiwari, Devesh
dc.date.accessioned2023-12-12T14:34:04Z
dc.date.available2023-12-12T14:34:04Z
dc.date.issued2023-11-12
dc.identifier.isbn979-8-4007-0109-2
dc.identifier.urihttps://hdl.handle.net/1721.1/153142
dc.description.abstractThis paper presents a solution to the challenge of mitigating carbon emissions from hosting large-scale machine learning (ML) inference services. ML inference is critical to modern technology products, but it is also a significant contributor to carbon footprint. We introduce, Clover, a carbon-friendly ML inference service runtime system that balances performance, accuracy, and carbon emissions through mixed-quality models and GPU resource partitioning. Our experimental results demonstrate that Clover is effective in substantially reducing carbon emissions while maintaining high accuracy and meeting service level agreement (SLA) targets.en_US
dc.publisherACM|The International Conference for High Performance Computing, Networking, Storage and Analysisen_US
dc.relation.isversionofhttps://doi.org/10.1145/3581784.3607034en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceAssociation for Computing Machineryen_US
dc.titleClover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Serviceen_US
dc.typeArticleen_US
dc.identifier.citationLi, Baolin, Samsi, Siddharth, Gadepally, Vijay and Tiwari, Devesh. 2023. "Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service."
dc.contributor.departmentLincoln Laboratory
dc.identifier.mitlicensePUBLISHER_POLICY
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2023-12-01T08:46:33Z
dc.language.rfc3066en
dc.rights.holderThe author(s)
dspace.date.submission2023-12-01T08:46:33Z
mit.licensePUBLISHER_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record