Show simple item record

dc.contributor.authorYang, Tien-Ju
dc.contributor.authorHoward, Andrew
dc.contributor.authorChen, Bo
dc.contributor.authorZhang, Xiao
dc.contributor.authorGo, Alec
dc.contributor.authorSandler, Mark
dc.contributor.authorSze, Vivienne
dc.contributor.authorAdam, Hartwig
dc.date.accessioned2021-11-02T16:26:15Z
dc.date.available2021-11-02T16:26:15Z
dc.date.issued2018
dc.identifier.issn0302-9743
dc.identifier.issn1611-3349
dc.identifier.urihttps://hdl.handle.net/1721.1/137102
dc.description.abstract© Springer Nature Switzerland AG 2018. This work proposes an algorithm, called NetAdapt, that automatically adapts a pre-trained deep neural network to a mobile platform given a resource budget. While many existing algorithms simplify networks based on the number of MACs or weights, optimizing those indirect metrics may not necessarily reduce the direct metrics, such as latency and energy consumption. To solve this problem, NetAdapt incorporates direct metrics into its adaptation algorithm. These direct metrics are evaluated using empirical measurements, so that detailed knowledge of the platform and toolchain is not required. NetAdapt automatically and progressively simplifies a pre-trained network until the resource budget is met while maximizing the accuracy. Experiment results show that NetAdapt achieves better accuracy versus latency trade-offs on both mobile CPU and mobile GPU, compared with the state-of-the-art automated network simplification algorithms. For image classification on the ImageNet dataset, NetAdapt achieves up to a 1.7 × speedup in measured inference latency with equal or higher accuracy on MobileNets (V1&V2).en_US
dc.language.isoen
dc.publisherSpringer International Publishingen_US
dc.relation.isversionof10.1007/978-3-030-01249-6_18en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourcearXiven_US
dc.titleNetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applicationsen_US
dc.typeBooken_US
dc.identifier.citationYang, Tien-Ju, Howard, Andrew, Chen, Bo, Zhang, Xiao, Go, Alec et al. 2018. "NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications."
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2019-07-03T16:34:22Z
dspace.date.submission2019-07-03T16:34:23Z
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record