Show simple item record

dc.contributor.authorLin, Ji
dc.contributor.authorLiu, Zhijian
dc.contributor.authorWang, Hanrui
dc.contributor.authorHan, Song
dc.date.accessioned2021-01-26T18:39:53Z
dc.date.available2021-01-26T18:39:53Z
dc.date.issued2018-10
dc.identifier.isbn9783030012205
dc.identifier.urihttps://hdl.handle.net/1721.1/129576
dc.description.abstractModel compression is an effective technique to efficiently deploy neural network models on mobile devices which have limited computation resources and tight power budgets. Conventional model compression techniques rely on hand-crafted features and require domain experts to explore the large design space trading off among model size, speed, and accuracy, which is usually sub-optimal and time-consuming. In this paper, we propose AutoML for Model Compression (AMC) which leverages reinforcement learning to efficiently sample the design space and can improve the model compression quality. We achieved state-of-the-art model compression results in a fully automated way without any human efforts. Under 4 × FLOPs reduction, we achieved 2.7% better accuracy than the hand-crafted model compression method for VGG-16 on ImageNet. We applied this automated, push-the-button compression pipeline to MobileNet-V1 and achieved a speedup of 1.53 × on the GPU (Titan Xp) and 1.95 × on an Android phone (Google Pixel 1), with negligible loss of accuracy.en_US
dc.language.isoen
dc.publisherSpringer Science and Business Media LLCen_US
dc.relation.isversionof10.1007/978-3-030-01234-2_48en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourcearXiven_US
dc.titleAMC: AutoML for Model Compression and Acceleration on Mobile Devicesen_US
dc.typeArticleen_US
dc.identifier.citationHe, Yihui et al. "AMC: AutoML for Model Compression and Acceleration on Mobile Devices." Computer vision -- ECCV 2018 : 15th European Conference, Lecture Notes in Computer Science, 11211, Springer, 2018, 815-832 © 2018 The Author(s)en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.relation.journalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2020-12-17T15:33:56Z
dspace.orderedauthorsHe, Y; Lin, J; Liu, Z; Wang, H; Li, L-J; Han, Sen_US
dspace.date.submission2020-12-17T15:34:00Z
mit.journal.volume11211en_US
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record