DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates

Renda, Alex; Chen, Yishen; Mendis, Charith; Carbin, Michael

dc.contributor.author	Renda, Alex
dc.contributor.author	Chen, Yishen
dc.contributor.author	Mendis, Charith
dc.contributor.author	Carbin, Michael
dc.date.accessioned	2022-06-07T12:49:09Z
dc.date.available	2022-06-07T12:49:09Z
dc.date.issued	2020
dc.identifier.uri	https://hdl.handle.net/1721.1/142895
dc.description.abstract	© 2020 IEEE. CPU simulators are useful tools for modeling CPU execution behavior. However, they suffer from inaccuracies due to the cost and complexity of setting their fine-grained parameters, such as the latencies of individual instructions. This complexity arises from the expertise required to design benchmarks and measurement frameworks that can precisely measure the values of parameters at such fine granularity. In some cases, these parameters do not necessarily have a physical realization and are therefore fundamentally approximate, or even unmeasurable.In this paper we present DiffTune, a system for learning the parameters of x86 basic block CPU simulators from coarse-grained end-to-end measurements. Given a simulator, DiffTune learns its parameters by first replacing the original simulator with a differentiable surrogate, another function that approximates the original function; by making the surrogate differentiable, DiffTune is then able to apply gradient-based optimization techniques even when the original function is non-differentiable, such as is the case with CPU simulators. With this differentiable surrogate, DiffTune then applies gradient-based optimization to produce values of the simulator's parameters that minimize the simulator's error on a dataset of ground truth end-to-end performance measurements. Finally, the learned parameters are plugged back into the original simulator.DiffTune is able to automatically learn the entire set of microarchitecture-specific parameters within the Intel x86 simulation model of llvm-mca, a basic block CPU simulator based on LLVM's instruction scheduling model. DiffTune's learned parameters lead llvm-mca to an average error that not only matches but lowers that of its original, expert-provided parameter values.	en_US
dc.language.iso	en
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)	en_US
dc.relation.isversionof	10.1109/MICRO50266.2020.00045	en_US
dc.rights	Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International	en_US
dc.rights.uri	https://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	arXiv	en_US
dc.title	DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates	en_US
dc.type	Article	en_US
dc.identifier.citation	Renda, Alex, Chen, Yishen, Mendis, Charith and Carbin, Michael. 2020. "DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates." Proceedings of the Annual International Symposium on Microarchitecture, MICRO, 2020-October.
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.relation.journal	Proceedings of the Annual International Symposium on Microarchitecture, MICRO	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2022-06-07T12:41:49Z
dspace.orderedauthors	Renda, A; Chen, Y; Mendis, C; Carbin, M	en_US
dspace.date.submission	2022-06-07T12:41:51Z
mit.journal.volume	2020-October	en_US
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 2010.04017.pdf
Size:: 800.4Kb
Format:: PDF
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record