dc.contributor.author | Anderson, Ross | |
dc.contributor.author | Huchette, Joey | |
dc.contributor.author | Ma, Will | |
dc.contributor.author | Tjandraatmadja, Christian | |
dc.contributor.author | Vielma, Juan P | |
dc.date.accessioned | 2021-09-20T17:16:45Z | |
dc.date.available | 2021-09-20T17:16:45Z | |
dc.date.issued | 2020-02-13 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/131368 | |
dc.description.abstract | Abstract
We present strong mixed-integer programming (MIP) formulations for high-dimensional piecewise linear functions that correspond to trained neural networks. These formulations can be used for a number of important tasks, such as verifying that an image classification network is robust to adversarial inputs, or solving decision problems where the objective function is a machine learning model. We present a generic framework, which may be of independent interest, that provides a way to construct sharp or ideal formulations for the maximum of d affine functions over arbitrary polyhedral input domains. We apply this result to derive MIP formulations for a number of the most popular nonlinear operations (e.g. ReLU and max pooling) that are strictly stronger than other approaches from the literature. We corroborate this computationally, showing that our formulations are able to offer substantial improvements in solve time on verification tasks for image classification networks. | en_US |
dc.publisher | Springer Berlin Heidelberg | en_US |
dc.relation.isversionof | https://doi.org/10.1007/s10107-020-01474-5 | en_US |
dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
dc.source | Springer Berlin Heidelberg | en_US |
dc.title | Strong mixed-integer programming formulations for trained neural networks | en_US |
dc.type | Article | en_US |
dc.contributor.department | Sloan School of Management | |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
dc.date.updated | 2020-09-24T21:02:19Z | |
dc.language.rfc3066 | en | |
dc.rights.holder | Springer-Verlag GmbH Germany, part of Springer Nature and Mathematical Optimization Society | |
dspace.embargo.terms | Y | |
dspace.date.submission | 2020-09-24T21:02:19Z | |
mit.license | PUBLISHER_POLICY | |
mit.metadata.status | Authority Work and Publication Information Needed | |