dc.contributor.author | Solar Lezama, Armando | |
dc.contributor.author | Singh, Rishabh | |
dc.contributor.author | Zhang, Xin | |
dc.date.accessioned | 2021-11-09T15:09:50Z | |
dc.date.available | 2021-11-09T15:09:50Z | |
dc.date.issued | 2018 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/137906 | |
dc.description.abstract | © 2018 Curran Associates Inc..All rights reserved. We present a new algorithm to generate minimal, stable, and symbolic corrections to an input that will cause a neural network with ReLU activations to change its output. We argue that such a correction is a useful way to provide feedback to a user when the network's output is different from a desired output. Our algorithm generates such a correction by solving a series of linear constraint satisfaction problems. The technique is evaluated on three neural network models: one predicting whether an applicant will pay a mortgage, one predicting whether a first-order theorem can be proved efficiently by a solver using certain heuristics, and the final one judging whether a drawing is an accurate rendition of a canonical drawing of a cat. | en_US |
dc.language.iso | en | |
dc.relation.isversionof | https://papers.nips.cc/paper/7736-interpreting-neural-network-judgments-via-minimal-stable-and-symbolic-corrections | en_US |
dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
dc.source | Neural Information Processing Systems (NIPS) | en_US |
dc.title | Interpreting neural network judgments via minimal, stable, and symbolic corrections | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Solar Lezama, Armando, Singh, Rishabh and Zhang, Xin. 2018. "Interpreting neural network judgments via minimal, stable, and symbolic corrections." | |
dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | |
dc.eprint.version | Final published version | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2019-07-10T13:22:05Z | |
dspace.date.submission | 2019-07-10T13:22:06Z | |
mit.license | PUBLISHER_POLICY | |
mit.metadata.status | Authority Work and Publication Information Needed | en_US |