Interpreting neural network judgments via minimal, stable, and symbolic corrections

Solar Lezama, Armando; Singh, Rishabh; Zhang, Xin

dc.contributor.author	Solar Lezama, Armando
dc.contributor.author	Singh, Rishabh
dc.contributor.author	Zhang, Xin
dc.date.accessioned	2021-11-09T15:09:50Z
dc.date.available	2021-11-09T15:09:50Z
dc.date.issued	2018
dc.identifier.uri	https://hdl.handle.net/1721.1/137906
dc.description.abstract	© 2018 Curran Associates Inc..All rights reserved. We present a new algorithm to generate minimal, stable, and symbolic corrections to an input that will cause a neural network with ReLU activations to change its output. We argue that such a correction is a useful way to provide feedback to a user when the network's output is different from a desired output. Our algorithm generates such a correction by solving a series of linear constraint satisfaction problems. The technique is evaluated on three neural network models: one predicting whether an applicant will pay a mortgage, one predicting whether a first-order theorem can be proved efficiently by a solver using certain heuristics, and the final one judging whether a drawing is an accurate rendition of a canonical drawing of a cat.	en_US
dc.language.iso	en
dc.relation.isversionof	https://papers.nips.cc/paper/7736-interpreting-neural-network-judgments-via-minimal-stable-and-symbolic-corrections	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Neural Information Processing Systems (NIPS)	en_US
dc.title	Interpreting neural network judgments via minimal, stable, and symbolic corrections	en_US
dc.type	Article	en_US
dc.identifier.citation	Solar Lezama, Armando, Singh, Rishabh and Zhang, Xin. 2018. "Interpreting neural network judgments via minimal, stable, and symbolic corrections."
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2019-07-10T13:22:05Z
dspace.date.submission	2019-07-10T13:22:06Z
mit.license	PUBLISHER_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 7736-interpreting-neural-netwo ...
Size:: 3.677Mb
Format:: PDF
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record