Robustness may be at odds with accuracy

Tsipras, Dimitris; Santurkar, Shibani (Shibani Vinay); Engstrom, Logan G.; Turner, Alexander M.; Madry, Aleksander

dc.contributor.author	Tsipras, Dimitris
dc.contributor.author	Santurkar, Shibani (Shibani Vinay)
dc.contributor.author	Engstrom, Logan G.
dc.contributor.author	Turner, Alexander M.
dc.contributor.author	Madry, Aleksander
dc.date.accessioned	2021-03-05T14:59:33Z
dc.date.available	2021-03-05T14:59:33Z
dc.date.issued	2019-04
dc.identifier.uri	https://hdl.handle.net/1721.1/130090
dc.description.abstract	We show that there exists an inherent tension between the goal of adversarial robustness and that of standard generalization. Specifically, training robust models may not only be more resource-consuming, but also lead to a reduction of standard accuracy. We demonstrate that this trade-off between the standard accuracy of a model and its robustness to adversarial perturbations provably exists even in a fairly simple and natural setting. These findings also corroborate a similar phenomenon observed in practice. Further, we argue that this phenomenon is a consequence of robust classifiers learning fundamentally different feature representations than standard classifiers. These differences, in particular, seem to result in unexpected benefits: the features learned by robust models tend to align better with salient data characteristics and human perception.	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (Grants S-1447786, IIS-1607189, CCF-1563880,CCF-1553428)	en_US
dc.language.iso	en
dc.publisher	ICLR	en_US
dc.relation.isversionof	https://openreview.net/forum?id=SyxAb30cY7	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	arXiv	en_US
dc.title	Robustness may be at odds with accuracy	en_US
dc.type	Article	en_US
dc.identifier.citation	Tsipras, Dimitris et al. “Robustness may be at odds with accuracy.” Paper presented at 7th International Conference on Learning Representations, ICLR 2019, New Orleans, Louisiana, May 6 - 9, 2019, ICLR © 2019 The Author(s)	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.relation.journal	7th International Conference on Learning Representations, ICLR 2019	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2021-02-05T15:05:55Z
dspace.orderedauthors	Tsipras, D; Santurkar, S; Engstrom, L; Turner, A; Madry, A	en_US
dspace.date.submission	2021-02-05T15:06:42Z
mit.journal.volume	2019	en_US
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Complete

Files in this item

Name:: 1805.12152.pdf
Size:: 5.904Mb
Format:: PDF
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record