dc.contributor.author | Tsipras, Dimitris | |
dc.contributor.author | Santurkar, Shibani (Shibani Vinay) | |
dc.contributor.author | Engstrom, Logan G. | |
dc.contributor.author | Turner, Alexander M. | |
dc.contributor.author | Madry, Aleksander | |
dc.date.accessioned | 2021-03-05T14:59:33Z | |
dc.date.available | 2021-03-05T14:59:33Z | |
dc.date.issued | 2019-04 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/130090 | |
dc.description.abstract | We show that there exists an inherent tension between the goal of adversarial robustness and that of standard generalization. Specifically, training robust models may not only be more resource-consuming, but also lead to a reduction of standard accuracy. We demonstrate that this trade-off between the standard accuracy of a model and its robustness to adversarial perturbations provably exists even in a fairly simple and natural setting. These findings also corroborate a similar phenomenon observed in practice. Further, we argue that this phenomenon is a consequence of robust classifiers learning fundamentally different feature representations than standard classifiers. These differences, in particular, seem to result in unexpected benefits: the features learned by robust models tend to align better with salient data characteristics and human perception. | en_US |
dc.description.sponsorship | National Science Foundation (U.S.) (Grants S-1447786, IIS-1607189, CCF-1563880,CCF-1553428) | en_US |
dc.language.iso | en | |
dc.publisher | ICLR | en_US |
dc.relation.isversionof | https://openreview.net/forum?id=SyxAb30cY7 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | arXiv | en_US |
dc.title | Robustness may be at odds with accuracy | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Tsipras, Dimitris et al. “Robustness may be at odds with accuracy.” Paper presented at 7th International Conference on Learning Representations, ICLR 2019, New Orleans, Louisiana, May 6 - 9, 2019, ICLR © 2019 The Author(s) | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.relation.journal | 7th International Conference on Learning Representations, ICLR 2019 | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2021-02-05T15:05:55Z | |
dspace.orderedauthors | Tsipras, D; Santurkar, S; Engstrom, L; Turner, A; Madry, A | en_US |
dspace.date.submission | 2021-02-05T15:06:42Z | |
mit.journal.volume | 2019 | en_US |
mit.license | OPEN_ACCESS_POLICY | |
mit.metadata.status | Complete | |