Show simple item record

dc.contributor.authorRadhakrishnan, Adityanarayanan
dc.contributor.authorBelkin, Mikhail
dc.contributor.authorUhler, Caroline
dc.date.accessioned2021-10-27T20:05:28Z
dc.date.available2021-10-27T20:05:28Z
dc.date.issued2020
dc.identifier.urihttps://hdl.handle.net/1721.1/134539
dc.description.abstract© 2020 National Academy of Sciences. All rights reserved. Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience. Our main finding is that standard overparameterized deep neural networks trained using standard optimization methods implement such a mechanism for real-valued data. We provide empirical evidence that 1) overparameterized autoencoders store training samples as attractors and thus iterating the learned map leads to sample recovery, and that 2) the same mechanism allows for encoding sequences of examples and serves as an even more efficient mechanism for memory than autoencoding. Theoretically, we prove that when trained on a single example, autoencoders store the example as an attractor. Lastly, by treating a sequence encoder as a composition of maps, we prove that sequence encoding provides a more efficient mechanism for memory than autoencoding.
dc.language.isoen
dc.publisherProceedings of the National Academy of Sciences
dc.relation.isversionof10.1073/PNAS.2005013117
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
dc.sourcePNAS
dc.titleOverparameterized neural networks implement associative memory
dc.typeArticle
dc.contributor.departmentMassachusetts Institute of Technology. Laboratory for Information and Decision Systems
dc.contributor.departmentMassachusetts Institute of Technology. Institute for Data, Systems, and Society
dc.relation.journalProceedings of the National Academy of Sciences of the United States of America
dc.eprint.versionFinal published version
dc.type.urihttp://purl.org/eprint/type/JournalArticle
eprint.statushttp://purl.org/eprint/status/PeerReviewed
dc.date.updated2021-03-19T15:25:38Z
dspace.orderedauthorsRadhakrishnan, A; Belkin, M; Uhler, C
dspace.date.submission2021-03-19T15:25:47Z
mit.journal.volume117
mit.journal.issue44
mit.licensePUBLISHER_POLICY
mit.metadata.statusAuthority Work and Publication Information Needed


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record