Show simple item record

dc.contributor.authorHosseini, Eghbal A.
dc.contributor.authorSchrimpf, Martin
dc.contributor.authorZhang, Yian
dc.contributor.authorBowman, Samuel
dc.contributor.authorZaslavsky, Noga
dc.contributor.authorFedorenko, Evelina
dc.date.accessioned2024-05-31T20:55:14Z
dc.date.available2024-05-31T20:55:14Z
dc.date.issued2024
dc.identifier.issn2641-4368
dc.identifier.urihttps://hdl.handle.net/1721.1/155151
dc.description.abstractArtificial neural networks have emerged as computationally plausible models of human language processing. A major criticism of these models is that the amount of training data they receive far exceeds that of humans during language learning. Here, we use two complementary approaches to ask how the models’ ability to capture human fMRI responses to sentences is affected by the amount of training data. First, we evaluate GPT-2 models trained on 1 million, 10 million, 100 million, or 1 billion words against an fMRI benchmark. We consider the 100-million-word model to be developmentally plausible in terms of the amount of training data given that this amount is similar to what children are estimated to be exposed to during the first 10 years of life. Second, we test the performance of a GPT-2 model trained on a 9-billion-token dataset to reach state-of-the-art next-word prediction performance on the human benchmark at different stages during training. Across both approaches, we find that (i) the models trained on a developmentally plausible amount of data already achieve near-maximal performance in capturing fMRI responses to sentences. Further, (ii) lower perplexity—a measure of next-word prediction performance—is associated with stronger alignment with human data, suggesting that models that have received enough training to achieve sufficiently high next-word prediction performance also acquire representations of sentences that are predictive of human fMRI responses. In tandem, these findings establish that although some training is necessary for the models’ predictive ability, a developmentally realistic amount of training (∼100 million words) may suffice.en_US
dc.language.isoen_US
dc.publisherMIT Pressen_US
dc.relation.isversionof10.1162/nol_a_00137en_US
dc.rightsCreative Commons Attributionen_US
dc.rightsAn error occurred on the license name.*
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceMIT Pressen_US
dc.titleArtificial Neural Network Language Models Predict Human Brain Responses to Language Even After a Developmentally Realistic Amount of Trainingen_US
dc.typeArticleen_US
dc.identifier.citationEghbal A. Hosseini, Martin Schrimpf, Yian Zhang, Samuel Bowman, Noga Zaslavsky, Evelina Fedorenko; Artificial Neural Network Language Models Predict Human Brain Responses to Language Even After a Developmentally Realistic Amount of Training. Neurobiology of Language 2024; 5 (1): 43–63.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciences
dc.contributor.departmentMcGovern Institute for Brain Research at MIT
dc.contributor.departmentMIT Quest for Intelligence
dc.relation.journalNeurobiology of Languageen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.date.submission2024-05-31T20:51:38Z
mit.journal.volume5en_US
mit.journal.issue1en_US
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record