Artificial Neural Network Language Models Predict Human Brain Responses to Language Even After a Developmentally Realistic Amount of Training

Hosseini, Eghbal A.; Schrimpf, Martin; Zhang, Yian; Bowman, Samuel; Zaslavsky, Noga; Fedorenko, Evelina

dc.contributor.author	Hosseini, Eghbal A.
dc.contributor.author	Schrimpf, Martin
dc.contributor.author	Zhang, Yian
dc.contributor.author	Bowman, Samuel
dc.contributor.author	Zaslavsky, Noga
dc.contributor.author	Fedorenko, Evelina
dc.date.accessioned	2024-05-31T20:55:14Z
dc.date.available	2024-05-31T20:55:14Z
dc.date.issued	2024
dc.identifier.issn	2641-4368
dc.identifier.uri	https://hdl.handle.net/1721.1/155151
dc.description.abstract	Artificial neural networks have emerged as computationally plausible models of human language processing. A major criticism of these models is that the amount of training data they receive far exceeds that of humans during language learning. Here, we use two complementary approaches to ask how the models’ ability to capture human fMRI responses to sentences is affected by the amount of training data. First, we evaluate GPT-2 models trained on 1 million, 10 million, 100 million, or 1 billion words against an fMRI benchmark. We consider the 100-million-word model to be developmentally plausible in terms of the amount of training data given that this amount is similar to what children are estimated to be exposed to during the first 10 years of life. Second, we test the performance of a GPT-2 model trained on a 9-billion-token dataset to reach state-of-the-art next-word prediction performance on the human benchmark at different stages during training. Across both approaches, we find that (i) the models trained on a developmentally plausible amount of data already achieve near-maximal performance in capturing fMRI responses to sentences. Further, (ii) lower perplexity—a measure of next-word prediction performance—is associated with stronger alignment with human data, suggesting that models that have received enough training to achieve sufficiently high next-word prediction performance also acquire representations of sentences that are predictive of human fMRI responses. In tandem, these findings establish that although some training is necessary for the models’ predictive ability, a developmentally realistic amount of training (∼100 million words) may suffice.	en_US
dc.language.iso	en_US
dc.publisher	MIT Press	en_US
dc.relation.isversionof	10.1162/nol_a_00137	en_US
dc.rights	Creative Commons Attribution	en_US
dc.rights	An error occurred on the license name.	*
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_US
dc.source	MIT Press	en_US
dc.title	Artificial Neural Network Language Models Predict Human Brain Responses to Language Even After a Developmentally Realistic Amount of Training	en_US
dc.type	Article	en_US
dc.identifier.citation	Eghbal A. Hosseini, Martin Schrimpf, Yian Zhang, Samuel Bowman, Noga Zaslavsky, Evelina Fedorenko; Artificial Neural Network Language Models Predict Human Brain Responses to Language Even After a Developmentally Realistic Amount of Training. Neurobiology of Language 2024; 5 (1): 43–63.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences
dc.contributor.department	McGovern Institute for Brain Research at MIT
dc.contributor.department	MIT Quest for Intelligence
dc.relation.journal	Neurobiology of Language	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dspace.date.submission	2024-05-31T20:51:38Z
mit.journal.volume	5	en_US
mit.journal.issue	1	en_US
mit.license	PUBLISHER_CC
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: license_rdf
Size:: 40bytes
Format:: application/rdf+xml

View/Open

Name:: nol_a_00137.pdf
Size:: 1018.Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record