Neural scaling of deep chemical models

Frey, Nathan C; Soklaski, Ryan; Axelrod, Simon; Samsi, Siddharth; Gómez-Bombarelli, Rafael; Coley, Connor W; Gadepally, Vijay

dc.contributor.author	Frey, Nathan C
dc.contributor.author	Soklaski, Ryan
dc.contributor.author	Axelrod, Simon
dc.contributor.author	Samsi, Siddharth
dc.contributor.author	Gómez-Bombarelli, Rafael
dc.contributor.author	Coley, Connor W
dc.contributor.author	Gadepally, Vijay
dc.date.accessioned	2025-02-11T21:05:33Z
dc.date.available	2025-02-11T21:05:33Z
dc.date.issued	2023
dc.identifier.uri	https://hdl.handle.net/1721.1/158195
dc.description.abstract	Massive scale, in terms of both data availability and computation, enables important breakthroughs in key application areas of deep learning such as natural language processing and computer vision. There is emerging evidence that scale may be a key ingredient in scientific deep learning, but the importance of physical priors in scientific domains makes the strategies and benefits of scaling uncertain. Here we investigate neural-scaling behaviour in large chemical models by varying model and dataset sizes over many orders of magnitude, studying models with over one billion parameters, pre-trained on datasets of up to ten million datapoints. We consider large language models for generative chemistry and graph neural networks for machine-learned interatomic potentials. We investigate the interplay between physical priors and scale and discover empirical neural-scaling relations for language models in chemistry with a scaling exponent of 0.17 for the largest dataset size considered, and a scaling exponent of 0.26 for equivariant graph neural network interatomic potentials.	en_US
dc.language.iso	en
dc.publisher	Springer Science and Business Media LLC	en_US
dc.relation.isversionof	10.1038/s42256-023-00740-3	en_US
dc.rights	Creative Commons Attribution	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_US
dc.source	Springer Science and Business Media LLC	en_US
dc.title	Neural scaling of deep chemical models	en_US
dc.type	Article	en_US
dc.identifier.citation	Frey, N.C., Soklaski, R., Axelrod, S. et al. Neural scaling of deep chemical models. Nat Mach Intell 5, 1297–1305 (2023).	en_US
dc.contributor.department	Lincoln Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Materials Science and Engineering	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Chemical Engineering	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.relation.journal	Nature Machine Intelligence	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dc.date.updated	2025-02-11T20:58:27Z
dspace.orderedauthors	Frey, NC; Soklaski, R; Axelrod, S; Samsi, S; Gómez-Bombarelli, R; Coley, CW; Gadepally, V	en_US
dspace.date.submission	2025-02-11T20:58:30Z
mit.journal.volume	5	en_US
mit.journal.issue	11	en_US
mit.license	PUBLISHER_CC
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: s42256-023-00740-3.pdf
Size:: 1.673Mb
Format:: PDF
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record