Show simple item record

dc.contributor.authorGreenman, Kevin P
dc.contributor.authorGreen Jr, William H
dc.contributor.authorGomez-Bombarelli, Rafael
dc.date.accessioned2022-01-05T19:36:22Z
dc.date.available2022-01-05T13:32:46Z
dc.date.available2022-01-05T19:36:22Z
dc.date.issued2022-01-04
dc.identifier.issn2041-6520
dc.identifier.issn2041-6539
dc.identifier.urihttps://hdl.handle.net/1721.1/138813.2
dc.description.abstractOptical properties are central to molecular design for many applications, including solar cells and biomedical imaging. A variety of ab initio and statistical methods have been developed for their prediction, each with a trade-off between accuracy, generality, and cost. Existing theoretical methods such as time-dependent density functional theory (TD-DFT) are generalizable across chemical space because of their robust physics-based foundations but still exhibit random and systematic errors with respect to experiment despite their high computational cost. Statistical methods can achieve high accuracy at a lower cost, but data sparsity and unoptimized molecule and solvent representations often limit their ability to generalize. Here, we utilize directed message passing neural networks (DMPNNs) to represent both dye molecules and solvents for predictions of molecular absorption peaks in solution. Additionally, we demonstrate a multi-fidelity approach based on an auxiliary model trained on over 28,000 TD-DFT calculations that further improves accuracy and generalizability, as shown through rigorous splitting strategies. Combining several openly-available experimental datasets, we benchmark these methods against a state-of-the-art regression tree algorithm and compare the DMPNN solvent representation to several alternatives. Finally, we explore the interpretability of the learned representations using dimensionality reduction and evaluate the use of ensemble variance as an estimator of the epistemic uncertainty in our predictions of molecular peak absorption in solution. The prediction methods proposed herein can be integrated with active learning, generative modeling, and experimental workflows to enable the more rapid design of molecules with targeted optical properties.en_US
dc.description.sponsorshipNational Science Foundation (Grant 1745302)en_US
dc.publisherRoyal Society of Chemistry (RSC)en_US
dc.relation.isversionof10.1039/d1sc05677hen_US
dc.rightsCreative Commons Attribution 3.0 unported licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/3.0/en_US
dc.sourceKevin P. Greenmanen_US
dc.titleMulti-fidelity prediction of molecular optical peaks with deep learningen_US
dc.typeArticleen_US
dc.identifier.citationGreenman, Kevin P, Green, William H. and Gomez-Bombarelli, Rafael. 2022. "Multi-fidelity prediction of molecular optical peaks with deep learning." Chemical Science.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemical Engineeringen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Materials Science and Engineeringen_US
dc.relation.journalChemical Scienceen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.date.submission2022-01-05T01:15:21Z
mit.licensePUBLISHER_CC
mit.metadata.statusPublication Information Neededen_US


Files in this item

No Thumbnail [100%x160]

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version