Show simple item record

dc.contributor.authorFisher, KE
dc.contributor.authorHerbst, MF
dc.contributor.authorMarzouk, YM
dc.date.accessioned2026-03-05T14:19:08Z
dc.date.available2026-03-05T14:19:08Z
dc.date.issued2024-07-03
dc.identifier.urihttps://hdl.handle.net/1721.1/165037
dc.description.abstractData generation remains a bottleneck in training surrogate models to predict molecular properties. We demonstrate that multitask Gaussian process regression overcomes this limitation by leveraging both expensive and cheap data sources. In particular, we consider training sets constructed from coupled-cluster (CC) and density functional theory (DFT) data. We report that multitask surrogates can predict at CC-level accuracy with a reduction in data generation cost by over an order of magnitude. Of note, our approach allows the training set to include DFT data generated by a heterogeneous mix of exchange–correlation functionals without imposing any artificial hierarchy on functional accuracy. More generally, the multitask framework can accommodate a wider range of training set structures—including the full disparity between the different levels of fidelity—than existing kernel approaches based on Δ-learning although we show that the accuracy of the two approaches can be similar. Consequently, multitask regression can be a tool for reducing data generation costs even further by opportunistically exploiting existing data sources.en_US
dc.language.isoen
dc.publisherAIP Publishingen_US
dc.relation.isversionofhttps://doi.org/10.1063/5.0201681en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceAIP Publishingen_US
dc.titleMultitask methods for predicting molecular properties from heterogeneous dataen_US
dc.typeArticleen_US
dc.identifier.citationK. E. Fisher, M. F. Herbst, Y. M. Marzouk; Multitask methods for predicting molecular properties from heterogeneous data. J. Chem. Phys. 7 July 2024; 161 (1): 014114.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Aeronautics and Astronauticsen_US
dc.relation.journalThe Journal of Chemical Physicsen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2026-03-05T14:12:26Z
dspace.orderedauthorsFisher, KE; Herbst, MF; Marzouk, YMen_US
dspace.date.submission2026-03-05T14:12:28Z
mit.journal.volume161en_US
mit.journal.issue1en_US
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record