Show simple item record

dc.contributor.authorGuan, Yanfei
dc.contributor.authorColey, Connor Wilson
dc.contributor.authorWu, Haoyang
dc.contributor.authorRanasinghe, Duminda S
dc.contributor.authorHeid, Esther
dc.contributor.authorStruble, Thomas J
dc.contributor.authorPattanaik, Lagnajit
dc.contributor.authorGreen Jr, William H
dc.contributor.authorJensen, Klavs F
dc.date.accessioned2022-07-05T13:44:08Z
dc.date.available2021-10-27T19:53:02Z
dc.date.available2022-07-05T13:44:08Z
dc.date.issued2021
dc.identifier.urihttps://hdl.handle.net/1721.1/133475.2
dc.description.abstract© The Royal Society of Chemistry 2021. Accurate and rapid evaluation of whether substrates can undergo the desired the transformation is crucial and challenging for both human knowledge and computer predictions. Despite the potential of machine learning in predicting chemical reactivity such as selectivity, popular feature engineering and learning methods are either time-consuming or data-hungry. We introduce a new method that combines machine-learned reaction representation with selected quantum mechanical descriptors to predict regio-selectivity in general substitution reactions. We construct a reactivity descriptor database based onab initiocalculations of 130k organic molecules, and train a multi-task constrained model to calculate demanded descriptors on-the-fly. The proposed platform enhances the inter/extra-polated performance for regio-selectivity predictions and enables learning from small datasets with just hundreds of examples. Furthermore, the proposed protocol is demonstrated to be generally applicable to a diverse range of chemical spaces. For three general types of substitution reactions (aromatic C-H functionalization, aromatic C-X substitution, and other substitution reactions) curated from a commercial database, the fusion model achieves 89.7%, 96.7%, and 97.2% top-1 accuracy in predicting the major outcome, respectively, each using 5000 training reactions. Using predicted descriptors, the fusion model is end-to-end, and requires approximately only 70 ms per reaction to predict the selectivity from reaction SMILES strings.en_US
dc.language.isoen
dc.publisherRoyal Society of Chemistry (RSC)en_US
dc.relation.isversionof10.1039/d0sc04823ben_US
dc.rightsCreative Commons Attribution 3.0 unported licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/3.0/en_US
dc.sourceRoyal Society of Chemistry (RSC)en_US
dc.titleRegio-selectivity prediction with a machine-learned reaction representation and on-the-fly quantum mechanical descriptorsen_US
dc.typeArticleen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemical Engineeringen_US
dc.relation.journalChemical Scienceen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2021-06-09T16:42:56Z
dspace.orderedauthorsguan, Y; Coley, CW; Wu, H; Duminda, R; Heid, E; Struble, TJ; Pattanaik, L; Green, WH; Jensen, KFen_US
dspace.date.submission2021-06-09T16:42:58Z
mit.journal.volume12en_US
mit.journal.issue6en_US
mit.licensePUBLISHER_CC
mit.metadata.statusPublication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version