Show simple item record

dc.contributor.authorEyke, Natalie S.
dc.contributor.authorGreen Jr, William H
dc.contributor.authorJensen, Klavs F
dc.date.accessioned2021-01-12T15:37:34Z
dc.date.available2021-01-12T15:37:34Z
dc.date.issued2020-08
dc.date.submitted2020-06
dc.identifier.issn2058-9883
dc.identifier.urihttps://hdl.handle.net/1721.1/129381
dc.description.abstractHigh-throughput reaction screening has emerged as a useful means of rapidly identifying the influence of key reaction variables on reaction outcomes. We show that active machine learning can further this objective by eliminating dependence on "exhaustive"screens (screens in which all possible combinations of the reaction variables of interest are examined). This is achieved through iterative selection of maximally informative experiments from the subset of all possible experiments in the domain. These experiments can be used to train accurate machine learning models that can be used to predict the outcomes of reactions that were not performed, thus reducing the overall experimental burden. To demonstrate our approach, we conduct retrospective analyses of the preexisting results of high-throughput reaction screening experiments. We compare the test set errors of models trained on actively-selected reactions to models trained on reactions selected at random from the same domain. We find that the degree to which models trained on actively-selected data outperform models trained on randomly-selected data depends on the domain being modeled, with it being possible to achieve very low test set errors when the dataset is heavily skewed in favor of low- or zero-yielding reactions. Our results confirm that this algorithm is a useful experiment planning tool that can change the reaction screening paradigm, by allowing medicinal and process chemists to focus their reaction screening efforts on the generation of a small amount of high-quality data.en_US
dc.language.isoen
dc.publisherRoyal Society of Chemistry (RSC)en_US
dc.relation.isversionof10.1039/D0RE00232Aen_US
dc.rightsCreative Commons Attribution Noncommercial 3.0 unported licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by-nc/3.0/en_US
dc.sourceRoyal Society of Chemistry (RSC)en_US
dc.titleIterative experimental design based on active machine learning reduces the experimental burden associated with reaction screeningen_US
dc.typeArticleen_US
dc.identifier.citationEyke, Natalie S. et al. “Iterative experimental design based on active machine learning reduces the experimental burden associated with reaction screening.” Reaction Chemistry and Engineering 5, 10 (August 2020): 1963–1972 © 2020 The Author(s)en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemical Engineeringen_US
dc.relation.journalReaction Chemistry and Engineeringen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2020-12-21T15:14:29Z
dspace.orderedauthorsEyke, NS; Green, WH; Jensen, KFen_US
dspace.date.submission2020-12-21T15:14:34Z
mit.journal.volume5en_US
mit.journal.issue10en_US
mit.licensePUBLISHER_CC
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record