Show simple item record

dc.contributor.authorNandy, Aditya
dc.contributor.authorDuan, Chenru
dc.contributor.authorJanet, Jon Paul
dc.contributor.authorGugler, Stefan O
dc.contributor.authorKulik, Heather Janine
dc.date.accessioned2019-11-11T19:16:07Z
dc.date.available2019-11-11T19:16:07Z
dc.date.issued2018-09
dc.date.submitted2018-08
dc.identifier.issn0888-5885
dc.identifier.urihttps://hdl.handle.net/1721.1/122820
dc.description.abstractMachine learning the electronic structure of open shell transition metal complexes presents unique challenges, including robust and automated data set generation. Here, we introduce tools that simplify data acquisition from density functional theory (DFT) and validation of trained machine learning models using the molSimplify automatic design (mAD) workflow. We demonstrate this workflow by training and comparing the performance of LASSO, kernel ridge regression (KRR), and artificial neural network (ANN) models using heuristic, topological revised autocorrelation (RAC) descriptors we have recently introduced for machine learning inorganic chemistry. On a series of open shell transition metal complexes, we evaluate set aside test errors of these models for predicting the HOMO level and HOMO-LUMO gap. The best performing models are ANNs, which show 0.15 and 0.25 eV test set mean absolute errors on the HOMO level and HOMO-LUMO gap, respectively. Poor performing KRR models using the full 153-feature RAC set are improved to nearly the same performance as the ANNs when trained on down-selected subsets of 20-30 features. Analysis of the essential descriptors for HOMO level and HOMO-LUMO gap prediction as well as comparison to subsets previously obtained for other properties reveal the paramount importance of nonlocal, steric properties in determining frontier molecular orbital energetics. We demonstrate our model performance on diverse complexes and in the discovery of molecules with target HOMO-LUMO gaps from a large 15,000 molecule design space in minutes rather than days that full DFT evaluation would require.en_US
dc.description.sponsorshipUnited States. Office of Naval Research (Grant N00014-17-1-2956)en_US
dc.description.sponsorshipUnited States. Office of Naval Research (Grant N00014-18-1-2434)en_US
dc.description.sponsorshipUnited States. Office of Naval Research (Grant D18AP00039)en_US
dc.description.sponsorshipUnited States. Department of Energy (Grant DE-SC0018096)en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (Grant CBET-1704266)en_US
dc.language.isoen
dc.publisherAmerican Chemical Societyen_US
dc.relation.isversionofhttp://dx.doi.org/10.1021/ACS.IECR.8B04015en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceOther repositoryen_US
dc.titleStrategies and Software for Machine Learning Accelerated Discovery in Transition Metal Chemistryen_US
dc.typeArticleen_US
dc.identifier.citationNandy, Aditya et al. "Strategies and software for machine learning accelerated discovery in transition metal chemistry." Industrial & Engineering Chemistry Research 57, 42 (2018): 13973-13986 © 2018 American Chemical Societyen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemical Engineeringen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemistryen_US
dc.relation.journalIndustrial & Engineering Chemistry Researchen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2019-08-22T15:42:34Z
dspace.date.submission2019-08-22T15:42:37Z
mit.journal.volume57en_US
mit.journal.issue42en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record