Show simple item record

dc.contributor.authorMeyer, Ralf
dc.contributor.authorChu, Daniel BK
dc.contributor.authorKulik, Heather J
dc.date.accessioned2025-09-22T20:56:11Z
dc.date.available2025-09-22T20:56:11Z
dc.date.issued2025-01-06
dc.identifier.urihttps://hdl.handle.net/1721.1/162780
dc.description.abstractGraph-based machine learning (ML) models for material properties show great potential to accelerate virtual high-throughput screening of large chemical spaces. However, in their simplest forms, graph-based models do not include any 3D information and are unable to distinguish stereoisomers such as those arising from different orderings of ligands around a metal center in coordination complexes. In this work we present a modification to revised autocorrelation descriptors, a molecular graph featurization method, for predicting spin state dependent properties of octahedral transition metal complexes (TMCs). Inspired by analytical semi-empirical models for TMCs, the new modeling strategy is based on the many-body expansion (MBE) and allows one to tune the captured stereoisomer information by changing the truncation order of the MBE. We present the necessary modifications to include this approach in two commonly used ML methods, kernel ridge regression and feed-forward neural networks. On a test set composed of all possible isomers of binary TMCs, the best MBE models achieve mean absolute errors (MAEs) of 2.75 kcal mol−1 on spin-splitting energies and 0.26 eV on frontier orbital energy gaps, a 30%–40% reduction in error compared to models based on our previous approach. We also observe improved generalization to previously unseen ligands where the best-performing models exhibit MAEs of 4.00 kcal mol−1 (i.e. a 0.73 kcal mol−1 reduction) on the spin-splitting energies and 0.53 eV (i.e. a 0.10 eV reduction) on the frontier orbital energy gaps. Because the new approach incorporates insights from electronic structure theory, such as ligand additivity relationships, these models exhibit systematic generalization from homoleptic to heteroleptic complexes, allowing for efficient screening of TMC search spaces.en_US
dc.language.isoen
dc.publisherIOP Publishingen_US
dc.relation.isversionofhttps://doi.org/10.1088/2632-2153/ad9f22en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceIOP Publishingen_US
dc.titleMany-body expansion based machine learning models for octahedral transition metal complexesen_US
dc.typeArticleen_US
dc.identifier.citationRalf Meyer et al 2024 Mach. Learn.: Sci. Technol. 5 045080en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemical Engineeringen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemistryen_US
dc.relation.journalMachine Learning: Science and Technologyen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2025-09-22T20:51:03Z
dspace.orderedauthorsMeyer, R; Chu, DBK; Kulik, HJen_US
dspace.date.submission2025-09-22T20:51:05Z
mit.journal.volume5en_US
mit.journal.issue4en_US
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record