Show simple item record

dc.contributor.authorHaas, Brittany C
dc.contributor.authorHardy, Melissa A
dc.contributor.authorSowndarya S. V., Shree
dc.contributor.authorAdams, Keir
dc.contributor.authorColey, Connor W
dc.contributor.authorPaton, Robert S
dc.contributor.authorSigman, Matthew S
dc.date.accessioned2025-01-28T16:55:47Z
dc.date.available2025-01-28T16:55:47Z
dc.date.issued2025-01-15
dc.identifier.urihttps://hdl.handle.net/1721.1/158096
dc.description.abstractData-driven reaction discovery and development is a growing field that relies on the use of molecular descriptors to capture key information about substrates, ligands, and targets. Broad adaptation of this strategy is hindered by the associated computational cost of descriptor calculation, especially when considering conformational flexibility. Descriptor libraries can be precomputed agnostic of application to reduce the computational burden of data-driven reaction development. However, as one often applies these models to evaluate novel hypothetical structures, it would be ideal to predict the descriptors of compounds on-the-fly. Herein, we report DFT-level descriptor libraries for conformational ensembles of 8528 carboxylic acids and 8172 alkyl amines towards this goal. Employing 2D and 3D graph neural network architectures trained on these libraries culminated in the development of predictive models for molecule-level descriptors, as well as the bond- and atom-level descriptors for the conserved reactive site (carboxylic acid or amine). The predictions were confirmed to be robust for an external validation set of medicinally-relevant carboxylic acids and alkyl amines. Additionally, a retrospective study correlating the rate of amide coupling reactions demonstrated the suitability of the predicted DFT-level descriptors for downstream applications. Ultimately, these models enable high-fidelity predictions for a vast number of potential substrates, greatly increasing accessibility to the field of data-driven reaction development.en_US
dc.language.isoen
dc.publisherRoyal Society of Chemistryen_US
dc.relation.isversionofhttps://doi.org/10.1039/D4DD00284Aen_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceRoyal Society of Chemistryen_US
dc.titleRapid prediction of conformationally-dependent DFT-level descriptors using graph neural networks for carboxylic acids and alkyl aminesen_US
dc.typeArticleen_US
dc.identifier.citationHaas, Brittany C, Hardy, Melissa A, Sowndarya S. V., Shree, Adams, Keir, Coley, Connor W et al. 2025. "Rapid prediction of conformationally-dependent DFT-level descriptors using graph neural networks for carboxylic acids and alkyl amines." Digital Discovery, 4 (1).
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemical Engineeringen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.relation.journalDigital Discoveryen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2025-01-28T16:49:06Z
dspace.orderedauthorsHaas, BC; Hardy, MA; Sowndarya S. V., S; Adams, K; Coley, CW; Paton, RS; Sigman, MSen_US
dspace.date.submission2025-01-28T16:49:07Z
mit.journal.volume4en_US
mit.journal.issue1en_US
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record