Notice

This is not the latest version of this item. The latest version can be found at:https://dspace.mit.edu/handle/1721.1/133389.2

Show simple item record

dc.contributor.authorGao, Wenhao
dc.contributor.authorColey, Connor W
dc.date.accessioned2021-10-27T19:52:34Z
dc.date.available2021-10-27T19:52:34Z
dc.date.issued2020
dc.identifier.urihttps://hdl.handle.net/1721.1/133389
dc.description.abstract© 2020 American Chemical Society. All rights reserved. The discovery of functional molecules is an expensive and time-consuming process, exemplified by the rising costs of small molecule therapeutic discovery. One class of techniques of growing interest for early stage drug discovery is de novo molecular generation and optimization, catalyzed by the development of new deep learning approaches. These techniques can suggest novel molecular structures intended to maximize a multiobjective function, e.g., suitability as a therapeutic against a particular target, without relying on brute-force exploration of a chemical space. However, the utility of these approaches is stymied by ignorance of synthesizability. To highlight the severity of this issue, we use a data-driven computer-aided synthesis planning program to quantify how often molecules proposed by state-of-the-art generative models cannot be readily synthesized. Our analysis demonstrates that there are several tasks for which these models generate unrealistic molecular structures despite performing well on popular quantitative benchmarks. Synthetic complexity heuristics can successfully bias generation toward synthetically tractable chemical space, although doing so necessarily detracts from the primary objective. This analysis suggests that to improve the utility of these models in real discovery workflows, new algorithm development is warranted.
dc.language.isoen
dc.publisherAmerican Chemical Society (ACS)
dc.relation.isversionof10.1021/ACS.JCIM.0C00174
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/
dc.sourcearXiv
dc.titleThe Synthesizability of Molecules Proposed by Generative Models
dc.typeArticle
dc.relation.journalJournal of Chemical Information and Modeling
dc.eprint.versionOriginal manuscript
dc.type.urihttp://purl.org/eprint/type/JournalArticle
eprint.statushttp://purl.org/eprint/status/NonPeerReviewed
dc.date.updated2021-06-09T16:33:11Z
dspace.orderedauthorsGao, W; Coley, CW
dspace.date.submission2021-06-09T16:33:13Z
mit.journal.volume60
mit.journal.issue12
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusAuthority Work and Publication Information Needed


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version