Show simple item record

dc.contributor.authorYang, Kevin
dc.contributor.authorSwanson, Kyle
dc.contributor.authorJin, Wengong
dc.contributor.authorColey, Connor
dc.contributor.authorEiden, Philipp
dc.contributor.authorGao, Hua
dc.contributor.authorGuzman-Perez, Angel
dc.contributor.authorHopper, Timothy
dc.contributor.authorKelley, Brian
dc.contributor.authorMathea, Miriam
dc.contributor.authorPalmer, Andrew
dc.contributor.authorSettels, Volker
dc.contributor.authorJaakkola, Tommi
dc.contributor.authorJensen, Klavs
dc.contributor.authorBarzilay, Regina
dc.date.accessioned2021-10-27T20:05:52Z
dc.date.available2021-10-27T20:05:52Z
dc.date.issued2019
dc.identifier.urihttps://hdl.handle.net/1721.1/134630
dc.description.abstract© 2019 American Chemical Society. Advancements in neural machinery have led to a wide range of algorithmic solutions for molecular property prediction. Two classes of models in particular have yielded promising results: neural networks applied to computed molecular fingerprints or expert-crafted descriptors and graph convolutional neural networks that construct a learned molecular representation by operating on the graph structure of the molecule. However, recent literature has yet to clearly determine which of these two methods is superior when generalizing to new chemical space. Furthermore, prior research has rarely examined these new models in industry research settings in comparison to existing employed models. In this paper, we benchmark models extensively on 19 public and 16 proprietary industrial data sets spanning a wide variety of chemical end points. In addition, we introduce a graph convolutional model that consistently matches or outperforms models using fixed molecular descriptors as well as previous graph neural architectures on both public and proprietary data sets. Our empirical findings indicate that while approaches based on these representations have yet to reach the level of experimental reproducibility, our proposed model nevertheless offers significant improvements over models currently used in industrial workflows.
dc.language.isoen
dc.publisherAmerican Chemical Society (ACS)
dc.relation.isversionof10.1021/acs.jcim.9b00237
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivs License
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.sourceACS
dc.titleAnalyzing Learned Molecular Representations for Property Prediction
dc.typeArticle
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.relation.journalJournal of Chemical Information and Modeling
dc.eprint.versionAuthor's final manuscript
dc.type.urihttp://purl.org/eprint/type/JournalArticle
eprint.statushttp://purl.org/eprint/status/PeerReviewed
dc.date.updated2019-08-22T13:08:28Z
dspace.orderedauthorsYang, K; Swanson, K; Jin, W; Coley, C; Eiden, P; Gao, H; Guzman-Perez, A; Hopper, T; Kelley, B; Mathea, M; Palmer, A; Settels, V; Jaakkola, T; Jensen, K; Barzilay, R
dspace.date.submission2019-08-22T13:08:30Z
mit.journal.volume59
mit.journal.issue8
mit.licensePUBLISHER_POLICY
mit.metadata.statusAuthority Work and Publication Information Needed


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record