Neural graph representation learning with application to chemistry
Author(s)
Jin, Wengong
DownloadFull printable version (3.427Mb)
Other Contributors
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.
Advisor
Regina Barzilay.
Terms of use
Metadata
Show full item recordAbstract
This thesis focus on deep learning algorithms for learning continuous representation of molecular graphs, a much more compact representation than traditional fingerprints. We demonstrate its better predictive performance in two tasks. First, we seek to automate the prediction of organic reaction outcomes. The previous solution utilizes reaction templates to limit the space, but it suffers from coverage and efficiency issues due to its discrete nature. We propose a template-free approach to efficiently explore the space of product molecules by pinpointing the reaction center. The candidates products are scored by a Weisfeiler-Lehman Difference Network that models high-order interactions between changes occurring at nodes across the molecule. Our framework outperforms the top-performing template-based approach with a 10% margin, while running orders of magnitude faster. Moreover, we demonstrate that the model accuracy rivals the performance of domain experts. Secondly, we seek to automate the design of molecules based on specific chemical properties. Our primary contribution is the direct realization of molecular graphs from continuous space. Our junction tree variational autoencoder generates molecular graphs in two phases, by first generating a tree-structured scaffold over chemical substructures, and then combining them into a molecule with a graph message passing network. This approach allows us to incrementally expand molecules while maintaining chemical validity at every step. We evaluate our model on multiple tasks ranging from molecular generation to optimization. Across these tasks, our model outperforms previous state-of-the-art baselines by a significant margin.
Description
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2018. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 49-53).
Date issued
2018Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer SciencePublisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.