| dc.contributor.advisor | Jaakkola, Tommi S. | |
| dc.contributor.advisor | Barzilay, Regina | |
| dc.contributor.author | Yim, Jason | |
| dc.date.accessioned | 2023-03-31T14:41:12Z | |
| dc.date.available | 2023-03-31T14:41:12Z | |
| dc.date.issued | 2023-02 | |
| dc.date.submitted | 2023-02-28T14:36:10.371Z | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/150230 | |
| dc.description.abstract | Construction of a scaffold structure that supports a desired motif, conferring protein function, shows promise for the design of vaccines and enzymes. But a general solution to this motif-scaffolding problem remains open. Current machine-learning techniques for scaffold design are either limited to unrealistically small scaffolds (up to length 20) or struggle to produce multiple diverse scaffolds. We propose to learn a distribution over diverse and longer protein backbone structures via an E(3)-equivariant graph neural network. We develop SMCDiff to efficiently sample scaffolds from this distribution conditioned on a given motif; our algorithm is the first to theoretically guarantee conditional samples from a diffusion model in the large-compute limit. We evaluate our designed backbones by how well they align with AlphaFold2-predicted structures. We show that our method can (1) sample scaffolds up to 80 residues and (2) achieve structurally diverse scaffolds for a fixed motif. | |
| dc.publisher | Massachusetts Institute of Technology | |
| dc.rights | In Copyright - Educational Use Permitted | |
| dc.rights | Copyright MIT | |
| dc.rights.uri | http://rightsstatements.org/page/InC-EDU/1.0/ | |
| dc.title | Diffusion Probabilistic Modeling of Protein Backbones in 3D for the Motif-Scaffolding problem | |
| dc.type | Thesis | |
| dc.description.degree | S.M. | |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
| mit.thesis.degree | Master | |
| thesis.degree.name | Master of Science in Electrical Engineering and Computer Science | |