Show simple item record

dc.contributor.advisorJacobson, Joseph
dc.contributor.authorRadev, Simeon
dc.date.accessioned2024-09-24T18:23:01Z
dc.date.available2024-09-24T18:23:01Z
dc.date.issued2024-05
dc.date.submitted2024-08-05T13:48:26.256Z
dc.identifier.urihttps://hdl.handle.net/1721.1/156959
dc.description.abstractA classical problem of machine learning is the interpretability of a model’s latent information processing. This is particularly the case in the richly complex field of protein analysis, whereby unique and novel insights into the structural organization of proteins can help illuminate their functional space, and in particular lead toward a factorization of the structural space into a set of motif building blocks, which completely span this universe. This thesis creates a new inference interface for performing such analysis, by leveraging the sequential learning process of a neural autoencoder to construct a decomposition of proteins as a hierarchical sequence of embedded representation vectors. The further development of this work could lead to a greater understanding of the organizational complexity of natural phenomena, and in particular, as it relates to the uniquely complex relationship between protein structures and their function.
dc.publisherMassachusetts Institute of Technology
dc.rightsIn Copyright - Educational Use Permitted
dc.rightsCopyright retained by author(s)
dc.rights.urihttps://rightsstatements.org/page/InC-EDU/1.0/
dc.titleTowards a Prime Factorization of Proteins
dc.typeThesis
dc.description.degreeS.M.
dc.contributor.departmentProgram in Media Arts and Sciences (Massachusetts Institute of Technology)
mit.thesis.degreeMaster
thesis.degree.nameMaster of Science in Media Arts and Sciences


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record