Show simple item record

dc.contributor.advisorBepler, Tristan
dc.contributor.authorRam, Soumya
dc.date.accessioned2022-01-14T15:04:53Z
dc.date.available2022-01-14T15:04:53Z
dc.date.issued2021-06
dc.date.submitted2021-06-17T20:14:07.377Z
dc.identifier.urihttps://hdl.handle.net/1721.1/139337
dc.description.abstractProtein engineering has the potential to solve complex global problems in medicine, clean energy, and manufacturing. However, current protein engineering efforts are hampered by a lack of supervised data. We help recitify this issue by developing supervised models that perform well in data-constrained settings by generalizing across protein engineering tasks and better incorporating coevolutionary and structural information. We also develop an unsupervised language model that conditions the target sequence on its multiple sequence alignment, allowing us to better model protein families.
dc.publisherMassachusetts Institute of Technology
dc.rightsIn Copyright - Educational Use Permitted
dc.rightsCopyright MIT
dc.rights.urihttp://rightsstatements.org/page/InC-EDU/1.0/
dc.titleUsing Co-evolutionary Information to Improve Protein Language Modelling
dc.typeThesis
dc.description.degreeM.Eng.
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
mit.thesis.degreeMaster
thesis.degree.nameMaster of Engineering in Electrical Engineering and Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record