Show simple item record

dc.contributor.advisorEsvelt, Kevin Michael
dc.contributor.authorEthan Chase Alley
dc.date.accessioned2022-03-03T19:28:35Z
dc.date.available2022-03-03T19:28:35Z
dc.date.issued2021-06
dc.date.submitted2022-02-27T16:47:23.914Z
dc.identifier.urihttps://hdl.handle.net/1721.1/140985
dc.description.abstractThe promise of biotechnology is tempered by its potential for accidental or deliberate misuse. Reliably identifying provenance by examining telltale signatures characteristic to different genetic designers, termed genetic engineering attribution, would deter misuse, yet is still considered unsolved. In this work, we present analysis of the biosecurity implications of improved tools for attribution, arguing that the technology has robust co-benefits for deterring misuse and promoting responsible innovation. Then, we demonstrate that recurrent neural networks trained on DNA motifs and basic phenotype data can reach 70% attribution accuracy distinguishing between over 1,300 labs. To make these models usable in practice, we introduce a framework for weighing predictions against other investigative evidence using calibration, and bring our model to within 1.6% of perfect calibration. Additionally, we demonstrate that simple models can accurately predict both the nation-state-of-origin and ancestor labs, forming the foundation of an integrated attribution toolkit which should promote responsible innovation and international security alike. Finally, we discuss ongoing work to crowdsource improved attribution tools via an open data science challenge.
dc.publisherMassachusetts Institute of Technology
dc.rightsIn Copyright - Educational Use Permitted
dc.rightsCopyright MIT
dc.rights.urihttp://rightsstatements.org/page/InC-EDU/1.0/
dc.titleMachine learning to promote transparent provenance of genetic engineering
dc.typeThesis
dc.description.degreeS.M.
dc.contributor.departmentProgram in Media Arts and Sciences (Massachusetts Institute of Technology)
dc.identifier.orcid0000-0002-8219-7382
mit.thesis.degreeMaster
thesis.degree.nameMaster of Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record