MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Machine learning to promote transparent provenance of genetic engineering

Author(s)
Ethan Chase Alley
Thumbnail
DownloadThesis PDF (4.489Mb)
Advisor
Esvelt, Kevin Michael
Terms of use
In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/
Metadata
Show full item record
Abstract
The promise of biotechnology is tempered by its potential for accidental or deliberate misuse. Reliably identifying provenance by examining telltale signatures characteristic to different genetic designers, termed genetic engineering attribution, would deter misuse, yet is still considered unsolved. In this work, we present analysis of the biosecurity implications of improved tools for attribution, arguing that the technology has robust co-benefits for deterring misuse and promoting responsible innovation. Then, we demonstrate that recurrent neural networks trained on DNA motifs and basic phenotype data can reach 70% attribution accuracy distinguishing between over 1,300 labs. To make these models usable in practice, we introduce a framework for weighing predictions against other investigative evidence using calibration, and bring our model to within 1.6% of perfect calibration. Additionally, we demonstrate that simple models can accurately predict both the nation-state-of-origin and ancestor labs, forming the foundation of an integrated attribution toolkit which should promote responsible innovation and international security alike. Finally, we discuss ongoing work to crowdsource improved attribution tools via an open data science challenge.
Date issued
2021-06
URI
https://hdl.handle.net/1721.1/140985
Department
Program in Media Arts and Sciences (Massachusetts Institute of Technology)
Publisher
Massachusetts Institute of Technology

Collections
  • Graduate Theses

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.