| dc.contributor.author | Blatt, Marcelo | |
| dc.contributor.author | Gusev, Alexander | |
| dc.contributor.author | Polyakov, Yuriy | |
| dc.contributor.author | Rohloff, Kurt | |
| dc.contributor.author | Vaikuntanathan, Vinod | |
| dc.date.accessioned | 2022-06-28T20:59:32Z | |
| dc.date.available | 2021-10-27T20:23:33Z | |
| dc.date.available | 2022-06-28T20:59:32Z | |
| dc.date.issued | 2020 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/135463.2 | |
| dc.description.abstract | © 2020 The Author(s). Background: Genome-Wide Association Studies (GWAS) refer to observational studies of a genome-wide set of genetic variants across many individuals to see if any genetic variants are associated with a certain trait. A typical GWAS analysis of a disease phenotype involves iterative logistic regression of a case/control phenotype on a single-neuclotide polymorphism (SNP) with quantitative covariates. GWAS have been a highly successful approach for identifying genetic-variant associations with many poorly-understood diseases. However, a major limitation of GWAS is the dependence on individual-level genotype/phenotype data and the corresponding privacy concerns. Methods: We present a solution for secure GWAS using homomorphic encryption (HE) that keeps all individual data encrypted throughout the association study. Our solution is based on an optimized semi-parallel GWAS compute model, a new Residue-Number-System (RNS) variant of the Cheon-Kim-Kim-Song (CKKS) HE scheme, novel techniques to switch between data encodings, and more than a dozen crypto-engineering optimizations. Results: Our prototype can perform the full GWAS computation for 1,000 individuals, 131,071 SNPs, and 3 covariates in about 10 minutes on a modern server computing node (with 28 cores). Our solution for a smaller dataset was awarded co-first place in iDASH'18 Track 2: "Secure Parallel Genome Wide Association Studies using HE". Conclusions: Many of the HE optimizations presented in our paper are general-purpose, and can be used in solving challenging problems with large datasets in other application domains. | en_US |
| dc.language.iso | en | |
| dc.publisher | Springer Science and Business Media LLC | en_US |
| dc.relation.isversionof | 10.1186/S12920-020-0719-9 | en_US |
| dc.rights | Creative Commons Attribution 4.0 International license | en_US |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_US |
| dc.source | BMC | en_US |
| dc.title | Optimized homomorphic encryption solution for secure genome-wide association studies | en_US |
| dc.type | Article | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
| dc.relation.journal | BMC Medical Genomics | en_US |
| dc.eprint.version | Final published version | en_US |
| dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
| eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
| dc.date.updated | 2021-03-19T15:40:15Z | |
| dspace.orderedauthors | Blatt, M; Gusev, A; Polyakov, Y; Rohloff, K; Vaikuntanathan, V | en_US |
| dspace.date.submission | 2021-03-19T15:40:16Z | |
| mit.journal.volume | 13 | en_US |
| mit.journal.issue | S7 | en_US |
| mit.license | PUBLISHER_CC | |
| mit.metadata.status | Publication Information Needed | en_US |