| dc.contributor.advisor | Aviv Regev. | en_US |
| dc.contributor.author | Hu, Eileen. | en_US |
| dc.contributor.other | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. | en_US |
| dc.date.accessioned | 2021-05-24T19:52:07Z | |
| dc.date.available | 2021-05-24T19:52:07Z | |
| dc.date.copyright | 2021 | en_US |
| dc.date.issued | 2021 | en_US |
| dc.identifier.uri | https://hdl.handle.net/1721.1/130692 | |
| dc.description | Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, February, 2021 | en_US |
| dc.description | Cataloged from the official PDF of thesis. | en_US |
| dc.description | Includes bibliographical references (pages 28-30). | en_US |
| dc.description.abstract | Polygenic risk score (PRS) models are a valuable tool for assessing risk of developing a disease, given an individual's genotype. The unique combination of germline variants in each genome can be informative in risk calculations, especially in complex polygenic traits that are affected by a broad set of variants, many of which may contribute a small amount to the disease individually but combine to have significant effects. Recent studies have refined the basic linear PRS model by using functional annotations and linkage disequilibrium modeling to shift towards variant effect sizes that reflect genetic mechanisms. Here, I refine the standard PRS method to incorporate the complex, non-linear interactions between variants. I perform fine mapping on the input genome-wide association study (GWAS) variant effects using a convolutional neural network trained on regulatory feature annotations. I further extend the PRS method by calculating scores for a set of genes that form functional gene expression modules and using this partitioned PRS to learn different disease subtypes. Module-based PRS could improve clinical utility of PRS results by providing a fine-grained method for stratifying different aspects of disease risk, especially those that are based on biological mechanisms occurring in specific cell-types. | en_US |
| dc.description.statementofresponsibility | by Eileen Hu. | en_US |
| dc.format.extent | 30 pages | en_US |
| dc.language.iso | eng | en_US |
| dc.publisher | Massachusetts Institute of Technology | en_US |
| dc.rights | MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. | en_US |
| dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
| dc.subject | Electrical Engineering and Computer Science. | en_US |
| dc.title | Refining polygenic risk score models through fine mapping and functional gene modules | en_US |
| dc.type | Thesis | en_US |
| dc.description.degree | M. Eng. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.identifier.oclc | 1251799793 | en_US |
| dc.description.collection | M.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science | en_US |
| dspace.imported | 2021-05-24T19:52:07Z | en_US |
| mit.thesis.degree | Master | en_US |
| mit.thesis.department | EECS | en_US |