Representation Learning Associates Patients’ Risks for Metabolic Diseases with Features of Their Lipocytes
Author(s)
Tan, Zipei
DownloadThesis PDF (1.950Mb)
Advisor
Uhler, Caroline
Terms of use
Metadata
Show full item recordAbstract
Polygenic risk scores (PRS) estimate an individual’s risk of developing a certain disease, suggesting that differences between cells of individuals with high versus low PRS could give us insight into the cellular disease mechanisms. To study metabolic diseases, we analyze the distribution of cell states of lipocytes of individuals with different PRS for metabolic diseases, thereby associating individual-level genotypes with cell-level features. To accomplish this, we make use of a recent large-scale lipocyte microscopy imaging dataset. By learning a representation of multi-channel lipocyte microscopy images using a convolutional autoencoder, we perform unsupervised clustering on the learnt representations to identify different cell states. We analyze the distribution of these cell states in different individuals and associate their PRS to the observed cell state distributions. Finally, we show that it is possible to generate counterfactual lipocyte images and understand the effect of increased or reduced PRS on cell states through transforming the learnt representations.
Date issued
2024-05Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer SciencePublisher
Massachusetts Institute of Technology