A Transformer for scATAC-scRNA Translation
Author(s)
Jin, Roger
DownloadThesis PDF (2.576Mb)
Advisor
Kellis, Manolis
Terms of use
Metadata
Show full item recordAbstract
scATAC-seq gives a comprehensive picture of the chromatin accessibility profile of a cell, covering not only protein-coding regions but also non-coding regulatory regions which are in theory missed by scRNA-seq. However, scATAC-seq data is highdimensional and noisy, aspects which when compounded with data scarcity present challenges for modeling on even seemingly-simple downstream tasks such as cell-type prediction. As such, researchers may benefit from access to a large library of models to evaluate. While we do not demonstrate state of the art results in any capacity, we provide an implementation of a simple representation of sparse tabular data that allows it to be inputted into the popular transformer family of architectures, and use this representation to train a transformer that predicts scRNA-seq given scATAC-seq. Our code is made available here https://github.com/rogershijin/GANOLI.
Date issued
2022-09Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer SciencePublisher
Massachusetts Institute of Technology