Multi-domain translation between single-cell imaging and sequencing data using autoencoders
Author(s)
Yang, Karren Dai; Belyaeva, Anastasiya; Venkatachalapathy, Saradha; Damodaran, Karthik; Katcoff, Abigail; Radhakrishnan, Adityanarayanan; Shivashankar, GV; Uhler, Caroline; ... Show more Show less
DownloadPublished version (2.632Mb)
Publisher with Creative Commons License
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
© 2021, The Author(s). The development of single-cell methods for capturing different data modalities including imaging and sequencing has revolutionized our ability to identify heterogeneous cell states. Different data modalities provide different perspectives on a population of cells, and their integration is critical for studying cellular heterogeneity and its function. While various methods have been proposed to integrate different sequencing data modalities, coupling imaging and sequencing has been an open challenge. We here present an approach for integrating vastly different modalities by learning a probabilistic coupling between the different data modalities using autoencoders to map to a shared latent space. We validate this approach by integrating single-cell RNA-seq and chromatin images to identify distinct subpopulations of human naive CD4+ T-cells that are poised for activation. Collectively, our approach provides a framework to integrate and translate between data modalities that cannot yet be measured within the same cell for diverse applications in biomedical discovery.
Date issued
2021Department
Massachusetts Institute of Technology. Laboratory for Information and Decision Systems; Massachusetts Institute of Technology. Institute for Data, Systems, and Society; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer ScienceJournal
Nature Communications
Publisher
Springer Science and Business Media LLC