Efficient integration of heterogeneous single-cell transcriptomes using Scanorama
Author(s)
Hie, Brian; Bryson, Bryan D.; Berger Leighton, Bonnie
DownloadAccepted version (1.074Mb)
Terms of use
Metadata
Show full item recordAbstract
ntegration of single-cell RNA sequencing (scRNA-seq) data from multiple experiments, laboratories and technologies can uncover biological insights, but current methods for scRNA-seq data integration are limited by a requirement for datasets to derive from functionally similar cells. We present Scanorama, an algorithm that identifies and merges the shared cell types among all pairs of datasets and accurately integrates heterogeneous collections of scRNA-seq data. We applied Scanorama to integrate and remove batch effects across 105,476 cells from 26 diverse scRNA-seq experiments representing 9 different technologies. Scanorama is sensitive to subtle temporal changes within the same cell lineage, successfully integrating functionally similar cells across time series data of CD14⁺ monocytes at different stages of differentiation into macrophages. Finally, we show that Scanorama is orders of magnitude faster than existing techniques and can integrate a collection of 1,095,538 cells in just ~9 h.
Date issued
2019-05Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Biological Engineering; Massachusetts Institute of Technology. Department of MathematicsJournal
Nature Biotechnology
Publisher
Springer Science and Business Media LLC
Citation
Hie, Brian et al. "Efficient integration of heterogeneous single-cell transcriptomes using Scanorama." Nature Biotechnology (May 2019): 685–691 © 2019 Springer Nature
Version: Author's final manuscript
ISSN
1087-0156
1546-1696