NEBULA is a fast negative binomial mixed model for differential or co-expression analysis of large-scale multi-subject single-cell data
Author(s)
He, Liang; Davila-Velderrain, Jose; Sumida, Tomokazu S; Hafler, David A; Kellis, Manolis; Kulminski, Alexander M; ... Show more Show less
DownloadPublished version (3.483Mb)
Publisher with Creative Commons License
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
<jats:title>Abstract</jats:title><jats:p>The increasing availability of single-cell data revolutionizes the understanding of biological mechanisms at cellular resolution. For differential expression analysis in multi-subject single-cell data, negative binomial mixed models account for both subject-level and cell-level overdispersions, but are computationally demanding. Here, we propose an efficient NEgative Binomial mixed model Using a Large-sample Approximation (NEBULA). The speed gain is achieved by analytically solving high-dimensional integrals instead of using the Laplace approximation. We demonstrate that NEBULA is orders of magnitude faster than existing tools and controls false-positive errors in marker gene identification and co-expression analysis. Using NEBULA in Alzheimer’s disease cohort data sets, we found that the cell-level expression of <jats:italic>APOE</jats:italic> correlated with that of other genetic risk factors (including <jats:italic>CLU, CST3, TREM2</jats:italic>, C1q, and <jats:italic>ITM2B</jats:italic>) in a cell-type-specific pattern and an isoform-dependent manner in microglia. NEBULA opens up a new avenue for the broad application of mixed models to large-scale multi-subject single-cell data.</jats:p>
Date issued
2021Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence LaboratoryJournal
Communications Biology
Publisher
Springer Science and Business Media LLC
Citation
He, Liang, Davila-Velderrain, Jose, Sumida, Tomokazu S, Hafler, David A, Kellis, Manolis et al. 2021. "NEBULA is a fast negative binomial mixed model for differential or co-expression analysis of large-scale multi-subject single-cell data." Communications Biology, 4 (1).
Version: Final published version