| dc.contributor.author | Hrovatin, Karin | |
| dc.contributor.author | Moinfar, Amir Ali | |
| dc.contributor.author | Zappia, Luke | |
| dc.contributor.author | Parikh, Shrey | |
| dc.contributor.author | Lapuerta, Alejandro T. | |
| dc.contributor.author | Lengerich, Ben | |
| dc.contributor.author | Kellis, Manolis | |
| dc.contributor.author | Theis, Fabian J. | |
| dc.date.accessioned | 2025-11-18T15:41:31Z | |
| dc.date.available | 2025-11-18T15:41:31Z | |
| dc.date.issued | 2025-10-30 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/163744 | |
| dc.description.abstract | Integration of single-cell RNA-sequencing (scRNA-seq) datasets is standard in scRNA-seq analysis. Nevertheless, current computational methods struggle to harmonize datasets across systems such as species, organoids and primary tissue, or different scRNA-seq protocols, including single-cell and single-nuclei. Conditional variational autoencoders (cVAE) are a popular integration method, however, existing strategies for stronger batch correction have limitations. Increasing the Kullback–Leibler divergence regularization does not improve integration and adversarial learning removes biological signals. Here, we propose sysVI, a cVAE-based method employing VampPrior and cycle-consistency constraints. We show that sysVI integrates across systems and improves biological signals for downstream interpretation of cell states and conditions. | en_US |
| dc.publisher | BioMed Central | en_US |
| dc.relation.isversionof | https://doi.org/10.1186/s12864-025-12126-3 | en_US |
| dc.rights | Creative Commons Attribution | en_US |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_US |
| dc.source | BioMed Central | en_US |
| dc.title | Integrating single-cell RNA-seq datasets with substantial batch effects | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Hrovatin, K., Moinfar, A., Zappia, L. et al. Integrating single-cell RNA-seq datasets with substantial batch effects. BMC Genomics 26, 974 (2025). | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
| dc.contributor.department | Broad Institute of MIT and Harvard | en_US |
| dc.relation.journal | BMC Genomics | en_US |
| dc.identifier.mitlicense | PUBLISHER_CC | |
| dc.eprint.version | Final published version | en_US |
| dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
| eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
| dc.date.updated | 2025-11-02T04:16:20Z | |
| dc.language.rfc3066 | en | |
| dc.rights.holder | The Author(s) | |
| dspace.date.submission | 2025-11-02T04:16:20Z | |
| mit.journal.volume | 26 | en_US |
| mit.license | PUBLISHER_CC | |
| mit.metadata.status | Authority Work and Publication Information Needed | en_US |