Show simple item record

dc.contributor.authorde Boer, Carl G.
dc.contributor.authorRegev, Aviv
dc.date.accessioned2018-07-11T13:35:51Z
dc.date.available2018-07-11T13:35:51Z
dc.date.issued2018-07
dc.date.submitted2017-09
dc.identifier.issn1471-2105
dc.identifier.urihttp://hdl.handle.net/1721.1/116880
dc.description.abstractBackground: Variation in chromatin organization across single cells can help shed important light on the mechanisms controlling gene expression, but scale, noise, and sparsity pose significant challenges for interpretation of single cell chromatin data. Here, we develop BROCKMAN (Brockman Representation Of Chromatin by K-mers in Mark-Associated Nucleotides), an approach to infer variation in transcription factor (TF) activity across samples through unsupervised analysis of the variation in DNA sequences associated with an epigenomic mark. Results: BROCKMAN represents each sample as a vector of epigenomic-mark-associated DNA word frequencies, and decomposes the resulting matrix to find hidden structure in the data, followed by unsupervised grouping of samples and identification of the TFs that distinguish groups. Applied to single cell ATAC-seq, BROCKMAN readily distinguished cell types, treatments, batch effects, experimental artifacts, and cycling cells. We show that each variable component in the k-mer landscape reflects a set of co-varying TFs, which are often known to physically interact. For example, in K562 cells, AP-1 TFs were central determinant of variability in chromatin accessibility through their variable expression levels and diverse interactions with other TFs. We provide a theoretical basis for why cooperative TF binding – and any associated epigenomic mark – is inherently more variable than non-cooperative binding. Conclusions: BROCKMAN and related approaches will help gain a mechanistic understanding of the trans determinants of chromatin variability between cells, treatments, and individuals. Keywords: Single-cell, Epigenome, Chromatin, scATAC-seq, K-mer, N-gram, Factorization, Decomposition, Clustering, Transcription factoren_US
dc.description.sponsorshipNational Human Genome Research Institute (U.S.) (Centers of Excellence in Genomic Science Grant)en_US
dc.description.sponsorshipHoward Hughes Medical Institute (Centers of Excellence in Genomic Science Grant)en_US
dc.publisherBioMed Centralen_US
dc.relation.isversionofhttps://doi.org/10.1186/s12859-018-2255-6en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en_US
dc.sourceBioMed Centralen_US
dc.titleBROCKMAN: deciphering variance in epigenomic regulators by k-mer factorizationen_US
dc.typeArticleen_US
dc.identifier.citationde Boer, Carl G., and Aviv Regev. “BROCKMAN: Deciphering Variance in Epigenomic Regulators by k-Mer Factorization.” BMC Bioinformatics, vol. 19, no. 1, Dec. 2018. © 2018 The Authorsen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Biologyen_US
dc.contributor.mitauthorRegev, Aviv
dc.relation.journalBMC Bioinformaticsen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2018-07-08T03:43:12Z
dc.language.rfc3066en
dc.rights.holderThe Author(s).
dspace.orderedauthorsde Boer, Carl G.; Regev, Aviven_US
dspace.embargo.termsNen_US
dc.identifier.orcidhttps://orcid.org/0000-0001-8567-2049
mit.licensePUBLISHER_CCen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record