Show simple item record

dc.contributor.authorde Boer, Carl G.
dc.contributor.authorVaishnav, Eeshit Dhaval
dc.contributor.authorSadeh, Ronen
dc.contributor.authorAbeyta, Esteban Luis
dc.contributor.authorFriedman, Nir
dc.contributor.authorRegev, Aviv
dc.date.accessioned2020-06-25T17:07:04Z
dc.date.available2020-06-25T17:07:04Z
dc.date.issued2019-12
dc.date.submitted2019-01
dc.identifier.issn1546-1696
dc.identifier.urihttps://hdl.handle.net/1721.1/125982
dc.description.abstractHow transcription factors (TFs) interpret cis-regulatory DNA sequence to control gene expression remains unclear, largely because past studies using native and engineered sequences had insufficient scale. Here, we measure the expression output of >100 million synthetic yeast promoter sequences that are fully random. These sequences yield diverse, reproducible expression levels that can be explained by their chance inclusion of functional TF binding sites. We use machine learning to build interpretable models of transcriptional regulation that predict ~94% of the expression driven from independent test promoters and ~89% of the expression driven from native yeast promoter fragments. These models allow us to characterize each TF’s specificity, activity and interactions with chromatin. TF activity depends on binding-site strand, position, DNA helical face and chromatin context. Notably, expression level is influenced by weak regulatory interactions, which confound designed-sequence studies. Our analyses show that massive-throughput assays of fully random DNA can provide the big data necessary to develop complex, predictive models of gene regulation. ©2019, The Author(s), under exclusive licence to Springer Nature America, Inc.en_US
dc.description.sponsorshipNIH (grant no. K99-HG009920-01)en_US
dc.description.sponsorshipFellowship from the Canadian Institutes for Health Researchen_US
dc.description.sponsorshipMIT Presidential Fellowshipen_US
dc.language.isoen
dc.publisherSpringer Science and Business Media LLCen_US
dc.relation.isversionofhttps://dx.doi.org/10.1038/s41587-019-0315-8en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourcePMCen_US
dc.titleDeciphering eukaryotic gene-regulatory logic with 100 million random promotersen_US
dc.typeArticleen_US
dc.identifier.citationde Boer, Carl G. et al., "Deciphering eukaryotic gene-regulatory logic with 100 million random promoters." Nature Biotechnology 38, 1 (January 2020): p. 56–65 doi. 10.1038/s41587-019-0315-8 ©2019 Author(s)en_US
dc.contributor.departmentBroad Institute of MIT and Harvarden_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Biologyen_US
dc.contributor.departmentKoch Institute for Integrative Cancer Research at MITen_US
dc.relation.journalNature Biotechnologyen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2020-06-19T17:41:43Z
dspace.date.submission2020-06-19T17:41:46Z
mit.journal.volume38en_US
mit.journal.issue1en_US
mit.licensePUBLISHER_POLICY
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record