MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Deciphering eukaryotic gene-regulatory logic with 100 million random promoters

Author(s)
de Boer, Carl G.; Vaishnav, Eeshit Dhaval; Sadeh, Ronen; Abeyta, Esteban Luis; Friedman, Nir; Regev, Aviv; ... Show more Show less
Thumbnail
DownloadAccepted version (2.246Mb)
Publisher Policy

Publisher Policy

Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.

Terms of use
Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
Metadata
Show full item record
Abstract
How transcription factors (TFs) interpret cis-regulatory DNA sequence to control gene expression remains unclear, largely because past studies using native and engineered sequences had insufficient scale. Here, we measure the expression output of >100 million synthetic yeast promoter sequences that are fully random. These sequences yield diverse, reproducible expression levels that can be explained by their chance inclusion of functional TF binding sites. We use machine learning to build interpretable models of transcriptional regulation that predict ~94% of the expression driven from independent test promoters and ~89% of the expression driven from native yeast promoter fragments. These models allow us to characterize each TF’s specificity, activity and interactions with chromatin. TF activity depends on binding-site strand, position, DNA helical face and chromatin context. Notably, expression level is influenced by weak regulatory interactions, which confound designed-sequence studies. Our analyses show that massive-throughput assays of fully random DNA can provide the big data necessary to develop complex, predictive models of gene regulation. ©2019, The Author(s), under exclusive licence to Springer Nature America, Inc.
Date issued
2019-12
URI
https://hdl.handle.net/1721.1/125982
Department
Broad Institute of MIT and Harvard; Massachusetts Institute of Technology. Department of Biology; Koch Institute for Integrative Cancer Research at MIT
Journal
Nature Biotechnology
Publisher
Springer Science and Business Media LLC
Citation
de Boer, Carl G. et al., "Deciphering eukaryotic gene-regulatory logic with 100 million random promoters." Nature Biotechnology 38, 1 (January 2020): p. 56–65 doi. 10.1038/s41587-019-0315-8 ©2019 Author(s)
Version: Author's final manuscript
ISSN
1546-1696

Collections
  • MIT Open Access Articles

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.