Locating protein-coding sequences under selection for additional, overlapping functions in 29 mammalian genomes
Author(s)
Lin, Michael F.; Kheradpour, Pouya; Mag Washietl, Stefan; Parker, Brian J.; Pedersen, Jakob S.; Kellis, Manolis; ... Show more Show less
DownloadKellis_Locating protein-coding.pdf (1.735Mb)
PUBLISHER_CC
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
The degeneracy of the genetic code allows protein-coding DNA and RNA sequences to simultaneously encode additional, overlapping functional elements. A sequence in which both protein-coding and additional overlapping functions have evolved under purifying selection should show increased evolutionary conservation compared to typical protein-coding genes—especially at synonymous sites. In this study, we use genome alignments of 29 placental mammals to systematically locate short regions within human ORFs that show conspicuously low estimated rates of synonymous substitution across these species. The 29-species alignment provides statistical power to locate more than 10,000 such regions with resolution down to nine-codon windows, which are found within more than a quarter of all human protein-coding genes and contain ∼2% of their synonymous sites. We collect numerous lines of evidence that the observed synonymous constraint in these regions reflects selection on overlapping functional elements including splicing regulatory elements, dual-coding genes, RNA secondary structures, microRNA target sites, and developmental enhancers. Our results show that overlapping functional elements are common in mammalian genes, despite the vast genomic landscape.
Date issued
2011-10Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer ScienceJournal
Genome Research
Publisher
Cold Spring Harbor Laboratory Press
Citation
Lin, M. F. et al. “Locating Protein-coding Sequences Under Selection for Additional, Overlapping Functions in 29 Mammalian Genomes.” Genome Research 21.11 (2011): 1916–1928. © 2011 by Cold Spring Harbor Laboratory Press
Version: Final published version
ISSN
1088-9051