Show simple item record

dc.contributor.authorKheradpour, Pouya
dc.contributor.authorKellis, Manolis
dc.date.accessioned2014-04-04T13:18:07Z
dc.date.available2014-04-04T13:18:07Z
dc.date.issued2013-12
dc.date.submitted2013-11
dc.identifier.issn0305-1048
dc.identifier.issn1362-4962
dc.identifier.urihttp://hdl.handle.net/1721.1/86012
dc.description.abstractRecent advances in technology have led to a dramatic increase in the number of available transcription factor ChIP-seq and ChIP-chip data sets. Understanding the motif content of these data sets is an important step in understanding the underlying mechanisms of regulation. Here we provide a systematic motif analysis for 427 human ChIP-seq data sets using motifs curated from the literature and also discovered de novo using five established motif discovery tools. We use a systematic pipeline for calculating motif enrichment in each data set, providing a principled way for choosing between motif variants found in the literature and for flagging potentially problematic data sets. Our analysis confirms the known specificity of 41 of the 56 analyzed factor groups and reveals motifs of potential cofactors. We also use cell type-specific binding to find factors active in specific conditions. The resource we provide is accessible both for browsing a small number of factors and for performing large-scale systematic analyses. We provide motif matrices, instances and enrichments in each of the ENCODE data sets. The motifs discovered here have been used in parallel studies to validate the specificity of antibodies, understand cooperativity between data sets and measure the variation of motif binding across individuals and species.en_US
dc.description.sponsorshipNational Institutes of Health (U.S.) (HG004037)en_US
dc.description.sponsorshipNational Institutes of Health (U.S.) (HG007000)en_US
dc.description.sponsorshipNational Institutes of Health (U.S.) (HG006991)en_US
dc.language.isoen_US
dc.publisherOxford University Pressen_US
dc.relation.isversionofhttp://dx.doi.org/10.1093/nar/gkt1249en_US
dc.rightsPublisher with Creative Commons Licenseen_US
dc.rights.urihttp://creativecommons.org/licenses/ by-nc/3.0/en_US
dc.sourceOxford University Pressen_US
dc.titleSystematic discovery and characterization of regulatory motifs in ENCODE TF binding experimentsen_US
dc.typeArticleen_US
dc.identifier.citationKheradpour, P., and M. Kellis. “Systematic Discovery and Characterization of Regulatory Motifs in ENCODE TF Binding Experiments.” Nucleic Acids Research 42, no. 5 (March 1, 2014): 2976–2987.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorKheradpour, Pouyaen_US
dc.contributor.mitauthorKellis, Manolisen_US
dc.relation.journalNucleic Acids Researchen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsKheradpour, P.; Kellis, M.en_US
mit.licensePUBLISHER_CCen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record