dc.contributor.advisor | Gregory Stephanopoulos. | en_US |
dc.contributor.author | Gupta, Vipin, 1978- | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Dept. of Chemical Engineering. | en_US |
dc.date.accessioned | 2005-09-27T18:40:19Z | |
dc.date.available | 2005-09-27T18:40:19Z | |
dc.date.copyright | 2004 | en_US |
dc.date.issued | 2004 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/28847 | |
dc.description | Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Chemical Engineering, 2004. | en_US |
dc.description | Includes bibliographical references. | en_US |
dc.description.abstract | (cont.) algorithm was validated on synthetic as well as real datasets. When tested on a set of 30 well-studied regulons in Escherichia Coli, with known instances of regulatory motifs collected from biological literature, the algorithm showed, in 14 cases, a high sensitivity and specificity of 70% and 80%, respectively. TABS was shown to perform better than two other popular state-of-the-art motif-finding algorithms. In addition, its applicability on synthetic microarray-like data was demonstrated. Several significant novel motifs detected by the algorithm that form good targets for investigation of regulatory function by biological experiments were reported. | en_US |
dc.description.abstract | One of the major challenges facing biologists is to understand the mechanisms governing the regulation of gene expression. Completely sequenced genomes, together with the emerging DNA microarray technologies have enabled the measurement of gene expression levels in cell cultures and opened new possibilities for studying gene regulation. A fundamental sub-problem in unraveling regulatory interactions in both prokaryotes and eukaryotes is to identify common binding sites or promoters in the regulatory regions of genes. For a gene's mRNA to be expressed, a class of proteins called transcription factors must bind to the cis-regulatory elements on the DNA sequence upstream of the gene, to enhance RNA polymerase binding and hence initiate transcription. These binding sites are believed to be located within several hundred base pairs upstream of the respective ORFs. Biological methods for discovering regulatory binding sites are slow and time consuming. To address this problem, several heuristic-based computational methods have been developed in the past with either of two approaches--sequence-driven or pattern-driven. In this dissertation, we propose a novel approach for finding shared motifs in DNA sequences based on an exhaustive pattern enumeration algorithm, that combines the benefits of the pattern-driven and sequence-driven approaches. We developed TABS, a method that identifies local regions of high similarity by clustering statistically significant patterns to obtain putative binding sites. The method assumes minimal apriori information about the sites and can detect signals in a subset of the input sequences, making it amenable for motif-discovery in gene clusters obtained from microarray experiments. The performance of the | en_US |
dc.description.statementofresponsibility | by Vipin Gupta. | en_US |
dc.format.extent | 126, iii, 39 p. | en_US |
dc.format.extent | 8458873 bytes | |
dc.format.extent | 8479756 bytes | |
dc.format.mimetype | application/pdf | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en_US | |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | |
dc.subject | Chemical Engineering. | en_US |
dc.title | Extracting regulatory signals from DNA sequences using syntactic pattern discovery | en_US |
dc.title.alternative | Capstone report : commercialization of bioinformatics tools | en_US |
dc.type | Thesis | en_US |
dc.description.degree | Ph.D. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Chemical Engineering | |
dc.identifier.oclc | 60398497 | en_US |