Show simple item record

dc.contributor.advisorGregory Stephanopoulos.en_US
dc.contributor.authorGupta, Vipin, 1978-en_US
dc.contributor.otherMassachusetts Institute of Technology. Dept. of Chemical Engineering.en_US
dc.date.accessioned2005-09-27T18:40:19Z
dc.date.available2005-09-27T18:40:19Z
dc.date.copyright2004en_US
dc.date.issued2004en_US
dc.identifier.urihttp://hdl.handle.net/1721.1/28847
dc.descriptionThesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Chemical Engineering, 2004.en_US
dc.descriptionIncludes bibliographical references.en_US
dc.description.abstract(cont.) algorithm was validated on synthetic as well as real datasets. When tested on a set of 30 well-studied regulons in Escherichia Coli, with known instances of regulatory motifs collected from biological literature, the algorithm showed, in 14 cases, a high sensitivity and specificity of 70% and 80%, respectively. TABS was shown to perform better than two other popular state-of-the-art motif-finding algorithms. In addition, its applicability on synthetic microarray-like data was demonstrated. Several significant novel motifs detected by the algorithm that form good targets for investigation of regulatory function by biological experiments were reported.en_US
dc.description.abstractOne of the major challenges facing biologists is to understand the mechanisms governing the regulation of gene expression. Completely sequenced genomes, together with the emerging DNA microarray technologies have enabled the measurement of gene expression levels in cell cultures and opened new possibilities for studying gene regulation. A fundamental sub-problem in unraveling regulatory interactions in both prokaryotes and eukaryotes is to identify common binding sites or promoters in the regulatory regions of genes. For a gene's mRNA to be expressed, a class of proteins called transcription factors must bind to the cis-regulatory elements on the DNA sequence upstream of the gene, to enhance RNA polymerase binding and hence initiate transcription. These binding sites are believed to be located within several hundred base pairs upstream of the respective ORFs. Biological methods for discovering regulatory binding sites are slow and time consuming. To address this problem, several heuristic-based computational methods have been developed in the past with either of two approaches--sequence-driven or pattern-driven. In this dissertation, we propose a novel approach for finding shared motifs in DNA sequences based on an exhaustive pattern enumeration algorithm, that combines the benefits of the pattern-driven and sequence-driven approaches. We developed TABS, a method that identifies local regions of high similarity by clustering statistically significant patterns to obtain putative binding sites. The method assumes minimal apriori information about the sites and can detect signals in a subset of the input sequences, making it amenable for motif-discovery in gene clusters obtained from microarray experiments. The performance of theen_US
dc.description.statementofresponsibilityby Vipin Gupta.en_US
dc.format.extent126, iii, 39 p.en_US
dc.format.extent8458873 bytes
dc.format.extent8479756 bytes
dc.format.mimetypeapplication/pdf
dc.format.mimetypeapplication/pdf
dc.language.isoen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsM.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582
dc.subjectChemical Engineering.en_US
dc.titleExtracting regulatory signals from DNA sequences using syntactic pattern discoveryen_US
dc.title.alternativeCapstone report : commercialization of bioinformatics toolsen_US
dc.typeThesisen_US
dc.description.degreePh.D.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemical Engineering
dc.identifier.oclc60398497en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record