dc.contributor.advisor | Manolis Kellis. | en_US |
dc.contributor.author | Lin, Michael F. (Michael Fong-Jay) | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. | en_US |
dc.date.accessioned | 2007-03-12T17:55:35Z | |
dc.date.available | 2007-03-12T17:55:35Z | |
dc.date.copyright | 2006 | en_US |
dc.date.issued | 2006 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/36807 | |
dc.description | Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. | en_US |
dc.description | Includes bibliographical references (leaves 55-56). | en_US |
dc.description.abstract | An important step in genome interpretation is the accurate identification of protein-coding genes. One approach to gene identification is comparative analysis of the genomes of several related species, to find genes that have been conserved by natural selection over millions of years of evolution. I develop general computational methods that combine statistical analysis of genome sequence alignments with classification algorithms in order to detect the distinctive signatures of protein-coding DNA sequence evolution. I implement these methods as a software system, which I then use to identify previously unknown genes, and cast doubt on some existing gene annotations, in the genomes of the fungi Saccharomyces cerevisiae and Candida albicans, the fruit fly Drosophila melanogaster, and the human. These methods perform competitively with the best existing de novo gene identification systems, and are practically applicable to the goal of improving existing gene annotations through comparative genomics. | en_US |
dc.description.statementofresponsibility | by Michael F. Lin. | en_US |
dc.format.extent | 56 leaves | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | |
dc.subject | Electrical Engineering and Computer Science. | en_US |
dc.title | Comparative gene identification in mammalian, fly, and fungal genomes | en_US |
dc.type | Thesis | en_US |
dc.description.degree | M.Eng. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
dc.identifier.oclc | 80777803 | en_US |