dc.contributor.advisor | Christopher Burge. | en_US |
dc.contributor.author | Lazarovici, Allan, 1979- | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. | en_US |
dc.date.accessioned | 2006-03-24T16:14:44Z | |
dc.date.available | 2006-03-24T16:14:44Z | |
dc.date.copyright | 2003 | en_US |
dc.date.issued | 2003 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/29681 | |
dc.description | Thesis (M.Eng. and S.B.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2003. | en_US |
dc.description | Includes bibliographical references (leaves 60-62). | en_US |
dc.description.abstract | A computer program called FUNSCAN was developed which identifies protein coding regions in fungal genomes. Gene structural and compositional properties are modeled using a Hidden Markov Model. Separate training and testing sets for FUNSCAN were obtained by aligning cDNAs from an organism to their genomic loci, generating a 'gold standard' set of annotated genes. The performance of FUNSCAN is competitive with other computer programs design to identify protein coding regions in fungal genomes. A technique called 'Training Set Augmentation' is described which can be used to train FUNSCAN when only a small training set of genes is available. Techniques that combine alignment algorithms with FUNSCAN to identify novel genes are also discussed and explored. | en_US |
dc.description.statementofresponsibility | by Allan Lazarovici. | en_US |
dc.format.extent | 62 leaves | en_US |
dc.format.extent | 2572412 bytes | |
dc.format.extent | 2572221 bytes | |
dc.format.mimetype | application/pdf | |
dc.format.mimetype | application/pdf | |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | |
dc.subject | Electrical Engineering and Computer Science. | en_US |
dc.title | Development of gene-finding algorithms for fungal genomes : dealing with small datasets and leveraging comparative genomics | en_US |
dc.type | Thesis | en_US |
dc.description.degree | M.Eng.and S.B. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
dc.identifier.oclc | 53843099 | en_US |