dc.contributor.advisor | Manolis Kellis. | en_US |
dc.contributor.author | Kumar, Nischay | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. | en_US |
dc.date.accessioned | 2014-03-06T15:41:44Z | |
dc.date.available | 2014-03-06T15:41:44Z | |
dc.date.copyright | 2013 | en_US |
dc.date.issued | 2013 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/85434 | |
dc.description | Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2013. | en_US |
dc.description | Title as it appears in MIT Commencement Exercises program, June 7, 2013: Classification of existing genes and de novo discovery of new genes using chromatin state segmentations in human epigenomes. Cataloged from PDF version of thesis. | en_US |
dc.description | Includes bibliographical references (pages 49-50). | en_US |
dc.description.abstract | Combinatorial patterns of chromatin marks have been shown to play a significant role in gene regulation activities by changing the landscape of the DNA through chemical means. Recent work has expanded on this observation using ChIP-seq signals of chromatin marks and supervised algorithms to build gene expression prediction models based on correlation analysis. However, no approach to date has attempted to use chromatin states to identify various classes of genes outside of the high-low expression classes. This research aims to fill this void by utilizing chromatin state segmentation and RNA-seq expression datasets from the NIH Roadmap Epigenomes project. A gene classification model was built using a k-fuzzy clustering approach of chromatin state features from a subset of training genes and then applied to a larger test set of genes. The models were found to be robust and show striking correspondence between training and test sets. 8 classes of genes that represent silent, repressed, and subsets of actively transcribed genes were identified and several metrics to validate the classes were computed. The systematic analysis outlined in this research is shown to a be promising approach for gene classification and future de novo discovery of gene like regions. | en_US |
dc.description.statementofresponsibility | by Nischay Kumar. | en_US |
dc.format.extent | 50 pages | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
dc.subject | Electrical Engineering and Computer Science. | en_US |
dc.title | Classification of genes using clustering of chromatin state segmentations in human epigenomes | en_US |
dc.title.alternative | Classification of existing genes and de novo discovery of new genes using chromatin state segmentations in human epigenomes | en_US |
dc.type | Thesis | en_US |
dc.description.degree | M. Eng. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
dc.identifier.oclc | 870678983 | en_US |