Show simple item record

dc.contributor.advisorMatthew Meyerson.en_US
dc.contributor.authorHelman, Elenaen_US
dc.contributor.otherHarvard--MIT Program in Health Sciences and Technology.en_US
dc.date.accessioned2014-05-23T19:39:06Z
dc.date.available2014-05-23T19:39:06Z
dc.date.copyright2013en_US
dc.date.issued2014en_US
dc.identifier.urihttp://hdl.handle.net/1721.1/87504
dc.descriptionThesis: Ph. D., Harvard-MIT Program in Health Sciences and Technology, February 2014.en_US
dc.descriptionCataloged from PDF version of thesis.en_US
dc.descriptionIncludes bibliographical references (pages 128-144).en_US
dc.description.abstractCancer is a complex disease of the genome exhibiting myriad somatic mutations, from single nucleotide changes to various chromosomal rearrangements. The technological advances of next-generation sequencing enable high-throughput identification and characterization of these events genome-wide using computational algorithms. Retrotransposons comprise 42% of the human genome and have the capacity to "jump" across the genome in a copy-and-paste manner. Recent studies have identified families of retrotransposable elements that are currently active. In fact, retrotransposons constitute a major source of human genetic variation, and somatic retrotransposon insertions have been implicated in several cancers, including an insertion into the APC tumor suppressor in a colorectal tumor. Because of the highly repetitive nature of these elements, however, the full extent of somatic retrotransposon movement across cancer remains largely unexplored. To this end, we developed TranspoSeq, a computational framework that identifies retrotransposon insertions from paired-end whole genome sequencing data, and TranspoSeq- Exome, a tool that localizes these insertions from whole-exome data. TranspoSeq identifies novel somatic retrotransposon insertions with high sensitivity and specificity in simulated data and with a 94% validation rate via site-specific PCR. Next, we applied these methods to wholegenomes from 200 tumor/normal pairs and whole-exomes from 767 tumor/normal pairs across 11 tumor types as part of The Cancer Genome Atlas (TCGA) Pan-Cancer Project. We discover more than 800 somatic retrotransposon insertions primarily in lung squamous, head and neck, colorectal and endometrial carcinomas, while glioblastoma multiforme and acute myeloid leukemia show no evidence of somatic retrotransposition. Moreover, many somatic retrotransposon insertions occur in known cancer genes. TranspoSeq-Exome uncovers 35 additional somatic retrotransposon insertions into exonic regions, including an insertion into an exon of the PTEN tumor suppressor in endometrial cancer. Finally, we integrate orthogonal genomic and clinical data to characterize features of retrotransposon insertion and samples that exhibit extensive somatic retrotransposition. We present a large-scale, comprehensive analysis of retrotransposon movement across tumor types using next-generation sequencing data. Our results suggest that somatic retrotransposon insertions may represent an important class of tumor-specific structural variation in cancer and future studies should incorporate this form of somatic genome aberration.en_US
dc.description.statementofresponsibilityby Elena Helman.en_US
dc.format.extent144 pagesen_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsM.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectHarvard--MIT Program in Health Sciences and Technology.en_US
dc.titleSomatic retrotransposition in the cancer genomeen_US
dc.typeThesisen_US
dc.description.degreePh. D.en_US
dc.contributor.departmentHarvard University--MIT Division of Health Sciences and Technology
dc.identifier.oclc879668800en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record