MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Doctoral Theses
  • View Item
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Doctoral Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Characterizing variation at short tandem repeats and their role in human genome regulation

Author(s)
Gymrek, Melissa A
Thumbnail
DownloadFull printable version (21.22Mb)
Other Contributors
Harvard--MIT Program in Health Sciences and Technology.
Advisor
Yaniv Erlich and Mark J. Daly.
Terms of use
M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582
Metadata
Show full item record
Abstract
A central goal in genomics is to understand the genetic variants that underlie molecular changes and lead to disease. Recent studies have identified thousands of genetic loci associated with human phenotypes. These have primarily analyzed point mutations, ignoring more complex types of variation. Here we focus on Short Tandem Repeats (STRs) as a model for complex variation. STRs are comprised of repeating motifs of 1-6bp that span over 1% of the human genome. The level of STR variation and its effect on phenotypes remains mostly uncharted, mainly due to the difficulty in accurately genotyping STRs on a large scale. To overcome bioinformatic challenges in STR genotyping, we developed lobSTR, an algorithm for profiling STRs from high throughput sequencing data. lobSTR employs a unique mapping strategy to rapidly align repetitive reads, and uses statistical learning techniques to account for STR-specific noise patterns. We applied lobSTR to generate the largest and highest quality STR catalog to date. This provided the first characterization of more than a million loci and gave novel insights into population-wide trends of STR variation. We used our catalog to conduct a genome-wide analysis of the contribution of STRs to gene expression in humans. This revealed that STRs explain 10-15% of the cis heritability of expression mediated by common variants and potentially play a role in various clinically relevant conditions. Overall these studies highlight the contribution of STRs to the genetic architecture of quantitative traits. We anticipate that integrating repetitive elements, specifically STRs, into genome-wide analyses will lead to the discovery of new genetic variants relevant to human conditions.
Description
Thesis: Ph. D., Harvard-MIT Program in Health Sciences and Technology, 2016.
 
Cataloged from PDF version of thesis.
 
Includes bibliographical references (pages 225-251).
 
Date issued
2016
URI
http://hdl.handle.net/1721.1/103501
Department
Harvard University--MIT Division of Health Sciences and Technology
Publisher
Massachusetts Institute of Technology
Keywords
Harvard--MIT Program in Health Sciences and Technology.

Collections
  • Doctoral Theses

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.