Show simple item record

dc.contributor.advisorDavid Reich and Nick Patterson.en_US
dc.contributor.authorSun, James Xinen_US
dc.contributor.otherHarvard--MIT Program in Health Sciences and Technology.en_US
dc.date.accessioned2012-09-13T19:01:56Z
dc.date.available2012-09-13T19:01:56Z
dc.date.copyright2012en_US
dc.date.issued2012en_US
dc.identifier.urihttp://hdl.handle.net/1721.1/72915
dc.descriptionThesis (Ph. D. in Electrical Engineering and Bioinformatics)--Harvard-MIT Program in Health Sciences and Technology, 2012.en_US
dc.descriptionCataloged from PDF version of thesis.en_US
dc.descriptionIncludes bibliographical references (p. 102-103).en_US
dc.description.abstractIn the past decade, thousands of human genomes have been catalogued, either by whole-genome sequencing or by targeted genotyping. The variability between human genomes encodes invaluable information about human traits and genetic diseases, as well as human migration patterns and population interactions. A key challenge is to understand and characterize the evolution of the variability between human genomes. In this thesis, I focus on studying human evolution through the use of microsatellites, which are simple repetitive sections of DNA of typically 1-6bp motifs (e.g. CACACACACA) that are highly polymorphic and highly mutable. The first aim is to establish that microsatellites are useful as reliable molecular clocks, such that its evolution highly correlates to time, especially when applied to the time range appropriate for human history. Using existing models of microsatellites, we examine microsatellite data from populations around the world to demonstrate that microsatellites are accurate molecular clocks for coalescent times of at least two million years. These results raise the prospect of using microsatellite data sets to determine parameters of population history. In order to calibrate genetic distances into time, the mutation rate must be known. This leads to the second aim, which is to directly measure the microsatellite mutation rate from largescale pedigree genetics data and provide a precision that is unprecedented. To do so, we use data from over 95,000 individuals in Icelandic pedigrees, genotyped in over 3000 microsatellite loci. Using trio and extended-family based approaches, we discover 2058 denovo mutations. In addition, we also attempt to capture many features that are covariates with the mutation rate, such as parental gender and age. The third aim takes our empirical observations of the microsatellite mutation process to build a new model of microsatellite evolution. This model improves upon the standard random walk model with features we have captured from aim 2. We use a Bayesian coalescent approach to provide a model that estimates the sequence mutation rate, European genetic divergence times, and human-chimpanzee speciation time.en_US
dc.description.statementofresponsibilityby James Xin Sun.en_US
dc.format.extent155 p.en_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsM.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectHarvard--MIT Program in Health Sciences and Technology.en_US
dc.titleThe human molecular clock and mutation process : a characterization using microsatellite DNAen_US
dc.typeThesisen_US
dc.description.degreePh.D.in Electrical Engineering and Bioinformaticsen_US
dc.contributor.departmentHarvard University--MIT Division of Health Sciences and Technology
dc.identifier.oclc809078450en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record