dc.contributor.advisor | David Reich and Nick Patterson. | en_US |
dc.contributor.author | Sun, James Xin | en_US |
dc.contributor.other | Harvard--MIT Program in Health Sciences and Technology. | en_US |
dc.date.accessioned | 2012-09-13T19:01:56Z | |
dc.date.available | 2012-09-13T19:01:56Z | |
dc.date.copyright | 2012 | en_US |
dc.date.issued | 2012 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/72915 | |
dc.description | Thesis (Ph. D. in Electrical Engineering and Bioinformatics)--Harvard-MIT Program in Health Sciences and Technology, 2012. | en_US |
dc.description | Cataloged from PDF version of thesis. | en_US |
dc.description | Includes bibliographical references (p. 102-103). | en_US |
dc.description.abstract | In the past decade, thousands of human genomes have been catalogued, either by whole-genome sequencing or by targeted genotyping. The variability between human genomes encodes invaluable information about human traits and genetic diseases, as well as human migration patterns and population interactions. A key challenge is to understand and characterize the evolution of the variability between human genomes. In this thesis, I focus on studying human evolution through the use of microsatellites, which are simple repetitive sections of DNA of typically 1-6bp motifs (e.g. CACACACACA) that are highly polymorphic and highly mutable. The first aim is to establish that microsatellites are useful as reliable molecular clocks, such that its evolution highly correlates to time, especially when applied to the time range appropriate for human history. Using existing models of microsatellites, we examine microsatellite data from populations around the world to demonstrate that microsatellites are accurate molecular clocks for coalescent times of at least two million years. These results raise the prospect of using microsatellite data sets to determine parameters of population history. In order to calibrate genetic distances into time, the mutation rate must be known. This leads to the second aim, which is to directly measure the microsatellite mutation rate from largescale pedigree genetics data and provide a precision that is unprecedented. To do so, we use data from over 95,000 individuals in Icelandic pedigrees, genotyped in over 3000 microsatellite loci. Using trio and extended-family based approaches, we discover 2058 denovo mutations. In addition, we also attempt to capture many features that are covariates with the mutation rate, such as parental gender and age. The third aim takes our empirical observations of the microsatellite mutation process to build a new model of microsatellite evolution. This model improves upon the standard random walk model with features we have captured from aim 2. We use a Bayesian coalescent approach to provide a model that estimates the sequence mutation rate, European genetic divergence times, and human-chimpanzee speciation time. | en_US |
dc.description.statementofresponsibility | by James Xin Sun. | en_US |
dc.format.extent | 155 p. | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | M.I.T. theses are protected by
copyright. They may be viewed from this source for any purpose, but
reproduction or distribution in any format is prohibited without written
permission. See provided URL for inquiries about permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
dc.subject | Harvard--MIT Program in Health Sciences and Technology. | en_US |
dc.title | The human molecular clock and mutation process : a characterization using microsatellite DNA | en_US |
dc.type | Thesis | en_US |
dc.description.degree | Ph.D.in Electrical Engineering and Bioinformatics | en_US |
dc.contributor.department | Harvard University--MIT Division of Health Sciences and Technology | |
dc.identifier.oclc | 809078450 | en_US |