Efficient haplotyping for families
Author(s)
Williams, Amy Lynne, Ph.D. Massachusetts Institute of Technology
DownloadFull printable version (6.708Mb)
Other Contributors
Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science.
Advisor
David K. Gifford.
Terms of use
Metadata
Show full item recordAbstract
Hapi is a novel dynamic programming algorithm for haplotyping nuclear families that outperforms contemporary family-based haplotyping algorithms. Haplotypes are useful for mapping and identifying genes which cause and contribute to the etiology of human disease, and for analyzing the products of meiosis to locate recombinations, enabling the identification of recombination hotspots and gene conversions. They can also be used to study population history, including expansion, contraction, and migration patterns in humans and other species. Hapi's efficiency is a result of eliminating or ignoring states and state transitions that are unnecessary for computing haplotypes. When applied to a dataset containing 103 families, Hapi performs over 3.8-320 times faster than state-of-the-art algorithms. These efficiency gains are practically important as they enable Hapi to haplotype family datasets which current algorithms are either unable to handle or are impractical for because of time constraints. Hapi infers both minimum-recombinant and maximum likelihood haplotypes, and because it applies to related individuals, the haplotypes it infers are highly accurate over large genomic distances.
Description
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2010. Cataloged from PDF version of thesis. Includes bibliographical references (p. 81-83).
Date issued
2010Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer SciencePublisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.