Show simple item record

dc.contributor.authorFlannick, Jason
dc.contributor.authorKorn, Joshua M.
dc.contributor.authorFontanillas, Pierre
dc.contributor.authorGrant, George B.
dc.contributor.authorDepristo, Mark A.
dc.contributor.authorAltshuler, David
dc.contributor.authorBanks, Eric, 1976-
dc.date.accessioned2012-08-29T15:25:17Z
dc.date.available2012-08-29T15:25:17Z
dc.date.issued2012-07
dc.date.submitted2012-03
dc.identifier.issn1553-734X
dc.identifier.issn1553-7358
dc.identifier.urihttp://hdl.handle.net/1721.1/72418
dc.description.abstractHigh coverage whole genome sequencing provides near complete information about genetic variation. However, other technologies can be more efficient in some settings by (a) reducing redundant coverage within samples and (b) exploiting patterns of genetic variation across samples. To characterize as many samples as possible, many genetic studies therefore employ lower coverage sequencing or SNP array genotyping coupled to statistical imputation. To compare these approaches individually and in conjunction, we developed a statistical framework to estimate genotypes jointly from sequence reads, array intensities, and imputation. In European samples, we find similar sensitivity (89%) and specificity (99.6%) from imputation with either 1× sequencing or 1 M SNP arrays. Sensitivity is increased, particularly for low-frequency polymorphisms (MAF <5%), when low coverage sequence reads are added to dense genome-wide SNP arrays — the converse, however, is not true. At sites where sequence reads and array intensities produce different sample genotypes, joint analysis reduces genotype errors and identifies novel error modes. Our joint framework informs the use of next-generation sequencing in genome wide association studies and supports development of improved methods for genotype calling.en_US
dc.language.isoen_US
dc.publisherPublic Library of Scienceen_US
dc.relation.isversionofhttp://dx.doi.org/10.1371/journal.pcbi.1002604en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttp://creativecommons.org/licenses/by/2.5/en_US
dc.sourcePLoSen_US
dc.titleEfficiency and Power as a Function of Sequence Coverage, SNP Array Density, and Imputationen_US
dc.typeArticleen_US
dc.identifier.citationFlannick, Jason et al. “Efficiency and Power as a Function of Sequence Coverage, SNP Array Density, and Imputation.” Ed. Jan Korbel. PLoS Computational Biology 8.7 (2012): e1002604.en_US
dc.contributor.departmentHarvard University--MIT Division of Health Sciences and Technologyen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Biologyen_US
dc.contributor.approverAltshuler, David
dc.contributor.mitauthorKorn, Joshua M.
dc.contributor.mitauthorAltshuler, David
dc.relation.journalPLoS Computational Biologyen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsFlannick, Jason; Korn, Joshua M.; Fontanillas, Pierre; Grant, George B.; Banks, Eric; Depristo, Mark A.; Altshuler, Daviden
dc.identifier.orcidhttps://orcid.org/0000-0002-7250-4107
mit.licensePUBLISHER_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record