Show simple item record

dc.contributor.authorMasoero, Lorenzo
dc.contributor.authorCamerlenghi, Federico
dc.contributor.authorFavaro, Stefano
dc.contributor.authorBroderick, Tamara
dc.date.accessioned2022-06-07T12:26:26Z
dc.date.available2022-06-07T12:26:26Z
dc.date.issued2022
dc.identifier.urihttps://hdl.handle.net/1721.1/142893
dc.description.abstract<jats:title>Summary</jats:title> <jats:p>While the cost of sequencing genomes has decreased dramatically in recent years, this expense often remains nontrivial. Under a fixed budget, scientists face a natural trade-off between quantity and quality: spending resources to sequence a greater number of genomes or spending resources to sequence genomes with increased accuracy. Our goal is to find the optimal allocation of resources between quantity and quality. Optimizing resource allocation promises to reveal as many new variations in the genome as possible. We introduce a Bayesian nonparametric methodology to predict the number of new variants in a follow-up study based on a pilot study. When experimental conditions are kept constant between the pilot and follow-up, we find that our prediction is competitive with the best existing methods. Unlike current methods, though, our new method allows practitioners to change experimental conditions between the pilot and the follow-up. We demonstrate how this distinction allows our method to be used for more realistic predictions and for optimal allocation of a fixed budget between quality and quantity. We validate our method on cancer and human genomics data.</jats:p>en_US
dc.language.isoen
dc.publisherOxford University Press (OUP)en_US
dc.relation.isversionof10.1093/BIOMET/ASAB012en_US
dc.rightsCreative Commons Attribution-NonCommercial-ShareAlike 4.0 Internationalen_US
dc.rights.urihttps://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourcearXiven_US
dc.titleMore for less: predicting and maximizing genomic variant discovery via Bayesian nonparametricsen_US
dc.typeArticleen_US
dc.identifier.citationMasoero, Lorenzo, Camerlenghi, Federico, Favaro, Stefano and Broderick, Tamara. 2022. "More for less: predicting and maximizing genomic variant discovery via Bayesian nonparametrics." Biometrika, 109 (1).
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.relation.journalBiometrikaen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2022-06-07T11:57:16Z
dspace.orderedauthorsMasoero, L; Camerlenghi, F; Favaro, S; Broderick, Ten_US
dspace.date.submission2022-06-07T11:57:27Z
mit.journal.volume109en_US
mit.journal.issue1en_US
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record