Show simple item record

dc.contributor.authorGaston, Jeffry M.
dc.contributor.authorAlm, Eric J.
dc.contributor.authorZhang, An-Ni
dc.date.accessioned2024-04-29T18:11:46Z
dc.date.available2024-04-29T18:11:46Z
dc.date.issued2024-04-22
dc.identifier.issn1741-7007
dc.identifier.urihttps://hdl.handle.net/1721.1/154309
dc.description.abstractBackground Accurate identification of genetic variants, such as point mutations and insertions/deletions (indels), is crucial for various genetic studies into epidemic tracking, population genetics, and disease diagnosis. Genetic studies into microbiomes often require processing numerous sequencing datasets, necessitating variant identifiers with high speed, accuracy, and robustness. Results We present QuickVariants, a bioinformatics tool that effectively summarizes variant information from read alignments and identifies variants. When tested on diverse bacterial sequencing data, QuickVariants demonstrates a ninefold higher median speed than bcftools, a widely used variant identifier, with higher accuracy in identifying both point mutations and indels. This accuracy extends to variant identification in virus samples, including SARS-CoV-2, particularly with significantly fewer false negative indels than bcftools. The high accuracy of QuickVariants is further demonstrated by its detection of a greater number of Omicron-specific indels (5 versus 0) and point mutations (61 versus 48–54) than bcftools in sewage metagenomes predominated by Omicron variants. Much of the reduced accuracy of bcftools was attributable to its misinterpretation of indels, often producing false negative indels and false positive point mutations at the same locations. Conclusions We introduce QuickVariants, a fast, accurate, and robust bioinformatics tool designed for identifying genetic variants for microbial studies. QuickVariants is available at https://github.com/caozhichongchong/QuickVariants .en_US
dc.publisherSpringer Science and Business Media LLCen_US
dc.relation.isversionof10.1186/s12915-024-01891-4en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceBioMed Centralen_US
dc.titleFast and accurate variant identification tool for sequencing-based studiesen_US
dc.typeArticleen_US
dc.identifier.citationGaston, J.M., Alm, E.J. & Zhang, AN. Fast and accurate variant identification tool for sequencing-based studies. BMC Biol 22, 90 (2024).en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Biological Engineering
dc.contributor.departmentMassachusetts Institute of Technology. Center for Microbiome Informatics and Therapeutics
dc.identifier.mitlicensePUBLISHER_CC
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2024-04-28T03:16:24Z
dc.language.rfc3066en
dc.rights.holderThe Author(s)
dspace.date.submission2024-04-28T03:16:24Z
mit.journal.volume22en_US
mit.journal.issue1en_US
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record