Show simple item record

dc.contributor.authorOndov, Brian D.
dc.contributor.authorStarrett, Gabriel J.
dc.contributor.authorSappington, Anna
dc.contributor.authorKostic, Aleksandra
dc.contributor.authorKoren, Sergey
dc.contributor.authorBuck, Christopher B.
dc.contributor.authorPhillippy, Adam M.
dc.date.accessioned2020-07-22T18:14:15Z
dc.date.available2020-07-22T18:14:15Z
dc.date.issued2019-11-05
dc.date.submitted2019-02
dc.identifier.issn1474-760X
dc.identifier.urihttps://hdl.handle.net/1721.1/126316
dc.description.abstractThe MinHash algorithm has proven effective for rapidly estimating the resemblance of two genomes or metagenomes. However, this method cannot reliably estimate the containment of a genome within a metagenome. Here, we describe an online algorithm capable of measuring the containment of genomes and proteomes within either assembled or unassembled sequencing read sets. We describe several use cases, including contamination screening and retrospective analysis of metagenomes for novel genome discovery. Using this tool, we provide containment estimates for every NCBI RefSeq genome within every SRA metagenome and demonstrate the identification of a novel polyomavirus species from a public metagenome.en_US
dc.publisherBioMed Centralen_US
dc.relation.isversionofhttps://doi.org/10.1186/s13059-019-1841-xen_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceBioMed Centralen_US
dc.titleMash Screen: high-throughput sequence containment estimation for genome discoveryen_US
dc.typeArticleen_US
dc.identifier.citationOndov, Brian D. et al. "Mash Screen: high-throughput sequence containment estimation for genome discovery." Genome Biology 20 (Nov. 2019): 232 doi https://doi.org/10.1186/s13059-019-1841-x ©2019 Author(s)en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.relation.journalGenome Biologyen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2020-06-26T11:08:06Z
dc.language.rfc3066en
dc.rights.holderThe Author(s)
dspace.date.submission2020-06-26T11:08:06Z
mit.journal.volume20en_US
mit.licensePUBLISHER_CC
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record