Show simple item record

dc.contributor.authorPreheim, Sarah Pacocha
dc.contributor.authorPerrotta, Allison Rose
dc.contributor.authorMartin-Platero, Antonio M.
dc.contributor.authorGupta, Anika
dc.contributor.authorAlm, Eric J.
dc.date.accessioned2014-11-05T20:30:29Z
dc.date.available2014-11-05T20:30:29Z
dc.date.issued2013-08
dc.date.submitted2013-01
dc.identifier.issn0099-2240
dc.identifier.urihttp://hdl.handle.net/1721.1/91469
dc.description.abstract16S rRNA sequencing, commonly used to survey microbial communities, begins by grouping individual reads into operational taxonomic units (OTUs). There are two major challenges in calling OTUs: identifying bacterial population boundaries and differentiating true diversity from sequencing errors. Current approaches to identifying taxonomic groups or eliminating sequencing errors rely on sequence data alone, but both of these activities could be informed by the distribution of sequences across samples. Here, we show that using the distribution of sequences across samples can help identify population boundaries even in noisy sequence data. The logic underlying our approach is that bacteria in different populations will often be highly correlated in their abundance across different samples. Conversely, 16S rRNA sequences derived from the same population, whether slightly different copies in the same organism, variation of the 16S rRNA gene within a population, or sequences generated randomly in error, will have the same underlying distribution across sampled environments. We present a simple OTU-calling algorithm (distribution-based clustering) that uses both genetic distance and the distribution of sequences across samples and demonstrate that it is more accurate than other methods at grouping reads into OTUs in a mock community. Distribution-based clustering also performs well on environmental samples: it is sensitive enough to differentiate between OTUs that differ by a single base pair yet predicts fewer overall OTUs than most other methods. The program can decrease the total number of OTUs with redundant information and improve the power of many downstream analyses to describe biologically relevant trends.en_US
dc.description.sponsorshipUnited States. Dept. of Energy (Office of Science, contract no. DEAC02-05CH11231)en_US
dc.language.isoen_US
dc.publisherAmerican Society for Microbiologyen_US
dc.relation.isversionofhttp://dx.doi.org/10.1128/AEM.00342-13en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourcePreheimen_US
dc.titleDistribution-Based Clustering: Using Ecology To Refine the Operational Taxonomic Uniten_US
dc.typeArticleen_US
dc.identifier.citationPreheim, S. P., A. R. Perrotta, A. M. Martin-Platero, A. Gupta, and E. J. Alm. “Distribution-Based Clustering: Using Ecology To Refine the Operational Taxonomic Unit.” Applied and Environmental Microbiology 79, no. 21 (August 23, 2013): 6593–6603.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Biological Engineeringen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Civil and Environmental Engineeringen_US
dc.contributor.approverAlm, Eric J.en_US
dc.contributor.mitauthorPreheim, Sarah Pacochaen_US
dc.contributor.mitauthorPerrotta, Allison Roseen_US
dc.contributor.mitauthorMartin-Platero, Antonio M.en_US
dc.contributor.mitauthorGupta, Anikaen_US
dc.contributor.mitauthorAlm, Eric J.en_US
dc.relation.journalApplied and Environmental Microbiologyen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsPreheim, S. P.; Perrotta, A. R.; Martin-Platero, A. M.; Gupta, A.; Alm, E. J.en_US
dc.identifier.orcidhttps://orcid.org/0000-0001-8294-9364
dc.identifier.orcidhttps://orcid.org/0000-0003-4378-9542
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record