Show simple item record

dc.contributor.authorTzahor, Shani
dc.contributor.authorMan-Aharonovich, Dikla
dc.contributor.authorKirkup, Benjamin
dc.contributor.authorYogev, Tali
dc.contributor.authorBerman-Frank, Ilana
dc.contributor.authorPolz, Martin F.
dc.contributor.authorBeja, Oded
dc.contributor.authorMandel-Gutfreund, Yael
dc.date.accessioned2010-03-09T21:28:54Z
dc.date.available2010-03-09T21:28:54Z
dc.date.issued2009-05
dc.date.submitted2008-09
dc.identifier.issn1471-2164
dc.identifier.urihttp://hdl.handle.net/1721.1/52448
dc.description.abstractBackground Cyanobacteria of the genera Synechococcus and Prochlorococcus play a key role in marine photosynthesis, which contributes to the global carbon cycle and to the world oxygen supply. Recently, genes encoding the photosystem II reaction center (psbA and psbD) were found in cyanophage genomes. This phenomenon suggested that the horizontal transfer of these genes may be involved in increasing phage fitness. To date, a very small percentage of marine bacteria and phages has been cultured. Thus, mapping genomic data extracted directly from the environment to its taxonomic origin is necessary for a better understanding of phage-host relationships and dynamics. Results To achieve an accurate and rapid taxonomic classification, we employed a computational approach combining a multi-class Support Vector Machine (SVM) with a codon usage position specific scoring matrix (cuPSSM). Our method has been applied successfully to classify core-photosystem-II gene fragments, including partial sequences coming directly from the ocean, to seven different taxonomic classes. Applying the method on a large set of DNA and RNA psbA clones from the Mediterranean Sea, we studied the distribution of cyanobacterial psbA genes and transcripts in their natural environment. Using our approach, we were able to simultaneously examine taxonomic and ecological distributions in the marine environment. Conclusion The ability to accurately classify the origin of individual genes and transcripts coming directly from the environment is of great importance in studying marine ecology. The classification method presented in this paper could be applied further to classify other genes amplified from the environment, for which training data is available.en
dc.language.isoen_US
dc.publisherBioMed Central Ltd.en
dc.relation.isversionofhttp://dx.doi.org/10.1186/1471-2164-10-229en
dc.rightsCreative Commons Attributionen
dc.rights.urihttp://creativecommons.org/licenses/by/2.0/en
dc.sourceBioMed Centralen
dc.titleA supervised learning approach for taxonomic classification of core-photosystem-II genes and transcripts in the marine environmenten
dc.typeArticleen
dc.identifier.citationTzahor, Shani et al. “A supervised learning approach for taxonomic classification of core-photosystem-II genes and transcripts in the marine environment.” BMC Genomics 10.1 (2009): 229.en
dc.contributor.departmentMassachusetts Institute of Technology. Department of Civil and Environmental Engineeringen_US
dc.contributor.approverPolz, Martin F.
dc.contributor.mitauthorKirkup, Benjamin
dc.contributor.mitauthorPolz, Martin F.
dc.relation.journalBMC Genomicsen
dc.eprint.versionFinal published versionen
dc.identifier.pmid19445709
dc.type.urihttp://purl.org/eprint/type/JournalArticleen
eprint.statushttp://purl.org/eprint/status/PeerRevieweden
dspace.orderedauthorsTzahor, Shani; Man-Aharonovich, Dikla; Kirkup, Benjamin C; Yogev, Tali; Berman-Frank, Ilana; Polz, Martin F; Béjà, Oded; Mandel-Gutfreund, Yaelen
dc.identifier.orcidhttps://orcid.org/0000-0001-9296-3733
dspace.mitauthor.errortrue
mit.licensePUBLISHER_CCen
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record