Show simple item record

dc.contributor.authorRangel, Luiz Thibério
dc.contributor.authorSoucy, Shannon M
dc.contributor.authorSetubal, João C
dc.contributor.authorGogarten, Johann Peter
dc.contributor.authorFournier, Gregory P.
dc.date.accessioned2021-12-02T16:25:22Z
dc.date.available2021-12-02T14:13:25Z
dc.date.available2021-12-02T16:25:22Z
dc.date.issued2021-09
dc.identifier.urihttps://hdl.handle.net/1721.1/138288.2
dc.description.abstractAssessing the compatibility between gene family phylogenies is a crucial and often computationally demanding step in many phylogenomic analyses. Here, we describe the Evolutionary Similarity Index (IES), a means to assess shared evolution between gene families using a weighted orthogonal distance regression model applied to sequence distances. The utilization of pairwise distance matrices circumvents comparisons between gene tree topologies, which are inherently uncertain and sensitive to evolutionary model choice, phylogenetic reconstruction artifacts, and other sources of error. Furthermore, IES enables the many-to-many pairing of multiple copies between similarly evolving gene families. This is done by selecting non-overlapping pairs of copies, one from each assessed family, and yielding the least sum of squared residuals. Analyses of simulated gene family data sets show that IES’s accuracy is on par with popular tree-based methods while also less susceptible to noise introduced by sequence alignment and evolutionary model fitting. Applying IES to an empirical data set of 1,322 genes from 42 archaeal genomes identified eight major clusters of gene families with compatible evolutionary trends. The most cohesive cluster consisted of 62 genes with compatible evolutionary signal, which occur as both single-copy and multiple homologs per genome; phylogenetic analysis of concatenated alignments from this cluster produced a tree closely matching previously published species trees for Archaea. Four other clusters are mainly composed of accessory genes with limited distribution among Archaea and enriched toward specific metabolic functions. Pairwise evolutionary distances obtained from these accessory gene clusters suggest patterns of interphyla horizontal gene transfer. An IES implementation is available at https://github.com/lthiberiol/evolSimIndex.</jats:p>en_US
dc.description.sponsorshipSimons Foundation (Award 339603)en_US
dc.description.sponsorshipNSF (Award 1615426)en_US
dc.language.isoen
dc.publisherOxford University Press (OUP)en_US
dc.relation.isversionof10.1093/gbe/evab187en_US
dc.rightsCreative Commons Attribution NonCommercial License 4.0en_US
dc.rights.urihttps://creativecommons.org/licenses/by-nc/4.0/en_US
dc.sourceOxford University Pressen_US
dc.titleAn Efficient, Nonphylogenetic Method for Detecting Genes Sharing Evolutionary Signals in Phylogenomic Data Setsen_US
dc.typeArticleen_US
dc.identifier.citationRangel, Luiz Thibério, Soucy, Shannon M, Setubal, João C, Gogarten, Johann Peter and Fournier, Gregory P. 2021. "An Efficient, Nonphylogenetic Method for Detecting Genes Sharing Evolutionary Signals in Phylogenomic Data Sets." Genome Biology and Evolution, 13 (9).en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Earth, Atmospheric, and Planetary Sciencesen_US
dc.relation.journalGenome Biology and Evolutionen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2021-12-02T14:10:16Z
dspace.orderedauthorsRangel, LT; Soucy, SM; Setubal, JC; Gogarten, JP; Fournier, GPen_US
dspace.date.submission2021-12-02T14:10:21Z
mit.journal.volume13en_US
mit.journal.issue9en_US
mit.licensePUBLISHER_CC
mit.metadata.statusPublication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version