X-Mapper: fast and accurate sequence alignment via gapped x-mers
Author(s)
Gaston, Jeffry M.; Alm, Eric J.; Zhang, An-Ni
Download13059_2024_Article_3473.pdf (2.397Mb)
Publisher with Creative Commons License
Publisher with Creative Commons License
Creative Commons Attribution
Additional downloads
Publisher with Creative Commons License
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
Sequence alignment is foundational to many bioinformatic analyses. Many aligners start by splitting sequences into contiguous, fixed-length seeds, called k-mers. Alignment is faster with longer, unique seeds, but more accurate with shorter seeds avoiding mutations. Here, we introduce X-Mapper, aiming to offer high speed and accuracy via dynamic-length seeds containing gaps, called gapped x-mers. We observe 11–24-fold fewer suboptimal alignments analyzing a human reference and 3–579-fold lower inconsistency across bacterial references than other aligners, improving on 53% and 30% of reads aligned to non-target strains and species, respectively. Other seed-based analysis algorithms might benefit from gapped x-mers too.
Date issued
2025-01-22Department
Massachusetts Institute of Technology. Department of Biological EngineeringJournal
Genome Biology
Publisher
BioMed Central
Citation
Gaston, J.M., Alm, E.J. & Zhang, AN. X-Mapper: fast and accurate sequence alignment via gapped x-mers. Genome Biol 26, 15 (2025).
Version: Final published version