rSW-seq: Algorithm for detection of copy number alterations in deep sequencing data
Author(s)
Kim, Tae-Min; Luquette, Lovelace J.; Xi, Ruibin; Park, Peter J.
DownloadPark-2010-rSW-seq Algorithm for detection of copy.pdf (3.269Mb)
PUBLISHER_CC
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
Background
Recent advances in sequencing technologies have enabled generation of large-scale genome sequencing data. These data can be used to characterize a variety of genomic features, including the DNA copy number profile of a cancer genome. A robust and reliable method for screening chromosomal alterations would allow a detailed characterization of the cancer genome with unprecedented accuracy.
Results
We develop a method for identification of copy number alterations in a tumor genome compared to its matched control, based on application of Smith-Waterman algorithm to single-end sequencing data. In a performance test with simulated data, our algorithm shows >90% sensitivity and >90% precision in detecting a single copy number change that contains approximately 500 reads for the normal sample. With 100-bp reads, this corresponds to a ~50 kb region for 1X genome coverage of the human genome. We further refine the algorithm to develop rSW-seq, (recursive Smith-Waterman-seq) to identify alterations in a complex configuration, which are commonly observed in the human cancer genome. To validate our approach, we compare our algorithm with an existing algorithm using simulated and publicly available datasets. We also compare the sequencing-based profiles to microarray-based results.
Conclusion
We propose rSW-seq as an efficient method for detecting copy number changes in the tumor genome.
Date issued
2010-08Department
Harvard University--MIT Division of Health Sciences and TechnologyJournal
BMC Bioinformatics
Publisher
Springer (Biomed Central Ltd.)
Citation
Kim, Tae-Min et al. “rSW-seq: Algorithm for Detection of Copy Number Alterations in Deep Sequencing Data.” BMC Bioinformatics 11.1 (2010): 432. Web. 9 Mar. 2012.
Version: Final published version
ISSN
1471-2105