Simultaneous alignment and folding of protein sequences
Author(s)
Waldispuhl, Jerome; O'Donnell, Charles William; Will, Sebastian; Devadas, Srinivas; Backofen, Rolf; Berger, Bonnie; ... Show more Show less
Download0D29CCAFd01.pdf (342.2Kb)
OPEN_ACCESS_POLICY
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We presentpartiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm’s complexity is polynomial in time and space. Algorithmically,partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane β-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments,partiFold-Align significantly outperforms state-of-the-art pairwise sequence alignment tools in the most difficult low sequence homology case and improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families. partiFold-Align is available at http://partiFold.csail.mit.edu.
Date issued
2009-05Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science; Massachusetts Institute of Technology. Department of MathematicsJournal
Research in Computational Molecular Biology
Publisher
Springer Berlin Heidelberg
Citation
Waldispühl, Jérôme et al. “Simultaneous Alignment and Folding of Protein Sequences.” Research in Computational Molecular Biology 2009.
Version: Author's final manuscript
ISSN
1611-3349
0302-9743