Atomic Interaction Networks in the Core of Protein Domains and Their Native Folds

Soundararajan, Venkataramanan; Raman, Rahul; Raguram, S.; Sasisekharan, V.; Sasisekharan, Ram

Author(s)

Soundararajan, Venkataramanan; Raman, Rahul; Raguram, S; Sasisekharan, Viswanathan; Sasisekharan, Ram

Downloadjournal.pone.0009391.PDF (901.8Kb)

PUBLISHER_CC

Terms of use

Attribution 4.0 International (CC BY 4.0) https://creativecommons.org/licenses/by/4.0/

Metadata

Show full item record

Abstract

Vastly divergent sequences populate a majority of protein folds. In the quest to identify features that are conserved within protein domains belonging to the same fold, we set out to examine the entire protein universe on a fold-by-fold basis. We report that the atomic interaction network in the solvent-unexposed core of protein domains are fold-conserved, extraordinary sequence divergence notwithstanding. Further, we find that this feature, termed protein core atomic interaction network (or PCAIN) is significantly distinguishable across different folds, thus appearing to be "signature" of a domain's native fold. As part of this study, we computed the PCAINs for 8698 representative protein domains from families across the 1018 known protein folds to construct our seed database and an automated framework was developed for PCAIN-based characterization of the protein fold universe. A test set of randomly selected domains that are not in the seed database was classified with over 97% accuracy, independent of sequence divergence. As an application of this novel fold signature, a PCAIN-based scoring scheme was developed for comparative (homology-based) structure prediction, with 1-2 angstroms (mean 1.61A) Cα RMSD generally observed between computed structures and reference crystal structures. Our results are consistent across the full spectrum of test domains including those from recent CASP experiments and most notably in the 'twilight' and 'midnight' zones wherein < 30% and < 10% target-template sequence identity prevails (mean twilight RMSD of 1.69A). We further demonstrate the utility of the PCAIN protocol to derive biological insight into protein structure-function relationships, by modeling the structure of the YopM effector novel E3 ligase (NEL) domain from plaguecausative bacterium Yersinia Pestis and discussing its implications for host adaptive and innate immune modulation by the pathogen. Considering the several high-throughput, sequence-identity-independent applications demonstrated in this work, we suggest that the PCAIN is a fundamental fold feature that could be a valuable addition to the arsenal of protein modeling and analysis tools.

Date issued

2010-02

URI

http://hdl.handle.net/1721.1/116196

Department

Massachusetts Institute of Technology. Department of Biology; Koch Institute for Integrative Cancer Research at MIT

Journal

PLoS ONE

Publisher

Public Library of Science (PLoS)

Citation

Soundararajan, Venkataramanan et al. “Atomic Interaction Networks in the Core of Protein Domains and Their Native Folds.” Edited by Neeraj Vij. PLoS ONE 5, 2 (February 2010): e9391 © 2010 Soundararajan et al

Version: Final published version

ISSN

1932-6203

Collections

MIT Open Access Articles

DSpace@MIT