Show simple item record

dc.contributor.authorQin, Zhao
dc.contributor.authorYu, Qingyi
dc.contributor.authorBuehler, Markus J
dc.date.accessioned2020-07-01T22:56:06Z
dc.date.available2020-07-01T22:56:06Z
dc.date.issued2020-04
dc.date.submitted2019-06
dc.identifier.issn2046-2069
dc.identifier.urihttps://hdl.handle.net/1721.1/126049
dc.description.abstractNatural vibrations and resonances are intrinsic features of protein structures and enable differentiation of one structure from another. These nanoscale features are important to help to understand the dynamics of a protein molecule and identify the effects of small sequence or other geometric alterations that may not cause significant visible structural changes, such as point mutations associated with disease or drug design. Although normal mode analysis provides a powerful way to accurately extract the natural frequencies of a protein, it must meet several critical conditions, including availability of high-resolution structures, availability of good chemical force fields and memory-intensive large-scale computing resources. Here, we study the natural frequency of over 100?000 known protein molecular structures from the Protein Data Bank and use this dataset to carefully investigate the correlation between their structural features and these natural frequencies by using a machine learning model composed of a Feedforward Neural Network made of four hidden layers that predicts the natural frequencies in excellent agreement with full-atomistic normal mode calculations, but is significantly more computationally efficient. In addition to the computational advance, we demonstrate that this model can be used to directly obtain the natural frequencies by merely using five structural features of protein molecules as predictor variables, including the largest and smallest diameter, and the ratio of amino acid residues with alpha-helix, beta strand and 3-10 helix domains. These structural features can be either experimentally or computationally obtained, and do not require a full-atomistic model of a protein of interest. This method is helpful in predicting the absorption and resonance functions of an unknown protein molecule without solving its full atomic structure. ©2020 The Royal Society of Chemistry.en_US
dc.description.sponsorshipONR (grant #N00014-16-1-2333)en_US
dc.description.sponsorshipNIH (U01 EB014976)en_US
dc.description.sponsorshipArmy Research Office - ARO (73793EG)en_US
dc.language.isoen
dc.publisherRoyal Society of Chemistry (RSC)en_US
dc.relation.isversionofhttps://dx.doi.org/10.1039/c9ra04186aen_US
dc.rightsCreative Commons Attribution Noncommercial 3.0 unported licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by-nc/3.0/en_US
dc.sourceRoyal Society of Chemistry (RSC)en_US
dc.titleMachine learning model for fast prediction of the natural frequencies of protein moleculesen_US
dc.typeArticleen_US
dc.identifier.citationQin, Zhao et al., "Machine learning model for fast prediction of the natural frequencies of protein molecules." RSC Advances 10, 28 (April 2020): p. 16607-15 doi. 10.1039/C9RA04186A ©2020 Authorsen_US
dc.contributor.departmentMassachusetts Institute of Technology. Laboratory for Atomistic and Molecular Mechanicsen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Civil and Environmental Engineeringen_US
dc.relation.journalRSC Advancesen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2020-05-19T15:16:15Z
dspace.date.submission2020-05-19T15:16:17Z
mit.journal.volume10en_US
mit.journal.issue28en_US
mit.licensePUBLISHER_CC
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record