MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Enhanced sampling of robust molecular datasets with uncertainty-based collective variables

Author(s)
Tan, Aik Rui; Dietschreit, Johannes CB; Gómez-Bombarelli, Rafael
Thumbnail
DownloadPublished version (9.638Mb)
Publisher with Creative Commons License

Publisher with Creative Commons License

Creative Commons Attribution

Terms of use
Creative Commons Attribution-Noncommercial https://creativecommons.org/licenses/by-nc/4.0/
Metadata
Show full item record
Abstract
Generating a dataset that is representative of the accessible configuration space of a molecular system is crucial for the robustness of machine-learned interatomic potentials. However, the complexity of molecular systems, characterized by intricate potential energy surfaces, with numerous local minima and energy barriers, presents a significant challenge. Traditional methods of data generation, such as random sampling or exhaustive exploration, are either intractable or may not capture rare, but highly informative configurations. In this study, we propose a method that leverages uncertainty as the collective variable (CV) to guide the acquisition of chemically relevant data points, focusing on regions of configuration space where ML model predictions are most uncertain. This approach employs a Gaussian Mixture Model-based uncertainty metric from a single model as the CV for biased molecular dynamics simulations. The effectiveness of our approach in overcoming energy barriers and exploring unseen energy minima, thereby enhancing the dataset in an active learning framework, is demonstrated on alanine dipeptide and bulk silica.
Date issued
2025-01-15
URI
https://hdl.handle.net/1721.1/165353
Department
Massachusetts Institute of Technology. Department of Materials Science and Engineering
Journal
The Journal of Chemical Physics
Publisher
AIP Publishing
Citation
Aik Rui Tan, Johannes C. B. Dietschreit, Rafael Gómez-Bombarelli; Enhanced sampling of robust molecular datasets with uncertainty-based collective variables. J. Chem. Phys. 21 January 2025; 162 (3): 034114.
Version: Final published version

Collections
  • MIT Open Access Articles

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.