{"title":"On the reproducibility of free energy surfaces in machine-learned collective variable spaces.","authors":"Florian M Dietrich, Matteo Salvalaglio","doi":"10.1063/5.0287912","DOIUrl":null,"url":null,"abstract":"<p><p>As Machine-Learned Collective Variables (MLCVs) are becoming increasingly relevant in the molecular simulation literature, we discuss the necessary conditions to enable reproducibility in calculating and representing free energy surfaces. We note that the variability of the training process and the roughness of the hyperparameter space impose inherent limits on the reproducibility of results even when the mathematical structure of the model defining a collective variable is consistent. To this end, we propose the adoption of a geometric (gauge invariant) free energy representation to obtain consistent free energy differences across training instances and architectures. Furthermore, we introduce a normalization factor to model gradients for biased enhanced sampling. This factor effectively unifies free energy definitions and addresses practical issues preventing the widespread use and deployment of MLCVs.</p>","PeriodicalId":15313,"journal":{"name":"Journal of Chemical Physics","volume":"163 14","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Physics","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1063/5.0287912","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
As Machine-Learned Collective Variables (MLCVs) are becoming increasingly relevant in the molecular simulation literature, we discuss the necessary conditions to enable reproducibility in calculating and representing free energy surfaces. We note that the variability of the training process and the roughness of the hyperparameter space impose inherent limits on the reproducibility of results even when the mathematical structure of the model defining a collective variable is consistent. To this end, we propose the adoption of a geometric (gauge invariant) free energy representation to obtain consistent free energy differences across training instances and architectures. Furthermore, we introduce a normalization factor to model gradients for biased enhanced sampling. This factor effectively unifies free energy definitions and addresses practical issues preventing the widespread use and deployment of MLCVs.
期刊介绍:
The Journal of Chemical Physics publishes quantitative and rigorous science of long-lasting value in methods and applications of chemical physics. The Journal also publishes brief Communications of significant new findings, Perspectives on the latest advances in the field, and Special Topic issues. The Journal focuses on innovative research in experimental and theoretical areas of chemical physics, including spectroscopy, dynamics, kinetics, statistical mechanics, and quantum mechanics. In addition, topical areas such as polymers, soft matter, materials, surfaces/interfaces, and systems of biological relevance are of increasing importance.
Topical coverage includes:
Theoretical Methods and Algorithms
Advanced Experimental Techniques
Atoms, Molecules, and Clusters
Liquids, Glasses, and Crystals
Surfaces, Interfaces, and Materials
Polymers and Soft Matter
Biological Molecules and Networks.