Accurate Lattice Free Energies of Packing Polymorphs from Probabilistic Generative Models.

IF 5.5 1区 化学 Q2 CHEMISTRY, PHYSICAL
Journal of Chemical Theory and Computation Pub Date : 2025-03-11 Epub Date: 2025-02-21 DOI:10.1021/acs.jctc.4c01612
Edgar Olehnovics, Yifei Michelle Liu, Nada Mehio, Ahmad Y Sheikh, Michael R Shirts, Matteo Salvalaglio
{"title":"Accurate Lattice Free Energies of Packing Polymorphs from Probabilistic Generative Models.","authors":"Edgar Olehnovics, Yifei Michelle Liu, Nada Mehio, Ahmad Y Sheikh, Michael R Shirts, Matteo Salvalaglio","doi":"10.1021/acs.jctc.4c01612","DOIUrl":null,"url":null,"abstract":"<p><p>Finite-temperature lattice free energy differences between polymorphs of molecular crystals are fundamental to understanding and predicting the relative stability relationships underpinning polymorphism, yet are computationally expensive to obtain. Here, we implement and critically assess machine-learning-enabled targeted free energy calculations derived from flow-based generative models to compute the free energy difference between two ice crystal polymorphs (Ice XI and Ic), modeled with a fully flexible empirical classical force field. We demonstrate that even when remapping from an analytical reference distribution, such methods enable a cost-effective and accurate calculation of free energy differences between disconnected metastable ensembles when trained on locally ergodic data sampled exclusively from the ensembles of interest. Unlike classical free energy perturbation methods, such as the Einstein crystal method, the targeted approach analyzed in this work requires no additional sampling of intermediate perturbed Hamiltonians, offering significant computational savings. To systematically assess the accuracy of the method, we monitored the convergence of free energy estimates during training by implementing an overfitting-aware weighted averaging strategy. By comparing our results with ground-truth free energy differences computed with the Einstein crystal method, we assess the accuracy and efficiency of two different model architectures, employing two different representations of the supercell degrees of freedom (Cartesian vs quaternion-based). We conduct our assessment by comparing free energy differences between crystal supercells of different sizes and temperatures and assessing the accuracy in extrapolating lattice free energies to the thermodynamic limit. While at low temperatures and in small system sizes, the models perform with similar accuracy. We note that for larger systems and high temperatures, the choice of representation is key to obtaining generalizable results of quality comparable to that obtained from the Einstein crystal method. We believe this work to be a stepping stone toward efficient free energy calculations in larger, more complex molecular crystals.</p>","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":" ","pages":"2244-2255"},"PeriodicalIF":5.5000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11912200/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Theory and Computation","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jctc.4c01612","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/21 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Finite-temperature lattice free energy differences between polymorphs of molecular crystals are fundamental to understanding and predicting the relative stability relationships underpinning polymorphism, yet are computationally expensive to obtain. Here, we implement and critically assess machine-learning-enabled targeted free energy calculations derived from flow-based generative models to compute the free energy difference between two ice crystal polymorphs (Ice XI and Ic), modeled with a fully flexible empirical classical force field. We demonstrate that even when remapping from an analytical reference distribution, such methods enable a cost-effective and accurate calculation of free energy differences between disconnected metastable ensembles when trained on locally ergodic data sampled exclusively from the ensembles of interest. Unlike classical free energy perturbation methods, such as the Einstein crystal method, the targeted approach analyzed in this work requires no additional sampling of intermediate perturbed Hamiltonians, offering significant computational savings. To systematically assess the accuracy of the method, we monitored the convergence of free energy estimates during training by implementing an overfitting-aware weighted averaging strategy. By comparing our results with ground-truth free energy differences computed with the Einstein crystal method, we assess the accuracy and efficiency of two different model architectures, employing two different representations of the supercell degrees of freedom (Cartesian vs quaternion-based). We conduct our assessment by comparing free energy differences between crystal supercells of different sizes and temperatures and assessing the accuracy in extrapolating lattice free energies to the thermodynamic limit. While at low temperatures and in small system sizes, the models perform with similar accuracy. We note that for larger systems and high temperatures, the choice of representation is key to obtaining generalizable results of quality comparable to that obtained from the Einstein crystal method. We believe this work to be a stepping stone toward efficient free energy calculations in larger, more complex molecular crystals.

基于概率生成模型的包装多态的精确晶格自由能。
分子晶体多晶态之间的有限温度晶格自由能差是理解和预测支撑多晶态的相对稳定性关系的基础,但计算成本很高。在这里,我们实施并批判性地评估了基于流动生成模型的机器学习目标自由能计算,以计算两种冰晶多晶(ice XI和Ic)之间的自由能差,并采用完全灵活的经验经典力场建模。我们证明,即使从分析参考分布重新映射,当训练局部遍历数据时,这种方法也能够经济有效地准确计算断开亚稳态系之间的自由能差,只从感兴趣的系中采样。与经典的自由能摄动方法(如爱因斯坦晶体方法)不同,本工作中分析的目标方法不需要额外的中间扰动哈密顿量采样,从而节省了大量的计算量。为了系统地评估该方法的准确性,我们通过实施一种感知过拟合的加权平均策略来监测训练期间自由能估计的收敛性。通过将我们的结果与用爱因斯坦晶体方法计算的地真自由能差进行比较,我们评估了两种不同模型架构的准确性和效率,采用了两种不同的超级单体自由度表示(笛卡儿和基于四元数的)。我们通过比较不同尺寸和温度的晶体超级电池之间的自由能差异,并评估将晶格自由能外推到热力学极限的准确性来进行评估。而在低温和小系统尺寸下,模型表现出相似的精度。我们注意到,对于较大的系统和高温,表示的选择是获得与爱因斯坦晶体方法获得的质量相当的可推广结果的关键。我们相信这项工作是在更大、更复杂的分子晶体中进行有效自由能计算的垫脚石。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Chemical Theory and Computation
Journal of Chemical Theory and Computation 化学-物理:原子、分子和化学物理
CiteScore
9.90
自引率
16.40%
发文量
568
审稿时长
1 months
期刊介绍: The Journal of Chemical Theory and Computation invites new and original contributions with the understanding that, if accepted, they will not be published elsewhere. Papers reporting new theories, methodology, and/or important applications in quantum electronic structure, molecular dynamics, and statistical mechanics are appropriate for submission to this Journal. Specific topics include advances in or applications of ab initio quantum mechanics, density functional theory, design and properties of new materials, surface science, Monte Carlo simulations, solvation models, QM/MM calculations, biomolecular structure prediction, and molecular dynamics in the broadest sense including gas-phase dynamics, ab initio dynamics, biomolecular dynamics, and protein folding. The Journal does not consider papers that are straightforward applications of known methods including DFT and molecular dynamics. The Journal favors submissions that include advances in theory or methodology with applications to compelling problems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信