Does Hessian Data Improve the Performance of Machine Learning Potentials?

IF 5.5 1区 化学 Q2 CHEMISTRY, PHYSICAL
Journal of Chemical Theory and Computation Pub Date : 2025-07-22 Epub Date: 2025-07-02 DOI:10.1021/acs.jctc.5c00402
Austin Rodriguez, Justin S Smith, Jose L Mendoza-Cortes
{"title":"Does Hessian Data Improve the Performance of Machine Learning Potentials?","authors":"Austin Rodriguez, Justin S Smith, Jose L Mendoza-Cortes","doi":"10.1021/acs.jctc.5c00402","DOIUrl":null,"url":null,"abstract":"<p><p>The integration of machine learning into reactive chemistry, materials discovery, and drug design is transforming the development of novel molecules and materials. Machine Learning Interatomic Potentials (MLIPs) predict potential energies and forces with quantum chemistry accuracy, surpassing traditional approaches. Incorporating force fitting in MLIP training enhances potential-energy surface predictions and improves model transferability and reliability. This paper introduces and evaluates the integration of Hessian matrix training in MLIPs, which encodes second-order information about the PES curvature. Our evaluation focuses on models trained only to equilibrium geometries and first-order saddle points (i.e., critical points on the potential surface), demonstrating their ability to extrapolate to nonequilibrium geometries. This integration improves extrapolation capabilities, allowing MLIPs to accurately predict energies, forces, and Hessian predictions for nonequilibrium geometries. Hessian-trained MLIPs enhance reaction pathway modeling, transition state identification, and vibrational spectra predictions, benefiting molecular dynamics (MD) simulations and Nudged Elastic Band (NEB) calculations. By analyzing models trained with varying combinations of energy, force, and Hessian data on a small molecule reactive data set, we demonstrate that models including Hessian information not only extrapolate more accurately to unseen molecular systems, improving accuracy in reaction modeling and vibrational analysis, but also reduce the total amount of data required for effective training. However, the primary trade-off is increased computational expense, as Hessian training requires more resources than conventional energy-force training. Our findings provide comprehensive insights into the advantages and limitations of Hessian integration in MLIP training, allowing practitioners in computational chemistry to make informed decisions about employing this method in accordance with their specific research objectives and computational constraints.</p>","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":" ","pages":"6698-6710"},"PeriodicalIF":5.5000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12288065/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Theory and Computation","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jctc.5c00402","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/2 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

The integration of machine learning into reactive chemistry, materials discovery, and drug design is transforming the development of novel molecules and materials. Machine Learning Interatomic Potentials (MLIPs) predict potential energies and forces with quantum chemistry accuracy, surpassing traditional approaches. Incorporating force fitting in MLIP training enhances potential-energy surface predictions and improves model transferability and reliability. This paper introduces and evaluates the integration of Hessian matrix training in MLIPs, which encodes second-order information about the PES curvature. Our evaluation focuses on models trained only to equilibrium geometries and first-order saddle points (i.e., critical points on the potential surface), demonstrating their ability to extrapolate to nonequilibrium geometries. This integration improves extrapolation capabilities, allowing MLIPs to accurately predict energies, forces, and Hessian predictions for nonequilibrium geometries. Hessian-trained MLIPs enhance reaction pathway modeling, transition state identification, and vibrational spectra predictions, benefiting molecular dynamics (MD) simulations and Nudged Elastic Band (NEB) calculations. By analyzing models trained with varying combinations of energy, force, and Hessian data on a small molecule reactive data set, we demonstrate that models including Hessian information not only extrapolate more accurately to unseen molecular systems, improving accuracy in reaction modeling and vibrational analysis, but also reduce the total amount of data required for effective training. However, the primary trade-off is increased computational expense, as Hessian training requires more resources than conventional energy-force training. Our findings provide comprehensive insights into the advantages and limitations of Hessian integration in MLIP training, allowing practitioners in computational chemistry to make informed decisions about employing this method in accordance with their specific research objectives and computational constraints.

Hessian数据是否提高了机器学习潜力的性能?
将机器学习整合到反应化学、材料发现和药物设计中,正在改变新分子和材料的发展。机器学习原子间势(MLIPs)以量子化学的精度预测势能和力,超越传统方法。在MLIP训练中加入力拟合可以增强势能面预测,提高模型的可转移性和可靠性。本文介绍并评价了MLIPs中Hessian矩阵训练的集成方法,该方法对PES曲率的二阶信息进行编码。我们的评估集中在只训练到平衡几何和一阶鞍点(即势表面上的临界点)的模型上,展示了它们外推到非平衡几何的能力。这种集成提高了外推能力,使MLIPs能够准确地预测非平衡几何的能量、力和Hessian预测。hessian训练的MLIPs增强了反应途径建模、过渡态识别和振动谱预测,有利于分子动力学(MD)模拟和微推弹性带(NEB)计算。通过分析在小分子反应数据集上使用能量、力和Hessian数据的不同组合训练的模型,我们证明了包含Hessian信息的模型不仅可以更准确地推断未知的分子系统,提高反应建模和振动分析的准确性,而且还减少了有效训练所需的数据总量。然而,主要的代价是增加了计算费用,因为Hessian训练比传统的能量-力量训练需要更多的资源。我们的研究结果为MLIP训练中Hessian集成的优势和局限性提供了全面的见解,使计算化学的从业者能够根据他们具体的研究目标和计算限制做出采用这种方法的明智决定。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Chemical Theory and Computation
Journal of Chemical Theory and Computation 化学-物理:原子、分子和化学物理
CiteScore
9.90
自引率
16.40%
发文量
568
审稿时长
1 months
期刊介绍: The Journal of Chemical Theory and Computation invites new and original contributions with the understanding that, if accepted, they will not be published elsewhere. Papers reporting new theories, methodology, and/or important applications in quantum electronic structure, molecular dynamics, and statistical mechanics are appropriate for submission to this Journal. Specific topics include advances in or applications of ab initio quantum mechanics, density functional theory, design and properties of new materials, surface science, Monte Carlo simulations, solvation models, QM/MM calculations, biomolecular structure prediction, and molecular dynamics in the broadest sense including gas-phase dynamics, ab initio dynamics, biomolecular dynamics, and protein folding. The Journal does not consider papers that are straightforward applications of known methods including DFT and molecular dynamics. The Journal favors submissions that include advances in theory or methodology with applications to compelling problems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信