Extending atomic decomposition and many-body representation with a chemistry-motivated approach to machine learning potentials.

IF 12 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Nature computational science Pub Date : 2025-04-14 DOI:10.1038/s43588-025-00790-0

Qi Yu, Ruitao Ma, Chen Qu, Riccardo Conte, Apurba Nandi, Priyanka Pandey, Paul L Houston, Dong H Zhang, Joel M Bowman

{"title":"Extending atomic decomposition and many-body representation with a chemistry-motivated approach to machine learning potentials.","authors":"Qi Yu, Ruitao Ma, Chen Qu, Riccardo Conte, Apurba Nandi, Priyanka Pandey, Paul L Houston, Dong H Zhang, Joel M Bowman","doi":"10.1038/s43588-025-00790-0","DOIUrl":null,"url":null,"abstract":"<p><p>Most widely used machine learning potentials for condensed-phase applications rely on many-body permutationally invariant polynomial or atom-centered neural networks. However, these approaches face challenges in achieving chemical interpretability in atomistic energy decomposition and fully matching the computational efficiency of traditional force fields. Here we present a method that combines aspects of both approaches and balances accuracy and force-field-level speed. This method utilizes a monomer-centered representation, where the potential energy is decomposed into the sum of chemically meaningful monomeric energies. The structural descriptors of monomers are described by one-body and two-body effective interactions, enforced by appropriate sets of permutationally invariant polynomials as inputs to the feed-forward neural networks. Systematic assessments of models for gas-phase water trimer, liquid water, methane-water cluster and liquid carbon dioxide are performed. The improved accuracy, efficiency and flexibility of this method have promise for constructing accurate machine learning potentials and enabling large-scale quantum and classical simulations for complex molecular systems.</p>","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":" ","pages":""},"PeriodicalIF":12.0000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature computational science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s43588-025-00790-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Most widely used machine learning potentials for condensed-phase applications rely on many-body permutationally invariant polynomial or atom-centered neural networks. However, these approaches face challenges in achieving chemical interpretability in atomistic energy decomposition and fully matching the computational efficiency of traditional force fields. Here we present a method that combines aspects of both approaches and balances accuracy and force-field-level speed. This method utilizes a monomer-centered representation, where the potential energy is decomposed into the sum of chemically meaningful monomeric energies. The structural descriptors of monomers are described by one-body and two-body effective interactions, enforced by appropriate sets of permutationally invariant polynomials as inputs to the feed-forward neural networks. Systematic assessments of models for gas-phase water trimer, liquid water, methane-water cluster and liquid carbon dioxide are performed. The improved accuracy, efficiency and flexibility of this method have promise for constructing accurate machine learning potentials and enabling large-scale quantum and classical simulations for complex molecular systems.

查看原文本刊更多论文

用化学驱动的方法扩展原子分解和多体表示到机器学习潜能。

在凝聚态应用中，最广泛使用的机器学习潜力依赖于多体排列不变多项式或原子中心神经网络。然而，这些方法在实现原子能量分解的化学可解释性和完全匹配传统力场的计算效率方面面临挑战。在这里，我们提出了一种结合了这两种方法的方法，并平衡了精度和力场级速度。该方法采用以单体为中心的表示，将势能分解为具有化学意义的单体能量的总和。单体的结构描述符由单体和双体有效相互作用来描述，通过适当的排列不变多项式集作为前馈神经网络的输入来强制执行。对气相水三聚体、液态水、甲烷-水簇和液态二氧化碳模型进行了系统评估。该方法提高了精度、效率和灵活性，有望构建准确的机器学习势，并实现复杂分子系统的大规模量子和经典模拟。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Nature computational science

CiteScore

11.70

自引率

0.00%

发文量