混合泛函DFT计算训练机器学习潜力的有效配置采样:在大气化学中的应用。

IF 10.7 2区 材料科学 Q1 CHEMISTRY, PHYSICAL
Sungwoo Kang, Runlong Cai, Dong Sik Yang, Dong Jin Ham, Markku Kulmala, John H Seinfeld, Jingkun Jiang, Hyun Chul Lee
{"title":"混合泛函DFT计算训练机器学习潜力的有效配置采样:在大气化学中的应用。","authors":"Sungwoo Kang, Runlong Cai, Dong Sik Yang, Dong Jin Ham, Markku Kulmala, John H Seinfeld, Jingkun Jiang, Hyun Chul Lee","doi":"10.1002/smtd.202500547","DOIUrl":null,"url":null,"abstract":"<p><p>Machine-learning potentials (MLPs), trained to predict energies from quantum chemical calculations, are widely employed to conduct large-scale MD simulations. However, MLPs are mostly trained on computationally inexpensive local/semilocal functionals, as generating training datasets using higher-accuracy theories, such as hybrid functionals, is challenging due to their high computational cost. Here, an active transfer learning scheme is developed to efficiently sample configurations for hybrid functional calculations. The proposed method is evaluated on atmospheric secondary aerosol formation reactions: clustering of sulfuric acid (SA), and dimethylamine (DMA), and oxidation of toluene. The accuracy of the trained MLP is shown to be comparable to that of the quantum chemical calculations, with errors within a few meV per atom. The molecular dynamics simulations are executed stably on a nanosecond scale, resulting in the formation of nanometer-size clusters. Thus, this study paves the way for establishing a general protocol to enable high-level atomistic simulations for a wide range of chemical systems.</p>","PeriodicalId":229,"journal":{"name":"Small Methods","volume":" ","pages":"e2500547"},"PeriodicalIF":10.7000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient Configuration Sampling for Hybrid Functional DFT Calculations to Train Machine-Learning Potentials: Application to Atmospheric Chemistry.\",\"authors\":\"Sungwoo Kang, Runlong Cai, Dong Sik Yang, Dong Jin Ham, Markku Kulmala, John H Seinfeld, Jingkun Jiang, Hyun Chul Lee\",\"doi\":\"10.1002/smtd.202500547\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Machine-learning potentials (MLPs), trained to predict energies from quantum chemical calculations, are widely employed to conduct large-scale MD simulations. However, MLPs are mostly trained on computationally inexpensive local/semilocal functionals, as generating training datasets using higher-accuracy theories, such as hybrid functionals, is challenging due to their high computational cost. Here, an active transfer learning scheme is developed to efficiently sample configurations for hybrid functional calculations. The proposed method is evaluated on atmospheric secondary aerosol formation reactions: clustering of sulfuric acid (SA), and dimethylamine (DMA), and oxidation of toluene. The accuracy of the trained MLP is shown to be comparable to that of the quantum chemical calculations, with errors within a few meV per atom. The molecular dynamics simulations are executed stably on a nanosecond scale, resulting in the formation of nanometer-size clusters. Thus, this study paves the way for establishing a general protocol to enable high-level atomistic simulations for a wide range of chemical systems.</p>\",\"PeriodicalId\":229,\"journal\":{\"name\":\"Small Methods\",\"volume\":\" \",\"pages\":\"e2500547\"},\"PeriodicalIF\":10.7000,\"publicationDate\":\"2025-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Small Methods\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://doi.org/10.1002/smtd.202500547\",\"RegionNum\":2,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Small Methods","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1002/smtd.202500547","RegionNum":2,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

摘要

机器学习电位(mlp)被训练来预测量子化学计算的能量,被广泛用于进行大规模的MD模拟。然而,mlp大多是在计算成本低廉的局部/半局部泛函上进行训练,因为使用更高精度的理论(如混合泛函)生成训练数据集具有挑战性,因为它们的计算成本很高。本文提出了一种主动迁移学习方案,用于混合泛函计算的有效采样配置。对该方法进行了大气二次气溶胶形成反应的评价:硫酸(SA)和二甲胺(DMA)的聚类,以及甲苯的氧化。经过训练的MLP的精度可以与量子化学计算相媲美,误差在每个原子几兆电子伏特以内。分子动力学模拟在纳秒尺度上稳定地进行,从而形成纳米尺寸的团簇。因此,本研究为建立一种通用协议铺平了道路,使高水平的原子模拟能够适用于广泛的化学系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Efficient Configuration Sampling for Hybrid Functional DFT Calculations to Train Machine-Learning Potentials: Application to Atmospheric Chemistry.

Machine-learning potentials (MLPs), trained to predict energies from quantum chemical calculations, are widely employed to conduct large-scale MD simulations. However, MLPs are mostly trained on computationally inexpensive local/semilocal functionals, as generating training datasets using higher-accuracy theories, such as hybrid functionals, is challenging due to their high computational cost. Here, an active transfer learning scheme is developed to efficiently sample configurations for hybrid functional calculations. The proposed method is evaluated on atmospheric secondary aerosol formation reactions: clustering of sulfuric acid (SA), and dimethylamine (DMA), and oxidation of toluene. The accuracy of the trained MLP is shown to be comparable to that of the quantum chemical calculations, with errors within a few meV per atom. The molecular dynamics simulations are executed stably on a nanosecond scale, resulting in the formation of nanometer-size clusters. Thus, this study paves the way for establishing a general protocol to enable high-level atomistic simulations for a wide range of chemical systems.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Small Methods
Small Methods Materials Science-General Materials Science
CiteScore
17.40
自引率
1.60%
发文量
347
期刊介绍: Small Methods is a multidisciplinary journal that publishes groundbreaking research on methods relevant to nano- and microscale research. It welcomes contributions from the fields of materials science, biomedical science, chemistry, and physics, showcasing the latest advancements in experimental techniques. With a notable 2022 Impact Factor of 12.4 (Journal Citation Reports, Clarivate Analytics, 2023), Small Methods is recognized for its significant impact on the scientific community. The online ISSN for Small Methods is 2366-9608.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信