Enhancing SchNet-Based Structure Prediction for Doped Clusters via Transfer Learning and Fine-Tuning

IF 2.8 2区 化学 Q3 CHEMISTRY, PHYSICAL
Zi-Xin Wen, , , Hui-Fang Li, , , Kai-Le Jiang, , and , Huai-Qian Wang*, 
{"title":"Enhancing SchNet-Based Structure Prediction for Doped Clusters via Transfer Learning and Fine-Tuning","authors":"Zi-Xin Wen,&nbsp;, ,&nbsp;Hui-Fang Li,&nbsp;, ,&nbsp;Kai-Le Jiang,&nbsp;, and ,&nbsp;Huai-Qian Wang*,&nbsp;","doi":"10.1021/acs.jpca.5c05018","DOIUrl":null,"url":null,"abstract":"<p >Doped clusters regulate their electronic structures and magnetic properties via heteroatoms, optimizing stability and core physicochemical performances to suit practical applications. Accurate structural prediction is a key foundation for elucidating structure–property relationships and advancing industrial applications. Despite the advancements of machine learning (ML) in cluster structure prediction, two key challenges remain: (1) predicting heterogeneous clusters demands massive data and computational resources; (2) the lack of standardized approaches for ML frameworks on heterogeneous clusters hinders the portability and efficiency of ML models. To address these challenges, we propose a method based on the SchNet model─which offers a well-established framework well-suited for physicochemical tasks (e.g., potential energy surface (PES) fitting and cluster dynamics simulations)─and integrate transfer learning into this method. By freezing neural network layers and fine-tuning with a minimal data set, we optimize the model for EuSi<sub><i>n</i></sub> (<i>n</i> = 3–12) clusters. The data set was constructed via ABCluster and Gaussian, with energy validation performed at the PBEPBE/3-21G//LANL2DZ and PBEPBE/6-311G(d)//SDD levels to ensure diversity and accuracy. The transfer-learned ML model successfully predicts the global minimum structures of EuSi<sub><i>n</i></sub> (<i>n</i> = 3–12) clusters, matching results from traditional density functional theory calculations. Compared to the original SchNet model, the method reduces computational time by 54.09% and data requirements by 88.89%, demonstrating significant efficiency gains. This work overcomes traditional doped cluster calculation bottlenecks, and provides a paradigm for doped cluster ML studies.</p>","PeriodicalId":59,"journal":{"name":"The Journal of Physical Chemistry A","volume":"129 41","pages":"9616–9624"},"PeriodicalIF":2.8000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Physical Chemistry A","FirstCategoryId":"1","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.jpca.5c05018","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Doped clusters regulate their electronic structures and magnetic properties via heteroatoms, optimizing stability and core physicochemical performances to suit practical applications. Accurate structural prediction is a key foundation for elucidating structure–property relationships and advancing industrial applications. Despite the advancements of machine learning (ML) in cluster structure prediction, two key challenges remain: (1) predicting heterogeneous clusters demands massive data and computational resources; (2) the lack of standardized approaches for ML frameworks on heterogeneous clusters hinders the portability and efficiency of ML models. To address these challenges, we propose a method based on the SchNet model─which offers a well-established framework well-suited for physicochemical tasks (e.g., potential energy surface (PES) fitting and cluster dynamics simulations)─and integrate transfer learning into this method. By freezing neural network layers and fine-tuning with a minimal data set, we optimize the model for EuSin (n = 3–12) clusters. The data set was constructed via ABCluster and Gaussian, with energy validation performed at the PBEPBE/3-21G//LANL2DZ and PBEPBE/6-311G(d)//SDD levels to ensure diversity and accuracy. The transfer-learned ML model successfully predicts the global minimum structures of EuSin (n = 3–12) clusters, matching results from traditional density functional theory calculations. Compared to the original SchNet model, the method reduces computational time by 54.09% and data requirements by 88.89%, demonstrating significant efficiency gains. This work overcomes traditional doped cluster calculation bottlenecks, and provides a paradigm for doped cluster ML studies.

Abstract Image

通过迁移学习和微调增强基于schnet的掺杂簇结构预测。
掺杂团簇通过杂原子调节其电子结构和磁性能,优化稳定性和核心物理化学性能,以适应实际应用。准确的结构预测是阐明结构-性能关系和推进工业应用的重要基础。尽管机器学习(ML)在集群结构预测方面取得了进展,但仍然存在两个关键挑战:(1)预测异构集群需要大量数据和计算资源;(2)异构集群上机器学习框架的标准化方法的缺乏阻碍了机器学习模型的可移植性和效率。为了解决这些挑战,我们提出了一种基于SchNet模型的方法,该模型提供了一个非常适合物理化学任务(例如,势能面(PES)拟合和聚类动力学模拟)的完善框架,并将迁移学习集成到该方法中。通过冻结神经网络层并使用最小数据集进行微调,我们优化了EuSin (n = 3-12)集群的模型。数据集通过ABCluster和Gaussian构建,在PBEPBE/3-21G//LANL2DZ和PBEPBE/6-311G(d)//SDD级别进行能量验证,以确保多样性和准确性。迁移学习的ML模型成功地预测了EuSin (n = 3-12)簇的全局最小结构,与传统密度泛函理论计算的结果相匹配。与最初的SchNet模型相比,该方法减少了54.09%的计算时间和88.89%的数据需求,显示出显著的效率提升。这项工作克服了传统掺杂簇计算的瓶颈,为掺杂簇的机器学习研究提供了一个范例。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
The Journal of Physical Chemistry A
The Journal of Physical Chemistry A 化学-物理:原子、分子和化学物理
CiteScore
5.20
自引率
10.30%
发文量
922
审稿时长
1.3 months
期刊介绍: The Journal of Physical Chemistry A is devoted to reporting new and original experimental and theoretical basic research of interest to physical chemists, biophysical chemists, and chemical physicists.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信