Predefined attention-focused mechanism using center-environment features: a machine learning study of alloying effects on the stability of Nb5Si3 alloys†

IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY
Yuchao Tang, Bin Xiao, Shuizhou Chen, Quan Qian and Yi Liu
{"title":"Predefined attention-focused mechanism using center-environment features: a machine learning study of alloying effects on the stability of Nb5Si3 alloys†","authors":"Yuchao Tang, Bin Xiao, Shuizhou Chen, Quan Qian and Yi Liu","doi":"10.1039/D5DD00079C","DOIUrl":null,"url":null,"abstract":"<p >Digital encoding of material structures using graph-based features combined with deep neural networks often lacks local specificity. Additionally, incorporating a self-attention mechanism increases architectural complexity and demands extensive data. To overcome these challenges, we developed a Center-Environment (CE) feature representation—a less data-intensive, physics-informed predefined attention mechanism. The pre-attention mechanism underlying the CE model shifts attention from complex black-box machine learning (ML) algorithms to explicit feature models with physical meaning, reducing data requirements while enhancing the transparency and interpretability of ML models. This CE-based ML approach was employed to investigate the alloying effects on the structural stability of Nb<small><sub>5</sub></small>Si<small><sub>3</sub></small>, guiding data-driven compositional design for ultra-high-temperature NbSi superalloys. The CE features leveraged the Atomic Environment Type (AET) method to characterize the local low-symmetry physical environments of atoms. The optimized CE<small><sub>AET</sub></small> models reasonably predicted double-site substitution energies in α-Nb<small><sub>5</sub></small>Si<small><sub>3</sub></small>, achieving a mean absolute error (MAE) of 329.43 meV per cell. The robust transferability of the CE<small><sub>AET</sub></small> models was demonstrated by their successful prediction of untrained β-Nb<small><sub>5</sub></small>Si<small><sub>3</sub></small> structures. Site occupancy preferences were identified for B, Si, and Al at Si sites and for Ti, Hf, and Zr at Nb sites within β-Nb<small><sub>5</sub></small>Si<small><sub>3</sub></small>. This CE-based ML approach represents a broadly applicable and intelligent computational design method capable of handling complex crystal structures with strong transferability, even when working with small datasets.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 7","pages":" 1870-1883"},"PeriodicalIF":6.2000,"publicationDate":"2025-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00079c?page=search","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital discovery","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/dd/d5dd00079c","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Digital encoding of material structures using graph-based features combined with deep neural networks often lacks local specificity. Additionally, incorporating a self-attention mechanism increases architectural complexity and demands extensive data. To overcome these challenges, we developed a Center-Environment (CE) feature representation—a less data-intensive, physics-informed predefined attention mechanism. The pre-attention mechanism underlying the CE model shifts attention from complex black-box machine learning (ML) algorithms to explicit feature models with physical meaning, reducing data requirements while enhancing the transparency and interpretability of ML models. This CE-based ML approach was employed to investigate the alloying effects on the structural stability of Nb5Si3, guiding data-driven compositional design for ultra-high-temperature NbSi superalloys. The CE features leveraged the Atomic Environment Type (AET) method to characterize the local low-symmetry physical environments of atoms. The optimized CEAET models reasonably predicted double-site substitution energies in α-Nb5Si3, achieving a mean absolute error (MAE) of 329.43 meV per cell. The robust transferability of the CEAET models was demonstrated by their successful prediction of untrained β-Nb5Si3 structures. Site occupancy preferences were identified for B, Si, and Al at Si sites and for Ti, Hf, and Zr at Nb sites within β-Nb5Si3. This CE-based ML approach represents a broadly applicable and intelligent computational design method capable of handling complex crystal structures with strong transferability, even when working with small datasets.

Abstract Image

使用中心-环境特征的预定义注意聚焦机制:一种机器学习研究合金化对Nb5Si3合金稳定性的影响
利用基于图的特征与深度神经网络相结合的材料结构数字编码往往缺乏局部特异性。此外,合并自关注机制会增加体系结构的复杂性,并需要大量的数据。为了克服这些挑战,我们开发了一种中心环境(CE)特征表示——一种数据密集度较低、物理信息完备的预定义注意力机制。CE模型的前注意机制将注意力从复杂的黑箱机器学习(ML)算法转移到具有物理意义的显式特征模型,减少了数据需求,同时提高了ML模型的透明度和可解释性。采用这种基于ce的ML方法研究了合金化对Nb5Si3结构稳定性的影响,指导了数据驱动的超高温NbSi高温合金成分设计。CE特征利用原子环境类型(AET)方法来表征原子的局部低对称性物理环境。优化后的CEAET模型合理地预测了α-Nb5Si3的双位点取代能,平均绝对误差(MAE)为329.43 meV / cell。CEAET模型成功预测了未经训练的β-Nb5Si3结构,证明了CEAET模型的鲁棒可转移性。在β-Nb5Si3中,B、Si和Al在Si位点和Ti、Hf和Zr在Nb位点的占位偏好被确定。这种基于ce的ML方法代表了一种广泛适用的智能计算设计方法,能够处理具有强可转移性的复杂晶体结构,即使在处理小数据集时也是如此。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信