机器学习结合固溶强化模型预测高熵合金硬度

IF 0.8 4区 物理与天体物理 Q3 PHYSICS, MULTIDISCIPLINARY
Zhang Yi-Fan, Ren Wei, Wang Wei-Li, Ding Shu-Jian, Li Nan, Chang Liang, Zhou Qian
{"title":"机器学习结合固溶强化模型预测高熵合金硬度","authors":"Zhang Yi-Fan, Ren Wei, Wang Wei-Li, Ding Shu-Jian, Li Nan, Chang Liang, Zhou Qian","doi":"10.7498/aps.72.20230646","DOIUrl":null,"url":null,"abstract":"Traditional material calculation methods, such as first principles and thermodynamic simulations, have accelerated the discovery of new materials. However, it is difficult for these methods to construct models flexibly based on various target properties. And they will consume plenty of computational resources while their prediction accuracy is not good. In last decade, data-driven machine learning techniques have gradually been applied in materials science, which has accumulated a large amount of theoretical and experimental data. Machine learning is able to dig out the hidden information in these data and help to predict the properties of materials. In this work, the data source was obtained through the published references. And several performance-oriented algorithms were selected to build a prediction model for the hardness of high entropy alloys. A high entropy alloy hardness dataset containing 19 candidate features was trained, tested, and evaluated using an ensemble learning algorithm: a genetic algorithm was selected to filter the 19 candidate features to obtain an optimized feature set of 8 features; a two-stage feature selection approach was then combined with a traditional solid solution strengthening theory to optimize the features, three most representative feature parameters were chosen and then used to build a Random Forest model for hardness prediction. The prediction accuracy achieved an R2 value of 0.9416 under the ten-fold cross-validation method. To better understand the prediction mechanism, solid solution strengthening theory of the alloy was used to explain the hardness differences. Further, the atomic size, electronegativity and modulus mismatch features were found to have very important effects on the solid solution strengthening of high entropy alloys when using genetic algorithms for feature selection. The machine learning algorithm and features were also further used for prediction of solid solution strengthening properties, resulting in an R2 of 0.8811 using the ten-fold cross-validation method. These screened-out parameters have good transferability for various high entropy alloy system. In view of the poor interpretability of the random forest algorithm, the SHAP interpretable machine learning method was used to dig out the internal reasoning logic of established machine learning model and clarify the mechanism of the influence of each feature on hardness. Especially, the valence electron concentration is found to have the most significant weakening effect on the hardness of high entropy alloys.","PeriodicalId":6995,"journal":{"name":"物理学报","volume":"4 1","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Machine learning combined with solid solution strengthening model to predict hardness of high entropy alloys\",\"authors\":\"Zhang Yi-Fan, Ren Wei, Wang Wei-Li, Ding Shu-Jian, Li Nan, Chang Liang, Zhou Qian\",\"doi\":\"10.7498/aps.72.20230646\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional material calculation methods, such as first principles and thermodynamic simulations, have accelerated the discovery of new materials. However, it is difficult for these methods to construct models flexibly based on various target properties. And they will consume plenty of computational resources while their prediction accuracy is not good. In last decade, data-driven machine learning techniques have gradually been applied in materials science, which has accumulated a large amount of theoretical and experimental data. Machine learning is able to dig out the hidden information in these data and help to predict the properties of materials. In this work, the data source was obtained through the published references. And several performance-oriented algorithms were selected to build a prediction model for the hardness of high entropy alloys. A high entropy alloy hardness dataset containing 19 candidate features was trained, tested, and evaluated using an ensemble learning algorithm: a genetic algorithm was selected to filter the 19 candidate features to obtain an optimized feature set of 8 features; a two-stage feature selection approach was then combined with a traditional solid solution strengthening theory to optimize the features, three most representative feature parameters were chosen and then used to build a Random Forest model for hardness prediction. The prediction accuracy achieved an R2 value of 0.9416 under the ten-fold cross-validation method. To better understand the prediction mechanism, solid solution strengthening theory of the alloy was used to explain the hardness differences. Further, the atomic size, electronegativity and modulus mismatch features were found to have very important effects on the solid solution strengthening of high entropy alloys when using genetic algorithms for feature selection. The machine learning algorithm and features were also further used for prediction of solid solution strengthening properties, resulting in an R2 of 0.8811 using the ten-fold cross-validation method. These screened-out parameters have good transferability for various high entropy alloy system. In view of the poor interpretability of the random forest algorithm, the SHAP interpretable machine learning method was used to dig out the internal reasoning logic of established machine learning model and clarify the mechanism of the influence of each feature on hardness. Especially, the valence electron concentration is found to have the most significant weakening effect on the hardness of high entropy alloys.\",\"PeriodicalId\":6995,\"journal\":{\"name\":\"物理学报\",\"volume\":\"4 1\",\"pages\":\"\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"物理学报\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.7498/aps.72.20230646\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"PHYSICS, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"物理学报","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.7498/aps.72.20230646","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 1

摘要

传统的材料计算方法,如第一性原理和热力学模拟,加速了新材料的发现。然而,这些方法难以根据目标的不同属性灵活地构建模型。这种方法会消耗大量的计算资源,而且预测精度不高。近十年来,数据驱动的机器学习技术逐渐在材料科学领域得到应用,积累了大量的理论和实验数据。机器学习能够挖掘出这些数据中隐藏的信息,并帮助预测材料的性能。在这项工作中,数据源是通过已发表的参考文献获得的。选择了几种面向性能的算法,建立了高熵合金硬度预测模型。采用集成学习算法对包含19个候选特征的高熵合金硬度数据集进行训练、测试和评估:选择遗传算法对19个候选特征进行过滤,得到8个特征的优化特征集;然后,将两阶段特征选择方法与传统的固溶强化理论相结合,对特征进行优化,选择3个最具代表性的特征参数,构建随机森林模型进行硬度预测。10倍交叉验证法预测精度R2值为0.9416。为了更好地理解预测机理,采用合金的固溶强化理论来解释硬度差异。此外,采用遗传算法进行特征选择时,发现原子尺寸、电负性和模量失配特征对高熵合金的固溶强化有重要影响。机器学习算法和特征也进一步用于预测固溶体强化性能,使用十倍交叉验证方法得到R2为0.8811。筛选出的参数对各种高熵合金体系具有良好的可移植性。针对随机森林算法可解释性较差的问题,采用SHAP可解释性机器学习方法,挖掘所建立机器学习模型的内部推理逻辑,阐明各特征对硬度的影响机制。其中,价电子浓度对高熵合金硬度的削弱作用最为显著。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine learning combined with solid solution strengthening model to predict hardness of high entropy alloys
Traditional material calculation methods, such as first principles and thermodynamic simulations, have accelerated the discovery of new materials. However, it is difficult for these methods to construct models flexibly based on various target properties. And they will consume plenty of computational resources while their prediction accuracy is not good. In last decade, data-driven machine learning techniques have gradually been applied in materials science, which has accumulated a large amount of theoretical and experimental data. Machine learning is able to dig out the hidden information in these data and help to predict the properties of materials. In this work, the data source was obtained through the published references. And several performance-oriented algorithms were selected to build a prediction model for the hardness of high entropy alloys. A high entropy alloy hardness dataset containing 19 candidate features was trained, tested, and evaluated using an ensemble learning algorithm: a genetic algorithm was selected to filter the 19 candidate features to obtain an optimized feature set of 8 features; a two-stage feature selection approach was then combined with a traditional solid solution strengthening theory to optimize the features, three most representative feature parameters were chosen and then used to build a Random Forest model for hardness prediction. The prediction accuracy achieved an R2 value of 0.9416 under the ten-fold cross-validation method. To better understand the prediction mechanism, solid solution strengthening theory of the alloy was used to explain the hardness differences. Further, the atomic size, electronegativity and modulus mismatch features were found to have very important effects on the solid solution strengthening of high entropy alloys when using genetic algorithms for feature selection. The machine learning algorithm and features were also further used for prediction of solid solution strengthening properties, resulting in an R2 of 0.8811 using the ten-fold cross-validation method. These screened-out parameters have good transferability for various high entropy alloy system. In view of the poor interpretability of the random forest algorithm, the SHAP interpretable machine learning method was used to dig out the internal reasoning logic of established machine learning model and clarify the mechanism of the influence of each feature on hardness. Especially, the valence electron concentration is found to have the most significant weakening effect on the hardness of high entropy alloys.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
物理学报
物理学报 物理-物理:综合
CiteScore
1.70
自引率
30.00%
发文量
31245
审稿时长
1.9 months
期刊介绍: Acta Physica Sinica (Acta Phys. Sin.) is supervised by Chinese Academy of Sciences and sponsored by Chinese Physical Society and Institute of Physics, Chinese Academy of Sciences. Published by Chinese Physical Society and launched in 1933, it is a semimonthly journal with about 40 articles per issue. It publishes original and top quality research papers, rapid communications and reviews in all branches of physics in Chinese. Acta Phys. Sin. enjoys high reputation among Chinese physics journals and plays a key role in bridging China and rest of the world in physics research. Specific areas of interest include: Condensed matter and materials physics; Atomic, molecular, and optical physics; Statistical, nonlinear, and soft matter physics; Plasma physics; Interdisciplinary physics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信