Zhang Yi-Fan, Ren Wei, Wang Wei-Li, Ding Shu-Jian, Li Nan, Chang Liang, Zhou Qian
{"title":"Machine learning combined with solid solution strengthening model to predict hardness of high entropy alloys","authors":"Zhang Yi-Fan, Ren Wei, Wang Wei-Li, Ding Shu-Jian, Li Nan, Chang Liang, Zhou Qian","doi":"10.7498/aps.72.20230646","DOIUrl":null,"url":null,"abstract":"Traditional material calculation methods, such as first principles and thermodynamic simulations, have accelerated the discovery of new materials. However, it is difficult for these methods to construct models flexibly based on various target properties. And they will consume plenty of computational resources while their prediction accuracy is not good. In last decade, data-driven machine learning techniques have gradually been applied in materials science, which has accumulated a large amount of theoretical and experimental data. Machine learning is able to dig out the hidden information in these data and help to predict the properties of materials. In this work, the data source was obtained through the published references. And several performance-oriented algorithms were selected to build a prediction model for the hardness of high entropy alloys. A high entropy alloy hardness dataset containing 19 candidate features was trained, tested, and evaluated using an ensemble learning algorithm: a genetic algorithm was selected to filter the 19 candidate features to obtain an optimized feature set of 8 features; a two-stage feature selection approach was then combined with a traditional solid solution strengthening theory to optimize the features, three most representative feature parameters were chosen and then used to build a Random Forest model for hardness prediction. The prediction accuracy achieved an R2 value of 0.9416 under the ten-fold cross-validation method. To better understand the prediction mechanism, solid solution strengthening theory of the alloy was used to explain the hardness differences. Further, the atomic size, electronegativity and modulus mismatch features were found to have very important effects on the solid solution strengthening of high entropy alloys when using genetic algorithms for feature selection. The machine learning algorithm and features were also further used for prediction of solid solution strengthening properties, resulting in an R2 of 0.8811 using the ten-fold cross-validation method. These screened-out parameters have good transferability for various high entropy alloy system. In view of the poor interpretability of the random forest algorithm, the SHAP interpretable machine learning method was used to dig out the internal reasoning logic of established machine learning model and clarify the mechanism of the influence of each feature on hardness. Especially, the valence electron concentration is found to have the most significant weakening effect on the hardness of high entropy alloys.","PeriodicalId":6995,"journal":{"name":"物理学报","volume":"4 1","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"物理学报","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.7498/aps.72.20230646","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 1
Abstract
Traditional material calculation methods, such as first principles and thermodynamic simulations, have accelerated the discovery of new materials. However, it is difficult for these methods to construct models flexibly based on various target properties. And they will consume plenty of computational resources while their prediction accuracy is not good. In last decade, data-driven machine learning techniques have gradually been applied in materials science, which has accumulated a large amount of theoretical and experimental data. Machine learning is able to dig out the hidden information in these data and help to predict the properties of materials. In this work, the data source was obtained through the published references. And several performance-oriented algorithms were selected to build a prediction model for the hardness of high entropy alloys. A high entropy alloy hardness dataset containing 19 candidate features was trained, tested, and evaluated using an ensemble learning algorithm: a genetic algorithm was selected to filter the 19 candidate features to obtain an optimized feature set of 8 features; a two-stage feature selection approach was then combined with a traditional solid solution strengthening theory to optimize the features, three most representative feature parameters were chosen and then used to build a Random Forest model for hardness prediction. The prediction accuracy achieved an R2 value of 0.9416 under the ten-fold cross-validation method. To better understand the prediction mechanism, solid solution strengthening theory of the alloy was used to explain the hardness differences. Further, the atomic size, electronegativity and modulus mismatch features were found to have very important effects on the solid solution strengthening of high entropy alloys when using genetic algorithms for feature selection. The machine learning algorithm and features were also further used for prediction of solid solution strengthening properties, resulting in an R2 of 0.8811 using the ten-fold cross-validation method. These screened-out parameters have good transferability for various high entropy alloy system. In view of the poor interpretability of the random forest algorithm, the SHAP interpretable machine learning method was used to dig out the internal reasoning logic of established machine learning model and clarify the mechanism of the influence of each feature on hardness. Especially, the valence electron concentration is found to have the most significant weakening effect on the hardness of high entropy alloys.
期刊介绍:
Acta Physica Sinica (Acta Phys. Sin.) is supervised by Chinese Academy of Sciences and sponsored by Chinese Physical Society and Institute of Physics, Chinese Academy of Sciences. Published by Chinese Physical Society and launched in 1933, it is a semimonthly journal with about 40 articles per issue.
It publishes original and top quality research papers, rapid communications and reviews in all branches of physics in Chinese. Acta Phys. Sin. enjoys high reputation among Chinese physics journals and plays a key role in bridging China and rest of the world in physics research. Specific areas of interest include: Condensed matter and materials physics; Atomic, molecular, and optical physics; Statistical, nonlinear, and soft matter physics; Plasma physics; Interdisciplinary physics.