High-throughput calculation integrated with stacking ensemble machine learning for predicting elastic properties of refractory multi-principal element alloys
Chengchen Jin, Kai Xiong, Congtao Luo, Hui Fang, Chaoguang Pu, Hua Dai, Aimin Zhang, Shunmeng Zhang, Yingwu Wang
{"title":"High-throughput calculation integrated with stacking ensemble machine learning for predicting elastic properties of refractory multi-principal element alloys","authors":"Chengchen Jin, Kai Xiong, Congtao Luo, Hui Fang, Chaoguang Pu, Hua Dai, Aimin Zhang, Shunmeng Zhang, Yingwu Wang","doi":"10.1002/mgea.70004","DOIUrl":null,"url":null,"abstract":"<p>The traditional trial-and-error method for designing refractory multi-principal element alloys (RMPEAs) is inefficient due to a vast compositional design space and high experimental costs. To surmount this challenge, the data-driven material design based on machine learning (ML) has emerged as a critical tool for accelerating materials design. However, the absence of robust datasets impedes the exploitation of machine learning in designing novel RMPEAs. High-throughput (HTP) calculations have enabled the creation of such datasets. This study addresses these challenges by developing a data-driven framework for predicting the elastic properties of RMPEAs, integrating HTP calculations with ML. A big dataset of RMPEAs including 4536 compositions was constructed using the new proposed HTP method. A novel stacking ensemble regression algorithm combining multilayer perceptron (MLP) and gradient boosting decision tree (GBDT) was developed, which achieved 92.9% accuracy in predicting the elastic properties of Ti-V-Nb-Ta alloys. Verification experiments confirmed the ML model's accuracy and robustness. This integration of HTP calculations and ML provides a cost-effective, efficient, and precise alloy design strategy, advancing RMPEAs development.</p>","PeriodicalId":100889,"journal":{"name":"Materials Genome Engineering Advances","volume":"3 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/mgea.70004","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Materials Genome Engineering Advances","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/mgea.70004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The traditional trial-and-error method for designing refractory multi-principal element alloys (RMPEAs) is inefficient due to a vast compositional design space and high experimental costs. To surmount this challenge, the data-driven material design based on machine learning (ML) has emerged as a critical tool for accelerating materials design. However, the absence of robust datasets impedes the exploitation of machine learning in designing novel RMPEAs. High-throughput (HTP) calculations have enabled the creation of such datasets. This study addresses these challenges by developing a data-driven framework for predicting the elastic properties of RMPEAs, integrating HTP calculations with ML. A big dataset of RMPEAs including 4536 compositions was constructed using the new proposed HTP method. A novel stacking ensemble regression algorithm combining multilayer perceptron (MLP) and gradient boosting decision tree (GBDT) was developed, which achieved 92.9% accuracy in predicting the elastic properties of Ti-V-Nb-Ta alloys. Verification experiments confirmed the ML model's accuracy and robustness. This integration of HTP calculations and ML provides a cost-effective, efficient, and precise alloy design strategy, advancing RMPEAs development.