Machine Learning Models for Efficient Property Prediction of ABX3 Materials: A High-Throughput Approach

IF 3.7 3区 化学 Q2 CHEMISTRY, MULTIDISCIPLINARY
Soundous Touati, Ali Benghia, Zoulikha Hebboul, Ibn Khaldoun Lefkaier, Mohammed Benali Kanoun and Souraya Goumri-Said*, 
{"title":"Machine Learning Models for Efficient Property Prediction of ABX3 Materials: A High-Throughput Approach","authors":"Soundous Touati,&nbsp;Ali Benghia,&nbsp;Zoulikha Hebboul,&nbsp;Ibn Khaldoun Lefkaier,&nbsp;Mohammed Benali Kanoun and Souraya Goumri-Said*,&nbsp;","doi":"10.1021/acsomega.4c0613910.1021/acsomega.4c06139","DOIUrl":null,"url":null,"abstract":"<p >Recently, ABX<sub>3</sub> materials have garnered significant attention due to their diverse applications in photovoltaics, catalysis, and optoelectronics as well as their remarkable efficiency in energy conversion. However, progress has been somewhat slow due to the high expenses of the experiment or the time-consuming density functional theory (DFT) calculation. In this study, we utilized the extreme gradient boosting (XGBoost) algorithm to facilitate the discovery and characterization of ABX<sub>3</sub> compounds based on vast data sets generated by DFT calculations. While the XGBoost algorithm provides a powerful tool for accelerating the discovery of ABX<sub>3</sub> compounds, it is crucial to acknowledge that different DFT approximation levels can significantly impact the predicted band gaps, potentially introducing discrepancies when compared with experimental values. In the first step, we predict the space group of 13947 oxides and halides using the Open Quantum Materials Database and elemental features. Our analysis yields classification accuracies ranging from 82.39% to 99.14% across these materials. Following this, XGBoost regression algorithms are employed to interrogate the data set, enabling predictions of volume (achieving an optimal accuracy of 98.41%, with a mean absolute error (MAE) of 2.395 Å<sup>3</sup> and a root-mean-square error (RMSE) of 4.416 Å<sup>3</sup>), formation energy (an optimal accuracy of 97.36%, with an MAE of 0.075 eV/atom and an RMSE of 0.132 eV/atom), and band gap energy (an optimal accuracy of 87.00%, an MAE of 0.391 eV, and an RMSE of 0.574 eV). Finally, these prediction models are employed to identify the possible space groups for each of the 1252 new ABX<sub>3</sub> formulas. Then, we predict the volume, the formation energy, and the band gap energy for each candidate space group. Through these predictive models, machine learning accelerates the exploration of new materials with enhanced performance and functionality.</p>","PeriodicalId":22,"journal":{"name":"ACS Omega","volume":"9 48","pages":"47519–47531 47519–47531"},"PeriodicalIF":3.7000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acsomega.4c06139","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Omega","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acsomega.4c06139","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Recently, ABX3 materials have garnered significant attention due to their diverse applications in photovoltaics, catalysis, and optoelectronics as well as their remarkable efficiency in energy conversion. However, progress has been somewhat slow due to the high expenses of the experiment or the time-consuming density functional theory (DFT) calculation. In this study, we utilized the extreme gradient boosting (XGBoost) algorithm to facilitate the discovery and characterization of ABX3 compounds based on vast data sets generated by DFT calculations. While the XGBoost algorithm provides a powerful tool for accelerating the discovery of ABX3 compounds, it is crucial to acknowledge that different DFT approximation levels can significantly impact the predicted band gaps, potentially introducing discrepancies when compared with experimental values. In the first step, we predict the space group of 13947 oxides and halides using the Open Quantum Materials Database and elemental features. Our analysis yields classification accuracies ranging from 82.39% to 99.14% across these materials. Following this, XGBoost regression algorithms are employed to interrogate the data set, enabling predictions of volume (achieving an optimal accuracy of 98.41%, with a mean absolute error (MAE) of 2.395 Å3 and a root-mean-square error (RMSE) of 4.416 Å3), formation energy (an optimal accuracy of 97.36%, with an MAE of 0.075 eV/atom and an RMSE of 0.132 eV/atom), and band gap energy (an optimal accuracy of 87.00%, an MAE of 0.391 eV, and an RMSE of 0.574 eV). Finally, these prediction models are employed to identify the possible space groups for each of the 1252 new ABX3 formulas. Then, we predict the volume, the formation energy, and the band gap energy for each candidate space group. Through these predictive models, machine learning accelerates the exploration of new materials with enhanced performance and functionality.

求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Omega
ACS Omega Chemical Engineering-General Chemical Engineering
CiteScore
6.60
自引率
4.90%
发文量
3945
审稿时长
2.4 months
期刊介绍: ACS Omega is an open-access global publication for scientific articles that describe new findings in chemistry and interfacing areas of science, without any perceived evaluation of immediate impact.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信