High-Accuracy Bandgap Prediction and Classification in Hybrid and Inorganic Halide Perovskites Using Advanced Machine Learning Techniques

IF 4.3 3区 工程技术 Q2 ENERGY & FUELS
Alireza Sabagh Moeini, Fatemeh Shariatmadar Tehrani, Alireza Naeimi-Sadigh
{"title":"High-Accuracy Bandgap Prediction and Classification in Hybrid and Inorganic Halide Perovskites Using Advanced Machine Learning Techniques","authors":"Alireza Sabagh Moeini,&nbsp;Fatemeh Shariatmadar Tehrani,&nbsp;Alireza Naeimi-Sadigh","doi":"10.1155/er/1215175","DOIUrl":null,"url":null,"abstract":"<div>\n <p>Hybrid and inorganic halide perovskites (HP) have garnered significant attention for their applications in solar cells, LEDs, and sensors due to their exceptional electronic and optical properties. The accurate prediction and classification of bandgaps in these materials are crucial for advancing their technological potential. Traditional methods like Density Functional Theory (DFT) are computationally expensive, motivating the use of machine learning (ML) as a faster and more efficient alternative. In this study, we analyze 7382 hybrid and inorganic HP using a diverse set of ML models to classify materials based on whether they exhibit zero or nonzero bandgaps, and to predict their bandgap values. For regression tasks, AdaBoost Regressor (ABR), decision tree regressor (DTR), and gradient boosting regressor (GBR) were employed, while gradient boosting machines (GBM), decision tree (DT), and Multilayer Perceptron (MLP) were used for classification. Evaluation metrics for prediction included mean absolute error (MAE), mean squared error (MSE), and the <i>R</i><sup>2</sup>. For classification, metrics, such as accuracy, precision, recall, F1-score, area under the ROC curve (AUC-ROC), and area under the precision-recall curve (AUC-PR) were utilized. Results indicate that ABR achieved the highest prediction accuracy (MSE ≈ 0.074 eV, MAE ≈ 0.088 eV, <i>R</i><sup>2</sup> ≈ 91.1% for direct bandgaps; MSE ≈ 0.041 eV, MAE ≈ 0.076 eV, <i>R</i><sup>2</sup> ≈ 93.4% for indirect bandgaps). In classification, the GBM model outperformed others, achieving 96% and 97% accuracy for direct and indirect bandgaps, respectively. Feature analysis revealed that elemental properties, such as valence and group of constituent elements, particularly their mean and standard deviation, play a dominant role in bandgap determination. These findings highlight the potential of ML-driven approaches in accelerating perovskite material discovery and optimizing their electronic properties for future optoelectronic applications.</p>\n </div>","PeriodicalId":14051,"journal":{"name":"International Journal of Energy Research","volume":"2025 1","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/er/1215175","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Energy Research","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1155/er/1215175","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0

Abstract

Hybrid and inorganic halide perovskites (HP) have garnered significant attention for their applications in solar cells, LEDs, and sensors due to their exceptional electronic and optical properties. The accurate prediction and classification of bandgaps in these materials are crucial for advancing their technological potential. Traditional methods like Density Functional Theory (DFT) are computationally expensive, motivating the use of machine learning (ML) as a faster and more efficient alternative. In this study, we analyze 7382 hybrid and inorganic HP using a diverse set of ML models to classify materials based on whether they exhibit zero or nonzero bandgaps, and to predict their bandgap values. For regression tasks, AdaBoost Regressor (ABR), decision tree regressor (DTR), and gradient boosting regressor (GBR) were employed, while gradient boosting machines (GBM), decision tree (DT), and Multilayer Perceptron (MLP) were used for classification. Evaluation metrics for prediction included mean absolute error (MAE), mean squared error (MSE), and the R2. For classification, metrics, such as accuracy, precision, recall, F1-score, area under the ROC curve (AUC-ROC), and area under the precision-recall curve (AUC-PR) were utilized. Results indicate that ABR achieved the highest prediction accuracy (MSE ≈ 0.074 eV, MAE ≈ 0.088 eV, R2 ≈ 91.1% for direct bandgaps; MSE ≈ 0.041 eV, MAE ≈ 0.076 eV, R2 ≈ 93.4% for indirect bandgaps). In classification, the GBM model outperformed others, achieving 96% and 97% accuracy for direct and indirect bandgaps, respectively. Feature analysis revealed that elemental properties, such as valence and group of constituent elements, particularly their mean and standard deviation, play a dominant role in bandgap determination. These findings highlight the potential of ML-driven approaches in accelerating perovskite material discovery and optimizing their electronic properties for future optoelectronic applications.

Abstract Image

利用先进的机器学习技术对杂化和无机卤化物钙钛矿进行高精度带隙预测和分类
混合和无机卤化物钙钛矿(HP)由于其特殊的电子和光学特性,在太阳能电池,led和传感器中的应用引起了极大的关注。这些材料中带隙的准确预测和分类对于提高其技术潜力至关重要。像密度泛函理论(DFT)这样的传统方法在计算上是昂贵的,这促使使用机器学习(ML)作为更快、更有效的替代方法。在这项研究中,我们使用一组不同的ML模型来分析7382混合和无机HP,根据它们是否表现出零或非零带隙来分类材料,并预测它们的带隙值。回归任务采用AdaBoost回归器(ABR)、决策树回归器(DTR)和梯度增强回归器(GBR),分类任务采用梯度增强机(GBM)、决策树(DT)和多层感知器(MLP)。预测的评价指标包括平均绝对误差(MAE)、均方误差(MSE)和R2。分类采用正确率、精密度、召回率、f1评分、ROC曲线下面积(AUC-ROC)和精密度-召回率曲线下面积(AUC-PR)等指标。结果表明,ABR对直接带隙的预测精度最高,MSE≈0.074 eV, MAE≈0.088 eV, R2≈91.1%;间接带隙MSE≈0.041 eV, MAE≈0.076 eV, R2≈93.4%)。在分类方面,GBM模型优于其他模型,对直接带隙和间接带隙的准确率分别达到96%和97%。特征分析表明,元素性质,如组成元素的价和族,特别是它们的平均值和标准差,在带隙测定中起主导作用。这些发现突出了机器学习驱动方法在加速钙钛矿材料发现和优化其电子特性以用于未来光电应用方面的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Energy Research
International Journal of Energy Research 工程技术-核科学技术
CiteScore
9.80
自引率
8.70%
发文量
1170
审稿时长
3.1 months
期刊介绍: The International Journal of Energy Research (IJER) is dedicated to providing a multidisciplinary, unique platform for researchers, scientists, engineers, technology developers, planners, and policy makers to present their research results and findings in a compelling manner on novel energy systems and applications. IJER covers the entire spectrum of energy from production to conversion, conservation, management, systems, technologies, etc. We encourage papers submissions aiming at better efficiency, cost improvements, more effective resource use, improved design and analysis, reduced environmental impact, and hence leading to better sustainability. IJER is concerned with the development and exploitation of both advanced traditional and new energy sources, systems, technologies and applications. Interdisciplinary subjects in the area of novel energy systems and applications are also encouraged. High-quality research papers are solicited in, but are not limited to, the following areas with innovative and novel contents: -Biofuels and alternatives -Carbon capturing and storage technologies -Clean coal technologies -Energy conversion, conservation and management -Energy storage -Energy systems -Hybrid/combined/integrated energy systems for multi-generation -Hydrogen energy and fuel cells -Hydrogen production technologies -Micro- and nano-energy systems and technologies -Nuclear energy -Renewable energies (e.g. geothermal, solar, wind, hydro, tidal, wave, biomass) -Smart energy system
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信