Identifying the best ML model for predicting the bandgap in a perovskite solar cell†

Nita Samantaray, Arjun Singh and Anu Tonk
{"title":"Identifying the best ML model for predicting the bandgap in a perovskite solar cell†","authors":"Nita Samantaray, Arjun Singh and Anu Tonk","doi":"10.1039/D4SU00370E","DOIUrl":null,"url":null,"abstract":"<p >Perovskite solar cells (PSCs) have gained attention for their characteristics of high efficiency and commercial viability. However, the efficiency of a PSC depends on various factors. One such important parameter is the bandgap of the active layer as it plays an important role in PSCs with regards to the amount of light absorption. Thus, it influences the overall performance of the solar cell. It is important to predict the bandgap of the active layer in PSCs to achieve an effective fabrication process. In this study, we compared six machine learning (ML) models to predict the bandgap. The models were created using a dataset of 500 devices, such as MAPbI<small><sub>3</sub></small>, FAPbI<small><sub>3</sub></small>, CsSnI<small><sub>3</sub></small> and CsMAPbI<small><sub>3</sub></small>, obtained from The Perovskite Database Project. These models were further validated using a different dataset of 50 devices. The models were created using ML methods: random forest, gradient boosting regressor, k-nearest neighbours (KNN), AdaBoost, Gaussian process regressor, and bagging. The feature parameters considered for the models were the A coefficient, B coefficient, and C coefficient, out of various other parameters such as the perovskite dimension, perovskite thickness, perovskite deposition temperature, and perovskite deposition time. The random forest model showed better results compared to other models with a low mean absolute error (MAE) of 0.000775, low mean squared error (MSE) of 0.00000920, and high coefficient of determination (<em>r</em><small><sup>2</sup></small>) of 0.9994.</p>","PeriodicalId":74745,"journal":{"name":"RSC sustainability","volume":" 11","pages":" 3520-3524"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/su/d4su00370e?page=search","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"RSC sustainability","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2024/su/d4su00370e","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Perovskite solar cells (PSCs) have gained attention for their characteristics of high efficiency and commercial viability. However, the efficiency of a PSC depends on various factors. One such important parameter is the bandgap of the active layer as it plays an important role in PSCs with regards to the amount of light absorption. Thus, it influences the overall performance of the solar cell. It is important to predict the bandgap of the active layer in PSCs to achieve an effective fabrication process. In this study, we compared six machine learning (ML) models to predict the bandgap. The models were created using a dataset of 500 devices, such as MAPbI3, FAPbI3, CsSnI3 and CsMAPbI3, obtained from The Perovskite Database Project. These models were further validated using a different dataset of 50 devices. The models were created using ML methods: random forest, gradient boosting regressor, k-nearest neighbours (KNN), AdaBoost, Gaussian process regressor, and bagging. The feature parameters considered for the models were the A coefficient, B coefficient, and C coefficient, out of various other parameters such as the perovskite dimension, perovskite thickness, perovskite deposition temperature, and perovskite deposition time. The random forest model showed better results compared to other models with a low mean absolute error (MAE) of 0.000775, low mean squared error (MSE) of 0.00000920, and high coefficient of determination (r2) of 0.9994.

Abstract Image

确定预测过氧化物太阳能电池带隙的最佳 ML 模型†。
过氧化物太阳能电池(PSC)因其高效率和商业可行性的特点而备受关注。然而,PSC 的效率取决于多种因素。其中一个重要参数是有源层的带隙,因为它在 PSC 的光吸收量方面发挥着重要作用。因此,它影响着太阳能电池的整体性能。预测 PSC 中有源层的带隙对于实现有效的制造工艺非常重要。在本研究中,我们比较了六种机器学习(ML)模型来预测带隙。这些模型是利用从包晶体数据库项目获得的 500 个器件数据集创建的,这些器件包括 MAPbI3、FAPbI3、CsSnI3 和 CsMAPbI3。这些模型使用不同的 50 个器件数据集进行了进一步验证。这些模型是使用多模型方法创建的:随机森林、梯度提升回归器、k-近邻(KNN)、AdaBoost、高斯过程回归器和袋集。这些模型考虑的特征参数是 A 系数、B 系数和 C 系数,以及其他各种参数,如包晶石尺寸、包晶石厚度、包晶石沉积温度和包晶石沉积时间。与其他模型相比,随机森林模型显示出更好的结果,平均绝对误差(MAE)低至 0.000775,平均平方误差(MSE)低至 0.00000920,判定系数(r2)高达 0.9994。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
0.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信