{"title":"基于机器学习和片段分子指纹的有机太阳能电池供体和非富勒烯受体设计","authors":"Cai-Rong Zhang, Rui Cao, Xiao-Meng Liu, Mei-Ling Zhang, Ji-Jun Gong, Zi-Jiang Liu, You-Zhi Wu, Hong-Shan Chen","doi":"10.1002/solr.202400846","DOIUrl":null,"url":null,"abstract":"<p>The molecular structures and properties of donor and acceptor materials for organic solar cells (OSCs) determine their photovoltaic performance; however, the complex relationship between them has hindered the development of OSC materials. To study this, we constructed the database comprising 544 donor and non-fullerene acceptor pairs. Based on the principle of minimal rings and molecular units, each molecule in the database is cut into different fragments and defined as a new molecular fingerprint, where each bit corresponds to a fragment number in the molecule. Accordingly, the defined molecular fingerprint length is 234 and 723 bits for donors and acceptors, respectively. Random forest and extreme tree regression (ETR) are applied to predict the photovoltaic parameters, with ETR being the most effective. Through SHapley Additive exPlanations (SHAP) importance analysis, eight (10) important donor (acceptor) fragments are identified. Furthermore, by computing the cut fragment similarities with that of the important fragments obtained from SHAP analysis, fragments with similarity exceeding 0.6 are collected in order to design new molecules. By assembling the collected fragments, we designed 21 168 D-<i>π</i>-A-<i>π</i>-type donors and 1 156 400 A-<i>π</i>-D-<i>π</i>-A-type nonfullerene acceptors, generating 24 478 675 200 donor–acceptor pairs. Based on predictions using the trained ETR model, the highest power conversion efficiency reaches 13.2%.</p>","PeriodicalId":230,"journal":{"name":"Solar RRL","volume":"9 4","pages":""},"PeriodicalIF":6.0000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Designing Donors and Nonfullerene Acceptors for Organic Solar Cells Assisted by Machine Learning and Fragment-Based Molecular Fingerprints\",\"authors\":\"Cai-Rong Zhang, Rui Cao, Xiao-Meng Liu, Mei-Ling Zhang, Ji-Jun Gong, Zi-Jiang Liu, You-Zhi Wu, Hong-Shan Chen\",\"doi\":\"10.1002/solr.202400846\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The molecular structures and properties of donor and acceptor materials for organic solar cells (OSCs) determine their photovoltaic performance; however, the complex relationship between them has hindered the development of OSC materials. To study this, we constructed the database comprising 544 donor and non-fullerene acceptor pairs. Based on the principle of minimal rings and molecular units, each molecule in the database is cut into different fragments and defined as a new molecular fingerprint, where each bit corresponds to a fragment number in the molecule. Accordingly, the defined molecular fingerprint length is 234 and 723 bits for donors and acceptors, respectively. Random forest and extreme tree regression (ETR) are applied to predict the photovoltaic parameters, with ETR being the most effective. Through SHapley Additive exPlanations (SHAP) importance analysis, eight (10) important donor (acceptor) fragments are identified. Furthermore, by computing the cut fragment similarities with that of the important fragments obtained from SHAP analysis, fragments with similarity exceeding 0.6 are collected in order to design new molecules. By assembling the collected fragments, we designed 21 168 D-<i>π</i>-A-<i>π</i>-type donors and 1 156 400 A-<i>π</i>-D-<i>π</i>-A-type nonfullerene acceptors, generating 24 478 675 200 donor–acceptor pairs. Based on predictions using the trained ETR model, the highest power conversion efficiency reaches 13.2%.</p>\",\"PeriodicalId\":230,\"journal\":{\"name\":\"Solar RRL\",\"volume\":\"9 4\",\"pages\":\"\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2025-01-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Solar RRL\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/solr.202400846\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Solar RRL","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/solr.202400846","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
Designing Donors and Nonfullerene Acceptors for Organic Solar Cells Assisted by Machine Learning and Fragment-Based Molecular Fingerprints
The molecular structures and properties of donor and acceptor materials for organic solar cells (OSCs) determine their photovoltaic performance; however, the complex relationship between them has hindered the development of OSC materials. To study this, we constructed the database comprising 544 donor and non-fullerene acceptor pairs. Based on the principle of minimal rings and molecular units, each molecule in the database is cut into different fragments and defined as a new molecular fingerprint, where each bit corresponds to a fragment number in the molecule. Accordingly, the defined molecular fingerprint length is 234 and 723 bits for donors and acceptors, respectively. Random forest and extreme tree regression (ETR) are applied to predict the photovoltaic parameters, with ETR being the most effective. Through SHapley Additive exPlanations (SHAP) importance analysis, eight (10) important donor (acceptor) fragments are identified. Furthermore, by computing the cut fragment similarities with that of the important fragments obtained from SHAP analysis, fragments with similarity exceeding 0.6 are collected in order to design new molecules. By assembling the collected fragments, we designed 21 168 D-π-A-π-type donors and 1 156 400 A-π-D-π-A-type nonfullerene acceptors, generating 24 478 675 200 donor–acceptor pairs. Based on predictions using the trained ETR model, the highest power conversion efficiency reaches 13.2%.
Solar RRLPhysics and Astronomy-Atomic and Molecular Physics, and Optics
CiteScore
12.10
自引率
6.30%
发文量
460
期刊介绍:
Solar RRL, formerly known as Rapid Research Letters, has evolved to embrace a broader and more encompassing format. We publish Research Articles and Reviews covering all facets of solar energy conversion. This includes, but is not limited to, photovoltaics and solar cells (both established and emerging systems), as well as the development, characterization, and optimization of materials and devices. Additionally, we cover topics such as photovoltaic modules and systems, their installation and deployment, photocatalysis, solar fuels, photothermal and photoelectrochemical solar energy conversion, energy distribution, grid issues, and other relevant aspects. Join us in exploring the latest advancements in solar energy conversion research.