Advancements in solar spectral irradiance modelling for photovoltaic systems: A machine learning approach utilising on-site data

Haoxiang Zhang , Sunny Chaudhary , Carlos D. Rodríguez-Gallegos , Tasmiat Rahman
{"title":"Advancements in solar spectral irradiance modelling for photovoltaic systems: A machine learning approach utilising on-site data","authors":"Haoxiang Zhang ,&nbsp;Sunny Chaudhary ,&nbsp;Carlos D. Rodríguez-Gallegos ,&nbsp;Tasmiat Rahman","doi":"10.1016/j.nxener.2025.100320","DOIUrl":null,"url":null,"abstract":"<div><div>Energy yield estimation for photovoltaics (PV) plays a crucial role in the growth of renewable energy. To reduce uncertainty in these estimations, having a spectral resolved irradiance is key. In the field of PV, radiative transfer models (RTMs) and spectroradiometers are commonly utilised to determine spectral solar irradiance, which is crucial for assessing spectral effects. However, these methodologies have inherent limitations; RTMs require precise and complex inputs of aerosol and meteorological data, while spectroradiometers entail significant costs. With the advancement of machine learning (ML) techniques, a data-driven spectral irradiance model is proposed in this study, which only requires the global horizontal irradiance (<em>GHI</em>) measured by pyranometer and the reference cell as input. Spectral data and meteorological data collected by Solar Energy Research Institute of Singapore (SERIS) at four sites across three continents are used for the training and testing of our models. We examined the viability on spectra modelling of three ML techniques including Long Short-Term Memory networks (LSTM), Random Forest (RF) algorithms and Extreme Gradient Boost (XGBoost). XGBoost achieves relatively good accuracy; additionally, the computational cost is much lower compared to LSTM and RF. The proposed ML model shows an overall <em>R</em><sup>2</sup> of 0.974 in comparison with 0.646 of the SMARTS model in the spectrum range 350.4–1052.4 nm. The ML models outperform the SMARTS model particularly under intermediate and overcast conditions. We have also shown that a model trained on data from a specific site cannot be effectively applied to other locations.</div></div>","PeriodicalId":100957,"journal":{"name":"Next Energy","volume":"8 ","pages":"Article 100320"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Next Energy","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949821X25000833","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Energy yield estimation for photovoltaics (PV) plays a crucial role in the growth of renewable energy. To reduce uncertainty in these estimations, having a spectral resolved irradiance is key. In the field of PV, radiative transfer models (RTMs) and spectroradiometers are commonly utilised to determine spectral solar irradiance, which is crucial for assessing spectral effects. However, these methodologies have inherent limitations; RTMs require precise and complex inputs of aerosol and meteorological data, while spectroradiometers entail significant costs. With the advancement of machine learning (ML) techniques, a data-driven spectral irradiance model is proposed in this study, which only requires the global horizontal irradiance (GHI) measured by pyranometer and the reference cell as input. Spectral data and meteorological data collected by Solar Energy Research Institute of Singapore (SERIS) at four sites across three continents are used for the training and testing of our models. We examined the viability on spectra modelling of three ML techniques including Long Short-Term Memory networks (LSTM), Random Forest (RF) algorithms and Extreme Gradient Boost (XGBoost). XGBoost achieves relatively good accuracy; additionally, the computational cost is much lower compared to LSTM and RF. The proposed ML model shows an overall R2 of 0.974 in comparison with 0.646 of the SMARTS model in the spectrum range 350.4–1052.4 nm. The ML models outperform the SMARTS model particularly under intermediate and overcast conditions. We have also shown that a model trained on data from a specific site cannot be effectively applied to other locations.
光伏系统太阳光谱辐照度建模的进展:利用现场数据的机器学习方法
光伏发电的发电量估算对可再生能源的发展起着至关重要的作用。为了减少这些估计的不确定性,具有光谱分辨辐照度是关键。在PV领域,通常使用辐射传输模型(RTMs)和光谱辐射计来确定光谱太阳辐照度,这对于评估光谱效应至关重要。然而,这些方法有其固有的局限性;rtm需要精确和复杂的气溶胶和气象数据输入,而光谱辐射计则需要大量成本。随着机器学习(ML)技术的进步,本研究提出了一种数据驱动的光谱辐照度模型,该模型只需要由辐射计和参考单元测量的全球水平辐照度(GHI)作为输入。新加坡太阳能研究所(SERIS)在三大洲的四个地点收集的光谱数据和气象数据用于我们的模型的训练和测试。我们研究了长短期记忆网络(LSTM)、随机森林(RF)算法和极限梯度增强(XGBoost)三种机器学习技术在光谱建模上的可行性。XGBoost实现了相对较好的精度;此外,与LSTM和RF相比,计算成本要低得多。在350.4-1052.4 nm光谱范围内,与SMARTS模型的0.646相比,本文提出的ML模型的总体R2为0.974。ML模型优于SMARTS模型,特别是在中间和阴天条件下。我们还表明,在特定地点的数据上训练的模型不能有效地应用于其他地点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信