Vision transformers for estimating irradiance using data scarce sky images

IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
David Hamlyn, Sunny Chaudhary, Tasmiat Rahman
{"title":"Vision transformers for estimating irradiance using data scarce sky images","authors":"David Hamlyn,&nbsp;Sunny Chaudhary,&nbsp;Tasmiat Rahman","doi":"10.1016/j.egyai.2025.100560","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate estimation of diffuse horizontal irradiance (DHI) is critical for optimising photovoltaic system performance and energy forecasting yet remains challenging in regions lacking comprehensive ground-based instrumentation. Recent advancements using Vision Transformers (ViTs) trained on extensive sky image datasets have shown promise in replacing costly irradiance measurement equipment, but the scarcity of long-term, high-quality sky imagery significantly restricts practical implementation. Addressing this critical gap, this study proposes a novel dual-framework approach designed for data-scarce scenarios. First, calculated atmospheric parameters, including extraterrestrial irradiance and cyclic time encodings, are integrated to represent sky conditions without utilising any instrumentation. Next, a sequential pipeline initially predicts synthetic global horizontal irradiance (GHI) and uses it as a feature, to refine DHI estimation. Finally, a dual-parallel architecture simultaneously processes raw and overlay-enhanced fisheye sky images. Overlays are generated through unsupervised, physics-informed cloud segmentation to highlight dynamic sky features. Empirical validation is performed using data from the Chilbolton Observatory, chosen for its temperate climate and frequent cloud variability. To simulate data-scarce conditions, models are trained on a single month (e.g., January) and evaluated across a temporally disjoint, full-year test set. Under this setup, the sequential and dual-parallel frameworks achieve RMSE values within 2–3 W/m² and 1–6 W/m², respectively, of a state-of-the-art ViT trained on the complete dataset. By combining physics-informed modelling with unsupervised segmentation, the proposed method provides a scalable and cost-effective solution for DHI estimation, advancing solar resource assessment in data-constrained environments.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"21 ","pages":"Article 100560"},"PeriodicalIF":9.6000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and AI","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666546825000928","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate estimation of diffuse horizontal irradiance (DHI) is critical for optimising photovoltaic system performance and energy forecasting yet remains challenging in regions lacking comprehensive ground-based instrumentation. Recent advancements using Vision Transformers (ViTs) trained on extensive sky image datasets have shown promise in replacing costly irradiance measurement equipment, but the scarcity of long-term, high-quality sky imagery significantly restricts practical implementation. Addressing this critical gap, this study proposes a novel dual-framework approach designed for data-scarce scenarios. First, calculated atmospheric parameters, including extraterrestrial irradiance and cyclic time encodings, are integrated to represent sky conditions without utilising any instrumentation. Next, a sequential pipeline initially predicts synthetic global horizontal irradiance (GHI) and uses it as a feature, to refine DHI estimation. Finally, a dual-parallel architecture simultaneously processes raw and overlay-enhanced fisheye sky images. Overlays are generated through unsupervised, physics-informed cloud segmentation to highlight dynamic sky features. Empirical validation is performed using data from the Chilbolton Observatory, chosen for its temperate climate and frequent cloud variability. To simulate data-scarce conditions, models are trained on a single month (e.g., January) and evaluated across a temporally disjoint, full-year test set. Under this setup, the sequential and dual-parallel frameworks achieve RMSE values within 2–3 W/m² and 1–6 W/m², respectively, of a state-of-the-art ViT trained on the complete dataset. By combining physics-informed modelling with unsupervised segmentation, the proposed method provides a scalable and cost-effective solution for DHI estimation, advancing solar resource assessment in data-constrained environments.

Abstract Image

利用数据稀缺的天空图像估计辐照度的视觉变压器
准确估计漫射水平辐照度(DHI)对于优化光伏系统性能和能源预测至关重要,但在缺乏综合地面仪器的地区仍然具有挑战性。使用视觉变压器(ViTs)在广泛的天空图像数据集上训练的最新进展已经显示出取代昂贵的辐照度测量设备的希望,但长期高质量天空图像的稀缺严重限制了实际实施。为了解决这一关键问题,本研究提出了一种新的双框架方法,用于数据稀缺的场景。首先,在不使用任何仪器的情况下,将计算出的大气参数(包括地外辐照度和循环时间编码)集成在一起以表示天空状况。接下来,顺序管道首先预测合成全球水平辐照度(GHI),并将其用作特征,以改进DHI估计。最后,双并行架构同时处理原始和叠加增强的鱼眼天空图像。叠加是通过无监督、物理信息云分割生成的,以突出动态天空特征。使用Chilbolton天文台的数据进行经验验证,选择Chilbolton天文台是因为其气候温和,云变化频繁。为了模拟数据稀缺的情况,模型在单个月(例如1月)进行训练,并在一个时间上不相交的全年测试集上进行评估。在这种设置下,顺序和双并行框架分别在完整数据集上训练的最先进的ViT的2-3 W/m²和1-6 W/m²内实现RMSE值。通过将物理信息建模与无监督分割相结合,该方法为DHI估计提供了一种可扩展且具有成本效益的解决方案,推进了数据约束环境下的太阳能资源评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Energy and AI
Energy and AI Engineering-Engineering (miscellaneous)
CiteScore
16.50
自引率
0.00%
发文量
64
审稿时长
56 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信