A new machine-learning based cloud mask using harmonized data of two Meteosat generations shows a general decrease in cloudiness over Europe in recent decades
Sheetabh Gaurav, Boris Thies, Sebastian Egli, Jörg Bendix
{"title":"A new machine-learning based cloud mask using harmonized data of two Meteosat generations shows a general decrease in cloudiness over Europe in recent decades","authors":"Sheetabh Gaurav, Boris Thies, Sebastian Egli, Jörg Bendix","doi":"10.1016/j.rse.2025.114599","DOIUrl":null,"url":null,"abstract":"Mid-latitude stratus clouds with large spatial extent are important cooling engines in a warming world, while other types of clouds may accelerate warming. However, our understanding of cloud feedback in a changing climate remains incomplete in both space and time. A key factor contributing to this knowledge gap is the lack of long-term observations with spatio-temporally continuous information over large areas. Satellite data from the geostationary orbit could help in this regard, but they were never originally intended for climatological studies and, as a result, provide inconsistent data between individual satellites of different satellite generations. However, for investigations on a time scale of 30 years and more, a homogeneous dataset of gross cloud occurrence is essential to assess changes in cloud cover over the last decades. In addition, such a dataset is the basis for further analyzing long-term changes in other cloud types such as fog and low stratus (FLS). The generation of temporally homogeneous cloud information over Europe requires a dataset that is consistent in space and time. The current study develops a new cloud detection scheme based on harmonized radiances obtained by cross-calibrating Meteosat First (MFG) MVIRI (Meteosat Visible Infra-Red Imager) and Second Generation (MSG) SEVIRI (Spinning Enhanced Visible and Infra-Red Imager) data. The harmonized data set consists of two MFG bands (thermal infrared IR and water vapour WV), which guarantee long-term (1991–2020) availability over the full diurnal cycle (24 h). The new cloud classification scheme is based on eXtreme Gradient Boosting (XGBoost) and uses the two MFG channels as primary predictors or features. While cloud detection using only two MFG channels is a challenging task, additional features such as temporal trends in brightness temperature (BT), its spatial heterogeneity, clear sky reference BTs, topographic variables, and solar and satellite angles are also considered in the XGBoost model. The EUMETSAT CM SAF SEVIRI cloud mask based on MSG SEVIRI is used in part as the binary target variable to train the XGBoost model (cloudy/clear-sky) and as a benchmark to test the performance of the newly developed cloud detection scheme. Test results show very good agreement with the benchmark CM SAF SEVIRI cloud mask, with an average Heidke Skill Score (HSS) of 0.83 for day-time and 0.8 for night-time cloud occurrence. Further testing shows that the new cloud mask clearly outperforms the existing EUMETSAT Optimal Cloud Analysis (OCA) dataset based on MSG visible and IR 10.8 μm channels. In particular, the FLS detection in our cloud mask was found to be superior to the OCA during night and boreal winter. Based on the trend analysis of the generated time series of cloud frequencies, we found a general decrease in cloudiness over the last 30 years in many parts of Europe.","PeriodicalId":417,"journal":{"name":"Remote Sensing of Environment","volume":"46 1","pages":""},"PeriodicalIF":11.1000,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing of Environment","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.rse.2025.114599","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Mid-latitude stratus clouds with large spatial extent are important cooling engines in a warming world, while other types of clouds may accelerate warming. However, our understanding of cloud feedback in a changing climate remains incomplete in both space and time. A key factor contributing to this knowledge gap is the lack of long-term observations with spatio-temporally continuous information over large areas. Satellite data from the geostationary orbit could help in this regard, but they were never originally intended for climatological studies and, as a result, provide inconsistent data between individual satellites of different satellite generations. However, for investigations on a time scale of 30 years and more, a homogeneous dataset of gross cloud occurrence is essential to assess changes in cloud cover over the last decades. In addition, such a dataset is the basis for further analyzing long-term changes in other cloud types such as fog and low stratus (FLS). The generation of temporally homogeneous cloud information over Europe requires a dataset that is consistent in space and time. The current study develops a new cloud detection scheme based on harmonized radiances obtained by cross-calibrating Meteosat First (MFG) MVIRI (Meteosat Visible Infra-Red Imager) and Second Generation (MSG) SEVIRI (Spinning Enhanced Visible and Infra-Red Imager) data. The harmonized data set consists of two MFG bands (thermal infrared IR and water vapour WV), which guarantee long-term (1991–2020) availability over the full diurnal cycle (24 h). The new cloud classification scheme is based on eXtreme Gradient Boosting (XGBoost) and uses the two MFG channels as primary predictors or features. While cloud detection using only two MFG channels is a challenging task, additional features such as temporal trends in brightness temperature (BT), its spatial heterogeneity, clear sky reference BTs, topographic variables, and solar and satellite angles are also considered in the XGBoost model. The EUMETSAT CM SAF SEVIRI cloud mask based on MSG SEVIRI is used in part as the binary target variable to train the XGBoost model (cloudy/clear-sky) and as a benchmark to test the performance of the newly developed cloud detection scheme. Test results show very good agreement with the benchmark CM SAF SEVIRI cloud mask, with an average Heidke Skill Score (HSS) of 0.83 for day-time and 0.8 for night-time cloud occurrence. Further testing shows that the new cloud mask clearly outperforms the existing EUMETSAT Optimal Cloud Analysis (OCA) dataset based on MSG visible and IR 10.8 μm channels. In particular, the FLS detection in our cloud mask was found to be superior to the OCA during night and boreal winter. Based on the trend analysis of the generated time series of cloud frequencies, we found a general decrease in cloudiness over the last 30 years in many parts of Europe.
空间范围大的中纬度层云是全球变暖的重要降温引擎,而其他类型的云可能加速全球变暖。然而,我们对气候变化中的云反馈的理解在空间和时间上都是不完整的。造成这种知识差距的一个关键因素是缺乏具有大面积时空连续信息的长期观测。来自地球静止轨道的卫星数据在这方面可以提供帮助,但它们最初从未打算用于气候研究,因此,不同卫星世代的单个卫星之间提供的数据不一致。然而,对于30年或更长的时间尺度的调查,一个均匀的总云量数据集对于评估过去几十年云量的变化是必不可少的。此外,这样的数据集是进一步分析雾和低层云(FLS)等其他云类型的长期变化的基础。在欧洲产生时间上同质的云信息需要一个在空间和时间上一致的数据集。目前的研究开发了一种新的云检测方案,该方案基于交叉校准气象卫星第一代(MFG) MVIRI(气象卫星可见红外成像仪)和第二代(MSG) SEVIRI(旋转增强可见和红外成像仪)数据获得的协调辐射。统一的数据集由两个MFG波段(热红外IR和水蒸气WV)组成,保证了整个昼夜周期(24 h)的长期(1991-2020)可用性。新的云分类方案基于极端梯度增强(XGBoost),并使用两个MFG通道作为主要预测因子或特征。虽然仅使用两个MFG通道进行云探测是一项具有挑战性的任务,但XGBoost模型还考虑了其他特征,如亮度温度(BT)的时间趋势、空间异质性、晴空参考BT、地形变量以及太阳和卫星角度。基于MSG SEVIRI的EUMETSAT CM SAF SEVIRI云掩模部分用作二元目标变量来训练XGBoost模型(多云/晴空),并作为测试新开发的云检测方案性能的基准。测试结果显示与基准CM SAF SEVIRI云掩膜非常吻合,白天的平均海德克技能分数(HSS)为0.83,夜间云出现的平均海德克技能分数(HSS)为0.8。进一步的测试表明,新的云掩模明显优于现有的基于MSG可见和IR 10.8 μm通道的EUMETSAT最优云分析(OCA)数据集。特别是,在夜间和北方冬季,我们的云掩膜中的FLS检测结果优于OCA。根据生成的云频率时间序列的趋势分析,我们发现过去30年来欧洲许多地区的云量普遍减少。
期刊介绍:
Remote Sensing of Environment (RSE) serves the Earth observation community by disseminating results on the theory, science, applications, and technology that contribute to advancing the field of remote sensing. With a thoroughly interdisciplinary approach, RSE encompasses terrestrial, oceanic, and atmospheric sensing.
The journal emphasizes biophysical and quantitative approaches to remote sensing at local to global scales, covering a diverse range of applications and techniques.
RSE serves as a vital platform for the exchange of knowledge and advancements in the dynamic field of remote sensing.